ModelDeployment
API reference for ModelDeployment
ModelDeployment
Model deployment management for serving infrastructure.
This class manages containerized model deployments including resource allocation, scaling, and activation status.
Parameters
model_id (UUID | str)
Examples
Get and update deployment:
# Get deployment
deployment = ModelDeployment.of(model_id=model.id)
# Update resources
deployment.replicas = 3
deployment.cpu = 1000
deployment.memory = 2048
job = deployment.update()
job.wait()Initialize ModelDeployment instance.
Parameters
Parameter
Type
Required
Default
Description
model_id
UUID | str
✗
None
Model identifier
update()
Update an existing model deployment. Return type: Job
classmethod of(model_id)
Get model deployment instance of the given model
Parameters
Parameter
Type
Required
Default
Description
model_id
UUID | str
✗
None
Model identifier
Returns
ModelDeployment instance Return type: ModelDeployment
Last updated
Was this helpful?