Skip to main content
Model deployment management for serving infrastructure. This class manages containerized model deployments including resource allocation, scaling, and activation status.

Examples

Get and update deployment:
# Get deployment
deployment = ModelDeployment.of(model_id=model.id)

# Update resources
deployment.replicas = 3
deployment.cpu = 1000
deployment.memory = 2048
job = deployment.update()
job.wait()
Initialize ModelDeployment instance.

Parameters

model_id
UUID | str
required
Model identifier

update()

Update an existing model deployment.

Returns

Job

classmethod of()

Get model deployment instance of the given model

Parameters

model_id
UUID | str
required
Model identifier

Returns

ModelDeployment instance