sagemaker.core.serverless_inference_config

sagemaker.core.serverless_inference_config#

This module contains code related to the ServerlessInferenceConfig class.

Codes are used for configuring serverless inference endpoint. Use it when deploying the model to the endpoints.

Classes

ServerlessInferenceConfig([...])

Configuration object passed in when deploying models to Amazon SageMaker Endpoints.

class sagemaker.core.serverless_inference_config.ServerlessInferenceConfig(memory_size_in_mb: int = 2048, max_concurrency: int = 5, provisioned_concurrency: int | None = None)[source]#

Bases: object

Configuration object passed in when deploying models to Amazon SageMaker Endpoints.

This object specifies configuration related to serverless endpoint. Use this configuration when trying to create serverless endpoint and make serverless inference