sagemaker.core.serverless_inference_config#
This module contains code related to the ServerlessInferenceConfig class.
Codes are used for configuring serverless inference endpoint. Use it when deploying the model to the endpoints.
Classes
|
Configuration object passed in when deploying models to Amazon SageMaker Endpoints. |
- class sagemaker.core.serverless_inference_config.ServerlessInferenceConfig(memory_size_in_mb: int = 2048, max_concurrency: int = 5, provisioned_concurrency: int | None = None)[source]#
Bases:
objectConfiguration object passed in when deploying models to Amazon SageMaker Endpoints.
This object specifies configuration related to serverless endpoint. Use this configuration when trying to create serverless endpoint and make serverless inference