Tuesday, January 21, 2025

What is Variance Weightage in sagemaaker?

You can have multiple Production Variants behind an Amazon SageMaker endpoint. Each production variant has an initial variant weight and based on the ratio of each variant weight to the total sum of weights, SageMaker can distribute the calls to each of the models. For example, if you have only one production variant with a weight of 1, all traffic will go to this variant. If you add another production variant with an initial weight of 2, the new variant will get 2/3 of the traffic and the first variant will get 1/3.

references:

https://docs.aws.amazon.com/sagemaker/latest/APIReference/API_ProductionVariant.html


No comments:

Post a Comment