Skip to content

Quantize and host a Mixtral with SageMaker LMI and evaluate it with SageMaker Clarify

License

Notifications You must be signed in to change notification settings

bhorev/sagemaker_mixtral_quantize_and_eval

Repository files navigation

sagemaker_mixtral_quantize_and_eval

Quantize and host a Mixtral with SageMaker LMI and evaluate it with SageMaker Clarify

mixtral_LMI-8bit.ipynb - deploy an 8bit quantized Mixtral model on a SageMaker Endpoint

mixtral_LMI-bf16.ipynb - deploy an 16bit (bf16) Mixtral model on a SageMaker Endpoint

eval_mixtral.ipynb - Use SageMaker Clarify with fmeval to evaluate the 8bit model

Refereneces

ml.g5.12xlarge

4x NVIDIA A10G

image

About

Quantize and host a Mixtral with SageMaker LMI and evaluate it with SageMaker Clarify

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published