AWS Trainium & Inferentia documentation

Notebooks

AWS Trainium & Inferentia

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

to get started

Notebooks

EC2

Notebook	Task	Model Architectures
Qwen embedding notebook	feature-extraction	Qwen3
Sentence Transformers notebook	sentence-transformers	Sentence Transformers
How to generate images with Stable Diffusion	stable-diffusion	Stable Diffusion
How to generate images with Stable Diffusion XL	stable-diffusion-xl	Stable Diffusion XL
Fine-tune BERT for text classification	text-classification	BERT
How to compile (if needed) and generate text with CodeLlama 7B	text-generation	CodeLlama
Create your own chatbot with llama-2-13B on AWS Inferentia	text-generation	Llama 2
Fine-tune llama-2-7B on AWS Trainium	fine-tuning	Llama 2

Inference Providers

Notebook	Task	Model Architectures
Compare book translations	feature-extraction	Embedding model

Sagemaker

Notebook	Task	Model Architectures
Deploy Llama 3.3 70B on SageMaker	sagemaker	Llama 3.3
Deploy Mixtral 8x7B on SageMaker	sagemaker	Mixtral

←Compare Book Translations with Embeddings on Inference Endpoints Llama-3.1 8B on AWS Inferentia2→