Meta: Llama 3.3 70B Instruct
MetaID: meta-llama/llama-3.3-70b-instruct
The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model is optimized for multilingual dialogue use cases and outperforms many of the available open source and closed chat models on common industry benchmarks. Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. [Model Card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_3/MODEL_CARD.md)
Pricing per 1M Tokens
| Input (Prompt) | $0.10 |
| Output (Completion) | $0.32 |
| Cache Read | Free |
| Cache Write | Free |
| Image | N/A |
Specifications
| Context Length | 131K |
| Max Output Tokens | 16K |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | Llama3 |
| Instruct Type | llama3 |
| Top Provider Context | 131K |
| Top Provider Max Output | 16K |
| Moderated | No |
Compare this model
See how Meta: Llama 3.3 70B Instruct stacks up against other models.
More from Meta
Last updated: March 23, 2026
First tracked: March 23, 2026