Meta: Llama 4 Scout
Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input (text and image) and multilingual output (text and code) across 12 supported languages. Designed for assistant-style interaction and visual reasoning, Scout uses 16 experts per forward pass and features a context length of 10 million tokens, with a training corpus of ~40 trillion tokens. Built for high efficiency and local or commercial deployment, Llama 4 Scout incorporates early fusion for seamless modality integration. It is instruction-tuned for use in multilingual chat, captioning, and image understanding tasks. Released under the Llama 4 Community License, it was last trained on data up to August 2024 and launched publicly on April 5, 2025.
Pricing per 1M Tokens
| Input (Prompt) | $0.08 |
| Output (Completion) | $0.30 |
| Cache Read | Free |
| Cache Write | Free |
| Image | N/A |
Specifications
| Context Length | 328K |
| Max Output Tokens | 16K |
| Input Modalities | Text + Image |
| Output Modalities | Text |
| Tokenizer | Llama4 |
| Instruct Type | N/A |
| Top Provider Context | 328K |
| Top Provider Max Output | 16K |
| Moderated | No |
Compare this model
See how Meta: Llama 4 Scout stacks up against other models.
More from Meta
Last updated: March 23, 2026
First tracked: March 23, 2026