Meta: Llama 4 Scout

MetaID: meta-llama/llama-4-scout

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input (text and image) and multilingual output (text and code) across 12 supported languages. Designed for assistant-style interaction and visual reasoning, Scout uses 16 experts per forward pass and features a context length of 10 million tokens, with a training corpus of ~40 trillion tokens. Built for high efficiency and local or commercial deployment, Llama 4 Scout incorporates early fusion for seamless modality integration. It is instruction-tuned for use in multilingual chat, captioning, and image understanding tasks. Released under the Llama 4 Community License, it was last trained on data up to August 2024 and launched publicly on April 5, 2025.

Pricing per 1M Tokens

Input (Prompt)	$0.08
Output (Completion)	$0.30
Cache Read	Free
Cache Write	Free
Image	N/A

Specifications

Context Length	328K
Max Output Tokens	16K
Input Modalities	Text + Image
Output Modalities	Text
Tokenizer	Llama4
Instruct Type	N/A
Top Provider Context	328K
Top Provider Max Output	16K
Moderated	No