NVIDIA: Nemotron 3 Nano 30B A3B

NVIDIAID: nvidia/nemotron-3-nano-30b-a3b

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully open with open-weights, datasets and recipes so developers can easily customize, optimize, and deploy the model on their infrastructure for maximum privacy and security.

Pricing per 1M Tokens

Input (Prompt)	$0.05
Output (Completion)	$0.20
Cache Read	Free
Cache Write	Free
Image	N/A

Specifications

Context Length	262K
Max Output Tokens	N/A
Input Modalities	Text
Output Modalities	Text
Tokenizer	Other
Instruct Type	N/A
Top Provider Context	262K
Top Provider Max Output	N/A
Moderated	No