NVIDIA: Nemotron 3 Nano 30B A3B

NVIDIAID: nvidia/nemotron-3-nano-30b-a3b

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully open with open-weights, datasets and recipes so developers can easily customize, optimize, and deploy the model on their infrastructure for maximum privacy and security.

Pricing per 1M Tokens

Input (Prompt)$0.05
Output (Completion)$0.20
Cache ReadFree
Cache WriteFree
ImageN/A

Specifications

Context Length262K
Max Output TokensN/A
Input ModalitiesText
Output ModalitiesText
TokenizerOther
Instruct TypeN/A
Top Provider Context262K
Top Provider Max OutputN/A
ModeratedNo

More from NVIDIA

Last updated: March 23, 2026

First tracked: March 23, 2026