NVIDIA: Nemotron 3 Nano 30B A3B
NVIDIAID: nvidia/nemotron-3-nano-30b-a3b
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully open with open-weights, datasets and recipes so developers can easily customize, optimize, and deploy the model on their infrastructure for maximum privacy and security.
Pricing per 1M Tokens
| Input (Prompt) | $0.05 |
| Output (Completion) | $0.20 |
| Cache Read | Free |
| Cache Write | Free |
| Image | N/A |
Specifications
| Context Length | 262K |
| Max Output Tokens | N/A |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | Other |
| Instruct Type | N/A |
| Top Provider Context | 262K |
| Top Provider Max Output | N/A |
| Moderated | No |
Compare this model
See how NVIDIA: Nemotron 3 Nano 30B A3B stacks up against other models.
More from NVIDIA
Last updated: March 23, 2026
First tracked: March 23, 2026