NVIDIA: Nemotron Nano 9B V2

NVIDIAID: nvidia/nemotron-nano-9b-v2

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

Pricing per 1M Tokens

Input (Prompt)$0.04
Output (Completion)$0.16
Cache ReadFree
Cache WriteFree
ImageN/A

Specifications

Context Length131K
Max Output TokensN/A
Input ModalitiesText
Output ModalitiesText
TokenizerOther
Instruct TypeN/A
Top Provider Context131K
Top Provider Max OutputN/A
ModeratedNo

More from NVIDIA

Last updated: March 23, 2026

First tracked: March 23, 2026