NVIDIA: Nemotron Nano 9B V2 (free)

Free
NVIDIAID: nvidia/nemotron-nano-9b-v2:free

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

Pricing per 1M Tokens

Input (Prompt)Free
Output (Completion)Free
Cache ReadFree
Cache WriteFree
ImageN/A

Specifications

Context Length128K
Max Output TokensN/A
Input ModalitiesText
Output ModalitiesText
TokenizerOther
Instruct TypeN/A
Top Provider Context128K
Top Provider Max OutputN/A
ModeratedNo

More from NVIDIA

Last updated: March 23, 2026

First tracked: March 23, 2026