NVIDIA: Nemotron Nano 9B V2 (free)

Free

NVIDIAID: nvidia/nemotron-nano-9b-v2:free

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

Pricing per 1M Tokens

Input (Prompt)	Free
Output (Completion)	Free
Cache Read	Free
Cache Write	Free
Image	N/A

Specifications

Context Length	128K
Max Output Tokens	N/A
Input Modalities	Text
Output Modalities	Text
Tokenizer	Other
Instruct Type	N/A
Top Provider Context	128K
Top Provider Max Output	N/A
Moderated	No