OpenAI: gpt-oss-20b

OpenAIID: openai/gpt-oss-20b

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for lower-latency inference and deployability on consumer or single-GPU hardware. The model is trained in OpenAI’s Harmony response format and supports reasoning level configuration, fine-tuning, and agentic capabilities including function calling, tool use, and structured outputs.

Pricing per 1M Tokens

Input (Prompt)	$0.03
Output (Completion)	$0.11
Cache Read	$0.01
Cache Write	Free
Image	N/A

Specifications

Context Length	131K
Max Output Tokens	131K
Input Modalities	Text
Output Modalities	Text
Tokenizer	GPT
Instruct Type	N/A
Top Provider Context	131K
Top Provider Max Output	131K
Moderated	No