OpenAI: GPT Audio

OpenAIID: openai/gpt-audio

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.

Pricing per 1M Tokens

Input (Prompt)	$2.50
Output (Completion)	$10
Cache Read	Free
Cache Write	Free
Image	N/A

Specifications

Context Length	128K
Max Output Tokens	16K
Input Modalities	Text + Audio
Output Modalities	Text + Audio
Tokenizer	GPT
Instruct Type	N/A
Top Provider Context	128K
Top Provider Max Output	16K
Moderated	Yes