Skip to content
No models found
OpenRouter
© 2026 OpenRouter, Inc

Product

  • Chat
  • Rankings
  • Apps
  • Models
  • Providers
  • Pricing
  • Enterprise
  • Labs

Company

  • About
  • Announcements
  • CareersHiring
  • Privacy
  • Terms of Service
  • Support
  • State of AI
  • Works With OR
  • Data

Developer

  • Documentation
  • API Reference
  • SDK
  • Status

Connect

  • Discord
  • GitHub
  • LinkedIn
  • X
  • YouTube
Favicon for openai

OpenAI: GPT Audio Mini

openai/gpt-audio-mini

Compare

A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Input is priced at $0.60 per million tokens and output is priced at $2.40 per million tokens.

Modalities

Input Price

$0.60/M

Output Price

$2.40/M

Context

128K

Weekly Tokens

74.6M

Released

Jan 19, 2026

OverviewProvidersPerformancePricingAppsActivityUptimeAPI

Sample code and API for GPT Audio Mini

OpenRouter normalizes requests and responses across providers for you.

1

Get your API key

Create an API key from your OpenRouter dashboard and set it as an environment variable:

2

Make your first request

Use openai/gpt-audio-mini with the OpenRouter API:

OpenRouter provides an OpenAI-compatible completion API to 400+ models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.

In the examples below, the OpenRouter-specific headers are optional. Setting them allows your app to appear on the OpenRouter leaderboards.

Using third-party SDKs

For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.

3

Enable streaming

Add "stream": true to your request body to receive responses as server-sent events:

Endpoint

Sends a request for a model response for the given chat conversation. Supports both streaming and non-streaming modes.

POSThttps://openrouter.ai/api/v1/chat/completions
AuthorizationBearer $OPENROUTER_API_KEY
Content-Typeapplication/json
HTTP-Refereroptional — your site URL, for rankings
X-Titleoptional — your site name, for rankings
Modelopenai/gpt-audio-mini

Creates a streaming or non-streaming response using the OpenAI Responses API format.

Docs
POSThttps://openrouter.ai/api/v1/responses
AuthorizationBearer $OPENROUTER_API_KEY
Content-Typeapplication/json
HTTP-Refereroptional — your site URL, for rankings
X-Titleoptional — your site name, for rankings
Modelopenai/gpt-audio-mini

Creates a message using the Anthropic Messages API format. Supports text, images, PDFs, tools, and extended thinking.

Docs
POSThttps://openrouter.ai/api/v1/messages
AuthorizationBearer $OPENROUTER_API_KEY
Content-Typeapplication/json
HTTP-Refereroptional — your site URL, for rankings
X-Titleoptional — your site name, for rankings
Modelopenai/gpt-audio-mini

Parameters

NameTypeDefaultDescription
seedinteger—If specified, the inferencing will sample deterministically, such that repeated requests with the same seed and parameters should return the same result.
max_tokensinteger—This sets the upper limit for the number of tokens the model can generate in response.
response_formatmap—Forces the model to produce specific output format.
temperaturefloat1This setting influences the variety in the model's responses.
top_pfloat1This setting limits the model's choices to a percentage of likely tokens: only the top tokens whose probabilities add up to P.
stoparray—Stop generation immediately if the model encounter any token specified in the stop array.
frequency_penaltyfloat0This setting aims to control the repetition of tokens based on how often they appear in the input.
presence_penaltyfloat0Adjusts how often the model repeats specific tokens already used in the input.
logit_biasmap—Accepts a JSON object that maps tokens (specified by their token ID in the tokenizer) to an associated bias value from -100 to 100.
logprobsboolean—Whether to return log probabilities of the output tokens or not.
top_logprobsinteger—An integer between 0 and 20 specifying the number of most likely tokens to return at each token position, each with an associated log probability.
tool_choicestring or object—Controls which (if any) tool is called by the model.
toolsarray—Tool calling parameter, following OpenAI's tool calling request shape.