RealtimeResponseCreateParams

idstring

For an item of type (message | function_call | function_call_output)
this field allows the client to assign the unique ID of the item. It is
not required because the server will generate one if not provided.

For an item of type item_reference, this field is required and is a
reference to any item that has previously existed in the conversation.

typestring

The type of the item (message, function_call, function_call_output, item_reference).

Allowed values:messagefunction_callfunction_call_output

objectstring

Identifier for the API object being returned - always realtime.item.

Allowed values:realtime.item

statusstring

The status of the item (completed, incomplete). These have no effect
on the conversation, but are accepted for consistency with the
conversation.item.created event.

Allowed values:completedincomplete

rolestring

The role of the message sender (user, assistant, system), only
applicable for message items.

Allowed values:userassistantsystem

contentarray[object]

The content of the message, applicable for message items.

Message items of role system support only input_text content
Message items of role user support input_text and input_audio
content
Message items of role assistant support text content.

Show Child Parameters

call_idstring

The ID of the function call (for function_call and
function_call_output items). If passed on a function_call_output
item, the server will check that a function_call item with the same
ID exists in the conversation history.

namestring

The name of the function being called (for function_call items).

argumentsstring

The arguments of the function call (for function_call items).

outputstring

The output of the function call (for function_call_output items).

Example

idstring

The unique ID of the response.

objectstring

The object type, must be realtime.response.

Allowed values:realtime.response

statusstring

The final status of the response (completed, cancelled, failed, or
incomplete).

Allowed values:completedcancelledfailedincomplete

status_detailsobject

Additional details about the status.

Show Child Parameters

outputarray[object]

The item to add to the conversation.

Show Child Parameters

metadataobject

Set of 16 key-value pairs that can be attached to an object. This can be
useful for storing additional information about the object in a structured
format, and querying for objects via API or the dashboard.

Keys are strings with a maximum length of 64 characters. Values are strings
with a maximum length of 512 characters.

usageobject

Usage statistics for the Response, this will correspond to billing. A
Realtime API session will maintain a conversation context and append new
Items to the Conversation, thus output from previous turns (text and
audio tokens) will become the input for later turns.

Show Child Parameters

conversation_idstring

Which conversation the response is added to, determined by the conversation
field in the response.create event. If auto, the response will be added to
the default conversation and the value of conversation_id will be an id like
conv_1234. If none, the response will not be added to any conversation and
the value of conversation_id will be null. If responses are being triggered
by server VAD, the response will be added to the default conversation, thus
the conversation_id will be an id like conv_1234.

voicestring

The voice the model used to respond.
Current voice options are alloy, ash, ballad, coral, echo sage,
shimmer and verse.

Allowed values:alloyashballadcoralechosageshimmerverse

modalitiesarray[string]

The set of modalities the model used to respond. If there are multiple modalities,
the model will pick one, for example if modalities is ["text", "audio"], the model
could be responding in either text or audio.

Allowed values:textaudio

output_audio_formatstring

The format of output audio. Options are pcm16, g711_ulaw, or g711_alaw.

Allowed values:pcm16g711_ulawg711_alaw

temperaturenumber

Sampling temperature for the model, limited to [0.6, 1.2]. Defaults to 0.8.

max_output_tokensOne Of

Maximum number of output tokens for a single assistant response,
inclusive of tool calls, that was used in this response.

Variant 1integer

Example

modalitiesarray[string]

The set of modalities the model can respond with. To disable audio,
set this to [“text”].

Allowed values:textaudio

instructionsstring

The default system instructions (i.e. system message) prepended to model
calls. This field allows the client to guide the model on desired
responses. The model can be instructed on response content and format,
(e.g. “be extremely succinct”, “act friendly”, “here are examples of good
responses”) and on audio behavior (e.g. “talk quickly”, “inject emotion
into your voice”, “laugh frequently”). The instructions are not guaranteed
to be followed by the model, but they provide guidance to the model on the
desired behavior.

Note that the server sets default instructions which will be used if this
field is not set and are visible in the session.created event at the
start of the session.

voicestring

The voice the model uses to respond. Voice cannot be changed during the
session once the model has responded with audio at least once. Current
voice options are alloy, ash, ballad, coral, echo sage,
shimmer and verse.

Allowed values:alloyashballadcoralechosageshimmerverse

output_audio_formatstring

The format of output audio. Options are pcm16, g711_ulaw, or g711_alaw.

Allowed values:pcm16g711_ulawg711_alaw

toolsarray[object]

Tools (functions) available to the model.

Show Child Parameters

tool_choicestring

How the model chooses tools. Options are auto, none, required, or
specify a function, like {"type": "function", "function": {"name": "my_function"}}.

temperaturenumber

Sampling temperature for the model, limited to [0.6, 1.2]. Defaults to 0.8.

max_response_output_tokensOne Of

Maximum number of output tokens for a single assistant response,
inclusive of tool calls. Provide an integer between 1 and 4096 to
limit output tokens, or inf for the maximum available tokens for a
given model. Defaults to inf.

Variant 1integer

conversationOne Of

Controls which conversation the response is added to. Currently supports
auto and none, with auto as the default value. The auto value
means that the contents of the response will be added to the default
conversation. Set this to none to create an out-of-band response which
will not add items to default conversation.

Variant 1string

metadataobject

Keys are strings with a maximum length of 64 characters. Values are strings
with a maximum length of 512 characters.

inputarray[object]

The item to add to the conversation.

Show Child Parameters

Example

event_idstringrequired

The unique ID of the server event.

typestringrequired

The event type, must be conversation.created.

Allowed values:conversation.created

conversationobjectrequired

The conversation resource.

Show Child Parameters

Example

Returned when a conversation item is created. There are several scenarios that
produce this event:

The server is generating a Response, which if successful will produce
either one or two Items, which will be of type message
(role assistant) or type function_call.
The input audio buffer has been committed, either by the client or the
server (in server_vad mode). The server will take the content of the
input audio buffer and add it to a new user message Item.
The client has sent a conversation.item.create event to add a new Item
to the Conversation.

event_idstringrequired

The unique ID of the server event.

typestringrequired

The event type, must be conversation.item.created.

Allowed values:conversation.item.created

previous_item_idstringrequired

The ID of the preceding item in the Conversation context, allows the
client to understand the order of the conversation.

itemobjectrequired

The item to add to the conversation.

Show Child Parameters

Example

RealtimeConversationItemWithReference

RealtimeResponse

RealtimeServerEventConversationCreated

RealtimeServerEventConversationItemCreated