OpenAI API

CompletionUsage

object

Usage statistics for the completion request.

completion_tokensintegerrequired

Number of tokens in the generated completion.

Default:0

prompt_tokensintegerrequired

Number of tokens in the prompt.

Default:0

total_tokensintegerrequired

Total number of tokens used in the request (prompt + completion).

Default:0

completion_tokens_detailsobject

Breakdown of tokens used in a completion.

Show Child Parameters
prompt_tokens_detailsobject

Breakdown of tokens used in the prompt.

Show Child Parameters
Example

CostsResult

object

The aggregated costs details of the specific time bucket.

objectstringrequired

Allowed values:organization.costs.result

amountobject

The monetary value in its associated currency.

Show Child Parameters
line_itemstring

When group_by=line_item, this field provides the line item of the grouped costs result.

project_idstring

When group_by=project_id, this field provides the project ID of the grouped costs result.

Example

CreateAssistantRequest

object
* Additional properties are NOT allowed.
modelAny Of
required

ID of the model to use. You can use the List models API to see all of your available models, or see our Model overview for descriptions of them.

Example:gpt-4o

Variant 1string
namestring

The name of the assistant. The maximum length is 256 characters.

<= 256 characters

descriptionstring

The description of the assistant. The maximum length is 512 characters.

<= 512 characters

instructionsstring

The system instructions that the assistant uses. The maximum length is 256,000 characters.

<= 256000 characters

reasoning_effortstring

o1 and o3-mini models only

Constrains effort on reasoning for
reasoning models.
Currently supported values are low, medium, and high. Reducing
reasoning effort can result in faster responses and fewer tokens used
on reasoning in a response.

Allowed values:lowmediumhigh

Default:medium

toolsOne Of
array

A list of tool enabled on the assistant. There can be a maximum of 128 tools per assistant. Tools can be of types code_interpreter, file_search, or function.

Default:[]

<= 128 items

Code interpreter toolobject
Show Child Parameters
tool_resourcesobject

A set of resources that are used by the assistant’s tools. The resources are specific to the type of tool. For example, the code_interpreter tool requires a list of file IDs, while the file_search tool requires a list of vector store IDs.

Show Child Parameters
metadataobject

Set of 16 key-value pairs that can be attached to an object. This can be
useful for storing additional information about the object in a structured
format, and querying for objects via API or the dashboard.

Keys are strings with a maximum length of 64 characters. Values are strings
with a maximum length of 512 characters.

temperaturenumber

What sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.

Default:1

>= 0<= 2

Example:1

top_pnumber

An alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.

We generally recommend altering this or temperature but not both.

Default:1

>= 0<= 1

Example:1

response_formatOne Of
AssistantsApiResponseFormatOptionstring

auto is the default value

Allowed values:auto

Example

CreateChatCompletionFunctionResponse

object

Represents a chat completion response returned by model, based on the provided input.

idstringrequired

A unique identifier for the chat completion.

choicesarray[object]required

A list of chat completion choices. Can be more than one if n is greater than 1.

Show Child Parameters
createdintegerrequired

The Unix timestamp (in seconds) of when the chat completion was created.

modelstringrequired

The model used for the chat completion.

system_fingerprintstring

This fingerprint represents the backend configuration that the model runs with.

Can be used in conjunction with the seed request parameter to understand when backend changes have been made that might impact determinism.

objectstringrequired

The object type, which is always chat.completion.

Allowed values:chat.completion

usageobject

Usage statistics for the completion request.

Show Child Parameters
Example

CreateChatCompletionImageResponse

object

Represents a streamed chunk of a chat completion response returned by model, based on the provided input.