CreateAndRunThreadOptions interface
The details used when creating and immediately running a new agent thread.
Properties
assistant |
The ID of the agent for which the thread should be created. |
instructions | The overridden system instructions the agent should use to run the thread. |
max |
The maximum number of completion tokens that may be used over the course of the run. The run will make a best effort to use only
the number of completion tokens specified, across multiple turns of the run. If the run exceeds the number of completion tokens
specified, the run will end with status |
max |
The maximum number of prompt tokens that may be used over the course of the run. The run will make a best effort to use only
the number of prompt tokens specified, across multiple turns of the run. If the run exceeds the number of prompt tokens specified,
the run will end with status |
metadata | A set of up to 16 key/value pairs that can be attached to an object, used for storing additional information about that object in a structured format. Keys may be up to 64 characters in length and values may be up to 512 characters in length. |
model | The overridden model that the agent should use to run the thread. |
parallel |
If |
response |
Specifies the format that the model must output. |
stream | If |
temperature | What sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. |
thread | The details used to create the new thread. If no thread is provided, an empty one will be created. |
tool |
Controls whether or not and which tool is called by the model. |
tool |
Override the tools the agent can use for this run. This is useful for modifying the behavior on a per-run basis |
tools | The overridden list of enabled tools the agent should use to run the thread. |
topP | An alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered. We generally recommend altering this or temperature but not both. |
truncation |
The strategy to use for dropping messages as the context windows moves forward. |
Property Details
assistantId
The ID of the agent for which the thread should be created.
assistantId: string
Property Value
string
instructions
The overridden system instructions the agent should use to run the thread.
instructions?: null | string
Property Value
null | string
maxCompletionTokens
The maximum number of completion tokens that may be used over the course of the run. The run will make a best effort to use only
the number of completion tokens specified, across multiple turns of the run. If the run exceeds the number of completion tokens
specified, the run will end with status incomplete
. See incomplete_details
for more info.
maxCompletionTokens?: null | number
Property Value
null | number
maxPromptTokens
The maximum number of prompt tokens that may be used over the course of the run. The run will make a best effort to use only
the number of prompt tokens specified, across multiple turns of the run. If the run exceeds the number of prompt tokens specified,
the run will end with status incomplete
. See incomplete_details
for more info.
maxPromptTokens?: null | number
Property Value
null | number
metadata
A set of up to 16 key/value pairs that can be attached to an object, used for storing additional information about that object in a structured format. Keys may be up to 64 characters in length and values may be up to 512 characters in length.
metadata?: null | Record<string, string>
Property Value
null | Record<string, string>
model
The overridden model that the agent should use to run the thread.
model?: null | string
Property Value
null | string
parallelToolCalls
If true
functions will run in parallel during tool use.
parallelToolCalls?: boolean
Property Value
boolean
responseFormat
Specifies the format that the model must output.
responseFormat?: null | AgentsApiResponseFormatOption
Property Value
stream
If true
, returns a stream of events that happen during the Run as server-sent events,
terminating when the Run enters a terminal state with a data: [DONE]
message.
stream?: boolean
Property Value
boolean
temperature
What sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.
temperature?: null | number
Property Value
null | number
thread
The details used to create the new thread. If no thread is provided, an empty one will be created.
thread?: AgentThreadCreationOptions
Property Value
toolChoice
Controls whether or not and which tool is called by the model.
toolChoice?: null | AgentsApiToolChoiceOption
Property Value
null | AgentsApiToolChoiceOption
toolResources
Override the tools the agent can use for this run. This is useful for modifying the behavior on a per-run basis
toolResources?: null | UpdateToolResourcesOptions
Property Value
null | UpdateToolResourcesOptions
tools
The overridden list of enabled tools the agent should use to run the thread.
tools?: null | ToolDefinition[]
Property Value
null | ToolDefinition[]
topP
An alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.
We generally recommend altering this or temperature but not both.
topP?: null | number
Property Value
null | number
truncationStrategy
The strategy to use for dropping messages as the context windows moves forward.
truncationStrategy?: null | TruncationObject
Property Value
null | TruncationObject