`streamObject()`

Streams a typed, structured object for a given prompt and schema using a language model.

It can be used to force the language model to return structured data, e.g. for information extraction, synthetic data generation, or classification tasks.

Example: stream an object using a schema

import { openai } from '@ai-sdk/openai';
import { streamObject } from 'ai';
import { z } from 'zod';

const { partialObjectStream } = streamObject({
  model: openai('gpt-4-turbo'),
  schema: z.object({
    recipe: z.object({
      name: z.string(),
      ingredients: z.array(z.string()),
      steps: z.array(z.string()),
    }),
  }),
  prompt: 'Generate a lasagna recipe.',
});

for await (const partialObject of partialObjectStream) {
  console.clear();
  console.log(partialObject);
}

Example: stream an array using a schema

For arrays, you specify the schema of the array items. You can use elementStream to get the stream of complete array elements.

import { openai } from '@ai-sdk/openai';
import { streamObject } from 'ai';
import { z } from 'zod';

const { elementStream } = streamObject({
  model: openai('gpt-4-turbo'),
  output: 'array',
  schema: z.object({
    name: z.string(),
    class: z
      .string()
      .describe('Character class, e.g. warrior, mage, or thief.'),
    description: z.string(),
  }),
  prompt: 'Generate 3 hero descriptions for a fantasy role playing game.',
});

for await (const hero of elementStream) {
  console.log(hero);
}

Example: generate JSON without a schema

import { openai } from '@ai-sdk/openai';
import { streamObject } from 'ai';

const { partialObjectStream } = streamObject({
  model: openai('gpt-4-turbo'),
  output: 'no-schema',
  prompt: 'Generate a lasagna recipe.',
});

for await (const partialObject of partialObjectStream) {
  console.clear();
  console.log(partialObject);
}

To see streamObject in action, check out the additional examples.

Import

import { streamObject } from "ai"

API Signature

model:

LanguageModel

The language model to use. Example: openai('gpt-4-turbo')

output:

'object' | 'array' | 'no-schema' | undefined

The type of output to generate. Defaults to 'object'.

mode:

'auto' | 'json' | 'tool'

The mode to use for object generation. Not every model supports all modes. Defaults to 'auto' for 'object' output and to 'json' for 'no-schema' output. Must be 'json' for 'no-schema' output.

schema:

Zod Schema | JSON Schema

The schema that describes the shape of the object to generate. It is sent to the model to generate the object and used to validate the output. You can either pass in a Zod schema or a JSON schema (using the `jsonSchema` function). In 'array' mode, the schema is used to describe an array element. Not available with 'no-schema' output.

schemaName:

string | undefined

Optional name of the output that should be generated. Used by some providers for additional LLM guidance, e.g. via tool or schema name. Not available with 'no-schema' output.

schemaDescription:

string | undefined

Optional description of the output that should be generated. Used by some providers for additional LLM guidance, e.g. via tool or schema name. Not available with 'no-schema' output.

system:

string

The system prompt to use that specifies the behavior of the model.

prompt:

string

The input prompt to generate the text from.

messages:

Array<CoreSystemMessage | CoreUserMessage | CoreAssistantMessage | CoreToolMessage> | Array<UIMessage>

A list of messages that represent a conversation. Automatically converts UI messages from the useChat hook.

CoreSystemMessage

role:

'system'

The role for the system message.

content:

string

The content of the message.

CoreUserMessage

role:

'user'

The role for the user message.

content:

string | Array<TextPart | ImagePart | FilePart>

The content of the message.

TextPart

type:

'text'

The type of the message part.

text:

string

The text content of the message part.

ImagePart

type:

'image'

The type of the message part.

image:

string | Uint8Array | Buffer | ArrayBuffer | URL

The image content of the message part. String are either base64 encoded content, base64 data URLs, or http(s) URLs.

mimeType?:

string

The mime type of the image. Optional.

FilePart

type:

'file'

The type of the message part.

data:

string | Uint8Array | Buffer | ArrayBuffer | URL

The file content of the message part. String are either base64 encoded content, base64 data URLs, or http(s) URLs.

mimeType:

string

The mime type of the file.

CoreAssistantMessage

role:

'assistant'

The role for the assistant message.

content:

string | Array<TextPart | ReasoningPart | RedactedReasoningPart | ToolCallPart>

The content of the message.

TextPart

type:

'text'

The type of the message part.

text:

string

The text content of the message part.

ReasoningPart

type:

'reasoning'

text:

string

The reasoning text.

signature?:

string

The signature for the reasoning.

RedactedReasoningPart

type:

'redacted-reasoning'

data:

string

The redacted data.

ToolCallPart

type:

'tool-call'

The type of the message part.

toolCallId:

string

The id of the tool call.

toolName:

string

The name of the tool, which typically would be the name of the function.

args:

object based on schema

Parameters generated by the model to be used by the tool.

CoreToolMessage

role:

'tool'

The role for the assistant message.

content:

Array<ToolResultPart>

The content of the message.

ToolResultPart

type:

'tool-result'

The type of the message part.

toolCallId:

string

The id of the tool call the result corresponds to.

toolName:

string

The name of the tool the result corresponds to.

result:

unknown

The result returned by the tool after execution.

isError?:

boolean

Whether the result is an error or an error message.

maxTokens?:

number

Maximum number of tokens to generate.

temperature?:

number

Temperature setting. The value is passed through to the provider. The range depends on the provider and model. It is recommended to set either `temperature` or `topP`, but not both.

topP?:

number

Nucleus sampling. The value is passed through to the provider. The range depends on the provider and model. It is recommended to set either `temperature` or `topP`, but not both.

topK?:

number

Only sample from the top K options for each subsequent token. Used to remove "long tail" low probability responses. Recommended for advanced use cases only. You usually only need to use temperature.

presencePenalty?:

number

Presence penalty setting. It affects the likelihood of the model to repeat information that is already in the prompt. The value is passed through to the provider. The range depends on the provider and model.

frequencyPenalty?:

number

Frequency penalty setting. It affects the likelihood of the model to repeatedly use the same words or phrases. The value is passed through to the provider. The range depends on the provider and model.

seed?:

number

The seed (integer) to use for random sampling. If set and supported by the model, calls will generate deterministic results.

maxRetries?:

number

Maximum number of retries. Set to 0 to disable retries. Default: 2.

abortSignal?:

AbortSignal

An optional abort signal that can be used to cancel the call.

headers?:

Record<string, string>

Additional HTTP headers to be sent with the request. Only applicable for HTTP-based providers.

experimental_telemetry?:

TelemetrySettings

Telemetry configuration. Experimental feature.

TelemetrySettings

isEnabled?:

boolean

Enable or disable telemetry. Disabled by default while experimental.

recordInputs?:

boolean

Enable or disable input recording. Enabled by default.

recordOutputs?:

boolean

Enable or disable output recording. Enabled by default.

functionId?:

string

Identifier for this function. Used to group telemetry data by function.

metadata?:

Additional information to include in the telemetry data.

providerOptions?:

Record<string,Record<string,JSONValue>> | undefined

Provider-specific options. The outer key is the provider name. The inner values are the metadata. Details depend on the provider.

onError?:

(event: OnErrorResult) => Promise<void> |void

Callback that is called when an error occurs during streaming. You can use it to log errors.

OnErrorResult

error:

unknown

The error that occurred.

onFinish?:

(result: OnFinishResult) => void

Callback that is called when the LLM response has finished.

OnFinishResult

usage:

CompletionTokenUsage

The token usage of the generated text.

CompletionTokenUsage

promptTokens:

number

The total number of tokens in the prompt.

completionTokens:

number

The total number of tokens in the completion.

totalTokens:

number

The total number of tokens generated.

providerMetadata:

Record<string,Record<string,JSONValue>> | undefined

Optional metadata from the provider. The outer key is the provider name. The inner values are the metadata. Details depend on the provider.

object:

T | undefined

The generated object (typed according to the schema). Can be undefined if the final object does not match the schema.

error:

unknown | undefined

Optional error object. This is e.g. a TypeValidationError when the final object does not match the schema.

warnings:

Warning[] | undefined

Warnings from the model provider (e.g. unsupported settings).

response?:

Response

Response metadata.

Response

id:

string

The response identifier. The AI SDK uses the ID from the provider response when available, and generates an ID otherwise.

model:

string

The model that was used to generate the response. The AI SDK uses the response model from the provider response when available, and the model from the function call otherwise.

timestamp:

Date

The timestamp of the response. The AI SDK uses the response timestamp from the provider response when available, and creates a timestamp otherwise.

headers?:

Record<string, string>

Optional response headers.

Returns

usage:

Promise<CompletionTokenUsage>

The token usage of the generated text. Resolved when the response is finished.

CompletionTokenUsage

promptTokens:

number

The total number of tokens in the prompt.

completionTokens:

number

The total number of tokens in the completion.

totalTokens:

number

The total number of tokens generated.

providerMetadata:

Promise<Record<string,Record<string,JSONValue>> | undefined>

Optional metadata from the provider. Resolved whe the response is finished. The outer key is the provider name. The inner values are the metadata. Details depend on the provider.

object:

Promise<T>

The generated object (typed according to the schema). Resolved when the response is finished.

partialObjectStream:

AsyncIterableStream<DeepPartial<T>>

Stream of partial objects. It gets more complete as the stream progresses. Note that the partial object is not validated. If you want to be certain that the actual content matches your schema, you need to implement your own validation for partial results.

elementStream:

AsyncIterableStream<ELEMENT>

Stream of array elements. Only available in "array" mode.

textStream:

AsyncIterableStream<string>

Text stream of the JSON representation of the generated object. It contains text chunks. When the stream is finished, the object is valid JSON that can be parsed.

fullStream:

AsyncIterableStream<ObjectStreamPart<T>>

Stream of different types of events, including partial objects, errors, and finish events. Only errors that stop the stream, such as network errors, are thrown.

ObjectPart

type:

'object'

object:

DeepPartial<T>

The partial object that was generated.

TextDeltaPart

type:

'text-delta'

textDelta:

string

The text delta for the underlying raw JSON text.

ErrorPart

type:

'error'

error:

unknown

The error that occurred.

FinishPart

type:

'finish'

finishReason:

FinishReason

logprobs?:

Logprobs

usage:

Usage

Token usage.

response?:

Response

Response metadata.

Response

id:

string

The response identifier. The AI SDK uses the ID from the provider response when available, and generates an ID otherwise.

model:

string

The model that was used to generate the response. The AI SDK uses the response model from the provider response when available, and the model from the function call otherwise.

timestamp:

Date

The timestamp of the response. The AI SDK uses the response timestamp from the provider response when available, and creates a timestamp otherwise.

request?:

Promise<RequestMetadata>

Request metadata.

RequestMetadata

body:

string

Raw request HTTP body that was sent to the provider API as a string (JSON should be stringified).

response?:

Promise<ResponseMetadata>

Response metadata. Resolved when the response is finished.

ResponseMetadata

id:

string

The response identifier. The AI SDK uses the ID from the provider response when available, and generates an ID otherwise.

model:

string

The model that was used to generate the response. The AI SDK uses the response model from the provider response when available, and the model from the function call otherwise.

timestamp:

Date

The timestamp of the response. The AI SDK uses the response timestamp from the provider response when available, and creates a timestamp otherwise.

headers?:

Record<string, string>

Optional response headers.

warnings:

Warning[] | undefined

Warnings from the model provider (e.g. unsupported settings).

pipeTextStreamToResponse:

(response: ServerResponse, init?: ResponseInit => void

Writes text delta output to a Node.js response-like object. It sets a `Content-Type` header to `text/plain; charset=utf-8` and writes each text delta as a separate chunk.

ResponseInit