Fireworks.ai
The legacy Fireworks integration is not compatible with the AI SDK 3.1 functions. It is recommended to use the AI SDK Fireworks Provider instead.
The AI SDK provides a set of utilities to make it easy to use Fireworks.ai's APIs and models. In this guide, we'll walk through how to use the utilities to create a chat bot and a text completion app.
Fireworks.ai's REST APIs are compatible with OpenAI's so we will use OpenAI's JavaScript SDK to make the requests. This makes it very easy to migrate and try out Fireworks.ai's models.
Guide: Llama 2 Chatbot
Create a Next.js app
Create a Next.js application and install ai
and openai
, the AI SDK and OpenAI API client respectively. Fireworks' REST APIs are compatible with OpenAI's so we will use OpenAI's JavaScript SDK to make the requests.
pnpm dlx create-next-app my-ai-appcd my-ai-apppnpm add ai openai
Add your Fireworks API Key to .env
Create a .env
file in your project root and add your Fireworks API Key:
FIREWORKS_API_KEY=xxxxxxxxx
Create a Route Handler
Create a Next.js Route Handler that uses the Edge Runtime that we'll use to generate a chat completion via Fireworks that we'll then stream back to our Next.js.
For this example, we'll create a route handler at app/api/chat/route.ts
that accepts a POST
request with a messages
array of strings:
import OpenAI from 'openai';import { OpenAIStream, StreamingTextResponse } from 'ai';
// Create an OpenAI API client// but configure it to point to fireworks.aiconst fireworks = new OpenAI({ apiKey: process.env.FIREWORKS_API_KEY || '', baseURL: 'https://api.fireworks.ai/inference/v1',});
export async function POST(req: Request) { // Extract the `messages` from the body of the request const { messages } = await req.json();
// Ask Fireworks for a streaming chat completion using Llama 2 70b model // @see https://app.fireworks.ai/models/fireworks/llama-v2-70b-chat const response = await fireworks.chat.completions.create({ model: 'accounts/fireworks/models/llama-v2-70b-chat', stream: true, max_tokens: 1000, messages, }); // Convert the response into a friendly text-stream. const stream = OpenAIStream(response); // Respond with the stream return new StreamingTextResponse(stream);}
The AI SDK provides 2 utility helpers to make the above seamless: First, we
pass the streaming response
we receive from Fireworks to
OpenAIStream
. This method
decodes/extracts the text tokens in the response and then re-encodes them
properly for simple consumption. We can then pass that new stream directly to
StreamingTextResponse
.
This is another utility class that extends the normal Node/Edge Runtime
Response
class with the default headers you probably want (hint:
'Content-Type': 'text/plain; charset=utf-8'
is already set for you).
Wire up the UI
Create a Client component with a form that we'll use to gather the prompt from the user and then stream back the completion from.
By default, the useChat
hook will use the POST
Route Handler we created above (it defaults to /api/chat
). You can override this by passing a api
prop to useChat({ api: '...'})
.
'use client';
import { useChat } from 'ai/react';
export default function Chat() { const { messages, input, handleInputChange, handleSubmit } = useChat();
return ( <div className="mx-auto w-full max-w-md py-24 flex flex-col stretch"> {messages.map(m => ( <div key={m.id}> {m.role === 'user' ? 'User: ' : 'AI: '} {m.content} </div> ))}
<form onSubmit={handleSubmit}> <label> Say something... <input className="fixed w-full max-w-md bottom-0 border border-gray-300 rounded mb-8 shadow-xl p-2" value={input} onChange={handleInputChange} /> </label> <button type="submit">Send</button> </form> </div> );}
Guide: Text Completion
Use the Completion API
Similar to the Chatbot example above, we'll create a Next.js Route Handler that generates a text completion via Fireworks that we'll then stream back to our Next.js. It accepts a POST
request with a prompt
string:
import OpenAI from 'openai';import { OpenAIStream, StreamingTextResponse } from 'ai';
// Create an OpenAI API client// but configure it to point to fireworks.aiconst fireworks = new OpenAI({ apiKey: process.env.FIREWORKS_API_KEY || '', baseURL: 'https://api.fireworks.ai/inference/v1',});
export async function POST(req: Request) { // Extract the `prompt` from the body of the request const { prompt } = await req.json();
// Ask Fireworks for a streaming chat completion using Llama 2 70b model // @see https://app.fireworks.ai/models/fireworks/llama-v2-70b-chat const response = await fireworks.completions.create({ model: 'accounts/fireworks/models/llama-v2-70b-chat', stream: true, max_tokens: 1000, prompt, }); // Convert the response into a friendly text-stream. const stream = OpenAIStream(response); // Respond with the stream return new StreamingTextResponse(stream);}
Wire up the UI
We can use the useCompletion
hook to make it easy to wire up the UI. By default, the useCompletion
hook will use the POST
Route Handler we created above (it defaults to /api/completion
). You can override this by passing a api
prop to useCompletion({ api: '...'})
.
'use client';
import { useCompletion } from 'ai/react';
export default function Completion() { const { completion, input, stop, isLoading, handleInputChange, handleSubmit, } = useCompletion({ api: '/api/completion', });
return ( <div className="mx-auto w-full max-w-md py-24 flex flex-col stretch"> <form onSubmit={handleSubmit}> <label> Say something... <input className="fixed w-full max-w-md bottom-0 border border-gray-300 rounded mb-8 shadow-xl p-2" value={input} onChange={handleInputChange} /> </label> <output>Completion result: {completion}</output> <button type="button" onClick={stop}> Stop </button> <button disabled={isLoading} type="submit"> Send </button> </form> </div> );}
Guide: Save to Database After Completion
It’s common to want to save the result of a completion to a database after streaming it back to the user. The OpenAIStream
adapter accepts a couple of optional callbacks that can be used to do this.
export async function POST(req: Request) { // ...
// Convert the response into a friendly text-stream const stream = OpenAIStream(response, { onStart: async () => { // This callback is called when the stream starts // You can use this to save the prompt to your database await savePromptToDatabase(prompt); }, onToken: async (token: string) => { // This callback is called for each token in the stream // You can use this to debug the stream or save the tokens to your database console.log(token); }, onCompletion: async (completion: string) => { // This callback is called when the stream completes // You can use this to save the final completion to your database await saveCompletionToDatabase(completion); }, });
// Respond with the stream return new StreamingTextResponse(stream);}