ai-sdk-memory

Semantic and intent-based memory for Vercel AI SDK

Embeddings turn text into numeric vectors that represent meaning. Similar prompts have similar vectors. ai-sdk-memory uses that idea to avoid paying tokens twice for similar questions.

Two Memory Types

createSemanticMemory — For one-shot conversations: FAQs, docs, single queries
createIntentMemory — For multi-turn conversations with context and intent understanding

Installation

npm add ai-sdk-memory

Usage

createSemanticMemory

For one-shot conversations: FAQs, docs, single queries

import { createSemanticMemory } from "ai-sdk-memory";

const semantic = createSemanticMemory({
  model: "text-embedding-3-small",
});

const result = await semantic.streamText({
  model: "openai/gpt-5-mini",
  messages: [{ role: "user", content: "What is an agent?" }],
});

createIntentMemory

For multi-turn conversations: Chatbots, contextual assistants, complex interactions

import { createIntentMemory } from "ai-sdk-memory";

const intent = createIntentMemory({
  intentExtractor: {
    model: "openai/gpt-5-mini",
    windowSize: 5,
  },
  model: "text-embedding-3-small",
  threshold: 0.95,
  onStepFinish: ({ step, userIntention, cacheScore }) => {
    console.log(step, userIntention, cacheScore);
  },
});

const result = await intent.streamText({
  model: "openai/gpt-5-mini",
  messages: [
    { role: "user", content: "I need help with React" },
    { role: "assistant", content: "What issue are you facing?" },
    { role: "user", content: "Components re-rendering too often" },
  ],
});

Environment Variables

VECTOR_REST_URL=
VECTOR_REST_TOKEN=
REDIS_REST_URL=
REDIS_REST_TOKEN=

Configuration Options

You can pass options to createSemanticCache() to customize caching behavior.

Option	Type	Default	Description
model	`string` or `EmbeddingModel`	Required	Embedding model used to compare prompts, e.g. `"openai:text-embedding-3-small"`
vector.url	`string`	`process.env.VECTOR_REST_URL`	URL of your Upstash Vector database
vector.token	`string`	`process.env.VECTOR_REST_TOKEN`	Access token for Upstash Vector
redis.url	`string`	`process.env.REDIS_REST_URL`	URL of your Upstash Redis instance
redis.token	`string`	`process.env.REDIS_REST_TOKEN`	Access token for Upstash Redis
threshold	`number`	`0.92`	Minimum similarity (0–1) to reuse cached responses
ttl	`number`	`60 * 60 * 24 * 14`	Cache expiration in seconds (default 14 days)
debug	`boolean`	`false`	Print logs for cache hits, misses, and writes
cacheMode	`'default'` or `'refresh'`	`'default'`	`default` uses cache if found, `refresh` forces regeneration
simulateStream.enabled	`boolean`	`true`	Simulate streaming when reading from cache
simulateStream.initialDelayInMs	`number`	`0`	Delay before first chunk (ms)
simulateStream.chunkDelayInMs	`number`	`10`	Delay between chunks (ms)
useFullMessages	`boolean`	`false`	If true, embeds entire conversation instead of last message only

How it works

When you send a prompt:

The text is turned into an embedding (a vector of numbers).
The vector database finds similar embeddings.
If a match is found, the previous response is reused.
Otherwise, the model runs and the result is stored for next time.

Learn more about embeddings → Cloudflare: What are embeddings?

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.vscode		.vscode
apps/ai-sdk-memory		apps/ai-sdk-memory
examples		examples
packages		packages
.gitignore		.gitignore
.npmrc		.npmrc
LICENSE		LICENSE
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.json		tsconfig.json
turbo.json		turbo.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ai-sdk-memory

Two Memory Types

Installation

Usage

createSemanticMemory

createIntentMemory

Environment Variables

Configuration Options

How it works

License

About

Uh oh!

Releases

Packages

Languages

License

fveiraswww/ai-sdk-memory

Folders and files

Latest commit

History

Repository files navigation

ai-sdk-memory

Two Memory Types

Installation

Usage

createSemanticMemory

createIntentMemory

Environment Variables

Configuration Options

How it works

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages