Now in public beta — Try it free

Give your AI
permanent memory

Memra gives your AI a permanent memory — whether you're building AI apps with our API or using AI tools in VS Code with our extension.

Two products, one platform

Memory API

For AI apps you build

VS Code Extension

Auto-captures AI chats

SOON

Dead simple to integrate

Three steps. Two minutes. Production ready.

1

Install

Add the npm package to your project

npm install @memra-client/client
2

Save conversations

Store every exchange with one line

await memory.save(userId, userMsg, aiReply)
3

Retrieve context

Semantic search finds relevant memories

const { context } = await memory.getContext(userId, query)

Your AI forgets. Memra fixes that.

Without Memra

I run a barber shop in Mumbai
That's great! How can I help?
— later —
What was my business again?
I don't have any information about your business.

User frustrated. Context lost.

With Memra

I run a barber shop in Mumbai
That's great! How can I help?
— later —
What was my business again?
You run a barber shop in Mumbai! How's it going?

AI remembers. User delighted.

Everything your AI needs to remember

Battle-tested infrastructure for production AI memory.

Lightning Fast

~125ms average. Semantic search powered by pgvector. Fast enough for real-time conversations.

🧠

Semantic Search

Not just last N messages. We find the most RELEVANT memories using vector similarity, not recency.

🔌

Any AI Provider

Works with GPT-4, Claude, Gemini, Groq, Llama. One SDK. Any model.

🔑

Per-Agent Memory

Separate memory spaces per agent ID. Customer support bot ≠ coding assistant.

🔒

Secure by Default

API key auth. Data isolated per key owner. GDPR-compliant deletion built in.

📦

npm Package

Two lines of code. No infrastructure to manage. Just install and go.

One platform. Two powerful products.

Whether you're building AI apps or using AI in VS Code — Memra gives your AI a memory.

For AI App Developers

Memory API

Add persistent memory to any AI application. Save conversations, retrieve context, and give every user their own memory layer.

// npm install @memra/client import { MemoryClient } from '@memra/client' const memory = new MemoryClient({ apiKey: 'mk_mem_...' }) // Save + retrieve in 2 lines await memory.save(userId, msg, reply) const ctx = await memory.getContext(userId, query)
  • Semantic search over conversation history
  • Per-user memory isolation
  • Works with any LLM or framework
  • REST API — no SDK required
Get Memory API key
For VS Code UsersCOMING SOON

VS Code Extension

Auto-captures every AI conversation in VS Code. Works with Copilot, Claude Code, Cline, and Continue. Zero setup — install and forget.

// Install from VS Code Marketplace ext install memra.vscode-extension // Add your extension key { memra.apiKey: 'mk_ext_...' } // That's it. Sessions auto-save.
  • Auto-saves every chat session
  • Injects context on new sessions
  • Works with all VS Code AI tools
  • Cross-session memory with semantic search
Join Waitlist

Not sure which? Building an AI app Memory API · Using AI in VS Code Extension

Never lose your AI context again

Install once. The extension handles everything.

🔌
1

Install the extension + enter API key

One-click install from the VS Code Marketplace. Paste your Memra extension key and you're done.

💬
2

Code normally with any AI in VS Code

Use GitHub Copilot, Claude Code, Cline, Continue — the extension works silently in the background.

🧠
3

Context follows you across sessions and tools

Every conversation is saved. When you start a new session, your AI picks up exactly where you left off.

Works with

GitHub Copilot · Claude Code · Cline · Continue · and more

Without Memra

Re-explain your project every session — 10-20 minutes wasted

With Memra

AI picks up exactly where you left off — instantly

Works with your entire stack

Drop in one function call. Remove the rest.

import { MemoryClient } from '@memra-client/client'

const memory = new MemoryClient({ apiKey: process.env.MEMRA_KEY })

export async function chat(userId: string, userMessage: string) {
  // Get relevant past context
  const { context } = await memory.getContext(userId, userMessage)

  // Memory goes in system prompt — NOT in the user message
  // This tells the AI what the context is and how to use it
  const systemPrompt = context.length > 0
    ? `You are a helpful assistant with memory of past conversations.
Here is what you remember:
${context.map((m, i) => `${i + 1}. [${m.role}]: ${m.content}`).join('\n')}
Use this memory to give personalized responses.`
    : `You are a helpful assistant.`

  // Your AI call — works with any provider
  const reply = await openai.chat.completions.create({
    model: 'gpt-4o',
    messages: [
      { role: 'system', content: systemPrompt },
      { role: 'user',   content: userMessage  },
    ],
  })

  // Save to memory
  await memory.save(userId, userMessage, reply.choices[0].message.content ?? '')

  return reply
}

Simple, transparent pricing

Start free. Scale when you're ready.

Free

Perfect for side projects

$0
  • Memory API
  • 1,000 memories
  • 1,000 API calls/month
  • 1 agent
  • Semantic search
  • npm package
  • Priority support
  • Custom retention
  • VS Code ExtensionComing Soon
  • 10 extension sessions
  • 100 messages/session
  • AI resume prompts
Start free
Coming soon

Pro

For production applications

$29/ month
  • Memory API
  • 500,000 memories
  • 100,000 API calls/month
  • Unlimited agents
  • Semantic search
  • npm package
  • Priority support
  • 90-day retention
  • VS Code ExtensionComing Soon
  • Unlimited extension sessions
  • 10,000 messages/session
  • AI resume prompts
Coming soon

Enterprise

For teams and high volume

Custom
  • Memory API
  • Unlimited everything
  • Self-host option
  • SLA guarantee
  • Dedicated support
  • Custom retention
  • SSO
  • VS Code ExtensionComing Soon
  • Unlimited everything
  • AI resume prompts
Contact us

Get early access to the VS Code Extension

Be the first to know when it launches

0+

memories stored

and counting

0ms

avg latency

semantic search

0%

uptime

SLA guaranteed

0 lines

to integrate

seriously, that's it

Ready to give your AI a memory?

Join developers building smarter AI applications with Memra.

Get started for free

No credit card required · Free forever plan