A Model-Agnostic Memory System

AI Use Case Meeting · 2026-06-08

A Model-Agnostic
Memory System

Building personal AI memory that doesn’t belong to a vendor

Brett Pollak · Executive Director, Workplace Technology

UC San Diego

The Problem

Generic AI is generic.
Useful AI knows your world.

▸ Anthropic, OpenAI, and Google are racing to make context the moat — by integrating your personal and enterprise data into their memory systems.
▸ The more useful they get, the harder it is to leave.
▸ This talk is about building that memory layer as yours instead.

What is OpenClaw?

Open Source · MIT

An open-source agent runtime that connects chat, tools, memory, and automation into one self-hosted system. 378k+ GitHub stars · MIT licensed · runs entirely on your infrastructure.

Self-Hosted Gateway

Runs on your hardware — Mac, Linux, server. Your data never leaves your control. MIT licensed, open source.

Multi-Channel

One gateway serves WhatsApp, Telegram, Discord, iMessage, Slack, Teams — simultaneously.

35+ Model Providers

Anthropic, OpenAI, Google, self-hosted (Ollama, vLLM, SGLang), or any OpenAI-compatible endpoint. Multi-model routing with automatic failover.

Agent-Native Runtime

Built-in tool use, sessions, persistent memory, cron scheduling, sub-agent orchestration, workflow pipelines.

MCP & Tool Ecosystem

First-class Model Context Protocol support. Wire in calendars, email, internal APIs, and custom tools — extensible without forking the runtime.

Workspace Files

SOUL.md, AGENTS.md, USER.md, MEMORY.md, plus daily memory logs — identity, rules, context, and curated long-term memory in plain markdown.

The Architecture

One pipeline, three components

Data sources → Knowledge & memory → Agent actions pipeline

Three components, one pipelineData sources → knowledge & memory → agent actions. Each piece is independently swappable.
Knowledge is the moatMost personal-AI projects stop at “connect to email.” The interesting work is what comes next: capture, synthesize, retrieve.
Synthesis is where the value lives78 cron jobs turn raw signals into durable knowledge — not just chat, a daily institutional memory build.
Mission Control — the surface I work fromNext.js dashboard at mission-control.vercel.app. Cron outputs land as task cards, decisions queue for review, jobs report status. Telegram pushes the urgent; the dashboard holds the rest.
Open formats, not vendor lockMarkdown + JSON on disk. Switch tools, keep your knowledge.

Context Loading

Three layers — and the one most personal AI projects skip

Three concentric layers: always loaded, triggered, explicit

Layer 1 — Always loaded (~15 KB)Identity, user profile, work patterns, today + yesterday. Cheap, fixed, every interaction.
Layer 2 — Triggered (270 wiki pages)Mention a colleague → reads their wiki page. Mention a vendor → reads tech-stack page. The biggest unlock.
Layer 3 — Explicit onlyFull transcripts, raw graph JSON, dated memory files. Rare deep-dives.
Most projects skip Layer 2Without triggered retrieval, an accumulating wiki sits unused. The agent only knows what fits in the opening prompt.

The Pipeline

78 cron jobs on a predictable daily rhythm

24-hour clock showing accumulation, synthesis, and consolidation jobs

🌅 Morning (6–9 AM)Calendar briefing, AI news digest, opportunity scan, campus pain monitor, wiki ingest. Everything ready before I open a laptop.
🌆 Evening (5:30–7:35 PM)Email triage with drafts, meeting debrief, daily reflection, context promotion to long-term memory.
🌙 Night (3 AM)Memory consolidation — short-term → long-term, the way sleep does for humans.
📅 Weekly (Sunday)Cross-day pattern extraction. “VPN failed Mon, Wed, Fri” → “VPN is systemically broken this week.”
The rhythm is the pipelineNot the individual jobs — the order they fire in is what turns raw files into institutional knowledge.

What lands on my screen — Morning

Six briefings before I open a laptop

📱 Telegram · OpenClaw Today · 6:30 AM → 9:00 AM

📅 Calendar briefing · 6:30 AM

4 meetings today. Hari at 10 — project decision pending. 11–1 free for deep work. Coffee with Doris cancelled, want me to rebook?

📰 AI news digest · 7:00 AM

9 items today. Top three:
• OpenAI quietly bumps GPT-5.5 reasoning — 2M context now default
• Anthropic ships MCP gateway updates relevant to your auth audit
• Google deprecates Gemma 3 (you migrated last week — no action)
brettcpollak.com/ai-digest ↗

🎯 Opportunity scanner · 7:30 AM

Campus IT is rolling out a new SSO config — relevant to your OpenClaw auth audit thread (last touched Apr 22). Linking the wiki page in case it’s useful in your 10 AM with Hari.

🚨 Campus pain monitor · 8:00 AM

VPN failed for 12 staff over 24h — mostly engineering school, 3rd time this week. Pattern looks systemic, not user-error. Worth flagging in your 1:1 with Wayne.

📚 Wiki ingest queue · 8:30 AM

3 new wiki pages drafted from yesterday’s interactions: people/sarah-chen.md, vendors/snowflake.md, decisions/2026-04-29-budget.md — review when ready.

📨 UCSD AI Weekly · Mondays 9:00 AM

This week’s edition is published — 8 tool updates, 2 trainings, TritonAI migration recap. brettcpollak.com/ucsd-ai-news ↗

Same items also queue as cards on Mission Control ↗ — Telegram for the urgent, the dashboard for the rest.

The Inference Layer

Model allocation — matching jobs to capabilities

All 69 active jobs run on TritonAI (UCSD’s institutional gateway, on-prem at SDSC) — zero vendor lock-in. 78 total configured, 9 disabled/deferred. 100% model-agnostic routing through TritonAI’s LiteLLM gateway.

Tier	Alias	Provider · Model	Use & why this model fits	Jobs
Heavy agentic	gpt-oss-120b	TritonAI · OpenAI gpt-oss 120B	Briefings · opportunity scans · weekly signal synthesis · backlog triage · LinkedIn candidates — Best value: powerful single-shot reasoning at TritonAI open-source cost. Handles multi-step agentic work across the full pipeline.	22
Lightweight sync	mistral-small-3.2	TritonAI · Mistral Small 3.2	Token refresh · Gmail/Drive sync · confluence sync · disk watchdog · API health pings — Lowest latency on TritonAI, cheap per-call — perfect for stateless hygiene work that fires every 15-60 min.	18
Mid-tier synthesis	gemma-4	TritonAI · Gemma 4 26B	Daily reflection · nightly context promotion · wiki ingest · team knowledge ops · evening wrap — Strong multimodal + 1M context window — handles synthesis, image understanding, and multi-source reasoning at low cost.	16
Long-form generation	mistral-large-3	TritonAI · Mistral Large 3 675B	UCSD AI newsletter · LinkedIn post drafts · architecture reviews · vision tracking — Best narrative flow on TritonAI — preserves voice across long-form prose without sounding generic.	6
Deep reasoning	sonnet	TritonAI · Claude Sonnet 4.6	Email triage · weekly AI deep-dive · identity drift review · provider quota monitoring — Reserved for jobs that need careful analysis, multi-source verification, and nuanced judgment — worth the premium for low-frequency high-stakes work.	5
Specialized code	deepseek-v4	TritonAI · DeepSeek V4 Flash Max	Overnight code maintenance — Code-specific reasoning that handles refactoring, dependency updates, and repo hygiene without drift.	1
Health check	gpt-oss-120b	TritonAI · OpenAI gpt-oss 120B	Henry API health ping — Same model as heavy agentic tier; just a single hourly status check that doesn’t need its own category.	1
System default	default	TritonAI · DeepSeek V4 Flash Max	Feature reminder — Unpinned job inherits the system default model. One catch-all that keeps complexity low.	1

Cost across the ecosystem

Same 69 active jobs, three pricing scenarios

Read each column top-to-bottom — vendor totals up top, then how Core memory and Brett-specific jobs break down underneath.

Model-agnostic gateway

Anthropic only

OpenAI only

~$16/mo

69 jobs · actual TritonAI published API rates

Core memory · 23 jobs

~$5/mo

Mostly open-weight on-prem (Gemma 4, GPT-OSS, Mistral Small) · 1 Sonnet escalation · near-zero marginal cost

Brett-specific · 46 jobs

~$11/mo

Open-weight synthesis + agentic work on GPT-OSS · Mistral Large for long-form · 2 Sonnet escalations · zero vendor lock-in

~$130/mo 8×

69 jobs · single-vendor lock-in

Core memory · 23 jobs

~$35/mo 7×

Haiku · Sonnet · Opus across all 23 jobs · no open-source savings · no cross-provider diversity

Brett-specific · 46 jobs

~$95/mo 8.6×

All synthesis on Sonnet · long-form on Sonnet · Opus deep-dive · no open-source fallback

~$160/mo 10×

69 jobs · single-vendor lock-in

Core memory · 23 jobs

~$40/mo 8×

gpt-5.5-mini for hygiene · gpt-5.5 for agentic · no open-source savings

Brett-specific · 46 jobs

~$120/mo 11×

Synthesis + long-form on gpt-5.5 · gpt-5.5 for deep-dives · no model diversity

On-prem open-weight models on UCSD GPUs at SDSC have ~$0/mo marginal cost (sunk institutional infra). TritonAI column uses actual published API rates from model_hub_table as of June 2026. Comparison columns use published Anthropic/OpenAI API rates on the same estimated token consumption. Core vs Brett-specific split estimated from job allocation patterns.

What lands — Evening & weekly

End-of-day synthesis becomes long-term memory

Each card below is what actually lands in the corresponding output file or feed.

5:30 PM📧 Email triage + drafts

reply-priority5 messages need a response today — Doris (intake), Hari (project), Wayne (1:1 prep), 2 vendor follow-ups.

drafts-ready3 drafts saved to my Outlook Drafts folder via MS Graph — open the email, review, send. Review beats writing from scratch.

unsubscribe8 newsletters proposed for cleanup based on 30-day open rate.

6:00 PM🎙 Meeting debrief

From today’s meeting transcripts → 4 action items extracted, 2 attendee wikis updated (sarah-chen, doris-tao), 5 new edges added to the knowledge graph, 1 architecture decision logged to decisions/.

7:00 PM🪞 Daily reflection

Today: 3 meetings, 1 architecture decision, 2 follow-ups due Friday.
Pattern noticed: third week of intake delays — worth a process retro.
Promoted to long-term wiki: 4 facts, 2 relationship updates.

3:00 AM🌙 Memory consolidation

12 short-term notes promoted, 3 stale entries pruned, 47 wiki backlinks + 23 knowledge-graph edges reindexed. The way sleep does it for humans — runs while I’m off.

SUN 8 AM📊 Weekly pattern extraction

“VPN issues spiked Mon/Wed/Fri — systemic, not user error.” Drafted a Henry ticket. Cross-day patterns surfaced by traversing the knowledge graph — connections no daily report would catch.

SUN 10 AM✍️ LinkedIn weekly drafts

3 post drafts in my voice, grounded in this week’s actual decisions and work — review beats drafting from scratch.

All of these also land as reviewable cards on Mission Control ↗ — the dashboard where I triage, accept, defer, or kill each item.

Outcomes

What the system actually delivers

2–3

hours / week saved

Research, prep, triage, news scanning, follow-up tracking — the crons gather; I do the judgment.

270

wiki pages indexed

Triggered context loading — the layer most personal-AI projects skip. 80 new pages since May 1.

cron jobs configured

Synthesis pipeline running 24/7 — 69 active, 9 disabled/deferred. Morning briefings, evening reflection, night consolidation.

∞

cross-session continuity

Decisions write back. Patterns persist. Future sessions inherit what past ones learned.

📈Pattern recognition I’d miss — “VPN failed 5 of 7 days” doesn’t surface in any daily report. Weekly synthesis surfaces it automatically.

🧠Institutional memory — “Why did we pick this over that?” has an answer. “When did we last discuss X?” has a date and a transcript link.

🎯Never starting from zero — every conversation begins with 48 hours of context, curated long-term memory, and triggered wiki lookups.

Forward-Looking

What else becomes possible — the foundation is in place

📍 Now · Proven daily

78 cron jobs configured (69 active)Morning briefings, evening synthesis, night consolidation, weekly patterns — plus 9 experimental or seasonal jobs ready to enable.

Triggered wiki retrieval270 pages, loaded on demand based on what the conversation references.

Cross-session continuityDecisions and patterns persist across days, weeks, and threads.

⏭ Next 30 days · Implementation

📋 Meeting prep automationAuto-compile attendee wikis, recent interactions, and open commitments before every meeting. Walk in warm.

🔔 Proactive nudges“This problem has come up three weeks in a row — here are the options you’ve already explored.”

✍️ Draft-first comms expansionMemos, project briefs, status updates — grounded in actual context, not generic AI writing.

🌅 Beyond · What unlocks

🏛 Decision archaeology“Why did we pick this two years ago?” → immediate answer with full reasoning, the voices that disagreed, the tradeoffs.

👥 Team-level memoryOpt-in shared wikis. Team patterns surface without crossing personal boundaries.

🏫 Department insightsAnonymized aggregates: “trending pain points this week” from many agents, never individual records.

Most of these are implementation work, not research. The hard part — the knowledge layer — is already built.

The Vision

Scaling to UC San Diego — what if every staff member had one?

Four-tier scaling: personal, team, department, campus

1 · Personal — own agent, own dataProven daily. Every staff member gets dedicated runtime, isolated data store, governance default of “this is yours.”
2 · Team — opt-in sharing onlyTeam wiki visible to members; 1:1s and personal notes stay private. Nothing crosses the boundary automatically.
3 · Department — anonymized aggregates“Trending pain points this week” — patterns from many agents, not records. Identity stripped before aggregation.
4 · Campus — statistical patterns onlyConnective tissue across departments. Never individual records, never attributable content.
🛡️ Privacy is the preconditionSharing boundaries are first-class features, not bolt-ons. Without that guarantee at every level, nothing ships.

Discussion

Where do we go from here?

1. Data sources & MCP

These tools are clearly valuable for this kind of work — which campus data sources should we be connecting through MCP, and where would those integrations move the needle most?

2. Governing access

How do we govern access to those data sources at the campus level — sanctioned MCP servers, scoped credentials, audit trails, and a sharing model staff and faculty can trust?

3. Over to Shawn Munro

Shawn is going to talk next about opportunities to address this through our APIs and hosting platforms — making these patterns first-class on UCSD infrastructure rather than one-off personal experiments.

      brettcpollak.com/ai-agent-architecture
      github.com/bpollak/mission-control
    

Generic AI is generic.Useful AI knows your world.

What is OpenClaw?

One pipeline, three components

Three layers — and the one most personal AI projects skip

78 cron jobs on a predictable daily rhythm

Six briefings before I open a laptop

Model allocation — matching jobs to capabilities

Same 69 active jobs, three pricing scenarios

End-of-day synthesis becomes long-term memory

What the system actually delivers

What else becomes possible — the foundation is in place

Scaling to UC San Diego — what if every staff member had one?

Generic AI is generic.
Useful AI knows your world.