An open-source agent runtime that connects chat, tools, memory, and automation into one self-hosted system. 378k+ GitHub stars · MIT licensed · runs entirely on your infrastructure.
Same items also queue as cards on Mission Control ↗ — Telegram for the urgent, the dashboard for the rest.
All 69 active jobs run on TritonAI (UCSD’s institutional gateway, on-prem at SDSC) — zero vendor lock-in. 78 total configured, 9 disabled/deferred. 100% model-agnostic routing through TritonAI’s LiteLLM gateway.
| Tier | Alias | Provider · Model | Use & why this model fits | Jobs |
|---|---|---|---|---|
| Heavy agentic | gpt-oss-120b | TritonAI · OpenAI gpt-oss 120B | Briefings · opportunity scans · weekly signal synthesis · backlog triage · LinkedIn candidates — Best value: powerful single-shot reasoning at TritonAI open-source cost. Handles multi-step agentic work across the full pipeline. |
22 |
| Lightweight sync | mistral-small-3.2 | TritonAI · Mistral Small 3.2 | Token refresh · Gmail/Drive sync · confluence sync · disk watchdog · API health pings — Lowest latency on TritonAI, cheap per-call — perfect for stateless hygiene work that fires every 15-60 min. |
18 |
| Mid-tier synthesis | gemma-4 | TritonAI · Gemma 4 26B | Daily reflection · nightly context promotion · wiki ingest · team knowledge ops · evening wrap — Strong multimodal + 1M context window — handles synthesis, image understanding, and multi-source reasoning at low cost. |
16 |
| Long-form generation | mistral-large-3 | TritonAI · Mistral Large 3 675B | UCSD AI newsletter · LinkedIn post drafts · architecture reviews · vision tracking — Best narrative flow on TritonAI — preserves voice across long-form prose without sounding generic. |
6 |
| Deep reasoning | sonnet | TritonAI · Claude Sonnet 4.6 | Email triage · weekly AI deep-dive · identity drift review · provider quota monitoring — Reserved for jobs that need careful analysis, multi-source verification, and nuanced judgment — worth the premium for low-frequency high-stakes work. |
5 |
| Specialized code | deepseek-v4 | TritonAI · DeepSeek V4 Flash Max | Overnight code maintenance — Code-specific reasoning that handles refactoring, dependency updates, and repo hygiene without drift. |
1 |
| Health check | gpt-oss-120b | TritonAI · OpenAI gpt-oss 120B | Henry API health ping — Same model as heavy agentic tier; just a single hourly status check that doesn’t need its own category. |
1 |
| System default | default | TritonAI · DeepSeek V4 Flash Max | Feature reminder — Unpinned job inherits the system default model. One catch-all that keeps complexity low. |
1 |
Read each column top-to-bottom — vendor totals up top, then how Core memory and Brett-specific jobs break down underneath.
On-prem open-weight models on UCSD GPUs at SDSC have ~$0/mo marginal cost (sunk institutional infra). TritonAI column uses actual published API rates from model_hub_table as of June 2026. Comparison columns use published Anthropic/OpenAI API rates on the same estimated token consumption. Core vs Brett-specific split estimated from job allocation patterns.
Each card below is what actually lands in the corresponding output file or feed.
All of these also land as reviewable cards on Mission Control ↗ — the dashboard where I triage, accept, defer, or kill each item.
Most of these are implementation work, not research. The hard part — the knowledge layer — is already built.