// personal project LIVE

AgentOS

A personal agent runtime: models propose, code decides

A task-graph engine fans audits out across the project portfolio, a deterministic verifier rejects incomplete work before it is accepted, and every file write is a replay of a verified report through code-enforced gates. Four executable workflows run on the engine today, operated from Studio, a hub inside the VS Code dashboard that prices a run before launching it and observes it live. Underneath: a context portfolio served over MCP, with shared skills and memory across Claude, Codex and Cursor. Context becomes infrastructure, not a paste job.

03 ai clients 10 portfolio files 04 executable workflows v0.5.79 dashboard
continuous_feedback [01]

Self-improving showcase

The system that builds this website can now audit its own output, identify where it undersells the project, and propose concrete improvements. The content-audit loop compares git evidence against the live site, generates a synthesis layer with repositioning recommendations, and ranks proposals by impact, effort, and risk, so 'what to do first' is a code decision, not a guess. A grounding split separates repo-backed claims from speculative ideas.

Synthesis layer
Evidence-grounded strategy

The audit generates fields like whatFeelsUnderstated and recommendedHeroShift, each backed by citations to commits and changelog entries. The model proposes; the evidence decides.

Decision engine
What to do first

Proposals are ranked by a deterministic nextBestActionScore: impact weighted by confidence, minus penalties for effort and risk, plus a bonus for grounding. Quick-wins clustering surfaces changes worth doing together.

Latest proof point
Salesforce CI/CD onboarded

The latest project to join the showcase, onboarded through the formal process with multiple audit reports already generated. The loop is production-ready.

why_it_exists [02]

Context as infrastructure

Every time a new Claude session opens, or a new agent spins up, the same question hits: does it know who I am, how I work, what the project context is? Without a system, the answer is always no, and the first ten minutes of every session are re-establishing ground that was covered yesterday.

AgentOS is the answer to that. A personal operating system for the agentic era: a persistent context layer that travels across tools, sessions, and projects.

The foundation is a Context Portfolio: ten structured markdown files covering identity, role, projects, team, communication style, decisions, and more. Deployed as an MCP server so any connected agent can read any file on demand. Versioned in git. Synced automatically at the end of every session.

The aim is a personal system that compounds in value over time, session by session, as it gets refined and extended.
What it covers
01Context Portfolio (10 files, MCP-served)
02Three AI clients: Claude, Codex, Cursor
03Shared skills and memory layer
04Task-graph runtime with verification gates
05Live token tracking and analytics
06VSCode dashboard with the Studio hub
Status

Active build. Context portfolio complete and served under pm2. Claude, Codex and Cursor wired in via the shared HTTP server. Task-graph runtime live with four executable workflows, deterministic verification, and two real gated applies committed. Dashboard at v0.5.79 with the Studio hub.

architecture [03]

Three layers and a runtime

The portfolio is the source of truth. The MCP gateway is the access layer. The clients are the consumers. The task-graph runtime lives inside the gateway and does the actual agent work. Git is the backup for everything.

Foundation
Context Portfolio

Ten markdown files covering identity, role, tools, projects, communication style, goals, and domain knowledge. Machine-readable. Versioned in git. Changes are live immediately.

Access layer
MCP Server

Node.js server exposing portfolio files as typed MCP resources over 127.0.0.1:3000/mcp. Runs under pm2, auto-starts on login. Tracks every read, skill call and tool invocation with token counts.

Consumers
Three Clients

Claude Code, Codex and Cursor all connect to the same MCP server. Shared memory and 16 shared commands are deployed to all three via per-client sync scripts. One canonical source; three runtime instances.

system_map
Claude Code
Codex
Cursor
Studio (VS Code)
MCP gateway
loopback only · bearer-authed · pm2
Context portfolio
10 files · read live from disk
Task-graph runtime
fan-out · verify · gated apply
Run telemetry
JSONL canonical + SQLite ledger
explore_the_system [04]
build_log [05]

From context files to a verified runtime

Built session by session since April 2026: context portfolio, local MCP server, multi-client wiring, multi-provider models, cost tracking, a VSCode extension, and then a task-graph runtime with deterministic verification and gated applies on top. The build log records every phase.

read the build log →
context-portfolio/ (10 md files)
└── identity, role, projects, tools, style …
mcp-server/ (Node.js + MCP SDK)
├── POST /mcp ──▶ StreamableHTTP transport
├── runner/ ──▶ graph engine, verifier, apply gates
└── run-control API ──▶ workflows, runs, live events
clients/shared/
├── memory/ ──▶ all 3 clients
└── skills/ ──▶ all 3 clients (16 commands)
vscode-agentos-dashboard/ (v0.5.79)
└── sidebar + webview, 11 tabs including a Studio, 0 dependencies
resources [06]

The conceptual and practical foundations draw from the AI Daily Brief's work on agentic systems and context management.

Want to build your own?

The AIDB AgentOS programme is free and open. Start with the context portfolio. Everything else follows from there.

View the programme →