Bay Area

Ritesh Mishra

I build platforms, scale teams, and stay hands-on. Currently obsessed with making AI actually useful in developer workflows. I like taking people on the journey, not just showing them the destination.

About

Engineering leader who still writes code. I get energy from hard problems, whether that's distributed systems architecture, figuring out how AI fits into real workflows, or helping engineers level up.

My thing is taking complex technical problems and making them approachable. I've done this across geospatial systems, real-time data pipelines, API platforms, and now AI tooling. The common thread: good abstractions, composable systems, and always asking "how do we make this easier for the next person?"

I read a lot, research even more, and genuinely enjoy bringing people along for the ride. Most of my best ideas come from whiteboard sessions that start with "okay hear me out..."

Books that shaped me
  • Who Says Elephants Can't Dance
  • Trillion Dollar Coach
  • Fahrenheit 451
  • The Age of Intelligent Machines
Education
UC Davis
Managerial Economics turned Engineer
Math nerd at heart
Thoughts
May 17, 2026draft

Sidecar Context Architectures for Model Portability

A systems design problem: how do you maintain persistent state across stateless inference runtimes? This is less about AI and more about building durable architectures when your compute layer is fundamentally ephemeral.

Designing for Stateless Compute

The Statefulness Problem

Modern inference runtimes are stateless by design. Every request reconstructs context from scratch. This is the same pattern we solved in web services with session stores, caches, and databases. Why are we relearning it?

The Problem

When you swap between different models locally, all accumulated context evaporates. Retrieval systems fetch fragments. Memory systems inject tokens. But neither maintains relational structure across runtime switches.

Sidecar as a Persistence Layer

What if there was a persistent graph that lives alongside your local models? A sidecar architecture that maintains entity state, timeline, and retrieval policies independent of which runtime is active.

The Architecture

User
  ↓
Inference Runtime (Ollama)
  ↓
Context Sidecar
  ├── Semantic Graph
  ├── Entity Memory
  ├── Timeline State
  └── Retrieval Policies

The sidecar intercepts requests, enriches context from the graph, and persists new relationships back. Runtime-agnostic. Local-first. The graph survives runtime switches, updates, even complete swaps.

Retrieval vs Structure

Vector similarity finds related chunks but doesn't encode why they're related. The graph stores relationships explicitly: causality, temporal ordering, entity connections. Any runtime can reason over structure, not just surface similarity.

This is a systems architecture thesis. Local-first compute needs persistent identity infrastructure. The runtimes become interchangeable; the context layer becomes the product.

Systems DesignLocal-FirstDistributed State
2026published

Context Lineage for Evolving Systems

A formal framework for versioned knowledge persistence. How do you maintain state integrity when your underlying systems are constantly evolving? This applies to AI agents but the patterns are older than that.

ResearchState Management
2026live

AgentIndexc

A side project exploring structured context layers. Scores website readability for automated systems and experiments with real-time context aggregation.

Side ProjectExperimentation

More half-baked ideas brewing. The best ones usually start as scribbles.

Building

At Work

Alkami Code Studio

AI-native development platform for building banking widgets. Agent-driven workflows with shared context architecture.

AI PlatformDeveloper Tools

Side Projects

llama-nest

Local-first memory sidecar for Ollama. Persistent semantic graphs that survive model switches.

RustOllamaLocal-First

AgentIndexc

A side project exploring context lineage for AI agents. Scores websites for AI readiness.

AISide Project

ohdh-context-arbiter

Local-first context arbitration for resolving conflicting instructions in memory stores.

AIpgvector
Journey
2023

Alkami

Engineering Leadership

Leading AI and platform strategy. Shipped Code Studio, scaled the developer platform, and somehow convinced everyone that AI tooling is actually worth the investment.

Presenting Code Studio at the keynote
Presenting Code Studio at the keynote
2021

Varo

Engineering Manager

Built the core platform integrating Zelle, Direct Deposit, Micro Deposits, and Card Management. Real-time money movement at scale. Learned that highly regulated systems can still be elegant if you fight for it.

A day in the life
A day in the life
2017

Zillow

Senior Software Engineer

Built an internal geocoding system using TIGER, OpenStreetMap, and proprietary geospatial datasets. Powered Zillow Group location intelligence and Zestimate spatial data. Helped take Zillow international with the Canada launch.

Scheming on internal geocoding
Scheming on internal geocoding
2015

NASA Ames

Intern

WorldWind virtual globe SDK. Built software that helped people look at Earth from space. Peak childhood dream energy.

The early days
The early days