About us
We make the frontier model market disappear behind one durable endpoint.
Direct Inference started with a frustration its founders kept hitting in production: every team building on AI was quietly running a second engineering project on the side — tracking model launches, renames, and retirements; hand-wiring capability rules; rebuilding failover trees; and re-pricing traffic every time a lab shipped. The model layer had crept into the codebase, and it never stopped moving. We built Direct Inference to take that layer back out.
The part we care about most is what the caller doesn’t see. Direct Inference is zero-knowledge by design: your users, and your own logs, never learn which model, provider, or version served a request. That isn’t a limitation we tolerate — it’s the point. When the serving path is an internal detail instead of part of your product surface, an upstream rename can’t break a branch you forgot you wrote, and a provider change becomes our operational event instead of your security review.
We think inference should feel finished. One endpoint, one key, one cost-and-quality knob, hard spend caps that fail closed, and an endpoint that puts capability ahead of the model name. The frontier will keep moving. Our job is to make sure your integration doesn’t have to.
Leadership
The team behind the endpoint.
Keith Coleman
Co-founder & CEO
Keith spent a decade building and operating large-scale inference and search systems, most recently leading platform engineering for a frontier AI lab where he saw firsthand how quickly model churn turns into customer pain.
Dr. Elena Varga
Co-founder & CTO
Elena led model-serving infrastructure at a hyperscale cloud provider, running multi-tenant inference at the scale of billions of requests a day. She owns the engine that serves every Direct Inference request.
Devin Asante
Head of Engineering
Devin built payments and risk infrastructure at a global fintech, where a single missed guardrail meant real money lost. He brought that discipline to DI’s spend controls and reliability surface.
Hannah Reyes
Head of Security & Trust
Hannah ran security and compliance programs at two enterprise SaaS companies through SOC 2 and ISO audits. She owns the zero-knowledge contract end to end.
Backed by
Direct Inference is a venture-backed company. We raised a $27M Series A to build the durable endpoint teams run their production AI on.
With angel investors Priya Venkataraman and Marcus Oyelaran.
Newsroom
Announcements and press.
April 14, 2026
Direct Inference raises $27M Series A to make the AI model market disappear behind one endpoint
March 3, 2026
Zero-knowledge inference: why Direct Inference won’t tell you which model answered — and why customers prefer it that way
January 22, 2026
As frontier models churn, Direct Inference bets enterprises want a durable surface, not a model marketplace
Building the layer the AI market runs on.