Inference, elegantly engineered

About us

We make the frontier model market disappear behind one durable endpoint.

Direct Inference started with a frustration its founders kept hitting in production: every team building on AI was quietly running a second engineering project on the side — tracking model launches, renames, and retirements; hand-wiring capability rules; rebuilding failover trees; and re-pricing traffic every time a lab shipped. The model layer had crept into the codebase, and it never stopped moving. We built Direct Inference to take that layer back out.

The part we care about most is what the caller doesn’t see. Direct Inference is zero-knowledge by design: your users, and your own logs, never learn which model, provider, or version served a request. That isn’t a limitation we tolerate — it’s the point. When the serving path is an internal detail instead of part of your product surface, an upstream rename can’t break a branch you forgot you wrote, and a provider change becomes our operational event instead of your security review.

We think inference should feel finished. One endpoint, one key, one cost-and-quality knob, hard spend caps that fail closed, and an endpoint that puts capability ahead of the model name. The frontier will keep moving. Our job is to make sure your integration doesn’t have to.

Leadership

The team behind the endpoint.

Keith Coleman

Co-founder & CEO

Keith spent a decade building and operating large-scale inference and search systems, most recently leading platform engineering for a frontier AI lab where he saw firsthand how quickly model churn turns into customer pain.

Dr. Elena Varga

Co-founder & CTO

Elena led model-serving infrastructure at a hyperscale cloud provider, running multi-tenant inference at the scale of billions of requests a day. She owns the engine that serves every Direct Inference request.

Devin Asante

Head of Engineering

Devin built payments and risk infrastructure at a global fintech, where a single missed guardrail meant real money lost. He brought that discipline to DI’s spend controls and reliability surface.

Hannah Reyes

Head of Security & Trust

Hannah ran security and compliance programs at two enterprise SaaS companies through SOC 2 and ISO audits. She owns the zero-knowledge contract end to end.

Backed by

Direct Inference is a venture-backed company. We raised a $27M Series A to build the durable endpoint teams run their production AI on.

Coastal Ridge VenturesInflection Point CapitalLatchford & Co.Northbound Labs

With angel investors Priya Venkataraman and Marcus Oyelaran.

Newsroom

Announcements and press.

TechCrunch
April 14, 2026

Direct Inference raises $27M Series A to make the AI model market disappear behind one endpoint

The New Stack
March 3, 2026

Zero-knowledge inference: why Direct Inference won’t tell you which model answered — and why customers prefer it that way

VentureBeat
January 22, 2026

As frontier models churn, Direct Inference bets enterprises want a durable surface, not a model marketplace

Building the layer the AI market runs on.

See open roles Get in touch