top of page

Research on the Next Generation of Agentic Infrastructure

Benchmarking, evaluating, and optimizing real-world AI systems across cost, latency, and quality

Product-driven Research. Research-driven Product.

We believe the next wave of AI progress will come not only from stronger models, but from better systems around those models: how inference is deployed, how workflows are routed, how outputs are grounded in external information, and how performance is measured in production. OpenMesh is built around that view. Our approach is to treat product and research as mutually reinforcing. Real-world usage exposes failure modes, performance bottlenecks, and cost-quality tradeoffs that do not appear in isolated benchmarks. Research helps us convert those observations into better architectures, stronger routing policies, and more reliable production systems.

OpenMesh: A Lab for Production AI Systems

May 17, 2026

Distribution-Free Uncertainty for Continuous Agent Evaluation

Accepted to International Conference in Machine Learning (ICML) 2026 AgenticUQ Workshop 

May 7, 2026

DecisionBench: Benchmarking Skill-Aware Emergent Orchestration in Long-Horizon Agentic Workflows

In Submission to NeurIPS 2026 

Interested in researching with us?

We are interested in speaking with researchers, engineers, and early partners who care about the future of agentic infrastructure systems.

Contact Us
bottom of page