Research on the Next Generation of Agentic Infrastructure
Benchmarking, evaluating, and optimizing real-world AI systems across cost, latency, and quality
Product-driven Research. Research-driven Product.
We believe the next wave of AI progress will come not only from stronger models, but from better systems around those models: how inference is deployed, how workflows are routed, how outputs are grounded in external information, and how performance is measured in production. OpenMesh is built around that view. Our approach is to treat product and research as mutually reinforcing. Real-world usage exposes failure modes, performance bottlenecks, and cost-quality tradeoffs that do not appear in isolated benchmarks. Research helps us convert those observations into better architectures, stronger routing policies, and more reliable production systems.
OpenMesh: A Lab for Production AI Systems
May 17, 2026
Distribution-Free Uncertainty for Continuous Agent Evaluation
Accepted to International Conference in Machine Learning (ICML) 2026 AgenticUQ Workshop
May 7, 2026
DecisionBench: Benchmarking Skill-Aware Emergent Orchestration in Long-Horizon Agentic Workflows
In Submission to NeurIPS 2026
Accepted to Conference on AI and Agentic Systems (CAIS) 2026 AgenticSE Workshop
Interested in researching with us?
We are interested in speaking with researchers, engineers, and early partners who care about the future of agentic infrastructure systems.
