<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[OpenMesh]]></title><description><![CDATA[OpenMesh]]></description><link>https://www.openmesh.ai/blogs</link><generator>RSS for Node</generator><lastBuildDate>Sat, 23 May 2026 16:52:18 GMT</lastBuildDate><atom:link href="https://www.openmesh.ai/blog-feed.xml" rel="self" type="application/rss+xml"/><item><title><![CDATA[DecisionBench: Measuring the Agent Handoff, Not Just the Answer]]></title><description><![CDATA[A benchmark for emergent delegation in long-horizon agentic workflows. We introduce DecisionBench, a benchmark for emergent delegation in long-horizon agentic workflows. It measures not just whether a task gets solved, but whether an agent hands its subtasks to the right peer model along the way. We characterize the benchmark with a five-condition reference sweep across an 11-model, 7-vendor pool, covering 23,375 task instances, and release the substrate, the annotation layer, the analysis...]]></description><link>https://www.openmesh.ai/news/decisionbench-measuring-whether-agents-delegatewell</link><guid isPermaLink="false">6a10ba170ac12a65423b768c</guid><pubDate>Fri, 22 May 2026 20:56:21 GMT</pubDate><enclosure url="https://static.wixstatic.com/media/47365e_0a60868acbcd41ebbc6981ec9ce6ee04~mv2.png/v1/fit/w_1000,h_839,al_c,q_80/file.png" length="0" type="image/png"/><dc:creator>Michael Gao</dc:creator></item><item><title><![CDATA[Introducing IntelligenceArena]]></title><description><![CDATA[A new way to measure intelligence in the age of AI agents Artificial intelligence has entered a new phase. Models are no longer evaluated in isolation, and systems are no longer defined by a single benchmark score. Today’s AI landscape is composed of agents, workflows, and continuously evolving tools that operate in real-world environments. Yet the way we evaluate these systems has not kept pace. Most benchmarks remain static, capturing capability at a fixed moment while ignoring how systems...]]></description><link>https://www.openmesh.ai/news/introducing-intelligencearena</link><guid isPermaLink="false">69ed4827b502c05c43b7cf2e</guid><pubDate>Sat, 25 Apr 2026 23:03:46 GMT</pubDate><enclosure url="https://static.wixstatic.com/media/604407_91e95ca3ef4d4124a6fbbcedadfae591~mv2.jpg/v1/fit/w_1000,h_1000,al_c,q_80/file.png" length="0" type="image/png"/><dc:creator>meganwwl123</dc:creator></item><item><title><![CDATA[OpenMesh: Building the Infrastructure Layer for Production AI Systems]]></title><description><![CDATA[OpenMesh is building the infrastructure layer for the next generation of AI systems. We believe the future of AI will not be defined only by larger models or higher benchmark scores. It will be defined by how intelligently those models are used in production: how tasks are routed, how workflows are decomposed, how outputs are evaluated, how failures are detected, and how systems adapt over time. Most teams today still rely on a single model for every task. That approach is simple, but it is...]]></description><link>https://www.openmesh.ai/news/openmesh-building-the-infrastructure-layer-for-production-ai-systems</link><guid isPermaLink="false">69dc66694760d48e5e91405f</guid><pubDate>Mon, 13 Apr 2026 03:55:46 GMT</pubDate><enclosure url="https://static.wixstatic.com/media/604407_6f529de924b64a14b613b7015965084d~mv2.png/v1/fit/w_1000,h_1000,al_c,q_80/file.png" length="0" type="image/png"/><dc:creator>meganwwl123</dc:creator></item></channel></rss>