Research
Findings we make public
May 27, 2026 · White paper
Five structural gaps in error monitoring
Evidence, causes, and a path forward for grouping, prioritization, configuration decay, alert noise, and AI-generated fixes.
White paper · Error Monitoring
01Clustering & duplicates
02Error prioritization
03Configuration decay
04Alert noise
05AI-generated fixes
May 11, 2026 · Publication
Measuring which tool calls help agents debug
Marginal tool utility and tool efficiency measure whether individual tool calls improve an agent’s probability of solving the task. Removing noisy tools preserved accuracy while doubling efficiency.
Finding · Tool Efficiency
April 4, 2026 · Benchmark
Measuring root cause accuracy from telemetry
A benchmark for the question every debugging agent should answer: what caused the production failure? Evaluated on root cause analysis from telemetry, not log summarization.
Benchmark · Root Cause Accuracy