agentsPublished: June 23, 2026

SHERLOC: Structured Diagnostic Localization for Code Repair Agents

By Hovhannes Tamoyan, Sean Narenthiran, Erik Arakelyan, Mira Mezini, Boris Ginsburg

Research TL;DR

"Training-free framework using reasoning LLM and repository tools for structured code fault localization, achieving SOTA on SWE-Bench and boosting repair agents while cutting tokens."

Abstract

LLM agents solve repository-level coding tasks through multi-turn tool use, but utilize half their budget on locating faults before editing. Dedicated localization frameworks have emerged, yet are still evaluated as file retrieval rather than actionable diagnosis, producing locations without the diagnostic context a repair agent needs. We introduce SHERLOC (Structured Hypothesis-driven Exploration and Reasoning for Localization), a training-free framework pairing a reasoning LLM with compact repository tools and self-recovery, without fine-tuning or multi-agent orchestration. SHERLOC reaches state-of-the-art localization across model scales: 84.33% accuracy@1 on SWE-Bench Lite and 81.27% recall@1 on SWE-Bench Verified; at ~30B parameters, it matches or outperforms other agentic methods. Injecting our locations and diagnostic findings into repair agents yields, on average, +5.95 pp resolve rate on SWE-Bench Verified while cutting localization and total tokens by 36.7% and 23.1%.

Read full paper on arXiv →

Related Research

Jun 2026

Accelerate your workflow with Feedalyze

AI churn detection for SaaS. Know which customers will leave before they do.

Free plan available · Connects to HubSpot, Intercom, Zendesk

Detect churn before it happens →

SHERLOC: Structured Diagnostic Localization for Code Repair Agents

Abstract

Related Research

InSight: Self-Guided Skill Acquisition via Steerable VLAs

OpenThoughts-Agent: Data Recipes for Agentic Models

World Models in Pieces: Structural Certification for General Agents

Accelerate your workflow with Feedalyze