LogoAgentHunter
  • Submit
  • Industries
  • Categories
  • Agency
Logo
LogoAgentHunter

Discover, Compare, and Leverage the Best AI Agents

Featured On

Featured on yo.directory
yo.directory
Featured on yo.directory
Featured on Startup Fame
Startup Fame
Featured on Startup Fame
AIStage
Listed on AIStage
Sprunkid
Featured on Sprunkid
Featured on Twelve Tools
Twelve Tools
Featured on Twelve Tools
Listed on Turbo0
Turbo0
Listed on Turbo0
Featured on Product Hunt
Product Hunt
Featured on Product Hunt
Game Sprunki
Featured on Game Sprunki
AI Toolz Dir
Featured on AI Toolz Dir
Featured on Microlaunch
Microlaunch
Featured on Microlaunch
Featured on Fazier
Fazier
Featured on Fazier
Featured on Techbase Directory
Techbase Directory
Featured on Techbase Directory
backlinkdirs
Featured on Backlink Dirs
Featured on SideProjectors
SideProjectors
Featured on SideProjectors
Submit AI Tools
Featured on Submit AI Tools
AI Hunt
Featured on AI Hunt
Featured on Dang.ai
Dang.ai
Featured on Dang.ai
Featured on AI Finder
AI Finder
Featured on AI Finder
Featured on LaunchIgniter
LaunchIgniter
Featured on LaunchIgniter
Imglab
Featured on Imglab
AI138
Featured on AI138
600.tools
Featured on 600.tools
Featured Tool
Featured on Featured Tool
Dirs.cc
Featured on Dirs.cc
Ant Directory
Featured on Ant Directory
Featured on MagicBox.tools
MagicBox.tools
Featured on MagicBox.tools
Featured on Code.market
Code.market
Featured on Code.market
Featured on LaunchBoard
LaunchBoard
Featured on LaunchBoard
Genify
Featured on Genify
Copyright © 2025 All Rights Reserved.
Product
  • AI Agents Directory
  • AI Agent Glossary
  • Industries
  • Categories
Resources
  • AI Agentic Workflows
  • Blog
  • News
  • Submit
  • Coummunity
  • Ebooks
Company
  • About Us
  • Privacy Policy
  • Terms of Service
  • Sitemap
Friend Links
  • AI Music API
  • ImaginePro AI
  • Dog Names
  • Readdit Analytics
Back to News List

Debug-gym Can AI agents lighten developers debugging load

April 10, 2025•Brenda Potts•Original Link•2 minutes
AI
Debugging
SoftwareDevelopment

Developers spend a lot of time debugging code. Learn how debug-gym can equip AI agents to help, enabling them to set breakpoints, navigate the codebase, and print runtime variable values on demand, so they better understand the code and its execution flow.

A graphic with a gradient background transitioning from blue on the left to pink on the right. The graphic features a white outline of a computer monitor with code brackets on the screen, an arrow pointing downwards into the monitor, and another arrow curving around to point upwards towards a magnifying glass with a bug icon inside it.

The Growing Role of AI in Coding

With the rise of AI coding tools like GitHub Copilot, developers are increasingly relying on AI to generate code. GitHub CEO Thomas Dohmke predicted that "sooner than later, 80% of the code is going to be written by Copilot." This trend is evident in startups, where 95% of code for a quarter of Y Combinator’s latest batch was written by large language models (LLMs).

However, most developers spend the majority of their time debugging code, not writing it. This raises an important question: Can AI tools also assist in debugging?

Introducing Debug-gym

Microsoft Research has developed debug-gym, an environment that equips AI agents with interactive debugging tools. Unlike traditional AI coding tools, debug-gym allows agents to:

  • Set breakpoints
  • Navigate codebases
  • Print runtime variable values
  • Create test functions

Figure 1: Diagram demonstrating the code-repairing process in outline. Left: conventional code-repairing system; right: additional tools enabled by debug-gym.

Key Features of Debug-gym

  • Repository-level information: Agents can navigate and edit files across the entire repository.
  • Robust and safe: Code runs in sandboxed Docker containers to prevent harmful actions.
  • Extensible: Easily add new tools to the environment.
  • Text-based: Compatible with modern LLM-based agents.

Early Results and Future Work

Initial experiments show promising results. While current AI tools struggle with complex debugging tasks, debug-gym-enabled agents show significant improvement. For example, on the SWE-bench Lite benchmark, agents with debugging tools performed better than those without.

Figure 2: The success rate represents the percentage of the 300 SWE-bench Lite issues resolved, comparing between agents with and without debugging tools.

Next Steps

Microsoft plans to:

  1. Fine-tune LLMs for interactive debugging.
  2. Develop specialized data for training debugging agents.
  3. Open-source debug-gym to encourage community collaboration.

Conclusion

Debug-gym represents a significant step forward in AI-assisted debugging. By enabling AI agents to interactively seek information and propose fixes, it has the potential to drastically reduce developers' debugging workload. For more details, check out the technical report and GitHub repository.

Related News

August 18, 2025•Kaydence Shum

Lenovo Wins Frost Sullivan 2025 Asia-Pacific AI Services Leadership Award

Lenovo earns Frost Sullivan's 2025 Asia-Pacific AI Services Customer Value Leadership Recognition for its value-driven innovation and real-world AI impact.

AI
Lenovo
Asia-Pacific
August 18, 2025•Unknown

Baidu Wenku GenFlow 2.0 Revolutionizes AI Agents with Multi-Agent Architecture

Baidu Wenku's GenFlow 2.0 introduces a multi-agent system for parallel task processing, integrating with Cangzhou OS to enhance efficiency and redefine AI workflows.

AI
MultiAgent
Baidu

About the Author

David Chen

David Chen

AI Startup Analyst

Senior analyst focusing on AI startup ecosystem with 11 years of venture capital and startup analysis experience. Former member of Sequoia Capital AI investment team, now independent analyst writing AI startup and investment analysis articles for Forbes, Harvard Business Review and other publications.

Expertise

Startup Analysis
Venture Capital
Market Research
Business Models
Experience
11 years
Publications
200+
Credentials
2
LinkedInTwitter

Agent Newsletter

Get Agentic Newsletter Today

Subscribe to our newsletter for the latest news and updates