We're sunsetting PodQuest on 2025-07-28. Thank you for your support!

SWE-bench & SWE-agent | Data Brew | Episode 44

2025/4/17

In this episode, Kilian Lieret, Research Software Engineer, and Carlos Jimenez, Computer Science PhD Candidate at Princeton University, discuss SWE-bench and SWE-agent, two groundbreaking tools for evaluating and enhancing AI in software engineering.Highlights include:- SWE-bench: A benchmark for assessing AI models on real-world coding tasks.- Addressing data leakage concerns in GitHub-sourced benchmarks.- SWE-agent: An AI-driven system for navigating and solving coding challenges.- Overcoming agent limitations, such as getting stuck in loops.- The future of AI-powered code reviews and automation in software engineering.

SWE-bench & SWE-agent | Data Brew | Episode 44

Data Brew by Databricks

Shownotes Transcript

SWE-bench & SWE-agent | Data Brew | Episode 44 36:22 Share

Data Brew by Databricks

Shownotes Transcript

SWE-bench & SWE-agent | Data Brew | Episode 44