Princeton's NLP Lab's SWE-agent Outperforms Devin with

SWE-Agent is a fully autonomous coding agent WITH TOOLS. Simply drop a GitHub issue URL and it'll replicate the issue, attempt to solve it, and prepare a fix. Here's a full review and tutorial 🎥👇 https://t.co/WBCsbnapcn

Alex Yanko 🇺🇦@LeopolisDream

3 mo

SWE-agent turns LMs (e.g. GPT-4) into software engineering agents that can fix bugs and issues in real GitHub repositories. https://t.co/ABINx7387H https://t.co/oXfykxyZ6E

AshutoshShrivastava@ai_for_success

3 mo

SWE-agent, from Princeton’s NLP lab, is the best open-source alternative for Devin, Demo, easy setup guide, and everything you need to know in this 🧵 https://t.co/bcWSjYFHKB

Karthik Narasimhan@karthik_r_n

3 mo

SWE-agent is finally out. A few highlights: 1. Agent-Computer Interface (ACI) design will be critical for the success of AI agents, much like HCI is critical for how effective humans are with computers. 2. You can use SWE-agent out of the box on any github issue. (1/2) https://t.co/Cbh7qUR6Ei https://t.co/5LdbsVkbye

Rohan Paul@rohanpaul_ai

3 mo

Another nice project - SWE-agent ✨ 📌 SWE-agent turns LMs (e.g. GPT-4) into software engineering agents that can fix bugs and issues in real GitHub repositories. 📌 There are two steps to the SWE-agent pipeline. First SWE-agent takes an input GitHub issue and returns a pull… https://t.co/p7reEDHogM

Harry Tormey 🇮🇪 | 🇺🇸| 🇺🇦@htormey

3 mo

Seems like the creators of SWEBench (benchmark used by Devin), just updated their leaderboard and released an open source competitor to Devin, SWE-Agent, that has a 12.3% resolve rate. Devin's was 13.86% and is not public or open source. They also published a paper to go with… https://t.co/S5Q8W0nTfm

TheAIGRID@TheAiGrid

3 mo

New OPEN SOURCE Software ENGINEER Agent Outperforms ALL! (New SWE AGENT!) https://t.co/hxmerRj8AA

Alex Kolicich@AlexKolicich

3 mo

Impressive results from @jyangballin at Princeton releasing an OSS SWE agent at near-parity with Devin (reported). Not only is Devin closed source but you can't even use it due to the waitlist! SWE Agent is open-source and free for everyone. https://t.co/JtN6LTZKSS https://t.co/JTPJ5R4nB5

Shunyu Yao@ShunyuYao12

3 mo

SWE-agent led by amazing @jyangballin @_carlosejimenez , first authors of SWE-bench Besides code base, also check out our discord https://t.co/BogeKNYmjP https://t.co/tcktoanGch

Shunyu Yao@ShunyuYao12

3 mo

Extremely excited to open-source our SWE-agent that achieves SoTA on SWE-bench😃 Turns out ReAct + Agent-Computer Interface (ACI) can go a long way, very excited about the implications for SWE and beyond! https://t.co/wnlk2HhYo2

Andrew Curran@AndrewCurran_

3 mo

Open source Devin, with very impressive numbers. From the thread: 'letting SWE-agent only view 100 lines at a time was better than letting it view 200 or 300 lines and much better than letting it view the entire file'. https://t.co/mQsplpewWw

Carlos@_carlosejimenez

3 mo

SWE-Agent is an open-source software engineering agent with a 12.3% resolve rate on SWE-Bench! Check out SWE-agent in action at https://t.co/1NNL526gMy Repo: https://t.co/LsgeVvD1UC https://t.co/KZDmFw67l3

John Yang @ ICLR 🇦🇹@jyangballin

3 mo

SWE-agent is our new system for autonomously solving issues in GitHub repos. It gets similar accuracy to Devin on SWE-bench, takes 93 seconds on avg + it's open source! We designed a new agent-computer interface to make it easy for GPT-4 to edit+run code https://t.co/CTzMxDiouH https://t.co/VW9FuZGIUf

Similar Stories

Princeton's NLP Lab's SWE-agent Outperforms Devin with 12.3% Resolve Rate on SWE-bench

Similar Stories

Sources

Princeton's NLP Lab's SWE-agent Outperforms Devin with 12.3% Resolve Rate on SWE-bench