Search

Search

Business Crypto Culture Environment Politics Science Sports Tech Video Games World

AI AR-VR Fintech Infosec IoT Metaverse Mobile Policy Robotics Smart Home Social Software Startups Wearables

Similar Stories

Similar Stories

Footer

Business

Economics
Real Estate
VC

Crypto

Airdrops
Blockchains
CBDCs
DeFi
Hacks
Markets
Memecoin
Mining
NFT
Regulation

Culture

Celebrities
Crime
Education
Movies
Music
Obituary
TV

Environment

Climate
Energy
Natural Disasters
Natural Resources
Sustainability

Politics

Arizona
Boston
California
Chicago
Colorado
Detroit
Florida
Georgia
LA
Las Vegas
Los Angeles
New Jersey
New Mexico
New York
Ohio
Oregon
Philadelphia
San Francisco
Seattle
SF
Texas
Utah
Washington DC

Science

Bio
Health

Sports

Boxing
Chess
Cricket
Golf
Hockey
MLB
NBA
NCAA
NFL
Olympics
PGA
Poker
Racing
Rugby
Soccer
Tennis
UFC

Tech

AI
AR-VR
Fintech
Infosec
IoT
Metaverse
Mobile
Policy
Robotics
Smart Home
Social
Software
Startups
Wearables

Video Games

Esports
Releases

World

Africa
Asia
Australia
Brazil
Britain
Canada
China
Europe
France
Germany
Hong Kong
India
Israel
Italy
Japan
Latin America
Mexico
Middle East
North Korea
Pakistan
Poland
Russia
South America
Spain
Turkey
Ukraine
United States
US
USA

WhatsApp YouTube X

© 2024 DeepNFTValue, Inc. All rights reserved.

Similar Stories

OpenAI Introduces GPT-4-Based CriticGPT to Enhance AI Code Accuracy, Outperforms Humans 60% of the Time
Authors
34
7 days
AI
Business
Tech
New Claude 3.5 Sonnet AI Model Surpasses GPT-4o in Performance Metrics, More Than Twice as Cost-Effective
Authors
5
14 days
AI
Tech
Claude 3.5 Achieves 40% on SWE-Bench Lite, Outperforms GPT and Gemini Pro
Authors
5
7 days
AI
Software
Tech
Anthropic AI's Claude 3.5 Sonnet Revolutionizes Coding and AI Tasks, Outperforming Competitors
Authors
7
2 days
AI
Business
Tech
OpenAI's ChatGPT to Reach 'PhD-Level Intelligence' in Future Update within 1.5 Years
Authors
28
14 days
AI
Tech

Sources

Loading...

Similar Stories

OpenAI Introduces GPT-4-Based CriticGPT to Enhance AI Code Accuracy, Outperforms Humans 60% of the Time
Authors
34
7 days
AI
Business
Tech
New Claude 3.5 Sonnet AI Model Surpasses GPT-4o in Performance Metrics, More Than Twice as Cost-Effective
Authors
5
14 days
AI
Tech
Claude 3.5 Achieves 40% on SWE-Bench Lite, Outperforms GPT and Gemini Pro
Authors
5
7 days
AI
Software
Tech
Anthropic AI's Claude 3.5 Sonnet Revolutionizes Coding and AI Tasks, Outperforming Competitors
Authors
7
2 days
AI
Business
Tech
OpenAI's ChatGPT to Reach 'PhD-Level Intelligence' in Future Update within 1.5 Years
Authors
28
14 days
AI
Tech

Apr 1, 04:47 PM

Open-source AI Model Devin with GPT4 Shows Promising Results in SWE Bench Test led by John for Generalization

Authors

6

A new open-source AI model named Devin has shown promising results in software engineering benchmarks. Devin achieved 12.29% accuracy on 100% of the SWE Bench test set, compared to 13.84% on 25% of the set. The model uses GPT4 and is expected to improve with GPT5. The project is led by a team including John and has garnered attention for its potential in generalization.

#Devin #SWE Bench #John

Written with ChatGPT (GPT-3).

Rich Hemming@S_A_R_Lab
3 mo
Open source with results close to Devin #ai #coding https://t.co/eNxX1dk8FO
Blaze (Balázs Galambosi)@gblazex
3 mo
Exciting open source Devin from Princeton https://t.co/2eiigNO0Pn
Andrew Curran@AndrewCurran_
3 mo
Open source Devin, with very impressive numbers. From the thread: 'letting SWE-agent only view 100 lines at a time was better than letting it view 200 or 300 lines and much better than letting it view the entire file'. https://t.co/mQsplpewWw
Daniel Han@danielhanchen
3 mo
An open source Devin getting 12.29% on 100% of the SWE Bench test set vs Devin's 13.84% on 25% of the test set! This uses GPT4, so imagine what GPT5 could do! Although this works on Github repo issues, I'm assuming the next step is a full generalization! Great work John and team! https://t.co/BQW4mp7koN
Netrunner@thenetrunna
3 mo
OPEN SOURCE AND DECENTRALIZE AI !!!!!!!!!!!!! https://t.co/p3Gi9C2188
Minion@0xminion
3 mo
decentralized ai is a joke! https://t.co/m7nbV9Vf6L https://t.co/uf8VZyXxUc https://t.co/0kOiH4BfWV

AI/Modeling AI/ChatGPT Features AI/New Products