AI Model Bard Leads ELO Rankings Amid Controversy Over

If Bard is higher in the @lmsysorg Arena benchmark because of web RAG then obviously it’s a flawed comparison. 🍎 to 🍊 https://t.co/vll8auovbM

Anjney Midha@AnjneyMidha

5 mo

Bard is the only model in this list with Internet access, which the table should make clearer As @appenz aptly put it, open book vs closed book exams should be graded differently https://t.co/cvKJGZOOSW

Bindu Reddy@bindureddy

5 mo

Mystery solved - Bard has access to the web while the rest of the services don’t! This gives it a inherent advantage https://t.co/UanRrCKmiB

Hrishi@hrishioa

5 mo

Bard is definitely winning in the ELO rankings because it can do web searches, and multiple ones at the same time. It's at least contributing. Can't seem to access this functionality via the API. @lmsysorg how are you guys doing it? https://t.co/xkPA0aQR7G

Igor Babuschkin@ibab_ml

5 mo

This is a misleading result. Bard uses RAG (access to Google search) here while the other top competitors like GPT-4, Mistral Medium and Claude are not using search. Access to recent information through search is an advantage that needs to be controlled for or at least pointed… https://t.co/ugQ1x7LMfI

Similar Stories

AI Model Bard Leads ELO Rankings Amid Controversy Over Web Search Capability

Similar Stories

Sources

AI Model Bard Leads ELO Rankings Amid Controversy Over Web Search Capability