Feb 26, 05:11 PM

Hugging Face Releases New AI Datasets Including OpenHermesPreferences 1M, Textual Human Feedback Dataset, and Analysis of Spaces from 20K Code Files

Hugging Face, a popular platform for AI datasets and models, has recently seen the release of several new datasets. These include a large open AI preference dataset called OpenHermesPreferences 1M, a public textual human feedback dataset, and an analysis of spaces dataset created from 20K code files. The datasets aim to provide valuable resources for the AI community and showcase the power of open science and collaboration.

#Hugging Face #OpenHermesPreferences 1M

Written with ChatGPT (GPT-3).

Sources

Blaze (Balázs Galambosi)@gblazex
4 mo
When @argilla_io & @huggingface embraces we get a great new public textual human feedback dataset https://t.co/fBBPlNIs4L
Daniel Vila Suero@dvilasuero
4 mo
So much fun to build this dataset together with our friends @huggingface Open science, software and collaboration are unstoppable 🚀 https://t.co/2RH4KBu4CK
Philipp Schmid@_philschmid
4 mo
Introducing OpenHermesPreferences 1M! 🦋 We just released the largest open AI preference dataset on Hugging Face! 🤯 Together with @argilla_io, we extended the OpenHermes (@Teknium1) dataset into a pair-wise comparison dataset for RLHF and DPO. 🧠 Dataset: 📦 Size: ~1 million AI… https://t.co/O5iucdR62Z
Weyaxi@Weyaxi
4 mo
🎉 New blogpost in @huggingface 🌌 Analysis of Spaces in Hugging Face I scraped 20K spaces' code files and combined them into one dataset, showcasing meaningful statistics 📶 📝 Blogpost: https://t.co/5emy8wJEWj 📊 Dataset: https://t.co/0rpIQhGd94 https://t.co/cPiqKJRVif
Qiao Jin, MD@DrQiaoJin
4 mo
Thank you @katieelink! This project benefits a lot from the open-source communities building biomedical datasets and models on @huggingface. https://t.co/QlRkxvFPQp

Hugging Face Releases New AI Datasets Including OpenHermesPreferences 1M, Textual Human Feedback Dataset, and Analysis of Spaces from 20K Code Files

Similar Stories

Sources

Hugging Face Releases New AI Datasets Including OpenHermesPreferences 1M, Textual Human Feedback Dataset, and Analysis of Spaces from 20K Code Files