EleutherAI Launches Pile-T5, Trained on 2T Tokens, Enh

Data is what makes the model. We at @TeraflopAI are working hard to provide the open-source community with permissible commercially licensed datasets for training. Congrats to @arankomatsuzaki, @lintangsutawika, and @colinraffel. And thanks to @ShayneRedford for his work on FLAN. https://t.co/DheOvHTeil

TeraflopAI@TeraflopAI

3 mo

Glad to see our very own @arankomatsuzaki pushing the boundaries of open-source research with a new T5 release using our data. Congrats to @lintangsutawika and @colinraffel. And @ShayneRedford for his great efforts on FLAN. https://t.co/0oZeOZhZhs

Emad@EMostaque

3 mo

A long labor of love by the team for a new high quality T5 model. Happy to have contributed compute resources for this, will be useful for research & more as a fully open release 🚀 https://t.co/pEoshSkPWq

Stella Biderman@BlancheMinerva

3 mo

Having teased this a couple times, I'm excited to share that @lintangsutawika and @arankomatsuzaki, advised by @colinraffel have retrained T5 using a more modern dataset and tokenizer, and for longer. This produces a better general model for both NL and code applications. https://t.co/tLMZ18xPRz

Luca Soldaini 🎀@soldni

3 mo

Great release by @lintangsutawika, @arankomatsuzaki , and @colinraffel! Finally a fully-reproducible T5 model: https://t.co/wzLtG6p8JS

Aran Komatsuzaki@arankomatsuzaki

3 mo

🚀 Introducing Pile-T5! 🔗 We (EleutherAI) are thrilled to open-source our latest T5 model trained on 2T tokens from the Pile using the Llama tokenizer. ✨ Featuring intermediate checkpoints and a significant boost in benchmark performance. Work done by @lintangsutawika, me… https://t.co/qvoSWyAVjb

Similar Stories

EleutherAI Launches Pile-T5, Trained on 2T Tokens, Enhances AI Benchmarks

Similar Stories

Sources

EleutherAI Launches Pile-T5, Trained on 2T Tokens, Enhances AI Benchmarks