Google Releases RecurrentGemma 9B with Griffin Archite

🔥 Introducing our 9B language model, trained on 2 trillion tokens! 🚀 Based on Griffin (https://t.co/kL5TeAbmVV) and delivers: 💪 Powerful performance ⚡️ Lightning-fast inference Pretrained and instruction-tuned models now available on HF & Kaggle! Start building today! 🏗️ https://t.co/s8sjsO51Vi

Samuel L Smith@SamuelMLSmith

16 d

RecurrentGemma-9B is out! https://t.co/rSTQn2SlhR https://t.co/Il5UudfpZk - Uses Griffin architecture, combining linear recurrence with local attention - Downstream evals comparable to Mistral and Gemma - Faster inference, especially for long sequences or large batch sizes 1/n https://t.co/l3MLNebAzq

Omar Sanseviero@osanseviero

17 d

RecurrentGemma 9B by Google is out 🔥 ⚡️Super fast for long sequences: Good throughput+latency 👀Base and instruct tuned versions 🏆Similar quality as Gemma Check the y-axis below 🤯 Models: https://t.co/Py7CImb6el Griffin paper: https://t.co/KhWcw0euaY https://t.co/4FNQePDTbb

Vaibhav (VB) Srivastav@reach_vb

17 d

Welcome RecurrentGemma 9B 🔥 > Same performance as Gemma with more than 25% lower latency and 6-7x higher tokens/ sec ⚡ > Base (9B) and Instruct (9B-IT) models released. > MMLU - 60.5, CommonSenseQA 73.2, AGIEval 39.3 - pretty strong base model to fine-tune further. > Based on… https://t.co/J3ctP4OSlU

Google for Developers@googledevs

17 d

📣 🧠 Exciting news for researchers pushing the boundaries of efficient deep learning! We've scaled RecurrentGemma to 9 billion parameters. 🧵↓

Similar Stories

Similar Stories

Google Releases RecurrentGemma 9B with Griffin Architecture, Trained on 2 Trillion Tokens

Similar Stories

Sources

Google Releases RecurrentGemma 9B with Griffin Architecture, Trained on 2 Trillion Tokens