TinyLlama Project Trains 1.1B Llama Model on 3 Trillio

Just released a new version of mlx_llm with support to TinyLLaMA and Phi2 models 🚀 They can run on an 8GB MacBook with Apple Silicon, so you can have a chat with them without a Pro model. Another step towards local computing 😍 https://t.co/2UPzWFnQsK #mlx #apple #llm

Ostris@ostrisai

6 mo

TinyLlama is amazing! I have been waiting on a <3B permissive model to come out. Fine tuning small LLMs to do very specific tasks has so much potential. I loaded it up in my prompt upsampler and it works shockingly well. 🧵 https://t.co/Oj5gKPcACu

ai geek (wishesh) ⚡️@aigeek__

6 mo

tinyllama is a 1.1B parameter model trained on 3T tokens. it now knows how to chat and can hold a conversation. model links and more... 👇 https://t.co/EaCkfI4Bqv

Awni Hannun@awnihannun

6 mo

Just using MLX to fine-tune TinyLlama with LoRA locally on a 8 GB Mac Mini. Code: https://t.co/BCQZAWHCTA That's 1.1B parameter TinyLlama which just finished training on 3T tokens. Happy new year! Looking forward to more Local LLMs in 2024 https://t.co/kACMRZ6Suw

Jeffrey Morgan@jmorgan

6 mo

TinyLlama is a 1.1B model with the Llama 2 architecture, trained on 3 trillion tokens. Its small size means it can run fast with little memory and compute requirements. https://t.co/yuyAYhZMMh

Xenova@xenovacom

6 mo

TinyLlama is finally here: a 1.1B Llama model trained on 3 trillion tokens! 🤯 It's also compatible with 🤗 Transformers.js (see code below)! 👇 What a way to end the year! 🥳 🔗 https://t.co/ILbuqqaIGV https://t.co/KnC23RUUVD

Rohan Paul@rohanpaul_ai

6 mo

🔥🦙 The TinyLlama project can be a game-changer - its currently pretraining a 1.1B Llama model on 3 trillion tokens. The team aim to achieve this within a span of "just" 90 days using 16 A100-40G GPUs 🚀🚀. The training has started on 2023-09-01. 🔥🦙 Now, overall, if a model… https://t.co/0Isdk5SIBZ

Similar Stories

TinyLlama Project Trains 1.1B Llama Model on 3 Trillion Tokens Using 16 A100-40G GPUs, Compatible with 🤗 Transformers.js and Llama 2 Architecture

Similar Stories

Sources

TinyLlama Project Trains 1.1B Llama Model on 3 Trillion Tokens Using 16 A100-40G GPUs, Compatible with 🤗 Transformers.js and Llama 2 Architecture