MistralAI Launches Mistral 7B v0.2: 32K Context, Enhan

Awni Hannun@awnihannun

3 mo

4-bit Mistral 7B 90+ toks-per-sec coming soon to MLX LM (on M2 Ultra) https://t.co/PRpafO0zQ8 https://t.co/ybzcVLd5I1

Matt Shumer@mattshumer_

3 mo

This is huge news. Mistral 7B was already the best model in its size class, and these improvements are a huge step up. I’ll be re-training many of my current fine-tunes over this model ASAP. https://t.co/NXqcKjQzHP

Yam Peleg@Yampeleg

3 mo

B R E A K I N G Mistral just announced Mistral-7B-v0.2 - New base Model - 32K context window (instead of 8k) - Theta (RoPE) = 1e6 - No sliding window https://t.co/tEnZOD3xtL

Adarsh@adarshxs

3 mo

yo @MistralAI dropping a new model today!! https://t.co/KHYnwASqNM

Marvin von Hagen@marvinvonhagen

3 mo

Mistral just announced at @SHACK15sf that they will release a new model today: Mistral 7B v0.2 Base Model - 32k instead of 8k context window - Rope Theta = 1e6 - No sliding window https://t.co/iAuEUEOw5K

Alex Reibman 🖇️@AlexReibman

3 mo

Mistral casually dropping a new model at the @cerebral_valley hackathon https://t.co/UI2ypNmfdl

Alessio Fanelli@FanaHOVA

3 mo

Mistral-7B-v0.2 just dropped with a full house 🔥 https://t.co/dT09nvTfra

Jerry Liu@jerryjliu0

3 mo

Excited for @MistralAI + @llama_index collabs (and Colabs) 🦙🔥 Thanks @sophiamyang for dropping by! https://t.co/YxjtI3xMbN

Alvaro Bartolome@alvarobartt

3 mo

⚡️ ORPO fine-tune of Mistral 7B v0.1 from @MistralAI using @argilla_io DPO Mix 7K! 🤗 Available in the Hub at https://t.co/yzAMAypjuW https://t.co/dGNmeuKliS

Awni Hannun@awnihannun

3 mo

MLX 0.7 → 0.8 Tokens per second on M2 Ultra - 4-bit Starcoder 7B, 64.3 → 74 - 4-bit Mistral 7B, 73.1 → 83.2 Thanks to Nanobind and fast RMS and layer norms. Look at that Mistral go: https://t.co/5hxs4JxNAm

Awni Hannun@awnihannun

3 mo

MLX 0.7 → 0.8 Tokens per second on M2 Ultra - 4-bit Starcoder 7B, 64.3 → 74 - 4-bit Mistral 7B, 73.1 → 83.2 Thanks to - Nanobind got rid of some Python overheads - Fast RMS norm and layer norm Look at that Mistral go: https://t.co/wOT8id8aiz

Awni Hannun@awnihannun

3 mo

MLX 0.7 → 0.8 Tokens per second for - 4-bit Starcoder 7B, 64.356 → 74 - 4-bit Mistral 7B, 73.1 → 83.2 Nanobind got rid of some Python overheads

Similar Stories

MistralAI Launches Mistral 7B v0.2: 32K Context, Enhanced Speed in MLX 0.7 → 0.8 Update

Similar Stories

Sources

MistralAI Launches Mistral 7B v0.2: 32K Context, Enhanced Speed in MLX 0.7 → 0.8 Update