Groq Launches Super Cheap LPU, Achieving Up to 483 Tok

Groq is the fastest LLM, with a processing speed of 500 tokens per second. Faster than ChatGPT4 and Google Gemini. Here's a Complete breakdown + live comparison with ChatGPT4: 🧵 https://t.co/9hjLwf6fgM

DriveTheFuture@DriveTheFuture3

4 mo

Just tested the @GroqInc ‘s first ever LPU- language processing unit, on Llama2 and Mixtral, wow, wow, super fast! Totally game changer! https://t.co/dRmIFM9sbH https://t.co/VhWGP8YN9Y

Teknium (e/λ)@Teknium1

4 mo

Thanks to @GroqInc team for getting me onto their platform rapidly! I tested Groq and it is impressively fast - unbelievably so. I inferenced llama-2 70B and it returned 76 tokens in 0.65s - 116tok/s, which is wild. But - I then inferenced with max_tokens = 1 and it returned in…

Brian Roemmele@BrianRoemmele

4 mo

How fast is the new LPU technology of Groq? This fast: https://t.co/SumUm9kLyM https://t.co/geDSwDbv7u

Brian Roemmele@BrianRoemmele

4 mo

Groq's LPU is faster than GPUs, handling requests and responding more quickly. Groq's LPUs don't need speedy data delivery like Nvidia GPUs do because they don't have HBM in their system. They use SRAM, which is about 20 times faster than what GPUs use. Since inference runs use… https://t.co/f0jMQCkNTv

Mukund Mohan@mukund

4 mo

Groq, the new competitor of ChatGPT, has rolled out a Language Processing Unit (LPU). This cutting-edge technology boasts speeds close to 500 tokens per second. It sets a new benchmark for speed and efficiency in digital processing. https://t.co/c87F7RkU5F

ifioravanti@ivanfioravanti

4 mo

I was completely blown away by @GroqInc demo running Mixtral 8x7B-32k at 422.73 tokens/s 🔥🚀 “How can groq be so fast?” This video is normal speed! 🤯 https://t.co/pGkXgzXd2r

Brian Atwood@batwood011

4 mo

Compare the speed here with @SindarinTech (https://t.co/t1ZcmUdiRN). Groq’s inference speed is incredible, but the fact that @SindarinTech’s voice to voice latency is so much lower (without groq!) is case in point why voice to voice is more than the sum of its parts. https://t.co/pSYYL8itbY

Sam@0xDEADLIFT

4 mo

Inference speed on @GroqInc is insane, just blows everyone else out of the water When gpt4 came out, everything generated by gpt3 immediately started feeling primitive in comparison Now everything producing less tokens/s than groq feels viscerally slow https://t.co/LkWCgBMN4R

Smoke-away@SmokeAwayyy

4 mo

Groq is just Karpathy typing really fast.

Brian Roemmele@BrianRoemmele

4 mo

A really good insight by @mattshumer_ on the spectacular new Groq system. It is a lightning-fast AI Answers Engine. It uses LPUs (Language Programming Units) and is faster than GPUs for serving many LLMs. It is a quantum leap in speed. https://t.co/E3p57CGIiD

buttabeni@buttabeni

4 mo

Today I am exclusively using @GroqInc side-by-side with @ChatGPTapp Wild how much faster Groq is.

Madhav Kumar@5mknc5

4 mo

After personally verifying @GroqInc's performance, I'm astounded. Groq's breakthroughs in inference speed and token context length mark a leap in LLM evolution, setting new efficiency and processing benchmarks with their LPU outshining traditional GPUs and CPUs. approx 500 T/s 👏 https://t.co/JV7xSYMuJJ

Mukund Mohan@mukund

4 mo

Groq from @elonmusk is faster than any of the other LLM according to some demos This race is just getting started. I wont be surprised if $GOOGL @openAI $MSFT $META and @X make LLM a total commodity https://t.co/J9NlJVD2aQ

elvis@omarsar0

4 mo

The inference speed on the @GroqInc examples didn't look real. So I tested it myself and I don't even know what to say about this. Need to take a closer look at the technical papers. For now, all I can think about is the complex use cases this, and the support of millions of… https://t.co/4ACMHDtiP1

Connor Shorten@CShorten30

4 mo

Groq has shaken up the AI world with a new LPU processor developed by former members of Google's TPU team! At the time of this writing, it is super fast and super cheap! I think this is a massive development for the DSPy and Weaviate story, so wanted to quickly share the state…

Nidhi@reallynidhi

4 mo

482 tokens per second. Blown away by the speed! Love it @GroqInc! Also, I am a power user of ChatGPT and Perplexity AI but none of them gave me results as good as Groq! Wild 🤯 https://t.co/RiOBr93ITa

Muratcan Koylan@youraimarketer

4 mo

Just wanted to confirm: the hype around Groq is absolutely real! It's incredibly fast. Experience the speed of the world's first Language Processing Unit – meet @GroqInc. See how low latency can go... https://t.co/NYwgk5VO5w

Jay Scambler@JayScambler

4 mo

Groq is serving the fastest responses I've ever seen. We're talking almost 500 T/s! I did some research on how they're able to do it. Turns out they developed their own hardware that utilize LPUs instead of GPUs. Here's the skinny: Groq created a novel processing unit known as… https://t.co/mgGK2YGeFp

Miguel Guerrero@apolmig

4 mo

latency with #LLMs is a problem? Enter @GroqInc , incredible 483 tokens per second 😮 with Mixtral. https://t.co/DFyR2buzXQ

Groq Inc@GroqInc

5 mo

The #LPU ™ Inference Engine delivers “Wow!” performance and precision at scale for LLMs and other generative AI language solutions. Stop by booth #74 at #ADOD24 this week to see for yourself. #BetterOnGroq https://t.co/WMYUWyKG3D

Similar Stories

Groq Launches Super Cheap LPU, Achieving Up to 483 Tokens/Second

Similar Stories

Sources

Groq Launches Super Cheap LPU, Achieving Up to 483 Tokens/Second