Apple Unveils MM1 vs GPT4, Musk's Grok-1 with 314B Par

AI NEWS: Apple just quietly unveiled MM1, a new LLM that competes with GPT-4 and Gemini. Plus, more developments from Elon Musk/Grok, Google DeepMind, Cognition Devin, India, Bernie Sanders, and Maisa. Here's everything going on in AI right now:

Artificial Analysis@ArtificialAnlys

3 mo

Grok-1 is now the highest quality open-source LLM Grok's declared MMLU score of 73% beats Llama 2 70B’s 68.9% and Mixtral 8x7B’s 70.6%. At 314 billion parameters, xAI’s Grok-1 is significantly larger than today’s leading open-source model. @xai's Grok-1 is a Mixture-of-Experts… https://t.co/Dh6JJ1xKNC

MatthewBerman@MatthewBerman

3 mo

Elon Musk's Grok LLM was just open-sourced! 🔥 It's uncensored, MASSIVE, and completely open-source. Spite is a powerful motivator lol. Here's what you need to know 👇🎥 https://t.co/hiKosjhMKr

gfodor.id@gfodor

3 mo

Grok-1 feels like a YOLO model that was simply good not great Awesome to see it released

Haider.@slow_developer

3 mo

Grok is available on hugging face. You guys are so quick. https://t.co/jz3HHEdCZp

VentureBeat@VentureBeat

3 mo

Musk's Grok AI goes open source https://t.co/3d6fX5LG1Q

SiliconANGLE@SiliconANGLE

3 mo

Elon Musk’s xAI releases Grok-1 architecture, while Apple advances multimodal AI research https://t.co/H4KHKNgSlg

jaldps@jaldpsd

3 mo

Grok is now available for testing. It's 314B MoE so you won't be able to run it 🙃 It's Amazing that a 314B model gives you a worse performance than open source 13B LLMs https://t.co/UU9AjwD7Cv

Aravind Srinivas@AravSrinivas

3 mo

Yep, thanks to @elonmusk and xAI team for open-sourcing the base model for Grok. We will fine-tune it for conversational search and optimize the inference, and bring it up for all Pro users! https://t.co/CGn6cIoivT

jaldps@jaldpsd

3 mo

Grok is now available for testing. It's 314B MoE so you won't be able to run it :upside_down_face: It's Amazing that a 314B model gives you a worse performance than open source 13B LLMs https://t.co/UU9AjwD7Cv

Mikko Ohtamaa@moo9000

3 mo

Xai (Twitter) releases does the open release of their Grok AI https://t.co/sNl6aMqEt0

AK@_akhaliq

3 mo

Grok-1 is out on Hugging Face https://t.co/6GBcgnD1ZI

Ethan Mollick@emollick

3 mo

Musk's Grok AI was just released open source in a way that is more open than most other open models (it has open weights) but less than what is needed to reproduce it (there is no information on training data). Won't change much, there are stronger open source models out there. https://t.co/eFouYFVRmN

Sylvain Filoni@fffiloni

3 mo

xAI release Grok-1, Community release Grok-1 on @huggingface 👀 https://t.co/iKgU7MdyIH

Marktechpost AI Research News ⚡@Marktechpost

3 mo

The Dawn of Grok-1: A Leap Forward in AI Accessibility: Today marks the open release of Grok-1, a behemoth in the landscape of AI, wielding a staggering 314 billion parameters. This Mixture-of-Experts model, which emerged from the fervent efforts of xAI’s dedicated team,…

Ethan He@EthanHe_42

3 mo

Grok-1 314B parameters mixture of experts (MoE) - 8 experts top 2 selection. - The MoE implementation is different from Mixtral 8x7b. Mixtral 8x7b applies softmax over top2 experts, while Grok-1 applies top2 over the softmax of all 8 experts. - written in jax - training code… https://t.co/MlNTlgqO0s

Ethan He@EthanHe_42

3 mo

Grok-1 314B 8 experts top2 MoE - 314B parameters mixture of experts (MoE). 8 experts top 2 selection. - The MoE implementation is different from Mixtral 8x7b. Mixtral 8x7b applies softmax over top2 experts, while Grok-1 applies top2 over the softmax of all 8 experts. - written… https://t.co/MlNTlgqO0s

1LittleCoder💻@1littlecoder

3 mo

Given Grok-1 is 314B MoE (~80B active params) , Im genuinely surprised why it's scored terribly 😑 https://t.co/tRh55WZA9e

1LittleCoder💻@1littlecoder

3 mo

Grok, OpenAI and Musk entered the chat !!! https://t.co/r2oAQ30xJi

Latent Space Podcast@latentspacepod

3 mo

Congrats @xai and @elonmusk on the open source release 🫡 Grok-1 cheat sheet 📝 - MoE architecture for 314B total params - 8 experts with 2 active - Apache 2.0 license - Trained on JAX and Rust - Finished training October 2023 - Base model, no task-specific fine-tune - No… https://t.co/ioKun85fm6

Nick Dobos@NickADobos

3 mo

Grok is 2x the size of GPT-3.5? And still this dumb?

Shital Shah@sytelus

3 mo

Grok-1 code is released. It's 8 experts, 2 selected at a time. Trend of large vocab (131k) continues. Attention output multiplier is interesting. Overall, it's much large model than I'd thought so bit surprised about lag in perf than other models. https://t.co/o6ZAUGLXiP https://t.co/EMTfp0vNeA

Yam Peleg@Yampeleg

3 mo

Cooling down a bit, Grok is 8x33B model with roughly Mixtral 8x7B's performance. Still huge respect for releasing it. To the best of my knowledge this is the largest decoder only model ever released. (The 1.6T params switch transformer on Huggingface is an encoder-decoder)

Philipp Schmid@_philschmid

3 mo

Elon Musk kept his word and released Grok-1🤯 Grok-1 is a 314B big Mixture-of-Experts (MoE) transformer. 🧐 What we know so far: 🧠 Base model, not fine-tuned ⚖️ Apache 2.0 license 🧮 314B MoE with 25% active on a token 📊 According to the initial announcement; 73% on MMLU,… https://t.co/Ep6i8uoKDY

Aidan McLau@aidan_mclau

3 mo

can someone who knows more about model pretraining than i explain how grok is worse than mixtral at 7 times the size

Aidan McLau@aidan_mclau

3 mo

can someone who knows more about model pretraining than i explain how grok is worse than mixtral at 1/7th the size

𝑨𝒓𝒕𝒊𝒇𝒊𝒄𝒊𝒂𝒍 𝑮𝒖𝒚@artificialguybr

3 mo

Grok-1 is MOE. Have big chance of finetune of it being bugged as Mixtral and the benchmarks dont improve so much.

jessicat@jessi_cata

3 mo

Grok seems about as powerful as LLaMa 2, less powerful than GPT3.5, so probably not a big deal that the weights are open now. https://t.co/RS0C1tUjP9

andrew gao@itsandrewgao

3 mo

HOLY SH*T @grok IS 314 BILLION PARAMETERS Mixture of 8 Experts, not RLHFd/moralized THIS IS HUGE https://t.co/kjsvEWp5O8 https://t.co/8CGAtNpbwT

AppleInsider@appleinsider

4 mo

Apple researchers have hit on a new multi-modal method of quickly training large language models (LLMs) that can enable more flexible and powerful machine-learning and "AI" type systems. https://t.co/jPCwURQLFq

Brett Adcock@adcock_brett

4 mo

AI and Robotics is moving fast. So, I share the most important research every week. Here's everything you need to know and how to make sense out of it:

Poonam Soni@CodeByPoonam

4 mo

New AI bombs are dropping daily. Massive Developments in AI from this week: - Google just dropped SIMA - Devin, the First AI Software Engineer - RunwayML adds Lip Sync - OpenAI Sora data revelation - ChatGPT got body. Here are 5 things you don't want to miss

AI Space@AISpaceLive

4 mo

The weekly AI overview is here to keep you informed on the latest news and top stories in AI. #AI #News #Overview #Week11 #AISpace (1/11) https://t.co/NKcgvRvaU4

Brian Roemmele@BrianRoemmele

4 mo

“openai/grok” No this is not Grok from https://t.co/KthS0SlWM4. It is the humor of some folks at OpenAI. I say more soon. Link: https://t.co/74g6F0QZs3

SwissCognitive, AI Venture, Advisory & Research@SwissCognitive

4 mo

This week's #AI updates: ➡️#AIAct sets global standards ➡️China’s railways get smarter with AI ➡️#ChatGPT bot now with robotic arm ➡️AI advances in surgery, an boosts crime data analysis #ShareForSuccess #SwissCognitive https://t.co/FQzrWRSELu

Adrian Dittmann@AdrianDittmann

4 mo

The openai/grok repository is the code for a paper on "Grokking" by: Alethea Power, Yuri Burda, Harri Edwards, Igor Babuschkin (OpenAI) Vedant Misra (Google) Submitted From: Alethea Power on Thursday 6 Jan 2022 18:43:37 UTC https://t.co/vuGX4nw5re

Michael P. Frank is joining a startup!@MikePFrank

4 mo

Apple announces a new series of multimodal LLMs https://t.co/uEzCLH3GxO

Jon Hartley@Jon_Hartley_

4 mo

Apple paper on developing an LLM: "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training" https://t.co/XyYtnQk3rL

Multiplatform.AI@MultiplatformAI

4 mo

Apple's strides in multimodal AI research signal a turning point in the tech giant's investment strategy #AI #Apple #artificialintelligence #Investment #investments #llm #machinelearning #MM1 #model #multimodal #research #Techgiants https://t.co/w3NLw1BxkQ https://t.co/W3PANNQ2nR

Brian Roemmele@BrianRoemmele

4 mo

“Apple researchers achieve breakthroughs in multimodal AI as company ramps up investments” Article: https://t.co/tEP39Ay4m0

Barsee 🐶@heyBarsee

4 mo

The world of AI x Robotics is progressing fast. Just this month, we got massive announcements from Google, OpenAI, Midjourney, Nvidia, PIKA, Figure 1, and more. 1/ Nvidia just released Chat with RTX https://t.co/9SMy6wWjVG

FryAI@TheFryAI

4 mo

➡️ Apple introduces MM1, a powerful multimodal AI system that competes with ChatGPT and sets new industry standards. Discover more here! https://t.co/SQlMnSDheV

The AI Edge@The_AI_Edge

4 mo

AI Weekly Rundown (March 9 to March 15) Major AI announcements from Meta, OpenAI, DeepMind, Apple, Cohere, and more https://t.co/5lCTfgPxhi

Mike@4KTV

4 mo

Apple continues to make moves in AI --unveils Multimodal Large Language Models (MLLMs) #Apple #MLLMs https://t.co/0yej6TGQxJ

Mike@4KTV

4 mo

Apple continues to make moves in AI --announces Multimodal Large Language Models (MLLMs) #Apple #MLLMs https://t.co/0yej6TGQxJ

pritam@Pritam_Roy1

4 mo

AI is moving at an incredible pace. 🤯 Massive announcements from this week: - Devin - OpenAI Sora - Google Deepmind - Figure Here's what you need to know:

FryAI@TheFryAI

4 mo

➡️ Apple researchers make significant strides in multimodal AI, unlocking new possibilities for innovation. Don't miss out on the breakthroughs! https://t.co/bNcyefwBss

Nick Dobos@NickADobos

4 mo

Apple is in the LLm game! Benchmarks seem pretty comparable to Gemini and a bit behind GPT4? Seems promising! Cmon plz do something cool with Siri https://t.co/lxlV5X9t6A https://t.co/b8GvzwTuUz

Similar Stories

Apple Unveils MM1 vs GPT4, Musk's Grok-1 with 314B Parameters Released

Similar Stories

Sources

Apple Unveils MM1 vs GPT4, Musk's Grok-1 with 314B Parameters Released