Meta's LLaMA 3 Boosts to 800+ Tokens/Sec, Microsoft's

Llama 3 is the latest Meta open-source LLM to a arrive on the scene, so Eivind Kjosbakken decided to explore its power and offer a clear workflow for anyone who'd like to run the model locally. https://t.co/SMSK2Y6tze

Gizmodo@Gizmodo

2 mo

ChatGPT Could Power the iPhone's AI Chatbot: Report https://t.co/0Yk8mPKrxV https://t.co/oWRdP1ULzT

The Independent@Independent

2 mo

Apple’s new flagship smartphone could be filled with AI features https://t.co/eTnegTICI2

TIMES NOW@TimesNow

2 mo

iPhone 16 Pro Max To Launch With Generative AI Features: Apple Intensifies Talks With OpenAI https://t.co/5hwQjvTrmF

Insider Tech@TechInsider

2 mo

Apple is weighing a big decision impacting the next iPhone release: OpenAI or Google's Gemini https://t.co/6pytwsXflL

SwissCognitive, AI Venture, Advisory & Research@SwissCognitive

2 mo

Apple's upcoming iOS 18 might revolutionize #AI interaction with on-device processing, promising more personal and secure AI features.🍏🧠 #ArtificialIntelligence #Tech #AINews https://t.co/DkLxHZyZNt

Andrew Curran@AndrewCurran_

2 mo

Apple is working on smaller local models, but they also want a large model to run under iOS 18 on the iPhone. They have had discussions with Anthropic, Google, and OpenAI. They have now reentered negotiations with OpenAI. Massive deal for whoever lands it. WWDC is on June 10th. https://t.co/GQl3UsUU5E

Brett Winton@wintonARK

2 mo

Apple shipping 2019-era AI capability in 2024 Appropriately captures the company's apparent lack of AI urgency as the revolution in language-based UI portends a new hardware-software era https://t.co/EMB6mOXAlF https://t.co/OvDkC7cUXE

Mark Gurman@markgurman

2 mo

Apple’s iOS 18 features they will highlight at WWDC are based on an in-house model. The talks with OpenAI and Google are for a chatbot/search component. https://t.co/preidMTAPR

Teslaconomics@Teslaconomics

2 mo

NEWS: Apple in talks with OpenAI to use its technology to power some new features coming to the iPhone this year, according to Bloomberg But… why not Grok? @elonmusk https://t.co/SEGowI3iqL

Decrypt@decryptmedia

2 mo

Apple Releases 8 Small AI Language Models To Compete Wit Microsoft’s Phi-3 ► https://t.co/XSPyveKMPR https://t.co/XSPyveKMPR

CNET News@CNETNews

2 mo

Apple Offers Peek at Its AI Language Model as iOS 18 Looms - CNET https://t.co/Xq15EzFAtJ

Haider.@slow_developer

2 mo

Big week for open-source AI. In just a few days, we've seen two major models launch. Meta Llama 3 and Microsoft Phi-3, each claiming to outperform the other. So I conducted complex coding tests: 🧵 https://t.co/0MU7uCPC1t

elvis@omarsar0

2 mo

Colossal-Inference now supports Llama 3 inference acceleration. They report a ~20% enhancement in training efficiency for Llama 3 8B and 70B and outperforming alternative inference solutions such as vLLM. This is why open-source AI matters. There are all kinds of innovations… https://t.co/BDlR2iGKo2 https://t.co/2r9WCBKQBP

elvis@omarsar0

2 mo

Colossal-Inference now supports Llama 3 inference. They report a ~20% enhancement in training efficiency for Llama 3 8B and 70B and outperforming alternative inference solutions such as vLLM. This is why open-source AI matters. There are all kinds of innovations happening… https://t.co/eTH2AIXXii https://t.co/2r9WCBKQBP

Mike Allton@mike_allton

2 mo

Apple has submitted eight new AI models to the Hugging Face hub, an online platform for open-source AI. While similar to other models like OpenAI's GPT-4, the difference is in the size of these models. Apple is clearly building models designed to function on our devices. https://t.co/JHVvnxdwCB

Salman Khan@KhanSalmanH

2 mo

LLaMA3 and Phi3 have made the splash this week in LLM Arena. But how strong is their visual understanding ability? ⚡We release LLaMA3-Vision and Phi3-Vision models that beat their larger size LLM competitors. Github: https://t.co/o4AVEn0AYF HF: https://t.co/AujQeYLMlG https://t.co/NJqNT9K8he

The Rundown AI@TheRundownAI

2 mo

Top stories in AI today: -Chinese AI model bests GPT-4 Turbo -Sanctuary releases next-gen Phoenix -Access Llama 3 on your Phone -Synthesia unveils ‘Expressive Avatars’ -6 new AI tools & 4 new AI jobs Read more: https://t.co/82bSB4PgMk https://t.co/HMt0uEcAbg

Philipp Schmid@_philschmid

2 mo

Llama 3 extended to almost 100,000-token context! ✅ By Combining PoSE and continuing pre-training on Llama 3 8B base for 300M tokens, the community (@winglian) managed to extend the context from 8k to 64k. 🚀 Applying rope scaling afterward led to a supported context window of… https://t.co/3eICXS95eA

Tech2@tech2eets

2 mo

#FPTech: iPhone’s AI: Apple launches OpenELM, open-source model that runs locally on device, without cloud https://t.co/bVzNAJpA8X

Rohan Paul@rohanpaul_ai

2 mo

This model extends LLama-3 8B's context length from 8k to > 160K, developed by @Gradient_AI_ , sponsored by compute from @CrusoeEnergy . Demonstrates that SOTA LLMs can learn to operate on long context with minimal training (< 200M tokens) by appropriately adjusting RoPE theta.… https://t.co/2e8NnarhfS

Deep_In_Depth@Deep_In_Depth

2 mo

How to Run Llama 3 Locally? Let's Have a Look! #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #AutonomousVehicles #NeuroMorphic #Robotics https://t.co/iLHp4Pojf0

Express Technology@ExpressTechie

2 mo

Ahead of WWDC, Apple has released its open-source efficient language models, hinting at its ongoing progress with AI. #Apple #WWDC #AI #LLM https://t.co/ukKOEz1KhI

Ian Macartney@ianmacartney

2 mo

Run an AI Town locally, powered by llama3 🎉 No cloud signups needed. Make your own world, and then talk to it :) Runs the open-source @convex_dev backend locally. Use @ollama locally or @togethercompute for cloud LLM. @realaitown https://t.co/7YmK6wBVIS

Marco Mascorro@Mascobot

2 mo

The OS LLama3 is moving fast. Llama3 8B-instruct with 160K context window, done with progressive training on augmented generations of increasing context lengths of SlimPajama https://t.co/M8jIGOr62I

Awni Hannun@awnihannun

2 mo

Quantized matmuls in the latest MLX are up to 40% faster thanks to @angeloskath and @DiganiJagrit QLoRA fine-tuning Llama 3 70B on a single M2 Ultra stats: - Batch size 4 with 16 LoRA layers - 95 toks/sec - Peak mem 41GB - Avg power 120 W https://t.co/dLuMAGrpzd

The Information@theinformation

2 mo

Apple’s most likely strategy: developing artificial intelligence software that’s small and light-weight enough to run on its devices. https://t.co/l1xrk432GA AI Agenda by @KalleyHuang

Matt Shumer@mattshumer_

2 mo

It's been a week since LLaMA 3 dropped. In that time, we've: - extended context from 8K -> 128K - trained multiple ridiculously performant fine-tunes - got inference working at 800+ tokens/second If Meta keeps releasing OSS models, closed providers won't be able to compete.

Min Choi@minchoi

2 mo

Llama 3 surprised everyone less than a week ago, but Microsoft just dropped Phi-3 and it's incredibly capable small AI model. We may soon see 7B models that can beat GPT-4. People are already coming up with incredible use cases. 10 wild examples:

Similar Stories

Meta's LLaMA 3 Boosts to 800+ Tokens/Sec, Microsoft's Phi-3 and Apple's iOS 18 AI Plans with OpenAI

Similar Stories

Sources

Meta's LLaMA 3 Boosts to 800+ Tokens/Sec, Microsoft's Phi-3 and Apple's iOS 18 AI Plans with OpenAI