OctoML Offers Lowest-Cost Mixtral MoE Inference Tokens

Andrew Carr (e/🤸)@andrew_n_carr

7 mo

OpenAI should put Mixtral 8x7b in their API just to flex their low prices

Nate Raw@_nateraw

7 mo

Now offering mixtral inference for $0.000000/1B tokens Burn all our VC money idgaf!

BioBootloader@bio_bootloader

7 mo

Mixtral > Gemini Pro on the Chatbot Arena Leaderboard And that's just `mistral-small` - Mistral also has a mystery `mistral-medium` available on their API https://t.co/hAvw62blhn

Matt Shumer@mattshumer_

7 mo

.@OctoML is now serving the lowest-cost Mixtral MoE inference I've seen. Input: $0.20/million tokens Output: $0.50/million tokens https://t.co/S5Rg5QoUdH

JJ@JosephJacks_

7 mo

Last week @MistralAI launched pricing for the Mixtral MoE: $2.00~ / 1M tokens. Hours later @togethercompute took the weights and dropped pricing by 70% to $0.60 / 1M. Days later @abacusai cut 50% deeper to $0.30 / 1M. Yesterday @DeepInfra went to $0.27 / 1M. Who’s next ??? 📉

OctoAI@OctoAICloud

7 mo

OctoAI has crazy fast #Mixtral 8x7B for $0.20 / 1M input tokens, and $0.50 / 1M output tokens 😎 You can try it out here (login first) https://t.co/7YYFCTsZT9

Similar Stories

OctoML Offers Lowest-Cost Mixtral MoE Inference Tokens at $0.20/1M Input and $0.50/1M Output, Beating MistralAI's $2.00/1M Launch Price

Similar Stories

Sources

OctoML Offers Lowest-Cost Mixtral MoE Inference Tokens at $0.20/1M Input and $0.50/1M Output, Beating MistralAI's $2.00/1M Launch Price