Apple's MLX Compiler Shows Impressive Speed-Ups on M1

I benchmarked CLIP on V100 -> 16 ms MLX CLIP on my M1 -> 3 ms MLX CLIP has some improvements over GELU, but I didn't expect this difference can someone else benchmark as well to confirm? https://t.co/1AekQvmzlD

murat 🍥@mayfer

5 mo

found mamba for mlx that works on apple silicon https://t.co/mti0fMpc9t seems memory inefficient (uses 37gb of ram for 2.8b model), but works

Tristan Bilot@tristanbilot

5 mo

Happy Valentine’s Day MLX! Functions compiled with ‘mx.compile’ are now integrated in mlx-benchmark 🧪 Here are the results on an M1 Pro. It would be nice to integrate benchmarks from other M chips @awnihannun @ivanfioravanti @digitalix https://t.co/S7Opsn9t4o

ifioravanti@ivanfioravanti

5 mo

Apple MLX compile! mx.compile improvements 🚀🚀🚀 mlx 0.2.0 - mlx vs mps PReLU vs compiled_PReLU +21% vs +54% 🔥 SeLU vs compiled_SeLU -43% vs +89% 🔥🔥🔥 Thanks @awnihannun for the hint! https://t.co/9t5YnIDToX https://t.co/nL0wUz1FmB

merve@mervenoyann

5 mo

I converted CLIP checkpoints to MLX and they're quite fast 💨 Find them here👇 - https://t.co/eItPFFm4v7 - https://t.co/T9EHIT7xF8 - https://t.co/ltdyxs04ev 📑 Benchmarks are appreciated 🤗

merve@mervenoyann

5 mo

I converted CLIP checkpoints to MLX and they're quite fast 💨 MBP M1 MLX vs T4 results in nearly 4x speed-up, I think I'll just develop on local 💕 Find them 👇 https://t.co/T9EHIT7xF8 https://t.co/eItPFFm4v7 https://t.co/ltdyxs04ev 📑

Similar Stories

Apple's MLX Compiler Shows Impressive Speed-Ups on M1 Chips vs V100, Benchmarks Reveal Memory Inefficiency

Similar Stories

Sources

Apple's MLX Compiler Shows Impressive Speed-Ups on M1 Chips vs V100, Benchmarks Reveal Memory Inefficiency