New NLP and ML Models Introduced, Enhancements in LLaV

Free LLM Improvements? 🤔 Model Merging allows us to blend/stack multiple open LLMs into one—bigger or the same size—without extra training to extend skills and performance!🌱 @arcee_ai just released the paper for their open-source library mergekit. Let's take a look 👀 How to… https://t.co/TRZiwF2e36

Omar Sanseviero@osanseviero

3 mo

Transformers 4.39 is out, and it's packed with exciting updates! 🚀 New models: Mamba, Command-R, LLaVA-NeXT, MusicGen Melody, StarCoder2, SegGPT, ... ⚡️GaLore optimizer for accessible pre-training 🤏Quanto integration and Exllama+AWQ 🍎MLX support https://t.co/AsqciYOi2e

Bindu Reddy@bindureddy

3 mo

Super excited for our next open-source LLM model drop!! We will be pushing up the SOTA for open-source models significantly. This time, the focus is purely on usable LLMs that match proprietary LLMs in human eval. i.e., the benchmark that really matters... https://t.co/gYBDUt5qRx

Awni Hannun@awnihannun

3 mo

Latest MLX LM has a couple of nice additions: - (Q)LoRA fine-tuning works with OpenAI format chat and completions data (h/t @Madroidmaq) - Fuse and export LoRA fine-tuned models to GGUF *fp16 only* (h/t @LiMzba) pip install -U mlx-lm

Niels Rogge@NielsRogge

3 mo

Excited to share that LLaVa-NeXT (also known as LLaVa-1.6) is now in @huggingface Transformers! Improves upon its predecessor, LLaVa-1.5, by incorporating * higher image resolutions * better OCR and reasoning capabilities * various LLMs (Mistral-7B, Yi-34B by @NousResearch) 1/2 https://t.co/BBpUnnfcTg

Similar Stories

New NLP and ML Models Introduced, Enhancements in LLaVa-NeXT, MLX LM by NousResearch

Similar Stories

Sources

New NLP and ML Models Introduced, Enhancements in LLaVa-NeXT, MLX LM by NousResearch