New LLMs and Compression Techniques Promise Efficient

Great overview of compression algorithms for LLMs. Covers compression algorithms like pruning, quantization, knowledge distillation, low-rank approximation, parameter sharing, and efficient architecture design. This space is moving so fast. This is just a nice overview… https://t.co/CQxMgw0Wih

Rohan Paul@rohanpaul_ai

5 mo

📌 This paper, revisits the problem of “extreme” LLM compression defined as targeting extremely low bit counts, such as 2 to 3 bits per parameter. 🔥 "Extreme Compression of Large Language Models via Additive Quantization" 📌 The resulting algorithm advances the… https://t.co/6IeXaYT0Dq

Rohan Paul@rohanpaul_ai

5 mo

Very nice proposal in this Paper from Microsoft - "SliceGPT: Compress Large Language Models by Deleting Rows and Columns" 🔥 SliceGPT can remove up to 25% of the model parameters (including embeddings) for LLAMA2-70B, OPT 66B and Phi-2 models while maintaining 99%, 99% and 90%… https://t.co/6D10HxnNpw

Philipp Singer@ph_singer

5 mo

Happy to share our first efforts for foundation modeling: H2O-Danube-1.8b A small 1.8b model based on Llama/Mistral architecture trained on only 1T natural language tokens showing competitive metrics across benchmarks in the <2B model space. We particularly hope for the model to…

AK@_akhaliq

5 mo

H2O-Danube-1.8B paper page: https://t.co/pl8Zg3VfmE present H2O-Danube-1.8B, a 1.8B language model trained on 1T tokens following the core principles of LLama 2 and Mistral. We leverage and refine various techniques for pre-training large language models. Although our model is… https://t.co/M25nLQ4Iqo

Marktechpost AI Research News ⚡@Marktechpost

5 mo

Fudan University Researchers Introduce SpeechGPT-Gen: A 8B-Parameter Speech Large Language Model (SLLM) Efficient in Semantic and Perceptual Information Modeling Quick read: https://t.co/XAjkEKiUfE Paper: https://t.co/EmKuzqqz3h Github: https://t.co/RgIAseerS1 #artificial… https://t.co/gsxI0h5S2Z

Similar Stories

New LLMs and Compression Techniques Promise Efficient AI Modeling

Similar Stories

Sources

New LLMs and Compression Techniques Promise Efficient AI Modeling