2024 Study Shows Small LMs Can Prune Data for Models 3

[CL] Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models Z Ankner, C Blakeney, K Sreenivasan, M Marion... [Databricks & MIT & DatologyAI] (2024) https://t.co/8TngcEoRZW - Perplexity-based data pruning, where a dataset is pruned to subsets with low,… https://t.co/qN8kQ570Mb

fly51fly@fly51fly

27 d

[CL] Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models Z Ankner, C Blakeney, K Sreenivasan, M Marion... [Databricks & MIT & DatologyAI] (2024) https://t.co/8TngcEoRZW - The marginal contribution of a data point to a model's loss, defined as the… https://t.co/O6xZwL6Qg6

Jonathan Frankle@jefrankle

28 d

Finally, a pruning paper that gets me excited. Small LLMs are helpful for choosing the data for larger LLMs! https://t.co/Bsfkq1ZMsD

Zack Ankner@ZackAnkner

28 d

New paper where we explore using a small LM’s perplexity to prune the pretraining data for larger LMs. We find that small LMs can prune data for up to 30x larger LMs, data pruning works in the overtrained and data-constrained regimes, and more! https://t.co/XYbI0Ijois

AK@_akhaliq

28 d

Perplexed by Perplexity Perplexity-Based Data Pruning With Small Reference Models In this work, we investigate whether small language models can determine high-quality subsets of large-scale text datasets that improve the performance of larger language https://t.co/9hejOpCiVJ

Similar Stories

2024 Study Shows Small LMs Can Prune Data for Models 30 Times Larger

Similar Stories

Sources

2024 Study Shows Small LMs Can Prune Data for Models 30 Times Larger