Distillation Scaling LawsView PDF#Large Language Models#Small Language Models#Distillation#Scale#Paper#PDF·arxiv.org·Feb 13, 2025Distillation Scaling Laws
Scaling Laws for Precision#Scale#Machine Learning#Paper#PDF·arxiv.org·Nov 19, 2024Scaling Laws for Precision
Announcing our updated Responsible Scaling Policy \ Anthropic#Anthropic#Scale#Responsible AI#Paper#PDF·anthropic.com·Oct 16, 2024Announcing our updated Responsible Scaling Policy \ Anthropic
When Less is More: Investigating Data Pruning for Pretraining LLMs at ScaleDownload PDF#Large Language Models#Pruning#Scale#Paper#PDF·arxiv.org·Dec 15, 2023When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale
Intriguing Properties of Quantization at ScalePDF#Machine Learning#Emergence#Cohere#Paper#PDF#Scale·arxiv.org·Jun 1, 2023Intriguing Properties of Quantization at Scale
Scaling Data-Constrained Language ModelsPDF#Large Language Models#Scale#Paper#PDF·arxiv.org·Jun 1, 2023Scaling Data-Constrained Language Models