When Less is More: Investigating Data Pruning for Pretraining LLMs at ScaleDownload PDF#Large Language Models#Pruning#Scale#Paper#PDF·arxiv.org·Dec 15, 2023When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale