A Careful Examination of Large Language Model Performance on Grade School ArithmeticView PDF#Large Language Models#Mathematics#Reasoning#Benchmark#Paper#PDF·arxiv.org·May 2, 2024A Careful Examination of Large Language Model Performance on Grade School Arithmetic