When AI Co-Scientists Fail: SPOT-a Benchmark for Automated...#Validation#Verification#Literature Review#Automation#Machine Learning#Paper#PDF·arxiv.org·May 23, 2025When AI Co-Scientists Fail: SPOT-a Benchmark for Automated...