SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World...View PDF#Economics#Model#Productivity#Software Engineering#Paper#PDF·arxiv.org·Feb 19, 2025SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World...
How is Google using AI for internal code migrations?View PDF#Software Engineering#Google#Performance#Paper#PDF#Large Language Models·arxiv.org·Jan 18, 2025How is Google using AI for internal code migrations?
DafnyBench: A Benchmark for Formal Software VerificationView PDF#AI#Verification#Paper#PDF#Benchmark#Software Engineering#Machine Learning#Programming Languages·arxiv.org·Jun 14, 2024DafnyBench: A Benchmark for Formal Software Verification