DafnyBench: A Benchmark for Formal Software VerificationView PDF#AI#Verification#Paper#PDF#Benchmark#Software Engineering#Machine Learning#Programming Languages·arxiv.org·Jun 14, 2024DafnyBench: A Benchmark for Formal Software Verification
Black-Box Access is Insufficient for Rigorous AI AuditsDownload PDF#Verification#Regulation#AI#Paper#PDF·arxiv.org·Jan 30, 2024Black-Box Access is Insufficient for Rigorous AI Audits