HardTests: Synthesizing High-Quality Test Cases for LLM Coding#Testing#Large Language Models#Paper#PDF#Coding#Verification·arxiv.org·Jun 3, 2025HardTests: Synthesizing High-Quality Test Cases for LLM Coding
OpenAI Builds AI to Critique AI#OpenAI#ChatGPT#Coding#Verification#Large Language Models·spectrum.ieee.org·Jun 27, 2024OpenAI Builds AI to Critique AI