Suppressing Pink Elephants with Direct Principle FeedbackDownload PDF#Large Language Models#Feedback#Paper#PDF·arxiv.org·Feb 13, 2024Suppressing Pink Elephants with Direct Principle Feedback
Human Feedback is not Gold StandardDownload PDF#Large Language Models#Feedback#Criticism#RLHF#Paper#PDF·arxiv.org·Oct 4, 2023Human Feedback is not Gold Standard
OpenELM/OpenELM_Paper.pdf at paper · CarperAI/OpenELM · GitHub#Large Language Models#Algorithms#Feedback#Paper#PDF·github.com·Jul 11, 2023OpenELM/OpenELM_Paper.pdf at paper · CarperAI/OpenELM · GitHub