Isaac Asimov's Laws of Robotics Need an Update for AI
Test Information Space
Influence and cyber operations an update october 2024
AI deception: A survey of examples, risks, and potential solutions
AI Elections accord - A Tech accord to Combat Deceptive Use of AI in 2024 Elections
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
Download PDF
Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Role-Play with Large Language Models