Test Information Space

#Deception

Isaac Asimov's Laws of Robotics Need an Update for AI

#Robotics #Ethics #Deception

·spectrum.ieee.org·Jan 14, 2025

Isaac Asimov's Laws of Robotics Need an Update for AI

Influence and cyber operations an update october 2024

#OpenAI #Report #Cybersecurity #Paper #PDF #Deception #AI

·openai.com·Oct 9, 2024

Influence and cyber operations an update october 2024

AI deception: A survey of examples, risks, and potential solutions

#Deception #AI #Paper

·cell.com·May 10, 2024

AI deception: A survey of examples, risks, and potential solutions

AI Elections accord - A Tech accord to Combat Deceptive Use of AI in 2024 Elections

#Elections #Deception #Alliances #AI #Cybersecurity

·aielectionsaccord.com·Feb 16, 2024

AI Elections accord - A Tech accord to Combat Deceptive Use of AI in 2024 Elections

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Download PDF

#Deception #Large Language Models #Paper #PDF

·arxiv.org·Jan 13, 2024

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure

#Deception #Large Language Models #Paper #PDF

·arxiv.org·Nov 15, 2023

Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure

Role-Play with Large Language Models

#Large Language Models #Dialogue #Deception #Self-Awareness #Paper #PDF #DeepMind

·arxiv.org·Nov 13, 2023

Role-Play with Large Language Models