Search Test Information Space

Found 2 bookmarks

Newest

How Claude uses AI to identify new threats

#Cybersecurity #Anthropic

·platformer.news·Dec 13, 2024

How Claude uses AI to identify new threats

Simple probes can catch sleeper agents \ Anthropic

#Training #Large Language Models #Anthropic #Paper #Classification #Cybersecurity

·anthropic.com·Apr 24, 2024

Simple probes can catch sleeper agents \ Anthropic