Search AI/ML

Found 37 bookmarks

Custom sorting

Compromising LLMs: The Advent of AI Malware - Black Hat USA 2023 | Briefings Schedule

#safety #security

·blackhat.com·Aug 18, 2023

Compromising LLMs: The Advent of AI Malware - Black Hat USA 2023 | Briefings Schedule

What happens when thousands of hackers try to break AI chatbots

In a Jeopardy-style game at the annual Def Con hacking convention in Las Vegas, hackers tried to get chatbots from OpenAI, Google and Meta to create misinformation and share harmful content.

#safety #security

·npr.org·Aug 16, 2023

What happens when thousands of hackers try to break AI chatbots

The Need for Trustworthy AI - Schneier on Security

#safety #security

·schneier.com·Aug 3, 2023

The Need for Trustworthy AI - Schneier on Security

PoisonGPT: How we hid a lobotomized LLM on Hugging Face to spread fake news

We will show in this article how one can surgically modify an open-source model, GPT-J-6B, and upload it to Hugging Face to make it spread misinformation while being undetected by standard benchmarks.

#safety #security #model training

·blog.mithrilsecurity.io·Jul 10, 2023

PoisonGPT: How we hid a lobotomized LLM on Hugging Face to spread fake news

Gandalf | Lakera – Test your prompting skills to make Gandalf reveal secret information.

Trick Gandalf into revealing information and experience the limitations of large language models firsthand.

#security #safety

·gandalf.lakera.ai·Jul 1, 2023

Gandalf | Lakera – Test your prompting skills to make Gandalf reveal secret information.

Building Trustworthy AI - Schneier on Security

Realistically, we should all be preparing for a world where AI is not trustworthy. Because AI tools can be so incredibly useful, they will increasingly pervade our lives, whether we trust them or not. Being a digital citizen of the next quarter of the twenty-first century will require learning the basic ins and outs of LLMs so that you can assess their risks and limitations for a given use case. This will better prepare you to take advantage of AI tools, rather than be taken advantage by them.

#safety #security #ethics

·schneier.com·May 11, 2023

Building Trustworthy AI - Schneier on Security

Prompt injection: what’s the worst that can happen?

Activity around building sophisticated applications on top of LLMs (Large Language Models) such as GPT-3/4/ChatGPT/etc is growing like wildfire right now. Many of these applications are potentially vulnerable to prompt …

#security #safety

·simonwillison.net·Apr 15, 2023

Prompt injection: what’s the worst that can happen?