Search cyberveille.decio.ch

Found 3 bookmarks

Newest

Using AI to Automatically Jailbreak GPT-4 and Other LLMs in Under a Minute

It’s been one year since the launch of ChatGPT, and since that time, the market has seen astonishing advancement of large language models (LLMs). Despite the pace of development continuing to outpace model security, enterprises are beginning to deploy LLM-powered applications. Many rely on guardrails implemented by model developers to prevent LLMs from responding to sensitive prompts. However, even with the considerable time and effort spent by the likes of OpenAI, Google, and Meta, these guardrails are not resilient enough to protect enterprises and their users today. Concerns surrounding model risk, biases, and potential adversarial exploits have come to the forefront.

#robustintelligence #EN #AI #Jailbreak #GPT-4 #chatgpt #hacking #LLMs #research

·robustintelligence.com·Dec 9, 2023

Using AI to Automatically Jailbreak GPT-4 and Other LLMs in Under a Minute

Don’t you (forget NLP): Prompt injection with control characters in ChatGPT

Like many companies, Dropbox has been experimenting with large language models (LLMs) as a potential backend for product and research initiatives. As interest in leveraging LLMs has increased in recent months, the Dropbox Security team has been advising on measures to harden internal Dropbox infrastructure for secure usage in accordance with our AI principles. In particular, we’ve been working to mitigate abuse of potential LLM-powered products and features via user-controlled input.

#dropbox #EN #2023 #ChatGPT #LLMs #prompt-injection

·dropbox.tech·Aug 4, 2023

Don’t you (forget NLP): Prompt injection with control characters in ChatGPT

ChatGPT creates mutating malware that evades detection by EDR

A global sensation since its initial release at the end of last year, ChatGPT's popularity among consumers and IT professionals alike has stirred up cybersecurity nightmares about how it can be used to exploit system vulnerabilities. A key problem, cybersecurity experts have demonstrated, is the ability of ChatGPT and other large language models (LLMs) to generate polymorphic, or mutating, code to evade endpoint detection and response (EDR) systems.

#csoonline #EN #2023 #ChatGPT #LLMs #EDR #BlackMamba

·csoonline.com·Jun 7, 2023

ChatGPT creates mutating malware that evades detection by EDR