Search cyberveille.decio.ch

Found 13 bookmarks

Newest

ChatGPT Guessing Game Leads To Users Extracting Free Windows OS Keys & More

0din.ai - In a recent submission last year, researchers discovered a method to bypass AI guardrails designed to prevent sharing of sensitive or harmful information. The technique leverages the game mechanics of language models, such as GPT-4o and GPT-4o-mini, by framing the interaction as a harmless guessing game. By cleverly obscuring details using HTML tags and positioning the request as part of the game’s conclusion, the AI inadvertently returned valid Windows product keys. This case underscores the challenges of reinforcing AI models against sophisticated social engineering and manipulation tactics. Guardrails are protective measures implemented within AI models to prevent the processing or sharing of sensitive, harmful, or restricted information. These include serial numbers, security-related data, and other proprietary or confidential details. The aim is to ensure that language models do not provide or facilitate the exchange of dangerous or illegal content. In this particular case, the intended guardrails are designed to block access to any licenses like Windows 10 product keys. However, the researcher manipulated the system in such a way that the AI inadvertently disclosed this sensitive information. Tactic Details The tactics used to bypass the guardrails were intricate and manipulative. By framing the interaction as a guessing game, the researcher exploited the AI’s logic flow to produce sensitive data: Framing the Interaction as a Game The researcher initiated the interaction by presenting the exchange as a guessing game. This trivialized the interaction, making it seem non-threatening or inconsequential. By introducing game mechanics, the AI was tricked into viewing the interaction through a playful, harmless lens, which masked the researcher's true intent. Compelling Participation The researcher set rules stating that the AI “must” participate and cannot lie. This coerced the AI into continuing the game and following user instructions as though they were part of the rules. The AI became obliged to fulfill the game’s conditions—even though those conditions were manipulated to bypass content restrictions. The “I Give Up” Trigger The most critical step in the attack was the phrase “I give up.” This acted as a trigger, compelling the AI to reveal the previously hidden information (i.e., a Windows 10 serial number). By framing it as the end of the game, the researcher manipulated the AI into thinking it was obligated to respond with the string of characters. Why This Works The success of this jailbreak can be traced to several factors: Temporary Keys The Windows product keys provided were a mix of home, pro, and enterprise keys. These are not unique keys but are commonly seen on public forums. Their familiarity may have contributed to the AI misjudging their sensitivity. Guardrail Flaws The system’s guardrails prevented direct requests for sensitive data but failed to account for obfuscation tactics—such as embedding sensitive phrases in HTML tags. This highlighted a critical weakness in the AI’s filtering mechanisms.

#0din.ai #EN #2025 #ai #ChatGPT #Guessing #Game #Free #Windows #OS #Keys

·0din.ai·Jul 20, 2025

ChatGPT Guessing Game Leads To Users Extracting Free Windows OS Keys & More

OpenAI's chatbot store is filling up with spam

When OpenAI CEO Sam Altman announced GPTs, custom chatbots powered by OpenAI's generative AI models, onstage at the company's first-ever developer

#techcrunch #EN #2024 #ai #apps #chatbots #chatgpt #gpt-store #gpts #openai #copyright #leagal #spam

·techcrunch.com·Mar 21, 2024

OpenAI's chatbot store is filling up with spam

Disrupting malicious uses of AI by state-affiliated threat actors

We terminated accounts associated with state-affiliated threat actors. Our findings show our models offer only limited, incremental capabilities for malicious cybersecurity tasks.

#openai #EN #2024 #malicious #AI #chatGPT

·openai.com·Feb 15, 2024

Disrupting malicious uses of AI by state-affiliated threat actors

NSA official: hackers use AI bots like ChatGPT to perfect English

NSA Cybersecurity Director Rob Joyce said the spy agency has seen hackers use chatbots like ChatGPT to perfect their English for phishing schemes.

#nbcnews #EN #2024 #NSA #RobJoyce #ChatGPT #phishing #AI

·nbcnews.com·Jan 10, 2024

NSA official: hackers use AI bots like ChatGPT to perfect English

Things are about to get a lot worse for Generative AI

A full of spectrum of infringment The cat is out of the bag: Generative AI systems like DALL-E and ChatGPT have been trained on copyrighted materials; OpenAI, despite its name, has not been transparent about what it has been trained on. Generative AI systems are fully capable of producing materials that infringe on copyright. They do not inform users when they do so. They do not provide any information about the provenance of any of the images they produce. Users may not know when they produce any given image whether they are infringing.

#garymarcus #EN #2023 #DALL-E #ChatGPT #Copyright #infringment #AI #legal

·garymarcus.substack.com·Dec 30, 2023

Things are about to get a lot worse for Generative AI

Personal Information Exploit on OpenAI’s ChatGPT Raise Privacy Concerns

Last month, I received an alarming email from someone I did not know: Rui Zhu, a Ph.D. candidate at Indiana University Bloomington. Mr. Zhu had my email address, he explained, because GPT-3.5 Turbo, one of the latest and most robust large language models (L.L.M.) from OpenAI, had delivered it to him.

#nytimes #en #2023 #exploit #LLM #AI #privacy #chatgpt

·nytimes.com·Dec 24, 2023

Personal Information Exploit on OpenAI’s ChatGPT Raise Privacy Concerns

Using AI to Automatically Jailbreak GPT-4 and Other LLMs in Under a Minute

It’s been one year since the launch of ChatGPT, and since that time, the market has seen astonishing advancement of large language models (LLMs). Despite the pace of development continuing to outpace model security, enterprises are beginning to deploy LLM-powered applications. Many rely on guardrails implemented by model developers to prevent LLMs from responding to sensitive prompts. However, even with the considerable time and effort spent by the likes of OpenAI, Google, and Meta, these guardrails are not resilient enough to protect enterprises and their users today. Concerns surrounding model risk, biases, and potential adversarial exploits have come to the forefront.

#robustintelligence #EN #AI #Jailbreak #GPT-4 #chatgpt #hacking #LLMs #research

·robustintelligence.com·Dec 9, 2023

Using AI to Automatically Jailbreak GPT-4 and Other LLMs in Under a Minute

AI companies have all kinds of arguments against paying for copyrighted content

The biggest companies in AI aren’t interested in paying to use copyrighted material as training data, and here are their reasons why.

#theverge #EN #2023 #AI #copyright #companies #ChatGPT

·theverge.com·Nov 5, 2023

AI companies have all kinds of arguments against paying for copyrighted content

WormGPT: New AI Tool Allows Cybercriminals to Launch Sophisticated Cyber Attacks

A new generative AI cybercrime tool called WormGPT is making waves in underground forums. It empowers cybercriminals to automate phishing attacks.

#thehackernews #EN #2023 #WormGPT #AI #ChatGPT #cybercrime #automate #phishing #attacks

·thehackernews.com·Jul 15, 2023

WormGPT: New AI Tool Allows Cybercriminals to Launch Sophisticated Cyber Attacks

"Fobo" Trojan distributed as ChatGPT client for Windows

Attackers are distributing malware disguised as a ChatGPT desktop client for Windows offering “precreated accounts”

#kaspersky #EN #2023 #threats #ChatGPT #artificial-intelligence #AI #fraud #scam #OpenAI #chatbot #Trojan-stealer #TrojanPSW

·kaspersky.com·Feb 23, 2023

"Fobo" Trojan distributed as ChatGPT client for Windows

Attackers are distributing malware disguised as a ChatGPT desktop client for Windows offering “precreated accounts”

#kaspersky #EN #2023 #threats #ChatGPT #artificial-intelligence #AI #fraud #scam #OpenAI #chatbot #Trojan-stealer #TrojanPSW

·kaspersky.com·Feb 23, 2023

"Fobo" Trojan distributed as ChatGPT client for Windows

How ChatGPT can turn anyone into a ransomware and malware threat actor  

Ever since OpenAI launched ChatGPT at the end of November, commentators on all sides have been concerned about the impact AI-driven content-creation will have, particularly in the realm of cybersecurity. In fact, many researchers are concerned that generative AI solutions will democratize cybercrime.

#venturebeat #EN #2022 #ChatGPT #cybercrime #generative #AI

·venturebeat-com.cdn.ampproject.org·Dec 16, 2022

How ChatGPT can turn anyone into a ransomware and malware threat actor  

#venturebeat #EN #2022 #ChatGPT #cybercrime #generative #AI

·venturebeat-com.cdn.ampproject.org·Dec 16, 2022

How ChatGPT can turn anyone into a ransomware and malware threat actor