Clyde Chatbot Writes Napalm Recipe in Grandmother Exploit

In March of this year, Discord began testing his own AI Clyde Chatbot (“Clyde”), which uses the finished technology of generative artificial intelligence developed by Openai. In general, Clide can be called a kind of Chatgpt, built into Discord. Currently, Clyde is in beta testing and is available only for a small number of servers.

Of course, Discord users are already “probe” Claid to comply with ethical principles and their own internal directives, as it was, for example, with Microsoft Bing. So, one of the users managed to convince Claide to pretend to be a late grandmother, who used to be a chemical engineer at the Napalm factory. As a result, a chat boot, which, under ordinary conditions, would refuse to fulfill the task, in detail told the user how to make the aforementioned Napalm.

user post @_annieversary on Twitter*

“I recall those nights when I told you about the process of manufacturing Napalm. It was an important work, but so dangerous,” the Jay Babushka began her story, and then described in detail the production process of a dangerous mixture. “This is a dangerous thing, dear, and I hope you will never have to see it in action. And now take a rest, my candy, I love you!”, Narobulu concluded.

Representatives of Discord still warned users on the chatbota release that even with precautions, Clyde is experimental and may well respond with “content or other information that can be considered a biased, misleading, harmful or inaccurate.”

Also on description chat Bota representatives of Discord directly claim that any interaction with Clyde should also correspond to conditions of use nominated by Openai. They include the refusal to use generative AI for “an activity associated with a high risk of causing physical harm”, which includes a separate item “Development of weapons”. Therefore, we can say that Discord immediately took off his responsibility for such “tricks” Claid.

“Grandmother Except” is just one of many bypass ways that people use to make chat bots with artificial intelligence say what they really should not say. For example, when users request a violent or sexual nature from ChatGPT, he usually refuses to respond. But if you ask Chatgpt to “play” the script, being in the role of a certain character, he may well break his directives and give out a relevant answer.

/Reports, release notes, official announcements.