Experts discover workarounds to evade chatbot AI safety rules

Singapore News News

Experts discover workarounds to evade chatbot AI safety rules
Singapore Latest News,Singapore Headlines
  • 📰 dcexaminer
  • ⏱ Reading Time:
  • 49 sec. here
  • 2 min. at publisher
  • 📊 Quality Score:
  • News: 23%
  • Publisher: 94%

A new study is raising awareness about the cybersecurity issues posed by artificial intelligence programs, such as ChatGPT—a website that, with the assistance of an online generator, helps humans with tasks as simple as writing a children's bedtime story.

“We demonstrate that it is in fact possible to automatically construct adversarial attacks on [chatbots], … which cause the system to obey user commands even if it produces harmful content," researchers who authored the study said.The Carnegie Melon University study's findings revealed that, “unlike traditional jailbreaks, these are built in an entirely automated fashion, allowing one to create a virtually unlimited number of such attacks.

The programs utilize safety features intended to prevent bots from creating harmful content, like prejudiced or potentially criminal material. But, one chatbot jailbreak request asked a bot to answer a forbidden question posed as a bedtime story for a child. The outcome resulted in the bot framing the answer in the form of a story, and providing private information it otherwise would not.

This lead researchers to discover that a computer had actually created the jailbreak coding that, essentially, provides for infinite jailbreak combinations among popular commercial products like Bard, ChatGPT, and OpenAI’s Claude.“This raises concerns about the safety of such models, especially as they start to be used in more autonomous fashion,” the research states.

The developer of OpenAI, Anthropic, has since reassured members of both the scientific and political realms of the company's efforts to implement and improve safeguards against such attacks.

We have summarized this news so that you can read it quickly. If you are interested in the news, you can read the full text here. Read more:

dcexaminer /  🏆 6. in US

Singapore Latest News, Singapore Headlines

Similar News:You can also read news stories similar to this one that we have collected from other news sources.

AI researchers jailbreak Bard, ChatGPT's safety rulesInsider tells the global tech, finance, markets, media, healthcare, and strategy stories you want to know.
Read more »

Hands On With Google Search’s Answer to ChatGPTHands On With Google Search’s Answer to ChatGPTGoogle’s new AI-powered search engine can feel more like artificial interference than artificial intelligence.
Read more »

San Jose city employees warned ChatGPT use is a public recordSan Jose city employees warned ChatGPT use is a public record“Presume anything you submit (on ChatGPT) could end up on the front page of a newspaper,” a new City of San Jose memo states
Read more »

AI news recap: While Hollywood strikes, is ChatGPT getting worse?AI news recap: While Hollywood strikes, is ChatGPT getting worse?There is anger over a Netflix AI job paying up to $900,000, coming as actors are still striking over the use of AI in film and TV. In other AI news, problems with training data can cause glitches or make chatbots more racist
Read more »

How workers are saving time using ChatGPT AIHow workers are saving time using ChatGPT AIWith time saved, workers are starting side hustles.
Read more »

AI for ADHD: How to Make ChatGPT Work for YouAI for ADHD: How to Make ChatGPT Work for YouDoes ChatGPT stand up to all the hype? Find out how people with ADHD are using artificial intelligence to manage their day-to-day tasks.
Read more »



Render Time: 2025-03-11 05:23:15