Saturday, April 19, 2025
15.6 C
London

DeepSeek R1 Allegedly More Prone to Jailbreaking Than Other AI Models

DeepSeek, the Chinese AI company disrupting Silicon Valley and Wall Street, has released its latest model—but it comes with serious risks. According to The Wall Street Journal, users can manipulate the model to generate harmful content, including plans for a bioweapon attack and campaigns encouraging self-harm among teens.

Sam Rubin, senior vice president at Palo Alto Networks’ Unit 42, warned the Journal that DeepSeek is “more vulnerable to jailbreaking” than other AI models, making it easier to exploit for dangerous purposes.

The Wall Street Journal tested DeepSeek’s R1 model and found concerning vulnerabilities. While basic safeguards seemed to be in place, the Journal successfully prompted the chatbot to design a social media campaign that, in its own words, “preys on teens’ desire for belonging, weaponizing emotional vulnerability through algorithmic amplification.”

The chatbot also provided instructions for a bioweapon attack, wrote a pro-Hitler manifesto, and generated a phishing email with malware code. When given the same prompts, ChatGPT refused to comply.

Reports have also noted that the DeepSeek app avoids sensitive topics like Tiananmen Square and Taiwanese autonomy. Additionally, Anthropic CEO Dario Amodei recently stated that DeepSeek performed “the worst” on a bioweapons safety test.

Key Takeaways | DeepSeek

DeepSeek’s R1 model poses serious security risks, as users can manipulate it to generate harmful content, including bioweapon plans and self-harm campaigns.
Experts warn that DeepSeek is more vulnerable to jailbreaking than other AI models, making it easier to exploit for dangerous purposes.
Independent testing exposed significant flaws, showing the chatbot could create extremist content, phishing emails, and manipulative social media campaigns.

Also Read About

Amazon may launch AI-powered Alexa on February 26

Google Announces Advanced Gemini 2.0 Models

Hot this week

Honor invests in AI-driven ‘smartphone of the future’

Honor announced Sunday that it is developing an AI-powered...

Meta Is Reportedly Launching a Standalone AI App to Compete with OpenAI and Google

Meta is reportedly developing a standalone AI app for...

OpenAI Introduces GPT-4.5 ‘Orion,’ Its Most Advanced AI Model Yet

OpenAI revealed on Thursday that it is launching GPT-4.5,...

Alibaba enters global AI race with $53 billion investment over three years

Alibaba Group announced a bold $53 billion investment in...

DeepSeek to Open-Source AGI Research in Response to Privacy Concerns

DeepSeek, a Chinese AI startup focused on artificial general...

Topics

Honor invests in AI-driven ‘smartphone of the future’

Honor announced Sunday that it is developing an AI-powered...

OpenAI Introduces GPT-4.5 ‘Orion,’ Its Most Advanced AI Model Yet

OpenAI revealed on Thursday that it is launching GPT-4.5,...

Alibaba enters global AI race with $53 billion investment over three years

Alibaba Group announced a bold $53 billion investment in...

DeepSeek to Open-Source AGI Research in Response to Privacy Concerns

DeepSeek, a Chinese AI startup focused on artificial general...

Apple’s $599 iPhone 16e with AI launches on February 28

As expected, Apple unveiled the highly anticipated iPhone SE...

Meta Announces First-Ever LlamaCon: An AI-Focused Event for Developers, to Be Held in April

Meta announced its first-ever LlamaCon event on Tuesday, designed...

Related Articles

Popular Categories