Researchers: ‘Adversarial attacks’ capable of producing harmful AI responses

May 17, 2024

Credit: Adobe Stock Images

A study by Amazon Web Services researchers has revealed critical security vulnerabilities in large language models that understand and respond to speech, which could allow them to be manipulated into generating harmful responses using sophisticated audio attacks, according to VentureBeat.

Click for more special coverage

The study found that, despite safety checks, speech-language models are highly susceptible to "adversarial attacks," which are slight, imperceptible changes to audio input that can drastically alter the model’s behavior. These attacks achieved an average success rate of 90% in generating toxic outputs during experiments.

Moreover, the study demonstrated that audio attacks on one SLM could transfer to other models, achieving a 10% success rate even without direct access. This transferability suggests a fundamental flaw in the way these systems are currently trained for safety.

The implications are significant, as adversarial attacks could lead to misuse for fraud, espionage, or physical harm.

The researchers proposed countermeasures like adding random noise to audio inputs, which reduced the attack success rate, but acknowledged that this is not a complete solution.

An In-Depth Guide to AI

Get essential knowledge and practical strategies to use AI to better your security program.

Learn More

SC Staff

AI/ML

Google’s AI-powered fuzzing tool discovers 26 new vulnerabilities

Laura FrenchNovember 21, 2024

LLM capabilities boosted OSS-Fuzz’s coverage and helped find a 20-year-old flaw in OpenSSL.

AI/ML

Operant AI launches 3D Runtime Defense Suite

SC StaffNovember 19, 2024

The platform addresses the vulnerabilities of artificial intelligence systems, particularly large language models, which are susceptible to unpredictable and undetected threats like prompt injection and zero-day vulnerabilities.

Plain code with the word "cyberattack" in red.

AI/ML

Cyberattack pilfers $250K from iLearningEngines

SC StaffNovember 19, 2024

Such a disclosure from iLearningEngines, which comes months after questions regarding the legitimacy of its revenue figures surfaced, also follows a string of business email compromise attacks resulting in the theft of millions of dollars in recent months.

Related Terms

Algorithm

Get daily email updates

SC Media's daily must-read of the most current and pressing daily news