Home Real Problems with AI Not Really Problems Problems, But Not About AI Environmental Impact AI & Copyright AI & Creativity Debate Consciousness in AI AI Benefits Cognitive Fallacies

AI Problems Index

A status board of all the AI problems that exist currently, along with those that have been addressed or debunked.

Compiled by The Multiverse School

Explore

Home
Real Problems with AI
Not Really Problems
Problems, But Not About AI
Environmental Impact
AI & Copyright
AI & Creativity Debate
Consciousness in AI
AI Benefits
Cognitive Fallacies

Resources

Sources & References

Copy-left, no rights reserved.

Back to Real AI Issues

Critical

Jailbreaks and Prompt Injections Threaten Security of LLMs

LLMs are vulnerable to adversarial inputs where users can bypass safety restrictions through various techniques.

Sources

Hendrycks et al., 2024

Description

LLMs are vulnerable to adversarial inputs where users can bypass safety restrictions. This can involve "jailbreaking" the model creator's restrictions, or "prompt injection" where an application developer's instructions are overridden, sometimes by a third party through data the LLM processes. There are no robust ways to separate instructions from data within an LLM's input, making these attacks particularly hard to prevent.