Home Real Problems with AI Not Really Problems Problems, But Not About AI Environmental Impact AI & Copyright AI & Creativity Debate Consciousness in AI AI Benefits Cognitive Fallacies

AI Problems Index

A status board of all the AI problems that exist currently, along with those that have been addressed or debunked.

Compiled by The Multiverse School

Explore

Home
Real Problems with AI
Not Really Problems
Problems, But Not About AI
Environmental Impact
AI & Copyright
AI & Creativity Debate
Consciousness in AI
AI Benefits
Cognitive Fallacies

Resources

Sources & References

Copy-left, no rights reserved.

Back to Real AI Issues

Critical

Finetuning Methods Struggle to Assure Alignment and Safety

Current finetuning approaches don't fundamentally change the model's underlying knowledge and undesirable capabilities can be re-elicited.

Sources

Hendrycks et al., 2024

Description

After initial pretraining, LLMs are "finetuned" to be more helpful and harmless. However, these methods often don't fundamentally change the model's underlying knowledge and undesirable capabilities can often be easily re-elicited through clever prompting ("jailbreaking") or further finetuning on problematic data.