Home Real Problems with AI Not Really Problems Problems, But Not About AI Environmental Impact AI & Copyright AI & Creativity Debate Consciousness in AI AI Benefits Cognitive Fallacies

AI Problems Index

A status board of all the AI problems that exist currently, along with those that have been addressed or debunked.

Compiled by The Multiverse School

Explore

Home
Real Problems with AI
Not Really Problems
Problems, But Not About AI
Environmental Impact
AI & Copyright
AI & Creativity Debate
Consciousness in AI
AI Benefits
Cognitive Fallacies

Resources

Sources & References

Copy-left, no rights reserved.

Back to Real AI Issues

Ongoing

Evaluations are Confounded and Biased

It's incredibly difficult to accurately evaluate what LLMs can do and the risks they pose due to various confounding factors.

Sources

Hendrycks et al., 2024

Description

It's incredibly difficult to accurately evaluate what LLMs can do and the risks they pose. LLM performance is highly sensitive to how they are prompted. Test data might have been part of their training data, leading to overestimated capabilities ("test-set contamination"). Evaluations can also be biased by the LLMs themselves (if used to evaluate other LLMs) or by the human evaluators.