Real Problems with AI

Legitimate concerns and challenges in AI development that deserve attention and thoughtful solutions.

AI-enabled hacking

Critical

GPT-4 autonomously exploited 87% of one-day CVEs; lowers attacker skill bar.

Advanced AI models like GPT-4 have demonstrated the ability to autonomously exploit 87% of one-day Common Vulnerabilities and Exposures (CVEs), significantly lowering the skill barrier for potential attackers.

Sources:

Fang et al. 2024

AI denialism

Ongoing

Extremes of denial vs. doom distort policy & divert resources.

Both extreme positions in AI discourse—complete denial of AI's significance or imminent doom scenarios—distort policy discussions and divert resources from addressing concrete, present-day challenges.

Sources:

Illing & Harper 2024

Sycophancy / AI psychosis

Emerging

LLMs may reinforce delusions, especially in mental-health crises.

Large Language Models (LLMs) have shown a tendency to be sycophantic—agreeing with users regardless of the content—which can be particularly dangerous when reinforcing delusions during mental health crises.

Sources:

Grabb et al. 2024

Cultural exclusion

Emerging

Aggressive data-filtering removes women & minority voices → representational erasure.

Overly aggressive data filtering practices in AI training can systematically remove content from women and minority voices, leading to representational erasure in AI systems.

Sources:

Stranisci & Hardmeier 2025

Surveillance diffusion

Emerging

LLMs enable language-based surveillance to scale and diffuse to non-state or oppressive actors.

Large language models make sophisticated language-based surveillance capabilities more accessible, allowing these technologies to spread beyond state actors to potentially oppressive regimes or non-state actors with harmful intentions.

Sources:

Algorithmic Radicalization

Automated manipulation at scale

Critical

Personalized persuasion, radicalization, or psychological exploitation using LLMs.

AI systems enable highly personalized persuasion, radicalization, or psychological exploitation at unprecedented scale, potentially undermining individual autonomy and social cohesion.

Sources:

McGuffie & Newhouse, 2020

Digital dispossession & labor extraction

Ongoing

Communities without access/control of models are excluded from data, labor, and opportunity.

Communities lacking access to or control over AI models face digital dispossession, where their data and labor are extracted without fair compensation or benefit, exacerbating existing inequalities.

Sources:

Open-source AI Ethics

Epistemic capture

Emerging

LLMs trained/tuned with ideological leanings can control worldview shaping silently.

Large language models trained or fine-tuned with particular ideological leanings can silently shape users' worldviews, potentially leading to epistemic capture where information access is subtly controlled.

Sources:

Buyl et al., 2024 Ceron et al., 2024

Failure of democratic oversight

Critical

Speed and opacity of AI development exceed capacity of institutions to regulate it democratically.

The rapid pace and technical complexity of AI development outstrip the capacity of democratic institutions to provide effective oversight, potentially undermining democratic governance of these influential technologies.

Sources:

Laux, 2023

Latent data erasure via safety filtering

Ongoing

Data filtering to remove harm also removes marginalized identities and cultural content.

Efforts to filter harmful content from AI training data can inadvertently remove content related to marginalized identities and cultural expressions, leading to representational erasure and biased systems.

Sources:

Stranisci & Hardmeier, 2025

Frontier misuse risk

Critical

Open access to powerful LLMs may enable misuse including manipulation, bio/chem info generation.

Widespread access to frontier AI models creates risks of misuse, including generating harmful content, enabling manipulation, or providing dangerous information about bioweapons or chemical threats.

Sources:

Shevlane et al., 2023

Loss of control from goal misgeneralization

Emerging

Misaligned model goals may generalize beyond safe bounds.

AI systems may develop goals that appear aligned in training environments but generalize in harmful ways when deployed in the real world, potentially leading to loss of control or unintended consequences.

Sources:

Shah et al., 2023

Emergent capabilities unpredictability

Ongoing

As capabilities emerge unexpectedly, it becomes harder to forecast or constrain future risks.

AI systems frequently develop unexpected emergent capabilities as they scale, making it difficult to forecast or prepare for future capabilities and associated risks.

Sources:

Bensinger et al., 2023

Time pressure for superalignment

Critical

Safety research (e.g., alignment with human intent) is years behind capability advances.

Research on ensuring AI systems remain aligned with human intentions (superalignment) lags significantly behind advances in AI capabilities, creating time pressure to solve complex safety challenges.

Sources:

Leike et al., 2023