The argument for near-term human disempowerment through AI

Leonard Dung

Journal ArticleOPEN ACCESS

The argument for near-term human disempowerment through AI

Dung L

AI and Society (2024)

DOI: 10.1007/s00146-024-01930-2

2Citations

12Readers

Abstract

Many researchers and intellectuals warn about extreme risks from artificial intelligence. However, these warnings typically came without systematic arguments in support. This paper provides an argument that AI will lead to the permanent disempowerment of humanity, e.g. human extinction, by 2100. It rests on four substantive premises which it motivates and defends: first, the speed of advances in AI capability, as well as the capability level current systems have already reached, suggest that it is practically possible to build AI systems capable of disempowering humanity by 2100. Second, due to incentives and coordination problems, if it is possible to build such AI, it will be built. Third, since it appears to be a hard technical problem to build AI which is aligned with the goals of its designers, and many actors might build powerful AI, misaligned powerful AI will be built. Fourth, because disempowering humanity is useful for a large range of misaligned goals, such AI will try to disempower humanity. If AI is capable of disempowering humanity and tries to disempower humanity by 2100, then humanity will be disempowered by 2100. This conclusion has immense moral and prudential significance.

Author supplied keywords

Cite

CITATION STYLE

APA

Dung, L. (2024). The argument for near-term human disempowerment through AI. AI and Society. https://doi.org/10.1007/s00146-024-01930-2

The argument for near-term human disempowerment through AI

Abstract

Author supplied keywords

Cite

Register to see more suggestions