How to stop runaway AI

How can humans retain power over more intelligent AI beings?
Dr. Stuart J. Russell is a Professor of Computer Science at UC Berkeley and has been studying the development of artificial intelligence for decades.
Sign up for the Freethink Weekly newsletter!
A collection of our favorite stories straight to your inbox

While he doesn’t think this latest crop of generative AI tools necessarily presents a significant threat to humanity, he does think it has helped to open the public’s eyes to the potential risks of more intelligent AI that could be coming in the future.

“They’re giving people now, in a very real sense, what would it be like if we had artificial general intelligence on tap available 24/7 to solve any problem that we might have. And they’re also seeing in a very visceral way that could present real risks,” Russell explained in a recent interview with Freethink.

As Russell argues in his book Human Compatible: Artificial Intelligence and the Problem of Control, we need to be appropriately concerned about the future threat of human-level, artificial general intelligence (AGI) which could pose an existential threat to humanity unless we can ensure that these systems remain aligned with human values and goals.

He contends that the standard approach to designing AI systems — in which machines are programmed to maximize some objective function — is fundamentally flawed because these machines don’t actually understand the world around them in any comprehensive way, a flaw that, in his mind, could lead to unintended and catastrophic failures if it can’t sufficiently anticipate the consequences of its own actions. Instead, he proposes a new approach to AI design in which machines are explicitly programmed to defer to humans in matters of value and to operate within a framework of uncertain and incomplete knowledge. By ensuring that AI systems are “human-compatible” in this way, Russell argues that we can harness the enormous potential of AI while minimizing the risk of catastrophic outcomes.

Related
AI chatbots may ease the world’s loneliness (if they don’t make it worse)
AI chatbots may have certain advantages when roleplaying as our friends. They may also come with downsides that make our loneliness worse.
Will AI supercharge hacking — if it hasn’t already?
The future of hacking is coming at us fast, and it isn’t clear yet whether AI will help attackers and defenders more.
No, LLMs still can’t reason like humans. This simple test reveals why.
Most AI models are incredible at taking tests but easily bamboozled by basic reasoning. “Simple Bench” shows us why.
The future of fertility, from artificial wombs to AI-assisted IVF
A look back at the history of infertility treatments and ahead to the tech that could change everything we thought we knew about reproduction.
“Model collapse” threatens to kill progress on generative AIs
Generative AIs start churning out nonsense when trained on synthetic data — a problem that could put a ceiling on their ability to improve.
Subscribe to Freethink for more great stories