Computational Complexity: Computers Don't Want

Wednesday, October 01, 2025

Computers Don't Want

I read through the new book If Anyone Builds It, Everyone Dies by Eliezer Yudkowsky and Nate Soares. "It" refers to Artificial Super Intelligence (ASI). A very short version of the authors' argument: You can view advanced AI as though it has its own desires and agencies, its needs will be incompatible with those of the human race, and AI has the capabilities to eliminate the humans even without the killer robots.

I have no doubt that a crafty sentient AI hellbent on destroying humanity could do so. But let's look at the first part of the argument, should we reason about AI as though it has agency and preferences? The authors make a subtle argument in their Chapter 3, that while AI doesn't have its own wants and desires, we can reason about AI as though it does. In the following chapters, the authors go all in and think of ASI as though it has preferences and acts in its own self interest.

I think of computing as a Turing machine, a device that follows a set of simple instructions, interacting with input and memory, and producing some output. The machine does not have wants or desires, all it does is follow its instructions.

But we also realize the immense complexity that can arise from such simplicity. We have Rice's Theorem that says we can't understand, in general, anything about a Turing machine's output from the code of the machine. And there's a reason why we can't prove P ≠ NP or even have a viable approach, we have no idea how to bound the complexity of efficient algorithms. But we shouldn't confuse complexity and our inability to understand the algorithm as evidence of agency and desires. Even if AI seems to exhibit goal-oriented behavior, it's a property of its training and not evidence of independent agency.

I worry less about AI developing its own hostile agency than about how humans will wield it, whether through malicious misuse or misplaced trust. These are serious risks, but they're the kinds of risks we can work to mitigate while continuing to develop transformative technology. The "everyone dies" framing isn't just fatalistic, it's premised on treating computational systems as agents, which substitutes metaphor for mechanism.

21 comments:

Anonymous10:59 AM, October 01, 2025
This whole discussion is so old.
Like The Matrix. Terminator 2. 2001 Space Odyssey. Half the Classic stories by Philip K Dick and so on.
Langs Metropolis. RUR. All the Golem stories.
Just because the book has a Nonfiction tag, everyone reheats decades old discussions.
ReplyDelete
Replies
Anonymous11:07 AM, October 01, 2025
what makes you convinced that humans have this special independent agency?

low-level instruction following could be modelled as agency at higher level abstractions. and i don't think the distinction between being able to model it in some way and it actually behaving that way is that clear.
ReplyDelete
Replies
gasarch11:13 AM, October 01, 2025
Gee,I I wish I had read your post before buying the book. More seriously, the writers are very knowledgable of the field so I am surprised they make such a funadmental error.

I'm more worried about AI and Economy- massive unemployment. YES people will change jobs but the transition will be brutal.
ReplyDelete
Replies
Anonymous11:17 AM, October 01, 2025
This post seems kind of odd to me. Do you think that human brains are doing something fundamentally different from computation? If so, what's the mechanism for that? If not, doesn't that show that computation can give rise to agents which, at the very least, have the strong appearance of having desires and goals?

The first comments also confuses me a bit. Isn't it natural that now that we are on the verge of having highly capable artificial intelligence, old debates about AI (and related topics) become more relevant and are discussed by more people and with more energy? Just because people have talked about a topic in the past does not mean it is pointless or irrelevant in the present.
ReplyDelete
Replies
Lance Fortnow3:52 PM, October 01, 2025
I'm getting a few comments and social media replies asking along the lines of "aren't human brains just Turing machines themselves?" That's a tricky question that I'll tackle in next week's post.
ReplyDelete
Replies
Anonymous5:10 PM, October 01, 2025
I don't think anything in your argument is inconsistent with the following scenario. We train an AI system to have some overarching goal, such as its own survival, and in the service of that goal it develops all sorts of subgoals. Even if the initial goal is in there as a direct result of training, we could have far less connection with the subgoals -- indeed, we might have very little idea what many of them are, and they might turn out to be very unaligned with our interests.

In such a situation, it would be reasonable to talk about the system having desires and agency (even if one could argue about whether those are consciously felt).

I would also repeat the argument that others have made. Suppose I were to replace "AI" by "a human" in your sentence, "Even if AI seems to exhibit goal-oriented behavior, it's a property of its training and not evidence of independent agency," how would you argue against that?
ReplyDelete
Replies
David Marcus9:52 AM, October 02, 2025
Rice's Theorem says that we can't produce an algorithm that will tell us for all machines whether each machine has the given property. But, we can certainly analyze many specific machines.
ReplyDelete
Replies
Anonymous3:43 PM, October 02, 2025
I find the Geoff Hinton's argument much better.

It does not rely on consciousness, self interest, needs, etc.

It simply says that you AI agent with learn that having more power is useful for completing tasks given to it by humans. It therefore will try to get more power as a subtask. And that is sufficient for the bad story.

And you don't need AGI for it, this can happen with an AI agent that is capable enough to get more power, running in a iterative loop, pretty close to the current models.

agents performing actions without human supervision are starting to show up, OpenAI started the race in the industry for them a few weeks ago by making its unsafe Agent mode publicly available in ChatGPT.
ReplyDelete
Replies
Anonymous10:56 PM, October 04, 2025
Who is the author Eliezer and what is his background? Don’t know him.
ReplyDelete
Replies
Anonymous10:56 PM, October 04, 2025
Who is the author Eliezer and what is his background? Don’t know him.
ReplyDelete
Replies

Add comment