Computational Complexity: You Need Much Less Memory than Time

Wednesday, February 26, 2025

You Need Much Less Memory than Time

Just as I was complaining that we haven't seen many surprising breakthroughs in complexity recently, we get an earthquake of a result to start the year, showing that all algorithms can be simulated using considerable less memory than the time of the original algorithm. You can reuse space (memory) but you can't reuse time, and this new result from Ryan Williams in an upcoming STOC paper provides the first stark difference.

DTIME(\(t(n)\)) \(\subseteq\) DSPACE(\(\sqrt{t(n)\log t(n)}\))

This is a vast improvement on the previous best known simulation, the classic 1977 Hopcroft-Paul-Valiant paper showing

DTIME(\(t(n)\)) \(\subseteq\) DSPACE(\(t(n)/\log t(n)\))

only slightly lower than the trivial \(t(n)\) bound. Williams gets a huge near quadratic improvement that will go down as a true classic complexity theorem. Note that the space simulation does not maintain the time bound.

Williams' proof relies on a space-efficient tree evaluation algorithm by James Cook and Ian Mertz from last year's STOC conference. Cook and Mertz's algorithm builds on earlier work on catalytic computing, highlighted in a recent Quanta article.

Let me give an highly overly simplified view of the combined proof.

A \(t(n)\) time Turing machine uses at most that much space on its tapes. Split the tapes into \(\sqrt{t(n)}\) segments of size \(\sqrt{t(n)}\). Using the fact that it takes \(\sqrt{t(n)}\) time to cross an entire segment, Williams with some clever tricks models acceptance of the Turing machines as a circuit of bounded degree and depth \(\sqrt{t(n)}\), where the wires carry the contents of the size \(\sqrt{t(n)}\) segments at various times in the computation.

Williams then applies the tree evaluation algorithm of Cook and Mertz. Cook and Mertz use finite fields to encode these segments as a combination of registers of size \(\log t(n)\) and show how to compute the value of each node of the tree using only \(\sqrt{t(n)}\) space for the local computation plus needing to only remember a constant number of registers while reusing the rest of the space when recursively computing the tree. It's pretty magical how they manage to make it all work.

It's worth going through the proof yourself. I recommend Sections 3.1 and Footnote 6 in Williams' paper (a slightly weaker space bound but much simpler) and Sections 2-4 of the Cook-Mertz paper. Oded Goldreich has an alternative exposition of the Cook-Mertz algorithm and proof.

Williams' theorem works for multitape Turing machines and oblivious random-access machines, where the queries to the memory are fixed in advance. He shows how to use this result to compute the output a circuit of size \(s\) using nearly \(\sqrt{s}\) space. Fully general random access machines remains open, as does nondeterministic and other models of computation (random, quantum, etc).

In 1986 my advisor Mike Sipser gave the first hardness vs randomness result, showing roughly that if there were problems that took time \(2^n\) but could not be solved in space \(2^{.99n}\) on multi-tape Turing machines then RP = P. Williams' theorem kills this assumption though we've developed weaker assumptions since.

Moving forward, can we push Williams' result to get a simulation in space \(n^\epsilon\) for \(\epsilon<1/2\). A simulation for all \(\epsilon>0\) would separate P from PSPACE. Even a slight improvement would have applications for alternating time. Maybe try to use the Cook-Mertz techniques directly in the Turing machine simulation instead of going through computation trees.

Read sections 4 and 5 of Williams' paper for some further consequences and challenges for further improvements.

17 comments:

EG10:31 AM, February 26, 2025
Sorry but is there a link to
Williams’ paper in this post? Am I missing something?
ReplyDelete
Replies
Anonymous4:04 PM, February 27, 2025
Non-expert here. Is "random access machines remains open" related to the Turing machine argument "using the fact that it takes time to cross an entire segment"? Could we say this result is as much about the time-inefficiency of Turning machines as it is about the existence of space-efficient algorithms?
ReplyDelete
Replies
Anonymous11:16 PM, February 27, 2025
I first read the post title as "You Need Much Less Money than Time", which would have been a great advice.
ReplyDelete
Replies
EG7:42 AM, March 03, 2025
I wonder what Mihai Patrascu would have said about this development ...
ReplyDelete
Replies
Anonymous2:14 PM, March 04, 2025
plenty of space and not enough time
ReplyDelete
Replies
Anonymous5:47 PM, March 10, 2025
The New Scientist covered this breakthrough in an article (only available to subscribers) that contains some quotes from Lance: https://www.newscientist.com/article/2469663-shock-discovery-tears-up-the-rules-of-time-and-space-inside-a-computer/
ReplyDelete
Replies
Lance Fortnow7:58 AM, March 17, 2025
Minor breakthroughs, yes and within a couple of years. Major breakthroughs--don't count on it.
ReplyDelete
Replies
Anonymous5:22 PM, March 17, 2025
Assume that you know that you will first know breakthrough x at time y, which is in the future. This means you also know x now, in the present. This contradicts the assumption. Therefore, you cannot predict what you will learn in the future.

David Deutsch puts it this way: “A simple example, if there were a method of predicting today some scientific truth that is only going to be discovered next year, then by using that method, we would have gained that knowledge today, and that’s a contradiction.” See https://medium.com/dorothyknows/david-deutsch-the-unknowable-how-to-prepare-for-it-e1b2c7d78744
ReplyDelete
Replies
Anonymous7:01 PM, March 23, 2025
Wouldn't P in Logspace be a surprise?
ReplyDelete
Replies
Anonymous7:55 PM, June 07, 2025
We can't create more time, but we can create more memory. So, why would one not use more memory if it saves some time?

I somewhat understand the theoretical aspects... but where would this new research be useful in a world where programs are expected to compute results in millisecond SLAs?
ReplyDelete
Replies
Defdefred4:09 AM, June 08, 2025
Funny thing is that memory is regulary the constraint in virtualized infra, while CPU have lots of idle time...
So everything is badly coded?
ReplyDelete
Replies
Ray Reinhard6:50 PM, June 08, 2025
See this video for a good explanation: https://www.youtube.com/watch?v=8JuWdXrCmWg
ReplyDelete
Replies
Anonymous11:12 AM, August 11, 2025
RP = P?

I am not familiar with this. What are you talking about?
ReplyDelete
Replies

Add comment