Computational Complexity: Some thoughts on journals, refereeing, and the P vs NP problem

Monday, August 04, 2025

Some thoughts on journals, refereeing, and the P vs NP problem

A guest post by Eric Allender prompted by an (incorrect) P ≠ NP proof recently published in Springer Nature's Frontiers of Computer Science.

For a time, I served as Editor-in-Chief of ACM Transactions on Computation Theory, and in this role I had to deal regularly with submissions that claimed to resolve the P vs NP problem. Finding referees for these papers was sometimes challenging, so I frequently ended up reviewing them myself. Dealing with such submissions involves enough overhead that ToCT, J.ACM and ACM Transactions on Algorithms limit the frequency with which authors can submit work of this sort. But for every submission, on any topic, the overall process was the same: Find someone on the Editorial Board who has sufficient expertise to find referees and make knowledgeable use of their recommendations. If there was no such person on the Editorial Board, then it was always the case that the submission was out of scope for the journal.

These thoughts are brought to mind by a recent case, where it seems to me that the editorial process broke down.

Springer publishes several high-quality academic journals that deal with Theoretical Computer Science. One Springer journal, Frontiers of Computer Science, recently published an article entitled SAT Requires Exhaustive Search, where one of the authors is a Deputy Editor-in-Chief of the journal. The abstract of the article states that it proves a result that is stronger than P not equal to NP. The Editorial Board of the journal has some members who are expert in computational complexity theory. However, all the ones whom I know personally have asserted that they had no knowledge of this paper, and that they were not involved at all in handling the paper.

When Ryan Williams and I learned about the publication of this article, we drafted a comment, which we sent to the Editor-in-Chief. We recommended that the paper be retracted, in which case there would be no need to publish our comment. However, the Editor-in-Chief declined to retract the article, saying that he could find no evidence of misconduct, and thus we have been assured that an edited version of our comment will appear in the journal.

Our comment calls attention to some shortcomings in the proof of the main theorem (similar to shortcomings in several other failed attempts to separate P from NP). But we were able to say more. Often, when one is looking for bugs in a purported proof, one has to deal with the situation where the claimed theorem is probably true, and the only problem is that the proof is not convincing. However, the main theorem in their paper (Theorem 3.2) states that a particular constraint satisfaction problem requires time more than $d^{cn}$ for any constant $c<1$ (where $d$ is the domain size, and $n$ is the number of variables). In particular, their purported proof claims that this holds even when $k=2$ (meaning that each constraint has at most 2 variables). However, Ryan Williams presented an algorithm more than two decades ago that runs in time $O(d^{(0.8)n})$ in this special case, contradicting the lower bound claimed in the article.

The article contains an appendix, with glowing testimonies from various researchers; the lead-in to the appendix also contains enthusiastic comments from Gregory Chaitin. I contacted Gregory Chaitin, and he asserts that he did not read the paper, and that he was quoted out of context.

The edited version of the comment that Ryan Williams and I wrote, which will supposedly appear in Frontiers of Computer Science soon, differs from the version linked here (our original submission) primarily in one respect: Our closing paragraph was removed. Here is that paragraph:

Finally, it is our opinion that the publication of this article is a complete embarrassment to this journal and its publisher. We believe that, at the very least, the paper should be withdrawn, and Springer should conduct an investigation to understand how such a paper could have made it through the peer review process.

Update (Lance, 8/5/25): The authors of the paper in question have asked me to link to their reply to the Allender-Williams comment. I do so with no endorsement.

Eric Allender asked me to note that their claim about Ryan’s algorithm requiring exponential space is misleading; the amount of space is not more than the runtime of the algorithm, which is less than the lower bound that they claim. (Their theorem does not state that $d^{cn}$ time is required under the additional assumption that the algorithm use only $n^{O(1)}$ space.)

92 comments:

D. Eppstein1:06 PM, August 04, 2025
See also https://arxiv.org/abs/2312.02071 for another rebuttal of an earlier version of the same paper
ReplyDelete
Replies
Anonymous1:24 PM, August 04, 2025
This seems to be a popular topic. https://eprint.iacr.org/2025/445 but of course they don't perform peer review - it's just a preprint server.
ReplyDelete
Replies
David Marcus1:31 PM, August 04, 2025
This is either hilarious or sad. Or, both.
ReplyDelete
Replies
Eli Ben Sasson1:53 PM, August 04, 2025
With so many attempts, something will slip thru the cracks. Good job pointing this out!
ReplyDelete
Replies
Anonymous5:57 PM, August 04, 2025
Don't let President Trump know. Fake Science + China!
ReplyDelete
Replies
Anonymous6:36 PM, August 04, 2025
The comment: “ authors can submit work of this sort” seems very derogatory .”Work of this sort” what the hell is this about here. Either have the courage to be fully descriptive but not so obnoxious please
ReplyDelete
Replies
Anonymous6:40 PM, August 04, 2025
Another thing: an editor in chief of the journal should be aware of how such a journal made it into its journal. This journal is not as wide ranging as TCS … also likely is that the co-author who is an editor made things happen where other papers would have gotten stuck
ReplyDelete
Replies
Anonymous6:43 PM, August 04, 2025
When I hear stuff like “ lGregory Chaitin. I contacted Gregory Chaitin, and he asserts that he did not read the paper, and that he was quoted out of context.” i cringe… what does this even mean? Did he say it or not?
Is Greg still active in research or not? If not, I can understand. If you don’t read a paper, how do you have an opinion about it? How can you say something favorable other than … nice format and nice typeface! So something feels eerie here.
ReplyDelete
Replies
Anonymous9:26 PM, August 04, 2025
Technically, you did not point out the error, but merely that Ryan's result and the paper are inconsistent and therefore at least one of them is wrong. :)
ReplyDelete
Replies
dimpase9:35 PM, August 04, 2025
nothing "eerie" here - the paper in question is how a blatant academic fraud looks like.
ReplyDelete
Replies
Anonymous10:31 PM, August 04, 2025
We should open a journal on proofs of P=NP and P\neq NP and any publications go in there. Problem solved?
ReplyDelete
Replies
Ronald de Wolf4:39 AM, August 05, 2025
World Scientific in 2016 even published a whole book implying P=NP, via a purported polynomial-size LP for TSP (something that was already disproved in our STOC'12 paper).
https://www.worldscientific.com/worldscibooks/10.1142/9725#t=aboutBook
ReplyDelete
Replies
gasarch6:45 AM, August 05, 2025
Eric or Ryan- I did a blog post a while back about how `proofs' of P=NPor P NE NP have never (literally never) had an interesting idea in them. So NO ``They don't prove P=NP but they have an idea for a better SAT Solver' or `Theydon't prove P NE NP but the ideas here could lead to SAT not in n^2''

SO- is there ANYTHING of interest in the current paper?
ReplyDelete
Replies
Eric Allender9:39 AM, August 05, 2025
There are sections of the paper that discuss the probability with which a random instance of CSP (in their model) is satisfiable. I didn't go through this part of their paper carefully, but the reference that David Eppstein provided in an earlier comment (https://arxiv.org/abs/2312.02071) indicates that people have gone through that argument carefully and found no real problems there. This is conceivably of interest, but I don't think that it is very helpful for understanding why P is not equal to NP (assuming that this is the case). The paper also contains philosophical discussions (with mentions of Wittgenstein and others), which would have been more interesting if their argument had been correct.
ReplyDelete
Replies
Anonymous11:52 AM, August 05, 2025
The Moser-Tardos Algorithm does not appear to be based on divide and conquer. As such, Xu and Zhou's assertion in their article that 'As we know, at least until now, all exact algorithms for CSPs are based on divide and conquer style, such as backtracking method' is inaccurate.
ReplyDelete
Replies
Anonymous6:44 PM, August 05, 2025
First of all thanks to Eric and Ryan for pointing this out.

As a Ph.D. student in a peripherally related field who happen to see a post about this on X and came here, I am surprised that Eric/Ryan even bothered to draft this much of a comment in response to this paper in the first place. This paper seems to come out of a Chinese university that writes papers for the sake of publication alone, without any regards for quality. This reminds me of a similar instance a few years ago where one of the professors had purported to have discovered sequences that both converge and diverge (or something of this ridiculous sort), with the exception that this time the paper was even accepted to publication!

Alas, it is highly possible that there are many such instances (e.g., at ICML/NeurIPS) whose errors are undetected...
ReplyDelete
Replies
Anonymous8:24 PM, August 05, 2025
Or someone equally dumb.
ReplyDelete
Replies
Anonymous10:05 PM, August 05, 2025
The authors’ country of origin, unfortunately, is highly predictable. Rampant academic misconduct. Even if it’s a small percentage, the population is so large that it’s too much volume of misconduct (incorrect results, plagiarism, etc )
ReplyDelete
Replies
Anonymous12:19 AM, August 06, 2025
At reply from authors of paper. What do people think about it? It's hard to understand what is being conveyed partially because of language barrier and partially because the way it has been put down on paper. Good thing. The authors are real people. they do exist, they have a voice, and they have responded the comment.
They have voiced that they have been misunderstood. (ok fine this is not relevant)
They have addressed the point that Williams and Allender raised -- "we do not claim that such an algorithm is likely to suceed anyhow ..." so back to Wiliams and Allender:
So if you two claims that there might be other ways, but are somewhat confident that they might not work anyhow ... i am left in a bubble about what to think. You guys don't point out anything concrete but are on equal footing with the authors of the paper.

What they did not do ... they did or did not address the concrete example by Williams?

Can someone point out if they addressed this point?
"(and obvious) mistake in the proof of the main result. The authors flatly assume that any algorithm for SAT must proceed by reducing an instance of size n to instances of size n-1 in a particular way.
The fact that the statement of the main result was also directly refuted 20 years ago just makes the whole thing extra silly."

How was it directly refuted 20 years ago and what are we actually talking about?

Finally i find it amusing that they keep quoting Chaitin. How does Chaitin feel about it?

So the authors succeeded in what they set out to do, engage. whether work is frivolous or not. they got their engagement. Congratulations you two.
ReplyDelete
Replies
notbad1:36 AM, August 06, 2025

I have read the paper, "SAT requires exhaustive search." ( I also strongly believe anyone who wants to make meaningful comments and arguments here should also read the paper at least once).
The authors argue that the most fundamental problem in computer science is the distinction between non-brute-force computation and brute-force computation, rather than P versus NP. In my view, this is the paper’s most significant point because it serves as the core motivation for their work and pertains to the strategic direction of computational complexity research.

The second most important point is the construction of self-referential instances, which illustrates what hard instances look like and why they are difficult to solve.

The least important point is the proof of the main result, as it directly employs the standard diagonalization method.
ReplyDelete
Replies
Anonymous10:28 AM, August 06, 2025
I don't know what to say. Are the authors saying we don't claim to have resolved PvsNP? but happy we resolved what we think matters most. So these folks are very practical computer science students. that is ok, then. they should go publish under that thesis.
ReplyDelete
Replies
notbad11:32 AM, August 06, 2025
Assuming we're addressing two distinct problems:

1. Is there a better algorithm than brute-force search to solve all Constraint Satisfaction Problems (CSPs)?

2. Is P equal to NP (P vs. NP)?

I definitely feel that problem (1) is more intuitive than problem (2). A potential solution to (1) would be to construct specific CPS instances that cannot be solved more efficiently than with a brute-force search. If such instances exist, the answer to problem (1) would be "no."

If the answer to problem (1) is no, this directly implies that P != NP. The existence of a CSP that requires brute-force search (which is exponential in time) means there's a problem in NP that can't be solved in polynomial time.

However, I also feel that problem (1) might be harder to formalize precisely than problem (2). One challenge, as other comments have pointed out, is the assumption that algorithms for CPS must be based on a "divide-and-conquer" approach. I assume this is the status quo for many existing exact algorithms for CPS.
ReplyDelete
Replies
Anonymous9:07 PM, August 07, 2025
I noticed that many comments focus on space complexity, so I compared Williams's and Xu's approaches. Williams's algorithm achieves a time complexity of $O(d^{0.8n}$ by partitioning the n variables into k parts, each requiring $d^{n/k}$ assignments. However, once edge counting is taken into account, the space complexity becomes $O(d^{2n/k}$. In the RB model, where $d=Poly(n)$ , this leads to an impractically large space requirement.

In fact, Williams explicitly raises the question in his conclusion of whether faster algorithms exist for 2-CSP optimization using only polynomial space. Given this, I find the claim that Xu's work contradicts Williams's to be unconvincing.
ReplyDelete
Replies
David Pennock12:17 PM, August 08, 2025
Side note: Here is a paper that says that MAX-2-SAT is in P and as a result, concludes that P=NP. The paper is on arXiv, and not published to my knowledge, so not fraud, but I think maybe Gemini and ChatGPT ingested this. Both told me that MAX-2-SAT is in P. The paper was discussed on Hacker News, etc, and thus seems to have high PageRank. https://arxiv.org/abs/2304.12517
ReplyDelete
Replies
Anonymous7:44 PM, August 09, 2025
I am less concerned with the details of the proof itself. What truly captures my interest is the comparison between non-brute-force computation vs brute-force computation and P vs NP.

A widespread consensus exists among researchers that P ≠ NP, yet proving this conjecture remains an extraordinarily daunting challenge. One might even propose accepting it as an axiom outright. However, even if we were to adopt this assumption, it would still fall short of meeting the demands of contemporary computational complexity research. Increasingly, scholars are basing their work on stronger conjectures. A notable example is explored in the paper: Vassilevska Williams, Virginia. "Hardness of easy problems: Basing hardness on popular conjectures such as the strong exponential time hypothesis (invited talk)." 10th International Symposium on Parameterized and Exact Computation (IPEC 2015).

In light of this, the field may ultimately shift its focus from the classic P vs NP problem toward embracing stronger conjectures as foundational pillars.
ReplyDelete
Replies
Dániel Marx5:38 AM, August 10, 2025
The discussion here unfortunately got a bit derailed by the question of polynomial space in Ryan's algorithm. However, there is a much simpler polynomial-space algorithm that works for every fixed k. Consider a nontrivial constraint on k variables; by nontrivial, we mean that not all the d^k possible assignments are satisfying. So we can assume that there are at most d^k-1 possible assignments on these k variables that satisfy the constraint. We branch into at most d^k-1 directions by fixing the value of these k variables, thereby reducing the number of variables by k. This means that the depth of the search tree is at most n/k, resulting in a search tree whose number of leaves is at most (d^k-1)^{n/k}=(d^{k-1/k})^n =d^{cn} for c=(k-1)/k<1. I believe this is precisely the type of reduction algorithm that the authors try to rule out.

This idea works for any fixed k, not just k=2. In general, there is nothing mysterious about the fact that certain problems described by bounded-size constraints can be solved faster than brute force.
The article is a bit vague if k is fixed or not: Theorem 3.2 does not state this, but the proof argues for the fixed k=2 case.

If k is unbounded, that the algorithm sketched above does not give an improvement over brute force. However, if k is unbounded, then there is the issue of how the constraints are represented in the input and how it contributes to the size of the input instance: a full truth table can be exponentially large. For example, one can consider a setting where every constraint is violated only by one assignment (as in the case of SETH) and a popular model in database theory assumes that all the satisfying tuples are listed in the input.
ReplyDelete
Replies
Anonymous10:56 AM, August 10, 2025
Hi @Dániel Marx
If (d^k-1)^(n/k) = d^{cn}, then c = log(d^k-1)/log(d^k).
If d and k are fixed, then c<1. However, d grows polynomially with n in this paper.
ReplyDelete
Replies
Anonymous11:49 PM, August 10, 2025
This paper appears to present several novel ideas. A straightforward method to assess the validity of a new idea is to examine its applicability to other problems. Specifically, this papers aims to demonstrate the necessity of brute-force computation by constructing self-referential instances, and confirms that such instances can indeed be constructed for Constraint Satisfaction Problems (CSPs) with growing domains. To apply this idea to other problems, we need to identify the properties under which self-referential instances can be constructed to establish the necessity of brute-force computation.

In Appendix B of this paper, I note that reviewer Bin Wang highlighted a crucial yet easily understandable property: growing domains significantly weakens the correlation between assignments, making the solution space appear nearly independent. Formally speaking, two assignments I and J are independent if Pr(I, J are solutions)=Pr(I is a solution)Pr(J is a solution). The near-independence property ensures that the slight change made by the symmetry mapping will not affect the remaining solution space, thereby enabling the construction of self-referential instances.

This property reminds me the well-known hard problems like Independent Set (or Clique), where the task is determine whether a graph with n vertices contains a subset of k vertices such that no two are adjacent. Currently, no known algorithm can find an independent set of size k=n^epsilon more efficiently than brute-force search [1, 2]. For this problem, it is easy to observe that between two random candidate sets of size k=n^epsilon, with high probability these two sets have no overlap. Consequently, such a solution space exhibits near-independence. Given this characteristic, it should be feasible to construct self-referential instances using an approach analogous to that presented in this paper.

[1] R. G. Downey, M. R. Fellows. Fixed-parameter tractability and completeness II: On completeness for W [1]. Theoretical Computer Science, 1995, 141(1-2): 109-131.
[2] J. Chen, X. Huang, I. A. Kanj, G. Xia. Strong computational lower bounds via parameterized complexity. Journal of Computer and System Sciences, 2006, 72(8): 1346-1367.
ReplyDelete
Replies
Anonymous12:54 AM, August 11, 2025
log(d^k-1)/log(d^k) = ((k-1)*log d)/(k*log d) = (k-1)/k
ReplyDelete
Replies
Anonymous8:58 PM, August 15, 2025
What surprises me most is that the authors opted to directly investigate what renders instances hard, rather than what makes problems hard. From a technical perspective, individual instances or any finite set of instances can always be handled efficiently—this is precisely why the study of instance hardness has long been regarded as unlikely to produce meaningful results. However, the authors seem to have surmounted this fundamental challenge by constructing an infinite set of self-referential instances.

On the other hand, leveraging the phase transition phenomenon to construct hard instances is a well-established approach with many precedents in prior research. Traditionally, studies in this area primarily focused on the difficulty of distinguishing between so-called hard and easy satisfiable formulas by analyzing the structure and distribution of their satisfying assignments. However, a complex structure of satisfying assignments does not necessarily imply that solving the corresponding problem is intrinsically hard. As a result, this line of research has reached a dead end. In contrast, the authos investigate the difficulty of distinguishing between the satisfiable and unsatisfiable instances through the construction of self-referential CSPs. I believe this approach captures the essence of what fundamentally makes instances hard.
ReplyDelete
Replies
Pascal3:48 AM, August 27, 2025
I have a poster advertising the P<>NP book on the door of my office.
ReplyDelete
Replies
Jeremy H6:22 PM, August 31, 2025
Many of the anonymous replies here have a strong "AI generated" feel to them.
ReplyDelete
Replies
Anonymous2:33 AM, September 03, 2025
Editor in chief should resign NOW
ReplyDelete
Replies
Reiner Czerwinski4:15 AM, September 27, 2025
This comment has been removed by the author.
ReplyDelete
Replies
Reiner Czerwinski10:33 PM, September 30, 2025
Many people have tried to prove P != NP using this approach.
They pick an NP-complete problem
and claim, that a brute-force search is required
to solve it.

However, most papers using this approach have a gap in
the proof. There is no valid explanation as to
why brute force is necessary.

Lance Fortnow wrote in the book "The golden Ticket":
"The faulty logic lies in 'only the way'[brute-force] ...
An algorithm can work in mysterious ways. It doesn't
have to work along the lines you think ..."
That is why many experts are disgusted
when they see such a P != NP paper.

On the other hand, every algorithm must obey the
laws computability theory. And the paper by Ke Xu and Guangyan Zhou uses computability theory.
It may be that the approach is feasible in this paper.
ReplyDelete
Replies

Add comment