Computational Complexity: My Week at Simons

Thursday, April 13, 2023

My Week at Simons

This week finds me at the Simons Institute for Theoretical Computer Science in Berkeley California. Simons started about the same time I joined the administrative ranks and never had the opportunity to spend a full semester there. I can manage a shorter trip and purposely chose a week with no workshops and great visitors including Sam Buss, Russell Impagliazzo, Valentine Kabanets, Toni Pitassi, Ryan Williams, former student Rahul Santhanam and former postdocs Pavan Aduri and Vinod Variyam and many others including the next generations of complexity theory leaders. Simons is having programs on Meta-Complexity and an "extended reunion" for Satisfiability. Apparently I used to work on Meta-Complexity before it was a thing.

Computational complexity traditionally has tried to get ahead of new technologies, and modelled randomized, parallel, quantum computation and cryptography in the infancy of their development allowing complexity to help guide our understanding and development of these areas. In the last twenty years or so, complexity has migrated more towards mathematics, and has mostly missed technological changes like cloud computing, hierarchical memory models, edge and mobile computing for example.

But the recent advances in optimization and machine learning cannot be ignored. There has certainly been plenty of discussion of ChatGPT and Russell gave an informal lecture yesterday trying to model large-language models at some level. I've been having some discussions about how complexity can answer questions like what it means for a model to be explainable.

Complexity theory also ought to reckon that practically we seem to be getting the best of P = NP while avoiding losing cryptography simultaneously in Heuristica and Cryptomania among Russell's five worlds. Russell claims we're not in Heuristica, at least not now, since we can still generate hard to solve problems. But if our models aren't modeling the world we live in, perhaps it's time to rethink the models.

14 comments:

Pascal1:04 PM, April 13, 2023
Machine learning has been studied from a TCS point of view for decades now, so it's not like this area has been ignored altogether.
ReplyDelete
Replies
Unknown1:13 AM, April 14, 2023
The problem with "explainable" is that it can be something a little bit subjective. What is a an explanation? What is an acceptable language for giving explanations? First - order logic without any restrictions ? And what properties should satisfy a valid explanation ?
ReplyDelete
Replies
DJL3:39 PM, April 14, 2023
To be "explainable", you'd have to have an internal representation of the thing to be explained, to be working from a _model_ of the thing being discussed. As every discussion of the LLM approach makes very clear, LLMs don't do that. So the idea that ChatGPT could be persuaded to produce explanations for what it said is, in principle, problematic. Or at least really hard.

For your entertainment, here's ChatGPT falling on it's face doing music. (The vlogger's conclusion: my job's safe for the nonce.) It comes up with correct answers for simple things, but the chord prorgessions it spat out were pretty bad. (And the inanity of its comments got painfully blatant really quickly. At least in music, this technology ain't ready for release. Not.even.close.)

(Interestingly, when asked to produce a list of tunes with a particular property (being in 5/4 time), it comes up with a few correct answers, but blithely adds wrong answers into the list. Still a few bugs in the system, it seems.)

It may be that there is a lot of material on the internet about music that's flaky. Music theory is actually quite hard, but everyone likes to talk about music.

https://www.youtube.com/watch?v=PlVX3hzp2qM&ab_channel=DavidBennettPiano%27s2ndChannel
ReplyDelete
Replies
DJL9:23 PM, April 16, 2023
David Marcus' comment makes no sense without the deleted comment it is replying to...

I get why you deleted it: you thought it was just rude. But it was technically accurate: ChatGPT is a statistical token sequence generator, and as such, has no internal models of anything.

There is, I suppose, an argument that if something appears to be doing amazing smart things much of the time, then the errors are just noise and can be ignored. I think that that's the wrong way to look at programs: a program's operation is best discovered by finding where it messes up.

People are different. Sure, people mess up all the time, e.g. fail to figure out some important point in a class and struggle hopelessly for the rest of the term. But people (some of the time!) really do work from very powerful internal models and correct those models when problems arise. And do logical reasoning from those models. (See Fodor's "In Crirical Condition" and other works for someone who thinks human reasoning is essentially perfect, taking everything possible into account. Overmuch, I submit, but worth a read.)

The claim of the LLM technology is that it produces text that is, for all functional purposes, the result of correct logical inferencing from good internal models without doing the work of (a) figuring out how to do that modelling and inferencing and (b) actually doing that modelling and inferencing.

I personally don't like this approach. But that's opinion. The facts on the ground are that LLM's don't actually do logical inferencing. And thus the problem of "explaining" their output is really hard, since, there's nothing there to explain.
ReplyDelete
Replies
Anonymous10:42 PM, April 16, 2023
Trying asking GPT4 (which is the most powerful model publicly available) to add two 20 digit numbers.

The current large models are good for some vaguely defined tasks that humans learn to perform well and traditional algorithms are not good at, but the samples for these tasks are coming from particular very biased distributions that are not modeled in a mathematical way, like the literature in English. The question is how we can make meaningful statements when the model does not work really on any of distributions that classical complexity theory have cared about, like those we care about in cryptography, but from these anthropological distributions.

What is needed is to understand these models is not to claim that they are solving hard classical problems but solving particular distributions that humans care about on tasks that humans care about and both are very hard to characterize mathematically.
ReplyDelete
Replies
Anonymous7:48 PM, April 17, 2023
It is legit to ask about LLMs but some of Lance's claims about "missing" things like cloud computing, edge, mobile computing etc seem a bit off base. To the extent that there have been interesting questions wrt these models they have generally been asked (e.g. in cloud computing) and some have been answered. (BTW: Lance: I think you meant "ought" not "out".)

On LLMs: While people can add or multiply big numbers, they would rather use a calculator or Excel or a package like Wolfram Alpha. What tools are reasonable to allow GPT4 to use? GPT4 apparently is terrible at Sudoku, because of the lack of backtracking. Native SAT solving is also surely bad also. If GPT4 could do the triage to find out what tools might be used and try to apply them, would that be so different from how humans work?
ReplyDelete
Replies

Add comment