Most active commenters
  • nybsjytm(5)
  • auggierose(4)

←back to thread

249 points rishicomplex | 15 comments | | HN request time: 1.958s | source | bottom
1. Robotenomics ◴[] No.42166395[source]
“Only 5/509 participants solved P6”
replies(1): >>42166999 #
2. nybsjytm ◴[] No.42166999[source]
This has to come with an asterisk, which is that participants had approximately 90 minutes to work on each problem while AlphaProof computed for three days for each of the ones it solved. Looking at this problem specifically, I think that many participants could have solved P6 without the time limit.

(I think you should be very skeptical of anyone who hypes AlphaProof without mentioning this - which is not to suggest that there's nothing there to hype)

replies(2): >>42167113 #>>42168802 #
3. auggierose ◴[] No.42167113[source]
Certainly an interesting information that AlphaProof needed three days. But does it matter for evaluating the importance of this result? No.
replies(2): >>42167292 #>>42168126 #
4. nybsjytm ◴[] No.42167292{3}[source]
I agree that the result is important regardless. But the tradeoff of computing time/cost with problem complexity is hugely important to think about. Finding a proof in a formal language is trivially solvable in theory since you just have to search through possible proofs until you find one ending with the desired statement. The whole practical question is how much time it takes.

Three days per problem is, by many standards, a 'reasonable' amount of time. However there are still unanswered questions, notably that 'three days' is not really meaningful in and of itself. How parallelized was the computation; what was the hardware capacity? And how optimized is AlphaProof for IMO-type problems (problems which, among other things, all have short solutions using elementary tools)? These are standard kinds of critical questions to ask.

replies(2): >>42167378 #>>42170461 #
5. dash2 ◴[] No.42167378{4}[source]
Though, if you start solving problems that humans can't or haven't solved, then questions of capacity won't matter much. A speedup in the movement of the maths frontier would be worth many power stations.
replies(1): >>42167482 #
6. nybsjytm ◴[] No.42167482{5}[source]
For some time a 'superhuman math AI' could be useful for company advertising and getting the attention of VCs. But eventually it would be pretty clear that innovative math research, with vanishingly few exceptions, isn't very useful for making revenue. (I am a mathematician and this is meant with nothing but respect for math research.)
replies(2): >>42168017 #>>42171919 #
7. airstrike ◴[] No.42168017{6}[source]
"Making revenue" is far from being the only metric by which we deem something worthy.
replies(1): >>42168169 #
8. andrepd ◴[] No.42168126{3}[source]
More or less. Modern theorem provers, even fully automatic ones, can prove incredibly difficult problems if given enough time. With 3 days and terabytes of memory, perhaps they could? Would be interesting to compare Alphaproof with a standard theorem prover that is given similarly astronomical computing resources.
replies(1): >>42168238 #
9. nybsjytm ◴[] No.42168169{7}[source]
As a mathematician, of course I agree. But in a sentence like:

> A speedup in the movement of the maths frontier would be worth many power stations

who is it 'worth' it to? And to what end? I can say with some confidence that many (likely most, albeit certainly not all) mathematicians do not want data centers and power stations to guzzle energy and do their math for them. It's largely a vision imposed from without by Silicon Valley and Google research teams. What do they want it for and why is it (at least for now) "worth" it to them?

Personally, I don't believe for a second that they want it for the good of the mathematical community. Of course, a few of their individual researchers might have their own personal and altruistic motivations; however I don't think this is so relevant.

replies(1): >>42170367 #
10. auggierose ◴[] No.42168238{4}[source]
That is an interesting thought. But I doubt that standard automated theorem proving techniques would have solved this in a 100 years, even with immense resources, as all they do is (basically) brute force search, and it remains an exponential search space. Therefore 3 days does not really matter, indeed, 3 months would still be a result that 10 years ago I would have thought impossible.
11. letitgo12345 ◴[] No.42168802[source]
Think more is made of this asterix than necessary. Quite possible adding 10x more GPUs would have allowed it to solve it in the time limit.
replies(1): >>42169740 #
12. nybsjytm ◴[] No.42169740{3}[source]
Very plausible, but that would also be noteworthy. As I've mentioned in some other comments here, (as far as I know) we outside of DeepMind don't know anything about the computing power required to run alphaproof, and the tradeoff between computing power required and the complexity of problems it can address is really key to understanding how useful it might be.
13. auggierose ◴[] No.42170367{8}[source]
There is something called applied math, and there is a big gulf between applied math and pure math. This new technology has the potential of making use of much more "pure math" for applied math, unifying much of pure math, applied math, and programming. I don't really care about the RH, I care about that.
14. auggierose ◴[] No.42170461{4}[source]
> Finding a proof in a formal language is trivially solvable in theory since you just have to search through possible proofs until you find one ending with the desired statement. The whole practical question is how much time it takes.

No, in my experience the whole practical question is, can it be found automatically, or can it not be found automatically? Because there is an exponential search space that conventional automated methods will not be able to chew through, it either works, or it doesn't. AlphaProof shows that for some difficult IMO problems, it works.

15. empath75 ◴[] No.42171919{6}[source]
The big exception being predicting market movements, and I can't imagine how much money the hedge funds are spending on this right now.