Microsoft and OpenAI's close partnership shows signs of fraying

(www.nytimes.com)

326 points jhunter1016 | 4 comments | 18 Oct 24 11:11 UTC | HN request time: 1.015s | source

Show context

twoodfin ◴[18 Oct 24 12:08 UTC] No.41878632[source]▶

Stay for the end and the hilarious idea that OpenAI’s board could declare one day that they’ve created AGI simply to weasel out of their contract with Microsoft.

replies(4): >>41878980 #>>41878982 #>>41880653 #>>41880775 #

candiddevmike ◴[18 Oct 24 12:53 UTC] No.41878982[source]▶

>>41878632 #

Ask a typical "everyday joe" and they'll probably tell you they already did due to how ChatGPT has been reported and hyped. I've spoken with/helped quite a few older folks who are terrified that ChatGPT in its current form is going to kill them.

replies(5): >>41879058 #>>41879151 #>>41880771 #>>41881072 #>>41881131 #

ilrwbwrkhv ◴[18 Oct 24 13:03 UTC] No.41879058[source]▶

>>41878982 #

It's crazy to me that anybody thinks that these models will end up with AGI. AGI is such a different concept from what is happening right now which is pure probabilistic sampling of words that anybody with a half a brain who doesn't drink the Kool-Aid can easily identify.

I remember all the hype open ai had done before the release of chat GPT-2 or something where they were so afraid, ooh so afraid to release this stuff and now it's a non-issue. it's all just marketing gimmicks.

replies(7): >>41879115 #>>41880616 #>>41880738 #>>41880753 #>>41880843 #>>41881009 #>>41881023 #

digging ◴[18 Oct 24 16:34 UTC] No.41881009[source]▶

>>41879058 #

> pure probabilistic sampling of words that anybody with a half a brain who doesn't drink the Kool-Aid can easily identify.

Your confidence is inspiring!

I'm just a moron, a true dimwit. I can't understand how strictly non-intelligent functions like word prediction can appear to develop a world model, a la the Othello Paper[0]. Obviously, it's not possible that intelligence emerges from non-intelligent processes. Our brains, as we all know, are formed around a kernel of true intelligence.

Could you possibly spare the time to explain this phenomenon to me?

[0] https://thegradient.pub/othello/

replies(3): >>41881076 #>>41881531 #>>41884745 #

1. psb217 ◴[18 Oct 24 17:23 UTC] No.41881531[source]▶

>>41881009 #

The othello paper is annoying and oversold. Yes, the representations in a model M trained to predict y (the set of possible next moves) conditioned on x (the full sequence of prior moves) will contain as much information about y as there is in x. That this information is present in M's internal representations says nothing about whether M has a world model. Eg, we could train a decoder to look just at x (not at the representations in M) and predict whatever bits of info we claim indicate presence of a world model in M when we predict the bits from M's internal representations. Does this mean the raw data x has a world model? I guess you could extend your definition of having a world model to say that any data produced by some system contains a model of that system, but then having a world model means nothing.

replies(1): >>41882691 #

2. digging ◴[18 Oct 24 19:32 UTC] No.41882691[source]▶

>>41881531 (TP) #

Well I actually read Neel Nanda's writings on it which acknowledge weaknesses and potential gaps. Because I'm not qualified to judge it myself.

But that's hardly the point. The question is whether or not "general intelligence" is an emergent property from stupider processes, and my view is "Yes, almost certainly, isn't that the most likely explanation for our own intelligence?" If it is, and we keep seeing LLMs building more robust approximations of real world models, it's pretty insane to say "No, there is without doubt a wall we're going to hit. It's invisible but I know it's there."

replies(1): >>41895567 #

3. psb217 ◴[20 Oct 24 14:24 UTC] No.41895567[source]▶

>>41882691 #

My point was mainly that this claim: "we keep seeing LLMs building more robust approximations of real world models" is hard to evaluate without a well-formed definition of what it means to have a world model. Eg, a more restrictive definition of having a world model might include the ability to adapt reasoning to account for changes in the modeled world. Eg, an LLM with a proper model of chess by this definition would be able to quickly adapt to account for a rule change like "rooks and bishops can't move more than 4 squares at a time".

I don't think there are any major walls either, but I think there are at least a few more plateaus we'll hit and spend time wandering around before finding the right direction for continued progress. Meanwhile, businesses/society/etc can work to catch up with the rapid progress made on the way to the current plateau.

replies(1): >>41905922 #

4. digging ◴[21 Oct 24 16:44 UTC] No.41905922{3}[source]▶

>>41895567 #

I think we're largely in agreement then, actually. I'm seeing "world models" as a spectrum. World models aren't even consistent among adult humans. I claim LLMs are moving up that ladder, and whether or not they've crosses a threshold into "real" world models I do not actually claim to know. Of course I also agree that it's very possible, maybe even likely, that LLMs aren't able to cross that threshold.

> this claim ... is hard to evaluate without a well-formed definition of what it means to have a world model

Absolutely yes, but that only makes it more imperative that we're analyzing things critically, rigorously, and honestly. Again you and I may be on the same side here. Mainly my point was that asserting the intrinsic non-intelligence of LLMs is a very bad take, as it's not supported by evidence and, if anything, it contradicts some (admittedly very difficult to parse) evidence we do have that LLMs might be able to develop a general capability for constructing mental models of the world.

↑