(openai.com)

1019 points atgctg | 1 comments | 11 Dec 25 18:04 UTC | HN request time: 0s | source

https://platform.openai.com/docs/guides/latest-model

System card: https://cdn.openai.com/pdf/3a4153c8-c748-4b71-8e31-aecbde944...

Show context

SkyPuncher ◴[11 Dec 25 19:26 UTC] No.46235977[source]▶

Given the price increase and speculation that GPT 5 is a MoE model, I'm wondering if they're simply "turning up the good stuff" without making significant changes under the hood.

replies(2): >>46235986 #>>46236012 #

minimaxir ◴[11 Dec 25 19:29 UTC] No.46236012[source]▶

>>46235977 #

I'm not sure why being a MoE model would allow OpenAI to "turn up the good stuff". You can't just increase the number of E without training it as such.

replies(2): >>46236953 #>>46236981 #

1. yberreby ◴[11 Dec 25 20:49 UTC] No.46236953[source]▶

>>46236012 #

Based on what works elsewhere in deep learning, I see no reason why you couldn't train once with a randomized number of experts, then set that number during inference based on your desired compute-accuracy tradeoff. I would expect that this has been done in the literature already.

↑

GPT-5.2