←back to thread

Google is winning on every AI front

(www.thealgorithmicbridge.com)
993 points vinhnx | 2 comments | | HN request time: 0.48s | source
Show context
paradite ◴[] No.43662612[source]
The author mentioned AlphaGo and Alpha Zero without mentioning OpenAI gym and OpenAI Five.

Those products show OpenAI was innovating and leading in RL at that stage around 2017 to 2019.

https://github.com/openai/gym

https://en.wikipedia.org/wiki/OpenAI_Five

replies(1): >>43675355 #
1. bitpush ◴[] No.43675355[source]
This is the first I'm hearing about it.
replies(1): >>43676937 #
2. paradite ◴[] No.43676937[source]
I forgot to mention that OpenAI also invented PPO, which is the default algorithm that everyone uses for RL since 2017: https://en.wikipedia.org/wiki/Proximal_policy_optimization

DeepSeek's GRPO is also just a minor variant of PPO.