Figure 03, our 3rd generation humanoid robot

(www.figure.ai)

392 points lairv | 2 comments | 09 Oct 25 13:27 UTC | HN request time: 1.155s | source

Show context

HAL3000 ◴[09 Oct 25 14:59 UTC] No.45528648[source]▶

All of the examples in videos are cherry picked. Go ask anyone working on humanoid robots today, almost everything you see here, if repeated 10 times, will enter failure mode because the happy path is so narrow. There should really be benchmarks where you invite robots from different companies, ask them beforehand about their capabilities, and then create an environment that is within those capabilities but was not used in the training data, and you will see the real failure rate. These things are not ready for anything besides tech demos currently. Most of the training is done in simulations that approximate physics, and the rest is done manually by humans using joysticks (almost everything they do with hands). Failure rates are staggering.

replies(17): >>45529270 #>>45529335 #>>45529542 #>>45529760 #>>45529839 #>>45529903 #>>45529962 #>>45530530 #>>45531634 #>>45532178 #>>45532431 #>>45532651 #>>45533534 #>>45533814 #>>45534991 #>>45539498 #>>45542410 #

ipnon ◴[09 Oct 25 15:41 UTC] No.45529270[source]▶

>>45528648 #

Now the question is if this is GPT-2 and we’re a decade away from autonomous androids given some scaling and tweaks, or if autonomous androids is just an extremely hard problem.

replies(4): >>45529367 #>>45529610 #>>45529686 #>>45532243 #

1. jcims ◴[09 Oct 25 16:10 UTC] No.45529686[source]▶

>>45529270 #

I don't know if I caught your comment in my peripheral vision or what but GPT-2 is exactly where I conceptually placed this.

Neural networks for motion control is very clearly resulting in some incredible capability in a relatively short amount of time vs. the more traditional control hierarchies used in something like Boston Dynamics. Look at Unitree's G1

https://www.youtube.com/shorts/mP3Exb1YC8o

https://www.youtube.com/watch?v=bPSLMX_V38E

It's like an agile idiot, very physically capable but no purpose.

The next domain is going to be incorporating goals and intent and short/long term chains of causality into the model, and for that it seems we're presently missing quite a bit usable training data. That will clearly evolve over time, as will the fidelity of simulations that can be used to train the model and the learned experience of deployed robots.

replies(1): >>45531038 #

2. robots0only ◴[09 Oct 25 18:04 UTC] No.45531038[source]▶

>>45529686 (TP) #

Locomotion and manipulation are pretty different. The former we know how to do well -- this is what you see in unitree videos. Manipulation still not so much. This is not at all like GPT-2 because we still don't know what to scale (and even the data to scale is not there).

↑