Model Predictive Control in the Browser with WebAssembly

(garethx.com)

1. klowrey ◴[06 Nov 24 14:43 UTC] No.42063015[source]▶

>>41992851 (OP) #

Here's a (hacky) demo of MPC using MuJoCo in the browser: https://klowrey.github.io/mujoco_wasm/

I hacked it using MPPI and it only works on the cartpole model so as to not have to dwell in Javascript too long; just click the 'MPPI Controller' button and you can perturb the model and see it recover.

2. mjburgess ◴[06 Nov 24 15:23 UTC] No.42063704[source]▶

>>41992851 (OP) #

I'd be interested, if any one had suggestions, on MPC applied to ML/AI systems -- it seems this is an underserved technique/concern in MLEng, and I'd expect to see more on it.

replies(3): >>42068613 #>>42069550 #>>42069899 #

3. tantalor ◴[06 Nov 24 15:23 UTC] No.42063708[source]▶

>>41992851 (OP) #

I love this kind of stuff because it seems like a roughly equal blend of art, science, and engineering.

4. gxcode ◴[06 Nov 24 16:30 UTC] No.42064758[source]▶

>>41992851 (OP) #

Author of the post here - happy to answer any questions.

replies(4): >>42065216 #>>42065649 #>>42074214 #>>42074366 #

5. philzook ◴[06 Nov 24 16:56 UTC] No.42065216[source]▶

>>42064758 #

Beautiful stuff, great post!

replies(1): >>42065280 #

6. gxcode ◴[06 Nov 24 16:59 UTC] No.42065280{3}[source]▶

>>42065216 #

Thank you, really appreciate that.

7. beltranaceves ◴[06 Nov 24 17:20 UTC] No.42065649[source]▶

>>42064758 #

Great job! I'm working in a similar blog post and it was fun seeing how you approached it. I was surprised the wasm implementation is fast enough, I was even considering writing webGpu compute shaders for my solver

8. lagrange77 ◴[06 Nov 24 20:13 UTC] No.42068613[source]▶

>>42063704 #

There is a big overlap between Optimal Control and Reinforcement Learning, in case you didn't know.

Also Steve Brunton does a lot on the interface between control theory and ML on his channel: https://www.youtube.com/channel/UCm5mt-A4w61lknZ9lCsZtBw/pla...

9. currymj ◴[06 Nov 24 21:18 UTC] No.42069550[source]▶

>>42063704 #

there's a lot of work in the broad area. most of it doesn't engage with the classical control theory literature (arguably it should).

some keywords to search for recent hot research would be "world model", "decision transformer", "active inference", "control as inference", "model-based RL".

10. szvsw ◴[06 Nov 24 21:42 UTC] No.42069899[source]▶

>>42063704 #

Another thing to keep in mind is that having AI/ML surrogates which can evaluate expensive functions faster can also be integrated as an information source in model predictive control algorithms.

replies(1): >>42070532 #

11. lagrange77 ◴[06 Nov 24 22:29 UTC] No.42070532{3}[source]▶

>>42069899 #

Exactly. ML models such as autoencoders can also be used for reduced order modeling / dimensionality reduction e.g. for MPC of fluid systems.

12. HammadB ◴[07 Nov 24 01:07 UTC] No.42072122[source]▶

>>41992851 (OP) #

I've taken the linked Russ Tedrake class and have to say I loved this. Please make more!

replies(1): >>42073018 #

13. gxcode ◴[07 Nov 24 03:20 UTC] No.42073018[source]▶

>>42072122 #

Thank you, I appreciate that!

14. dartharva ◴[07 Nov 24 07:01 UTC] No.42074214[source]▶

>>42064758 #

OT, but can you share the CSS you're using for your site (the blog)? I love how clean it is.

replies(1): >>42078958 #

15. exe34 ◴[07 Nov 24 07:23 UTC] No.42074366[source]▶

>>42064758 #

hi, this is brilliant, thank you! I will definitely go through it soon.

I have been trying to figure something out for a while but maybe haven't quite found the right paper for it to click just yet - how would you mix this with video feedback in a real robot - do you forward predict the position and then have some means of telling if they overlap in your simulated image and reality?

I've tried grounding models like cogvlm and yolo, but often the bounding box is just barely useful to go face something, not actually reach out and pick something.

there are grasping datasets, but then I think you still have to train a new model for your given object+gripper pair - so I'm not clear where the MPC part comes in.

so I guess I'm just asking for any hints/papers that might make it easier for a beginner to grasp.

thanks :-)

16. whatever1 ◴[07 Nov 24 08:08 UTC] No.42074628[source]▶

>>41992851 (OP) #

I am delighted to see a renewed interest in the field of systems control! This is awesome work!

17. RealityVoid ◴[07 Nov 24 12:45 UTC] No.42076167[source]▶

>>41992851 (OP) #

I am so upset my math skills lagged behind the rest of my technical skills, I struggle greatly to grok the math in this. I also find it quite difficult to brush it up enough to be happy with my level of understanding. I know I am able to, in high school I probably could have tackled this, but not now, with so many things sloshing in my brain.

18. gxcode ◴[07 Nov 24 17:44 UTC] No.42078958{3}[source]▶

>>42074214 #

I ended up making my own theme, but my starting point was PicoCSS: https://picocss.com