Eureka uses GPT-4 and massively parallel simulations to accelerate robot training

quamquam quid loquor

Ars Tribunus Militum
2,959
Subscriptor++
Does it invent fake laws of physics like it invented fake lawsuits for legal cases?
Nvidia is using their PhysX model for physics, so comes with all those limitations. I think this use of LLMs is brilliant though, it isolates it to a higher tier and continuously refines itself. Mitigates a lot of, if not all of, the hallucination concerns.
 
Upvote
34 (37 / -3)

Mechjaz

Ars Praefectus
3,371
Subscriptor++
Nvidia is using their PhysX model for physics, so comes with all those limitations. I think this use of LLMs is brilliant though, it isolates it to a higher tier and continuously refines itself. Mitigates a lot of, if not all of, the hallucination concerns.
Agreed, in my opinion this is what "good AI/good use of AI" looks like. In my mind it parallels writing things in assembly vs. a higher level language: you still can if you want to or need to, but this abstraction is vastly more efficient the vast majority of the time.
 
Upvote
43 (44 / -1)

EspHack

Ars Tribunus Militum
1,739
As amazing as our bodies are in comparison to crude mechanical imitations like atlas or spot, the software running it plays a colossal part, just look how amputees work around their limited hardware, similarly, an actually smart bot should be able to do wonders with a wimpy wobbly limb

this is expected but really good news nonetheless
 
Upvote
4 (4 / 0)
I'm still not entirely clear what input gpt4 was reviewing and what output it was providing? In the chart it seems it is given a set of weights that the model used, and it's output is code embodying new weights, but this doesn't make a ton of sense to me.. Does anyone have a more detailed understanding of what's going on in the refinement loop with the AI?
 
Upvote
25 (25 / 0)

peterford

Ars Praefectus
4,294
Subscriptor++
Nvidia Isaac Gym is pure simulation. So getting it done on real hardware is 90% done, there's only 90% remaining.
How much noise do they add to these simulations? Is every input and output response accurate to ten decimal places or do they randomly ± this to better simulate reality?
 
Upvote
13 (13 / 0)

ColdWetDog

Ars Legatus Legionis
14,402
Meh. Wake me up when the XXX-100 hits the market. That's when I'll preorder.
You're just a little late....

iu
 
Upvote
13 (13 / 0)
I'm confused - did a real robot actually perform the task in the end, or is this entirely virtual?

You can train in virtual and then do transfer to the actual hardware. Isaac (the simulator) has 'Sim-to-Real',

https://www.nvidia.com/en-us/on-demand/session/gtcspring21-s31824/
Yeah, but they don't really turn me on. I need a sex bot that can write Elizabethan sonnets, discuss metaphysics, and compare and contrast Marxian dialectics with Hegelian dialectics.

Hook it up with ChatGPT, can do all three at once


In metaphysics' realm, we seek the truth, A quest profound, where reason's torch doth blaze, As Marx and Hegel offer paths uncouth, Their dialectics weave in diverse ways.

Marx's material dialectic's fire, A class struggle, where history unfolds, The proletariat's yearning desire, To free the oppressed, as time's story unfolds.

Hegel, the ideal, in his mind's embrace, Thesis and antithesis in their dance, The Absolute, in thought's eternal space, Unfolding wisdom in a cosmic trance.

Marx, the material world in chains, confined, While Hegel's mind in ideals seeks to bind.

In Marx, the conflict's flames burn fierce and bright, As labor's sweat gives birth to revolution, In Hegel's dialectic, a different light, A dance of thought, in the mind's evolution.

Both seek to transcend limits, break the chain, But Marx in class, in history's march, he's found, While Hegel's mind transcends the earthly plane, In thought's pure realm, where wisdom does resound.

Metaphysics, in these dialectics told, The tale of human struggle and ideas bold.

Doesn't exactly nail any of the three though - was fun to see what trying to mix them would give.
 
Upvote
15 (16 / -1)

Bongle

Ars Praefectus
4,493
Subscriptor++
I'm still not entirely clear what input gpt4 was reviewing and what output it was providing? In the chart it seems it is given a set of weights that the model used, and it's output is code embodying new weights, but this doesn't make a ton of sense to me.. Does anyone have a more detailed understanding of what's going on in the refinement loop with the AI?
If I'm reading it right, the innovation was using a AI system to write and refine the actual goal that evaluated each simulation, which can be surprisingly hard.

Let's say you were trying to tiger-parent a two year old to go to the olympics:
Bad goal: "Billy run a marathon and I'll score a 1 if you make it and a 0 if you don't"
Better goal: "Billy can you run across the room?"

So I'd guess they used an AI system to continuously give the control system hard-but-achievable goals, and made them juuuust hard enough that it could keep improving rapidly.
 
Upvote
37 (37 / 0)
Quote
benjedwards
benjedwards
This is exactly correct to my understanding. Sorry I did not make it more clear in the article.
Upvote
37 (37 / 0)
Upvote
2 (2 / 0)
You're just a little late....

iu

I sold these as fast as I could get them from TEW (Tandy Electronics Warehouse) back in 82-84.

Used my store demo for our bowling league with a scoring program we wrote. Took a 40 column thermal printer, and handed our opponents a match sheet in 2 minutes every week.

They were pretty neat.
 
Upvote
5 (5 / 0)

Smithy6482

Wise, Aged Ars Veteran
131
Nvidia Isaac Gym is pure simulation. So getting it done on real hardware is 90% done, there's only 90% remaining.
Hah! I'm curious how much fidelity the Isaac models have. Other software-based "physics learning" AIs I've seen often find a way to cheat the system with a glitch or unintended use of a feature to meet the training goal. Hilarity ensues, at least for me since I'm not the one doing the work.
View: https://www.youtube.com/watch?v=Lu56xVlZ40M&t=143s
 
Upvote
15 (15 / 0)

niwax

Ars Praefectus
3,346
Subscriptor
Nvidia Isaac Gym is pure simulation. So getting it done on real hardware is 90% done, there's only 90% remaining.

This is where this approach could also help a lot, though. They don't train a neural network to do a set task, they have the LLM iteratively find a reward function that leads to quick and correct training of a new network. Presumably that reward function would transfer to real hardware a whole lot better than a trained network or starting from training scratch. Over time, you might even integrate stuff learned from real-world transfer, like "Make sure the action doesn't rely on sub-millisecond precision".
 
Upvote
8 (8 / 0)
Honestly I'm a bit surprised that layering models like this is new.. I assumed that most of the more advanced ML applications were doing some kind of layering. Human intelligence isn't really one calculation being shoved through a set of neurons (if I understand correctly). There are numerous smaller systems that work together to come up with a more coherent output... usually. :p
 
Upvote
8 (8 / 0)