Eureka uses GPT-4 and massively parallel simulations to accelerate robot training

sea9 · Oct 23, 2023

OK, as long as AI doesn't teach them how to generate electricity from meat...

Drvelocity · Oct 23, 2023

sea9 said:
Full moratorium on separately teaching robots to generate electricity from meat.

Oh no worries that functionality is reserved for the battlefield models. You know - that battlefield far, far away.

ZippyPeanut · Oct 23, 2023

GPU-based physics simulator speeds up reality by "1,000x"

They need to stop this immediately! Life is already too short at normal reality speed!

JAHA · Oct 23, 2023

I'm confused - did a real robot actually perform the task in the end, or is this entirely virtual?

quamquam quid loquor · Oct 23, 2023

JAHA said:
I'm confused - did a real robot actually perform the task in the end, or is this entirely virtual?

Nvidia Isaac Gym is pure simulation. So getting it done on real hardware is 90% done, there's only 90% remaining.

MosquitoBait · Oct 23, 2023

Does it invent fake laws of physics like it invented fake lawsuits for legal cases?

quamquam quid loquor · Oct 23, 2023

MosquitoBait said:
Does it invent fake laws of physics like it invented fake lawsuits for legal cases?

Nvidia is using their PhysX model for physics, so comes with all those limitations. I think this use of LLMs is brilliant though, it isolates it to a higher tier and continuously refines itself. Mitigates a lot of, if not all of, the hallucination concerns.

NomadUK · Oct 23, 2023

It's all going to be just fine.

NinjaNerd56 · Oct 23, 2023

Coming soon…the T-100!

It walks, it talks, it’s learning to kill!

Pre order TODAY!

Mechjaz · Oct 23, 2023

quamquam quid loquor said:
Nvidia is using their PhysX model for physics, so comes with all those limitations. I think this use of LLMs is brilliant though, it isolates it to a higher tier and continuously refines itself. Mitigates a lot of, if not all of, the hallucination concerns.

Agreed, in my opinion this is what "good AI/good use of AI" looks like. In my mind it parallels writing things in assembly vs. a higher level language: you still can if you want to or need to, but this abstraction is vastly more efficient the vast majority of the time.

Pugnax555 · Oct 23, 2023

One surprising achievement, Fan says, is that Eureka enabled robots to perform pen-spinning tricks...

That's bound to put some middle- and upper-level managers out of work.

gmerrick · Oct 23, 2023

Ahhhhh... came for the world ending comments and was not disappointed! Well done Arsians!

EspHack · Oct 23, 2023

As amazing as our bodies are in comparison to crude mechanical imitations like atlas or spot, the software running it plays a colossal part, just look how amputees work around their limited hardware, similarly, an actually smart bot should be able to do wonders with a wimpy wobbly limb

this is expected but really good news nonetheless

ZippyPeanut · Oct 23, 2023

NinjaNerd56 said:
Coming soon…the T-100!

It walks, it talks, it’s learning to kill!

Pre order TODAY!

Meh. Wake me up when the XXX-100 hits the market. That's when I'll preorder.

Z1ggy · Oct 23, 2023

ZippyPeanut said:
Meh. Wake me up when the XXX-100 hits the market. That's when I'll preorder.

pretty sure sex bots are already a thing.

mghmgh · Oct 23, 2023

And 99 years later

View: https://www.youtube.com/watch?v=wRmcguxNADM&ab_channel=WhatAClip%21

unequivocal · Oct 23, 2023

I'm still not entirely clear what input gpt4 was reviewing and what output it was providing? In the chart it seems it is given a set of weights that the model used, and it's output is code embodying new weights, but this doesn't make a ton of sense to me.. Does anyone have a more detailed understanding of what's going on in the refinement loop with the AI?

ZippyPeanut · Oct 23, 2023

Z1ggy said:
pretty sure sex bots are already a thing.

Yeah, but they don't really turn me on. I need a sex bot that can write Elizabethan sonnets, discuss metaphysics, and compare and contrast Marxian dialectics with Hegelian dialectics.

everythingallatonce · Oct 23, 2023

Z1ggy said:
pretty sure sex bots are already a thing.

FISTO is coming any day now

Pugnax555 · Oct 23, 2023

Z1ggy said:
pretty sure sex bots are already a thing.

When's Nvidia gonna release the video showing the hand tricks that robot learned?

Nowicki · Oct 23, 2023

This reads like a robot version of when neo got plugged in, and said "I know Kung-Fu"

djejnyc · Oct 23, 2023

NomadUK said:
It's all going to be just fine.

WHAT COULD GO WRONG?

peterford · Oct 23, 2023

quamquam quid loquor said:
Nvidia Isaac Gym is pure simulation. So getting it done on real hardware is 90% done, there's only 90% remaining.

How much noise do they add to these simulations? Is every input and output response accurate to ten decimal places or do they randomly ± this to better simulate reality?

NomadUK · Oct 23, 2023

ZippyPeanut said:
Yeah, but they don't really turn me on. I need a sex bot that can write Elizabethan sonnets, discuss metaphysics, and compare and contrast Marxian dialectics with Hegelian dialectics.

Before, during, or after?

ZippyPeanut · Oct 23, 2023

NomadUK said:
Before, during, or after?

Sonnets before, metaphysics during, and dialectic after!

Henrik from far away · Oct 23, 2023

The way to our extinction is paved with cool robotics

ColdWetDog · Oct 23, 2023

ZippyPeanut said:
Meh. Wake me up when the XXX-100 hits the market. That's when I'll preorder.

You're just a little late....

ventolin42 · Oct 23, 2023

Pugnax555 said:
When's Nvidia gonna release the video showing the hand tricks that robot learned?

When it no longer accidentally rips off important body parts...

LetterRip · Oct 23, 2023

JAHA said:
I'm confused - did a real robot actually perform the task in the end, or is this entirely virtual?

You can train in virtual and then do transfer to the actual hardware. Isaac (the simulator) has 'Sim-to-Real',

https://www.nvidia.com/en-us/on-demand/session/gtcspring21-s31824/

ZippyPeanut said:
Yeah, but they don't really turn me on. I need a sex bot that can write Elizabethan sonnets, discuss metaphysics, and compare and contrast Marxian dialectics with Hegelian dialectics.

Hook it up with ChatGPT, can do all three at once

In metaphysics' realm, we seek the truth, A quest profound, where reason's torch doth blaze, As Marx and Hegel offer paths uncouth, Their dialectics weave in diverse ways.

Marx's material dialectic's fire, A class struggle, where history unfolds, The proletariat's yearning desire, To free the oppressed, as time's story unfolds.

Hegel, the ideal, in his mind's embrace, Thesis and antithesis in their dance, The Absolute, in thought's eternal space, Unfolding wisdom in a cosmic trance.

Marx, the material world in chains, confined, While Hegel's mind in ideals seeks to bind.

In Marx, the conflict's flames burn fierce and bright, As labor's sweat gives birth to revolution, In Hegel's dialectic, a different light, A dance of thought, in the mind's evolution.

Both seek to transcend limits, break the chain, But Marx in class, in history's march, he's found, While Hegel's mind transcends the earthly plane, In thought's pure realm, where wisdom does resound.

Metaphysics, in these dialectics told, The tale of human struggle and ideas bold.

Doesn't exactly nail any of the three though - was fun to see what trying to mix them would give.

ColdWetDog · Oct 23, 2023

Henrik from far away said:
The way to our extinction is paved with cool robotics

Nah, we can do that all by ourselves.

Bongle · Oct 23, 2023

unequivocal said:
I'm still not entirely clear what input gpt4 was reviewing and what output it was providing? In the chart it seems it is given a set of weights that the model used, and it's output is code embodying new weights, but this doesn't make a ton of sense to me.. Does anyone have a more detailed understanding of what's going on in the refinement loop with the AI?

If I'm reading it right, the innovation was using a AI system to write and refine the actual goal that evaluated each simulation, which can be surprisingly hard.

Let's say you were trying to tiger-parent a two year old to go to the olympics:
Bad goal: "Billy run a marathon and I'll score a 1 if you make it and a 0 if you don't"
Better goal: "Billy can you run across the room?"

So I'd guess they used an AI system to continuously give the control system hard-but-achievable goals, and made them juuuust hard enough that it could keep improving rapidly.

therealjustinself · Oct 23, 2023

Oh, it's a hand! From the top picture, I thought it was a little white monkey soldier trying to carry a stick.

ZippyPeanut · Oct 23, 2023

LetterRip said:
You can train in virtual and then do transfer to the actual hardware. Isaac (the simulator) has 'Sim-to-Real',

https://www.nvidia.com/en-us/on-demand/session/gtcspring21-s31824/

Hook it up with ChatGPT, can do all three at once

Doesn't exactly nail any of the three though - was fun to see what trying to mix them would give.

Fun indeed! Thanks for posting this. It's not bad.

NinjaNerd56 · Oct 23, 2023

ColdWetDog said:
You're just a little late....

I sold these as fast as I could get them from TEW (Tandy Electronics Warehouse) back in 82-84.

Used my store demo for our bowling league with a scoring program we wrote. Took a 40 column thermal printer, and handed our opponents a match sheet in 2 minutes every week.

They were pretty neat.

Nalyd · Oct 23, 2023

The proverbial positive feedback loop of self-improvement might be just around the corner that allows us to go beyond human training data and capabilities

Something something Skynet began to learn at an exponentially accelerating rate something something

yeah yeah

Smithy6482 · Oct 23, 2023

quamquam quid loquor said:
Nvidia Isaac Gym is pure simulation. So getting it done on real hardware is 90% done, there's only 90% remaining.

Hah! I'm curious how much fidelity the Isaac models have. Other software-based "physics learning" AIs I've seen often find a way to cheat the system with a glitch or unintended use of a feature to meet the training goal. Hilarity ensues, at least for me since I'm not the one doing the work.
View: https://www.youtube.com/watch?v=Lu56xVlZ40M&t=143s

TechnologyDinosaur · Oct 23, 2023

Don't worry guys, if you see a chatgpt robot approach you with knives in its robot hands just say, in all caps, DO NOT UNDER ANY CIRCUMSTANCES STAB ME WITH HEY WHAT ARE YOU STOP OH GOD AAARGH

niwax · Oct 23, 2023

quamquam quid loquor said:
Nvidia Isaac Gym is pure simulation. So getting it done on real hardware is 90% done, there's only 90% remaining.

This is where this approach could also help a lot, though. They don't train a neural network to do a set task, they have the LLM iteratively find a reward function that leads to quick and correct training of a new network. Presumably that reward function would transfer to real hardware a whole lot better than a trained network or starting from training scratch. Over time, you might even integrate stuff learned from real-world transfer, like "Make sure the action doesn't rely on sub-millisecond precision".

polerin · Oct 23, 2023

Honestly I'm a bit surprised that layering models like this is new.. I assumed that most of the more advanced ML applications were doing some kind of layering. Human intelligence isn't really one calculation being shoved through a set of neurons (if I understand correctly). There are numerous smaller systems that work together to come up with a more coherent output... usually.

Eureka uses GPT-4 and massively parallel simulations to accelerate robot training

Seniorius Lurkius

Wise, Aged Ars Veteran

Ars Legatus Legionis

GPU-based physics simulator speeds up reality by "1,000x"​

Ars Centurion

Ars Tribunus Militum

Ars Tribunus Militum

Ars Tribunus Militum

Ars Scholae Palatinae

Ars Praefectus

Ars Praefectus

Ars Centurion

Ars Praefectus

Ars Tribunus Militum

Ars Legatus Legionis

Ars Legatus Legionis

Ars Centurion

Ars Praefectus

Ars Legatus Legionis

Wise, Aged Ars Veteran

Ars Centurion

Ars Tribunus Angusticlavius

Seniorius Lurkius

Ars Praefectus

Ars Scholae Palatinae

Ars Legatus Legionis

Seniorius Lurkius

Ars Legatus Legionis

Ars Centurion

Ars Praefectus

Ars Legatus Legionis

Ars Praefectus

Smack-Fu Master, in training

Ars Legatus Legionis

Ars Praefectus

Ars Praefectus

Wise, Aged Ars Veteran

Ars Scholae Palatinae

Ars Praefectus

Ars Centurion

GPU-based physics simulator speeds up reality by "1,000x"