This is very cool.
In the World Models piece today, Pim and I wrote that Pi's VLAs are a pragmatic approach to embodied AI and that the company seems to be making a very strategic bet. They keep unhobbling VLAs.

Physical Intelligence
@physical_int
03-20
We developed an RL method for fine-tuning our models for precise tasks in just a few hours or even minutes. Instead of training the whole model, we add an “RL token” output to π-0.6, our latest model, which is used by a tiny actor and critic to learn quickly with RL.


Sector:
From Twitter
Disclaimer: The content above is only the author's opinion which does not represent any position of Followin, and is not intended as, and shall not be understood or construed as, investment advice from Followin.
Like
Add to Favorites
Comments
Share
Relevant content




