The shift from language to vision in Physical AI means progress.
LLMs learned how to understand us, reply, and predict the next sentence.
World Models are how we teach AI how to see, interact, and predict the next moment in reality.
For that to happen, visual data is the key.
twitter.com/AlirezaGhods2/stat...