PANews, February 10 news, the video generation experimental model "VideoWorld" was proposed by the Douban Large Model Team, Beijing Jiaotong University, and the University of Science and Technology of China. Unlike mainstream multimodal models such as Sora, DALL-E, and Midjourney, VideoWorld is the first in the industry to achieve cognition of the world without relying on language models. Currently, the project code and model have been open-sourced.
Doubao: VideoWorld, a video generation model that can perceive the world based on vision alone, is now open source
This article is machine translated
Show original
Source
Disclaimer: The content above is only the author's opinion which does not represent any position of Followin, and is not intended as, and shall not be understood or construed as, investment advice from Followin.
Like
Add to Favorites
Comments
Share



