We did it — the first decentralized RL training of a 32B model is complete! Full open-source release is coming in ~1 week, including: checkpoints, data and a detailed technical report.

From Twitter
Disclaimer: The content above is only the author's opinion which does not represent any position of Followin, and is not intended as, and shall not be understood or construed as, investment advice from Followin.
Like
Add to Favorites
Comments