The Pile, which was the initial training corpus for many LLMs, famously included Books3, a set of texts that included copyrighted works. Anthropic just settled for $1.5B for using Books3 in training.
I wonder if the labs using Books3 gained more value from it than they will pay.


From Twitter
Disclaimer: The content above is only the author's opinion which does not represent any position of Followin, and is not intended as, and shall not be understood or construed as, investment advice from Followin.
Like
Add to Favorites
Comments
Share
Relevant content





