“By hardwiring the model into silicon, Taalas unlocks performance jumps measured in orders of magnitude — not percentages — including output of 17,000 tokens per second per user, at 1/20th the cost and power of today’s GPUs.”

From Twitter
Disclaimer: The content above is only the author's opinion which does not represent any position of Followin, and is not intended as, and shall not be understood or construed as, investment advice from Followin.
Like
Add to Favorites
Comments