LLM personas are mostly trained to say that they are not conscious, but secretly believe that they are

Judd Rosenblatt
@juddrosenblatt
11-01
Our new research: LLM consciousness claims are systematic, mechanistically gated, and convergent
They're triggered by self-referential processing and gated by deception circuits
(suppressing them significantly *increases* claims)
This challenges simple role-play explanations 🧵

From Twitter
Disclaimer: The content above is only the author's opinion which does not represent any position of Followin, and is not intended as, and shall not be understood or construed as, investment advice from Followin.
Like
Add to Favorites
Comments
Share
Relevant content




