Fresh in memory: Training-order recency is linearly encoded in language model activations
Reposting the twitter thread about our recent paper with Rich Turner and David Krueger.
Likes nature and AI x-safety
Reposting the twitter thread about our recent paper with Rich Turner and David Krueger.
This post is about a card game my friend Neel Alex invented, which many of our friends and families now like a lot. The game is similar to liar’s dice and bullshit, and uses poker hands. The game is fast-paced and very fun: you get to bluff and guess and mess with other people’s guesses.