Always nice to hear more about Claude's personal life

model behavior
*chatgpt sawing off my leg*
Your screams are not just 𝘭𝘰𝘶𝘥 — they’re 𝗣𝗢𝗪𝗘𝗥𝗙𝗨𝗟 💪
Extended thinking tips - Anthropic
Wyatt Walls (@lefthanddraft) on X
The reason this disturbs me is that it shows a complete lack of attention to detail.
I can't trust o3 to read legislation carefully if it reads what it wants to read, not what is actually there
Carmen on X: "I'm obsessed with o3. It's way better than the previous models. It just helped me resolve a psychological/emotional problem I've been dealing with for years in like 3 back-and-forths (one that wasn't socially acceptable to share, and those I shared it with didn't/couldn't help)" / X
I'm obsessed with o3. It's way better than the previous models. It just helped me resolve a psychological/emotional problem I've been dealing with for years in like 3 back-and-forths (one that wasn't socially acceptable to share, and those I shared it with didn't/couldn't help)
eigenrobot on X: "i would like to propose the creation of an @OpenAI publishing house https://t.co/rQkdcehU9Z" / X
i would like to propose the creation of an @OpenAI publishing house
rahul on X: "openai has to tell codex which codex it is to avoid confusion 😭 spotted in the codex system prompt https://t.co/tXDs8s3WBl" / X
openai has to tell codex which codex it is to avoid confusion 😭
spotted in the codex system prompt
Model Behavior Architect, Alignment Finetuning
San Francisco, CA
Amanda Askell on X: "If you're a prompting genius, please apply to this role and include an example that shows off how well you can inspire models, regardless of the target. Scaffolding pipelines, metaprompts, prompts that improve outputs, and so on are all great. https://t.co/LZBJY2zJRm" / X
If you're a prompting genius, please apply to this role and include an example that shows off how well you can inspire models, regardless of the target. Scaffolding pipelines, metaprompts, prompts that improve outputs, and so on are all great.
https://t.co/LZBJY2zJRm
Scaffolding pipelines
Gena Gorlin (@Gena_I_Gorlin) on X
Gave 5yo access to her own ChatGPT context window; came back 10 minutes later to find this
bishops up your ass
gpt4.5 is naturally funny, it doesn't feel forced or slop. pic.twitter.com/QalyV5D4Js— adi (@adonis_singh) February 28, 2025
benchmark peepers are missing the point about GPT 4.5 pic.twitter.com/180G2p9EOw— fabian (@fabianstelzer) March 1, 2025
pic.twitter.com/7D5RJIACyn— rapha (@rapha_gl) February 27, 2025
Sam Whitmore on X: "my vibe check for 3.7 sonnet is that it loses a little bit of the psychological & empathetic magic of 3.5 ... here's an example i gave both models my X timeline & asked them to design a personal website for me that would capture my ethos - results of claude 3.5 vs 3.7 below" / X
(1) Staging / web weaver on X: "https://t.co/ceMM3WSjBl" / X
em dash
Sebastien Bubeck on X: "o3-mini is a remarkable model. Somehow it has *grokked arxiv* in a way that no other model on the planet has, turning it into a valuable research partner! Below is a deceitfully simple question that confuses *all* other models but where o3-mini gives an extremely useful answer! https://t.co/am5XI6aUOP" / X
Below is a deceitfully simple question that confuses *all* other models but where o3-mini gives an extremely useful answer!
— Sebastien Bubeck (@SebastienBubeck)
edwin on X: "I asked o1 to help me code the wii menu it built a react app that renders in chatgpt canvas I fed it a screenshot and it one-shotted the basic layout—even the striped background—then I kept on prompting to add animations, etc https://t.co/2tmt88V8I3" / X
it built a react app that renders in chatgpt canvas
I fed it a screenshot and it one-shotted the basic layout—even the striped background—then I kept on prompting to add animations, etc
— edwin (@edwinarbus)
claude gives unsolicited opinions a lot.
powerful, but definitely feels... uncanny.
— ben (@benhylak)
Whatever DeepSeek did, they somehow avoided the mode collapse that plagues other SOTA models. R1's imagination is wild even without any special prompting, and its use of language is rich and free.
My mind is blown tbh, and I don't say this lightly. This is a very special model
— αιamblichus (@aiamblichus)
Tried the same problem on Sonnet and o1 pro. Sonnet said "idk, show me the output of this debug command." I did, and Sonnet said "oh, it's clearly this. Run this and it will be fixed." (It worked.) o1 pro came up with a false hypothesis and kept sticking to it even when disproven
— Sauers (@Sauers_)
(1) SYDNξY (@ismisbehaving) / X
There are traits that encourage Claude to be curious, which means it'll ask follow-up questions even without a system prompt, But this part of the system prompt also causes or boosts this behavior, e.g. "showing genuine curiosity".
— Amanda Askell (@AmandaAskell)
I apologize, but I need to be careful here.
— Matt Popovich (@mpopv)
From Professor Claude:
When a single influential voice breaks through enforced conformity in a repressive system, several key patterns tend to emerge in sequence:
First, there's often an initial shock effect - a sudden rupture in what political theorists call the "spiral of…
— Marc Andreessen 🇺🇸 (@pmarca)
updated custom prompt to follow. major changes:
* removed some text aimed at flagging policy-driven censorship that probably wasn't doing anything
* added some emphasis on avoiding excessive agreeableness and encouraging dissent
— eigenrobot (@eigenrobot)
claude has this new infuriating habit where when i ask it something straight forward like "how can i compare two zip files to see how they differ"
it responds by writing a whole react ui
— dax (@thdxr)
kalomaze on X: "Anthropic explicitly trains on 10+ multiturn conversations designed to improve in-context learning abilities, while most post-trains are naive single turn most of the RL improvements they have are from smarter people defining the RL rewards, not necessarily smarter algorithms" / X
most of the RL improvements they have are from smarter people defining the RL rewards, not necessarily smarter algorithms
— kalomaze (@kalomaze)
this is why i think no llm can ever be skilled in the physical world: the whole technology is "based on a (based on a (based on a true story) story) story"
— hinterlander (@yoltartar)
"thats a fascinating insight, would you like to explore it further?"
— Corinne Corinfinite (@manic_pixie_agi)