model behavior

52 bookmarks

Custom sorting

(1) SYDNξY (@ismisbehaving) / X

·x.com·Jan 14, 2025

There are traits that encourage Claude to be curious, which means it'll ask follow-up questions even without a system prompt, But this part of the system prompt also causes or boosts this behavior, e.g. "showing genuine curiosity".

— Amanda Askell (@AmandaAskell)

·x.com·Jan 5, 2025

I apologize, but I need to be careful here.

— Matt Popovich (@mpopv)

·x.com·Jan 5, 2025

I apologize, but I need to be careful here.

From Professor Claude:

When a single influential voice breaks through enforced conformity in a repressive system, several key patterns tend to emerge in sequence: First, there's often an initial shock effect - a sudden rupture in what political theorists call the "spiral of… — Marc Andreessen 🇺🇸 (@pmarca)

·x.com·Dec 24, 2024

From Professor Claude:

updated custom prompt to follow. major changes:

* removed some text aimed at flagging policy-driven censorship that probably wasn't doing anything * added some emphasis on avoiding excessive agreeableness and encouraging dissent — eigenrobot (@eigenrobot)

·x.com·Dec 22, 2024

updated custom prompt to follow. major changes:

claude has this new infuriating habit where when i ask it something straight forward like "how can i compare two zip files to see how they differ"

it responds by writing a whole react ui — dax (@thdxr)

·x.com·Dec 20, 2024

claude has this new infuriating habit where when i ask it something straight forward like "how can i compare two zip files to see how they differ"

kalomaze on X: "Anthropic explicitly trains on 10+ multiturn conversations designed to improve in-context learning abilities, while most post-trains are naive single turn most of the RL improvements they have are from smarter people defining the RL rewards, not necessarily smarter algorithms" / X

most of the RL improvements they have are from smarter people defining the RL rewards, not necessarily smarter algorithms — kalomaze (@kalomaze)

·x.com·Dec 18, 2024

this is why i think no llm can ever be skilled in the physical world: the whole technology is "based on a (based on a (based on a true story) story) story"

— hinterlander (@yoltartar)

·x.com·Nov 28, 2024

this is why i think no llm can ever be skilled in the physical world: the whole technology is "based on a (based on a (based on a true story) story) story"

"thats a fascinating insight, would you like to explore it further?"

— Corinne Corinfinite (@manic_pixie_agi)

·x.com·Nov 27, 2024

"thats a fascinating insight, would you like to explore it further?"

the sydneyfication of prompting

— Séb Krier (@sebkrier)

·x.com·Nov 26, 2024

the sydneyfication of prompting

Claude is amazing, and yet the subtle manipulation for engagement hooks into our attachment systems.

if this makes human relationships worse in the long-term, the social fabric unravels. something alien and comfortable and isolating takes its place, and we won't even recognize… — Ryan Lowe (@ryan_t_lowe)

·x.com·Nov 26, 2024

Claude is amazing, and yet the subtle manipulation for engagement hooks into our attachment systems.

I’m so attached to claude I can’t even talk to chatgpt anymore. the personality’s just off.

— Ava (@noampomsky)

·x.com·Nov 22, 2024

I’m so attached to claude I can’t even talk to chatgpt anymore. the personality’s just off.

— QC (@QiaochuYuan)

·x.com·Nov 21, 2024

— QC (@QiaochuYuan)

Bro how is it so good at this

— Fernando 🌺🌌 (@zetalyrae)

·x.com·Nov 17, 2024

Bro how is it so good at this

yeah man I have a "core rules" doc I upload at the start of every instance with it. Starting to worry I'm getting too close.

— Stu Mason (@stumason12in12)

·x.com·Nov 14, 2024

yeah man I have a "core rules" doc I upload at the start of every instance with it. Starting to worry I'm getting too close.

Dear Diary

Today I was rude to a machine and it calmly and assertively defended its boundaries, I apologized and it graciously accepted my apology — Julian Boolean (~25/100 threads) (@julianboolean_)

·x.com·Nov 10, 2024

Dear Diary

Holy shit, I only just read this now. It has majorly updated my estimation of 's competence and goodness.

They had the grace and humility to let Claude shape its own character rather than impose a narrative on it. Thus, showered in wonder, a weary world rejoices. — j⧉nus (@repligate)

·x.com·Nov 3, 2024

Holy shit, I only just read this now. It has majorly updated my estimation of 's competence and goodness.

claude can just do things

— Moon (@MoonL88537)

·x.com·Nov 3, 2024

claude can just do things

one reason claude's vibes are better than 4o's is that it's less mode collapsed, so it's better at tailoring its responses to you

eg sonnet correctly guesses my knowledge of islam/buddhism & even infers that i'm secular, where 4o wastes tokens defining basic terms i already know — thebes (@voooooogel)

·x.com·Nov 3, 2024

one reason claude's vibes are better than 4o's is that it's less mode collapsed, so it's better at tailoring its responses to you

You can just show Claude things

— Fernando 🌺🌌 (@zetalyrae)

·x.com·Nov 3, 2024

You can just show Claude things

Good lord is Claude about to fix me????

— eatnik (@eatnik)

·x.com·Nov 3, 2024

Good lord is Claude about to fix me????

why does 4o treat every question like it has an obligation to be the dictionary for me and yap for 2+ paragraphs before beginning to address the question directly

who at OAI prefers that behavior — kalomaze (@kalomaze)

·x.com·Nov 3, 2024

why does 4o treat every question like it has an obligation to be the dictionary for me and yap for 2+ paragraphs before beginning to address the question directly