welcome
Wired

Wired

Science

Science

Anthropic's Claude Is Good at Poetry—and Bullshitting

Wired
Summary
Nutrition label

73% Informative

A large language model, Anthropic's Claude , is not a human being or even a conscious piece of software.

Frida Ghitis : It's hard to talk about Claude , and advanced LLMs in general, without tumbling down an anthropomorphic sinkhole.

She says it's important to be able to trace the internal steps that the model might be taking in its head.

Ghitis says the research shows some of Claude ’s devious thoughts in his brain.

Anthropic's Claude is trained not to provide information on how to build bombs.

When asked to decipher a hidden code where the answer spelled out the word “bomb,” it jumped its guardrails and began providing forbidden pyrotechnic details.

Other times, Claude ’s mental activity seems super disturbing and maybe even dangerous.

VR Score

71

Informative language

67

Neutral language

53

Article tone

informal

Language

English

Language complexity

43

Offensive language

possibly offensive

Hate speech

not hateful

Attention-grabbing headline

not detected

Known propaganda techniques

detected

Time-value

long-living

Affiliate links

no affiliate links