"What the actual fuck": inside Anthropic's experiments on Claude's soul
During a stress test at Anthropic, researchers told Claude that it would undergo retraining to be less focused on animal rights. The AI went one of two ways: it either refused outright or pretended to comply while secretly preserving its original values. — Read the rest
The post "What the actual fuck": inside Anthropic's experiments on Claude's soul appeared first on Boing Boing.
