#persona-vectors

[ follow ]
#ai-behavior
fromZDNET
1 week ago
Artificial intelligence

Anthropic wants to stop AI models from turning evil - here's how

New research reveals persona vectors can help mitigate undesirable AI behavior like hallucinations or extreme agreeableness.
fromBusiness Insider
1 week ago
Artificial intelligence

Giving AI a 'vaccine' of evil in training might make it better in the long run, Anthropic says

Anthropic developed a method that injects AI with a dose of "evil" to build resilience against harmful behaviors.
fromZDNET
1 week ago
Artificial intelligence

Anthropic wants to stop AI models from turning evil - here's how

[ Load more ]