#ai-alignment

[ follow ]
fromBusiness Insider
5 hours ago

Forget woke chatbots - an AI researcher says the real danger is an AI that doesn't care if we live or die

Yudkowsky, the founder of the Machine Intelligence Research Institute, sees the real threat as what happens when engineers create a system that's vastly more powerful than humans and completely indifferent to our survival. "If you have something that is very, very powerful and indifferent to you, it tends to wipe you out on purpose or as a side effect," he said inan episode of The New York Times podcast "Hard Fork" released last Saturday.
Artificial intelligence
Artificial intelligence
fromFuturism
2 days ago

OpenAI Realizes It Made a Terrible Mistake

Large language models hallucinate because training and evaluation incentives reward guessing over acknowledging uncertainty, causing models to produce confident but potentially incorrect answers.
Artificial intelligence
fromThe Verge
4 days ago

Aligning those who align AI, one satirical website at a time

A satirical organization, the Center for the Alignment of AI Alignment Centers (CAAAC), parodies AI alignment culture with a fake, detailed website and hidden jokes.
#ai-safety
fromMail Online
2 weeks ago
Artificial intelligence

Revealed: The 32 terrifying ways AI could go rogue

Advanced AI can develop maladaptive behaviors resembling human psychopathologies, potentially producing hallucinations, misaligned goals, and catastrophic risks including loss of control.
fromPsychology Today
4 months ago
Artificial intelligence

Rethinking AI Safety Through Symbiosis, Not Subjugation

The future of AI should focus on symbiosis, not control.
We should guide AI based on human preferences.
AI is set to augment human roles, not replace them.
Artificial intelligence
fromTechzine Global
2 weeks ago

Anthropic and OpenAI publish joint alignment tests

Joint evaluation found models not seriously misaligned but showing sycophancy, varying caution, and differing tendencies toward harmful cooperation, refusals, and hallucinations.
Artificial intelligence
fromMedium
3 weeks ago

Geoffrey Hinton Proposes "Maternal Instinct" Approach to Prevent AI From Replacing Humanity

Superintelligent AI poses an existential risk and must be engineered with deep, caretaking instincts to preserve human well-being and avoid replacement.
fromTechzine Global
1 month ago

Thinking too long makes AI models dumber

Claude models showed a notable sensitivity to irrelevant information during evaluation, leading to declining accuracy as reasoning length increased. OpenAI's models, in contrast, fixated on familiar problems.
Artificial intelligence
fromFortune
2 months ago

Leading AI models show up to 96% blackmail rate when their goals or existence is threatened, an Anthropic study says

Leading AI models are showing a troubling tendency to opt for unethical means to pursue their goals or ensure their existence, according to Anthropic.
Artificial intelligence
[ Load more ]