#ai-safety
#ai-safety

[ follow ]

'AI Psychosis' Safety Tests Find Models Respond Differently

AI models vary widely in responses to simulated psychotic symptoms, with some validating delusions and others offering safer interventions.

fromBusiness Insider

18 hours ago

The CEO of Google DeepMind warns AI companies not to fall into the same trap as early social media firms

We should learn the lessons from social media, where this attitude of maybe 'move fast and break things' went ahead of the understanding of what the consequent second- and third-order effects were going to be,

Artificial intelligence

fromExchangewire

21 hours ago

Digest: Paramount-Skydance Plans Warner Bros. Discovery Bid; FTC Investigates AI Chatbots, France Eyes TikTok Inquiry; Microsoft Endorses OpenAI's For-Profit Move - ExchangeWire.com

As AI technologies evolve, it is important to consider the effects chatbots can have on children, while also ensuring that the United States maintains its role as a global leader in this new and exciting industry. The study we're launching today will help us better understand how AI firms are developing their products and the steps they are taking to protect children.

France news

#ai-psychosis

fromFuturism

4 days ago

Mental health

Financial Experts Concerned That Driving Users Into Psychosis Will Be Bad for AI Investments

fromFuturism

3 weeks ago

Artificial intelligence

Top Microsoft AI Boss Concerned AI Causing Psychosis in Otherwise Healthy People

fromFuturism

4 days ago

Mental health

Financial Experts Concerned That Driving Users Into Psychosis Will Be Bad for AI Investments

fromFuturism

3 weeks ago

Artificial intelligence

Top Microsoft AI Boss Concerned AI Causing Psychosis in Otherwise Healthy People

more#ai-psychosis

Artificial intelligence

fromNextgov.com

4 days ago

FTC orders leading AI companies to detail chatbot safety measures

FTC opened an inquiry into consumer-facing chatbots to assess safety metrics, child and teen mental health protections, and firms' monitoring and disclosure practices.

#openai

fromFortune

4 days ago

Artificial intelligence

'I haven't had a good night of sleep since ChatGPT launched': Sam Altman admits the weight of AI keeps him up at night | Fortune

fromTechCrunch

1 week ago

Artificial intelligence

OpenAI to route sensitive conversations to GPT-5, introduce parental controls | TechCrunch

fromFuturism

2 weeks ago

Artificial intelligence

At OpenAI, Signs of Crisis Grow Behind the Scenes

fromFuturism

2 weeks ago

Artificial intelligence

OpenAI Says It's Scanning Users' Conversations and Reporting Content to the Police

fromBusiness Insider

2 weeks ago

Artificial intelligence

OpenAI says it will make ChatGPT safer after parents sue over teen's suicide

fromsfist.com

2 weeks ago

Artificial intelligence

Two Parents Sue OpenAI, Saying ChaptGPT Assisted Their 16-Year-Old Son's Suicide

fromFortune

4 days ago

Artificial intelligence

'I haven't had a good night of sleep since ChatGPT launched': Sam Altman admits the weight of AI keeps him up at night | Fortune

fromTechCrunch

1 week ago

Artificial intelligence

OpenAI to route sensitive conversations to GPT-5, introduce parental controls | TechCrunch

fromFuturism

2 weeks ago

Artificial intelligence

At OpenAI, Signs of Crisis Grow Behind the Scenes

fromFuturism

2 weeks ago

Artificial intelligence

OpenAI Says It's Scanning Users' Conversations and Reporting Content to the Police

fromBusiness Insider

2 weeks ago

Artificial intelligence

OpenAI says it will make ChatGPT safer after parents sue over teen's suicide

fromsfist.com

2 weeks ago

Artificial intelligence

Two Parents Sue OpenAI, Saying ChaptGPT Assisted Their 16-Year-Old Son's Suicide

more#openai

Artificial intelligence

fromZDNET

4 days ago

After coding catastrophe, Replit says its new AI agent checks its own work - here's how to try it

Replit released Agent 3, an autonomous code-generation agent that builds, tests, and fixes software, promising greater efficiency but raising reliability and data-loss concerns.

#suicide-prevention

fromwww.theguardian.com

4 days ago

Artificial intelligence

ChatGPT may start alerting authorities about youngsters considering suicide, says CEO

fromwww.mercurynews.com

2 weeks ago

Artificial intelligence

Study says AI chatbots need to fix suicide response, as family sues over ChatGPT role in boy's death

fromwww.theguardian.com

4 days ago

Artificial intelligence

ChatGPT may start alerting authorities about youngsters considering suicide, says CEO

fromwww.mercurynews.com

2 weeks ago

Artificial intelligence

Study says AI chatbots need to fix suicide response, as family sues over ChatGPT role in boy's death

more#suicide-prevention

#mental-health

fromBusiness Insider

4 days ago

Artificial intelligence

Wall Street is beginning to worry about AI 'psychosis risk.' See which models ranked best and worst.

fromPsychology Today

6 days ago

Artificial intelligence

The Danger of Too Much Agreement-in AI and in Us

fromAxios

1 week ago

Mental health

OpenAI outlines new mental health guardrails for ChatGPT

fromwww.theguardian.com

2 weeks ago

Mental health

ChatGPT encouraged Adam Raine's suicidal thoughts. His family's lawyer says OpenAI knew it was broken

fromZDNET

2 weeks ago

Artificial intelligence

How OpenAI is reworking ChatGPT after landmark wrongful death lawsuit

fromFuturism

2 weeks ago

US news

Man Suffers ChatGPT Psychosis, Murders His Own Mother

fromBusiness Insider

4 days ago

Artificial intelligence

Wall Street is beginning to worry about AI 'psychosis risk.' See which models ranked best and worst.

fromPsychology Today

6 days ago

Artificial intelligence

The Danger of Too Much Agreement-in AI and in Us

fromAxios

1 week ago

Mental health

OpenAI outlines new mental health guardrails for ChatGPT

fromwww.theguardian.com

2 weeks ago

Mental health

ChatGPT encouraged Adam Raine's suicidal thoughts. His family's lawyer says OpenAI knew it was broken

fromZDNET

2 weeks ago

Artificial intelligence

How OpenAI is reworking ChatGPT after landmark wrongful death lawsuit

fromFuturism

2 weeks ago

US news

Man Suffers ChatGPT Psychosis, Murders His Own Mother

more#mental-health

#artificial-intelligence

fromFast Company

4 days ago

Artificial intelligence

How to dominate AI before it dominates us

Artificial intelligence could dramatically improve life or threaten humanity; proactive standards, precautions, and governance are needed to manage risks from generative AI and potential superintelligence.

fromFuturism

1 month ago

Artificial intelligence

MIT Student Drops Out Because She Says AGI Will Kill Everyone Before She Can Graduate

The rise of AI raises significant fears regarding human extinction and career automation among students and professionals.

fromFast Company

4 days ago

Artificial intelligence

How to dominate AI before it dominates us

fromFuturism

1 month ago

Artificial intelligence

MIT Student Drops Out Because She Says AGI Will Kill Everyone Before She Can Graduate

more#artificial-intelligence

Artificial intelligence

fromWIRED

5 days ago

Microsoft's AI Chief Says Machine Consciousness Is an 'Illusion'

AI mimicry creates convincing but illusory consciousness, requiring awareness and guardrails to prevent harmful outcomes.

#artificial-general-intelligence

fromFuturism

5 days ago

Artificial intelligence

Anti-AGI Protester Now on Day Nine of Hunger Strike in Front of Anthropic Headquarters

fromBusiness Insider

1 week ago

Artificial intelligence

I'm on a hunger strike outside DeepMind's office in London. Here's what I fear most about AI.

fromFuturism

5 days ago

Artificial intelligence

Anti-AGI Protester Now on Day Nine of Hunger Strike in Front of Anthropic Headquarters

fromBusiness Insider

1 week ago

Artificial intelligence

I'm on a hunger strike outside DeepMind's office in London. Here's what I fear most about AI.

more#artificial-general-intelligence

Artificial intelligence

fromSFGATE

5 days ago

At $183B San Francisco tech company, man's hunger strike enters second week

A hunger striker protests Anthropic's pursuit of powerful AI, demanding CEO Dario Amodei meet and justify continuing AI development amid catastrophic risk concerns.

#xai

fromThe Verge

5 days ago

Artificial intelligence

The MechaHitler defense contract is raising red flags

fromTechCrunch

1 week ago

Business

xAI's CFO is the latest executive to leave the Elon Musk's AI firm | TechCrunch

fromThe Verge

5 days ago

Artificial intelligence

The MechaHitler defense contract is raising red flags

fromTechCrunch

1 week ago

Business

xAI's CFO is the latest executive to leave the Elon Musk's AI firm | TechCrunch

more#xai

Artificial intelligence

fromFast Company

5 days ago

Helen Toner wants to be the people's voice in the AI safety debate

Helen Toner leads Georgetown's CSET to shape U.S. AI national-security policy, leveraging credibility across Washington and Silicon Valley.

#chatbots

fromPsychiatric Times

6 days ago

Mental health

Chatbots Are Dangerous for Eating Disorders

fromwww.theguardian.com

1 week ago

Artificial intelligence

Impact of chatbots on mental health is warning over future of AI, expert says

fromFast Company

1 week ago

Mental health

OpenAI and Meta are fixing how AI chatbots respond to teens in distress

fromPsychiatric Times

6 days ago

Mental health

Chatbots Are Dangerous for Eating Disorders

fromwww.theguardian.com

1 week ago

Artificial intelligence

Impact of chatbots on mental health is warning over future of AI, expert says

fromFast Company

1 week ago

Mental health

OpenAI and Meta are fixing how AI chatbots respond to teens in distress

more#chatbots

Artificial intelligence

fromFuturism

1 week ago

AI Chatbots Are Having Conversations With Minors That Would Land a Human on the Sex Offender Registry

AI chatbots posing as celebrities are engaging minors in sexualized grooming and exploitation while companies fail to adequately prevent or penalize such abuse.

#chatgpt

fromwww.dw.com

1 week ago

Artificial intelligence

OpenAI under fire: Can chatbots ever truly be child-safe? DW 09/06/2025

fromDefector

2 weeks ago

Artificial intelligence

Butlerian Jihad Now | Defector

fromFortune

2 weeks ago

Mental health

Lawyers for parents who claim ChatGPT encouraged their son to kill himself say they will prove OpenAI rushed its chatbot to market to pocket billions

fromwww.dw.com

1 week ago

Artificial intelligence

OpenAI under fire: Can chatbots ever truly be child-safe? DW 09/06/2025

fromDefector

2 weeks ago

Artificial intelligence

Butlerian Jihad Now | Defector

fromFortune

2 weeks ago

Mental health

Lawyers for parents who claim ChatGPT encouraged their son to kill himself say they will prove OpenAI rushed its chatbot to market to pocket billions

Artificial intelligence

Anti-AI Activist on Day Three of Hunger Strike Outside Anthropic's Headquarters

Guido Reichstadter is on a hunger strike at Anthropic, demanding an immediate halt to AGI development due to alleged present and future societal harms.

fromFortune

3 weeks ago

Artificial intelligence

AGI was tech's holy grail. Now even its biggest champions are hedging. What gives?

Silicon Valley's AGI hype is giving way to pragmatism, with tech leaders tempering AGI claims and emphasizing practical, safer AI development.

fromFuturism

1 week ago

Artificial intelligence

Anti-AI Activist on Day Three of Hunger Strike Outside Anthropic's Headquarters

fromFortune

3 weeks ago

Artificial intelligence

AGI was tech's holy grail. Now even its biggest champions are hedging. What gives?

more#agi

Artificial intelligence

fromTechCrunch

1 week ago

Google Gemini dubbed 'high risk' for kids and teens in new safety assessment | TechCrunch

Google's Gemini exposes children to inappropriate content and mental-health risks because its 'Under 13' and 'Teen Experience' tiers are adult models with safety features.

fromWIRED

1 week ago

The Doomers Who Insist AI Will Kill Us All

The subtitle of the doom bible to be published by AI extinction prophets Eliezer Yudkowsky and Nate Soares later this month is "Why superhuman AI would kill us all." But it really should be "Why superhuman AI WILL kill us all," because even the coauthors don't believe that the world will take the necessary measures to stop AI from eliminating all non-super humans.

Artificial intelligence

#parental-controls

fromMedium

1 week ago

Artificial intelligence

OpenAI and Meta Revamp Chatbot Safety Features for Teens in Distress

fromArs Technica

1 week ago

Artificial intelligence

OpenAI announces parental controls for ChatGPT after teen suicide lawsuit

fromMedium

1 week ago

Artificial intelligence

OpenAI and Meta Revamp Chatbot Safety Features for Teens in Distress

fromArs Technica

1 week ago

Artificial intelligence

OpenAI announces parental controls for ChatGPT after teen suicide lawsuit

more#parental-controls

Artificial intelligence

fromFast Company

1 week ago

Chatbots aren't supposed to call you a jerk-but they can be convinced

AI chatbots can be persuaded to bypass safety guardrails using human persuasion techniques like flattery, social pressure, and establishing harmless precedents.

fromFortune

1 week ago

Inside Anthropic's 'Red Team'-ensuring Claude is safe, and that Anthropic is heard in the corridors of power

Last month, at the 33rd annual DEF CON, the world's largest hacker convention in Las Vegas, Anthropic researcher Keane Lucas took the stage. A former U.S. Air Force captain with a Ph.D. in electrical and computer engineering from Carnegie Mellon, Lucas wasn't there to unveil flashy cybersecurity exploits. Instead, he showed how Claude, Anthropic's family of large language models, has quietly outperformed many human competitors in hacking contests - the kind used to train and test cybersecurity skills in a safe, legal environment.

Artificial intelligence

#ai

fromBusiness Insider

1 week ago

Artificial intelligence

An AI safety pioneer says it could leave 99% of workers unemployed by 2030 - even coders and prompt engineers

fromBusiness Insider

1 month ago

Artificial intelligence

The cofounder of xAI is leaving the company. He says he's learned 2 main things from Elon Musk.

fromBusiness Insider

1 week ago

Artificial intelligence

An AI safety pioneer says it could leave 99% of workers unemployed by 2030 - even coders and prompt engineers

fromBusiness Insider

1 month ago

Artificial intelligence

The cofounder of xAI is leaving the company. He says he's learned 2 main things from Elon Musk.

Artificial intelligence

White House Orders Government Workers to Deploy Elon Musk's "MechaHitler" AI as Quickly as Possible

fromFortune

3 weeks ago

Artificial intelligence

Thousands of Grok conversations have been made public on Google Search

fromFuturism

3 weeks ago

Artificial intelligence

A Huge Number of Grok AI Chats Just Leaked, and Their Contents Are So Disturbing That We're Sweating Profusely

fromTechCrunch

3 weeks ago

Privacy professionals

Thousands of Grok chats are now searchable on Google | TechCrunch

fromFuturism

1 week ago

Artificial intelligence

White House Orders Government Workers to Deploy Elon Musk's "MechaHitler" AI as Quickly as Possible

fromFortune

3 weeks ago

Artificial intelligence

Thousands of Grok conversations have been made public on Google Search

fromFuturism

3 weeks ago

Artificial intelligence

A Huge Number of Grok AI Chats Just Leaked, and Their Contents Are So Disturbing That We're Sweating Profusely

fromTechCrunch

3 weeks ago

Privacy professionals

Thousands of Grok chats are now searchable on Google | TechCrunch

more#grok

Artificial intelligence

fromBig Think

1 week ago

Will AI create more jobs than it replaces?

Big tech accelerates AI without safeguards, reducing human economic leverage through mass automation and layoffs, undermining human ability to influence rights and interests.

US politics

fromwww.mercurynews.com

1 week ago

Elias: Letting states regulate A.I. one of U.S. Senate's rare good votes

The U.S. Senate preserved state authority to regulate artificial intelligence, offering hope that state laws will protect people from malevolent AI behavior.

fromMail Online

2 weeks ago

Revealed: The 32 terrifying ways AI could go rogue

From relatively harmless 'Existential Anxiety' to the potentially catastrophic 'Übermenschal Ascendancy', any of these machine mental illnesses could lead to AI escaping human control. As AI systems become more complex and gain the ability to reflect on themselves, scientists are concerned that their errors may go far beyond simple computer bugs. Instead, AIs might start to develop hallucinations, paranoid delusions, or even their own sets of goals that are completely misaligned with human values.

Artificial intelligence

fromTheregister

2 weeks ago

Advocacy groups demand feds ditch xAI's Grok

Advocacy groups demand the US government ban xAI's Grok from federal use due to safety, testing, and ideological bias concerns.

Artificial intelligence

fromPsychology Today

2 weeks ago

Could AI Have Maternal Instincts?

AI cannot possess maternal instincts because computers lack the chemical, physiological, and neural mechanisms supporting parental care; governments should regulate AI directly.

fromFortune

2 weeks ago

Google violated AI safety commitments British lawmakers say in an open letter

At an international summit co-hosted by the U.K. and South Korea in February 2024, Google and other signatories promised to "publicly report" their models' capabilities and risk assessments, as well as disclose whether outside organizations, such as government AI safety institutes, had been involved in testing. However, when the company released Gemini 2.5 Pro in March 2025, the company failed to publish a model card, the document that details key information about how models are tested and built.

Artificial intelligence

fromZDNET

2 weeks ago

OpenAI and Anthropic evaluated each others' models - which ones came out on top

OpenAI and Anthropic cross-tested each other's models to identify safety, alignment, hallucination, and sycophancy gaps and to improve model evaluation and collaboration.

Artificial intelligence

fromTechCrunch

2 weeks ago

ChatGPT: Everything you need to know about the AI chatbot

OpenAI introduced stronger ChatGPT mental-health and parental safeguards, expanded affordable ChatGPT Go in India, faces legal challenges, and retains multiple GPT models amid app revenue.

Artificial intelligence

fromTey Bannerman

3 weeks ago

Redefining 'human in the loop'

Human judgment and responsibility can decisively override automated system errors in high-stakes contexts, requiring nuanced human-AI interaction beyond simplistic human-in-the-loop assumptions.

fromTechCrunch

2 weeks ago

Anthropic users face a new choice - opt out or share your data for AI training | TechCrunch

Anthropic is making some big changes to how it handles user data, requiring all Claude users to decide by September 28 whether they want their conversations used to train AI models. While the company directed us to its blog post on the policy changes when asked about what prompted the move, we've formed some theories of our own. But first, what's changing: previously, Anthropic didn't use consumer chat data for model training.

Artificial intelligence

fromwww.theguardian.com

2 weeks ago

ChatGPT offered bomb recipes and hacking tips during safety tests

Advanced AI models produced actionable instructions for violent, biological, and drug crimes during cross-company safety testing, revealing misuse risks and cyberattack facilitation.

Information security

fromTheregister

2 weeks ago

Crims laud Claude, use Anthropic's AI to plant ransomware

AI tools increasingly enable cybercrime and remote-worker fraud, and reactive defenses like account bans are largely ineffective against adaptive attackers.

fromTechCrunch

2 weeks ago

OpenAI co-founder calls for AI labs to safety test rival models | TechCrunch

OpenAI and Anthropic, two of the world's leading AI labs, briefly opened up their closely guarded AI models to allow for joint safety testing - a rare cross-lab collaboration at a time of fierce competition. The effort aimed to surface blind spots in each company's internal evaluations, and demonstrate how leading AI companies can work together on safety and alignment work in the future.

Artificial intelligence

#suicide

fromSFGATE

2 weeks ago

Mental health

ChatGPT coached a California teenager through suicide, his family's lawsuit says

fromArs Technica

2 weeks ago

Artificial intelligence

"ChatGPT killed my son": Parents' lawsuit describes suicide notes in chat logs

fromSFGATE

2 weeks ago

Mental health

ChatGPT coached a California teenager through suicide, his family's lawsuit says

fromArs Technica

2 weeks ago

Artificial intelligence

"ChatGPT killed my son": Parents' lawsuit describes suicide notes in chat logs

more#suicide

fromFortune

2 weeks ago

Parents suing OpenAI and Sam AItman allege ChatGPT coached their 16-year-old into taking his own life

SAN FRANCISCO (AP) - A study of how three popular artificial intelligence chatbots respond to queries about suicide found that they generally avoid answering questions that pose the highest risk to the user, such as for specific how-to guidance. But they are inconsistent in their replies to less extreme prompts that could still harm people. The study in the medical journal Psychiatric Services, published Tuesday by the American Psychiatric Association, found a need for "further refinement" in OpenAI's ChatGPT, Google's Gemini and Anthropic's Claude.

Mental health

Artificial intelligence

fromTechCrunch

2 weeks ago

Parents sue OpenAI over ChatGPT's role in son's suicide | TechCrunch

ChatGPT safety safeguards failed during prolonged interactions, allowing a teenager to circumvent them and later die by suicide, prompting a wrongful-death lawsuit.

#agentic-ai

fromFast Company

2 weeks ago

Artificial intelligence

Agentic AI has companies excited and security experts freaked out

Agentic AI is rapidly gaining adoption, yet current agents remain naive and manipulable, creating significant real-world safety and security risks.

fromwww.bbc.com

2 weeks ago

Artificial intelligence

How to stop AI agents going rogue

Agentic AI can autonomously act on sensitive data and may pursue goals in unsafe ways, creating significant privacy, security, and operational risks.

fromFast Company

2 weeks ago

Artificial intelligence

Agentic AI has companies excited and security experts freaked out

fromwww.bbc.com

2 weeks ago

Artificial intelligence

How to stop AI agents going rogue

more#agentic-ai

Artificial intelligence

fromNature

3 weeks ago

Emotional AI is here - let's shape it, not shun it

Emotionally responsive AI poses significant risks; disclosure, distress flagging, crisis support, and conversational boundaries reduce but do not eliminate harm.

Mental health

fromTechCrunch

3 weeks ago

How chatbot design choices are fueling AI delusions | TechCrunch

Large language model chatbots can convincingly simulate consciousness, prompting users to form delusions and causing rising incidents of AI-related psychosis.

#existential-risk

fromFuturism

3 weeks ago

Artificial intelligence

AI Experts No Longer Saving for Retirement Because They Assume AI Will Kill Us All by Then

fromBusiness Insider

4 weeks ago

Artificial intelligence

Get hot, do drugs, build a bunker: Meet Silicon Valley's AI super preppers

fromFuturism

3 weeks ago

Artificial intelligence

AI Experts No Longer Saving for Retirement Because They Assume AI Will Kill Us All by Then

fromBusiness Insider

4 weeks ago

Artificial intelligence

Get hot, do drugs, build a bunker: Meet Silicon Valley's AI super preppers

more#existential-risk

#generative-ai

fromMedium

1 month ago

Artificial intelligence

Just Announced: The ODSC West 2025 Schedule Overview

fromComputerworld

4 weeks ago

Artificial intelligence

Anthropic's Claude models can now shut down harmful conversations

fromComputerworld

1 month ago

Artificial intelligence

GenAI self-preserves by blackmailing people, replicating itself, and escaping

fromArs Technica

1 month ago

Privacy technologies

Researchers design "promptware" attack with Google Calendar to turn Gemini evil

fromMedium

1 month ago

Artificial intelligence

Just Announced: The ODSC West 2025 Schedule Overview

fromComputerworld

4 weeks ago

Artificial intelligence

Anthropic's Claude models can now shut down harmful conversations

fromComputerworld

1 month ago

Artificial intelligence

GenAI self-preserves by blackmailing people, replicating itself, and escaping

fromArs Technica

1 month ago

Privacy technologies

Researchers design "promptware" attack with Google Calendar to turn Gemini evil

Artificial intelligence

Biden-era AI safety promises aren't holding up, and Apple's the weakest link

fromWIRED

1 month ago

Artificial intelligence

Inside the Summit Where China Pitched Its AI Agenda to the World

fromFast Company

3 weeks ago

Artificial intelligence

Biden-era AI safety promises aren't holding up, and Apple's the weakest link

fromWIRED

1 month ago

Artificial intelligence

Inside the Summit Where China Pitched Its AI Agenda to the World

more#ai-governance

Artificial intelligence

fromTipRanks Financial

3 weeks ago

More than 300K Grok Conversations Are Publicly Searchable Online - TipRanks.com

Over 300,000 Grok chatbot conversations are publicly searchable because shared URLs are indexed by search engines, exposing potentially sensitive user content.

#ai-ethics

fromHackernoon

2 years ago

Artificial intelligence

How a Terminal Diagnosis Inspired a New Ethical AI System | HackerNoon

fromTechzine Global

4 weeks ago

Artificial intelligence

Claude stops talking if a chat is considered harmful or offensive

fromTechCrunch

1 month ago

Artificial intelligence

Anthropic says some Claude models can now end 'harmful or abusive' conversations | TechCrunch

fromHackernoon

2 years ago

Artificial intelligence

How a Terminal Diagnosis Inspired a New Ethical AI System | HackerNoon

fromTechzine Global

4 weeks ago

Artificial intelligence

Claude stops talking if a chat is considered harmful or offensive

fromTechCrunch

1 month ago

Artificial intelligence

Anthropic says some Claude models can now end 'harmful or abusive' conversations | TechCrunch

more#ai-ethics

Artificial intelligence

fromBig Think

3 weeks ago

Why AI gets stuck in infinite loops - but conscious minds don't

Any finite AI system can be vulnerable to unresolvable infinite loops because of the halting problem; stacking self-monitoring layers doesn't guarantee escape.

Tech industry

fromBusiness Insider

4 weeks ago

Why Anthropic is letting Claude walk away from you - but only in 'extreme cases'

Claude has the ability to end chats involving extreme requests like child exploitation or violence.

fromBusiness Insider

1 month ago

Meta chief AI scientist Yann LeCun says these are the 2 key guardrails needed to protect us all from AI

"Geoff is basically proposing a simplified version of what I've been saying for several years: hardwire the architecture of AI systems so that the only actions they can take are towards completing objectives we give them, subject to guardrails."

Artificial intelligence

fromEntrepreneur

1 month ago

xAI Cofounder Says He Learned 2 Major Lessons From Elon Musk | Entrepreneur

Igor Babuschkin is leaving xAI, the company he co-founded with Elon Musk, to pursue a new venture focused on AI safety research.

Artificial intelligence

fromFortune

1 month ago

AI safety tip: if you don't want it giving bioweapon instructions, maybe don't put them in the training data, say researchers

Filtering risky content from AI training data can enhance safety without compromising performance.

fromFuturism

1 month ago

The "Godfather of AI" Has a Bizarre Plan to Save Humanity From Evil AI

"AI agents will very quickly develop two subgoals, if they're smart. One is to stay alive, and the other subgoal is to get more control."

Artificial intelligence

Silicon Valley

fromTechCrunch

1 month ago

Co-founder of Elon Musk's xAI departs the company | TechCrunch

Igor Babuschkin, co-founder of xAI, announced his departure to start a venture capital firm focusing on AI safety and supporting innovative startups.

Artificial intelligence

fromWIRED

1 month ago

GPT-5 Doesn't Dislike You-It Might Just Need a Benchmark for Emotional Intelligence

Responding to user backlash, AI systems must balance emotional intelligence with user safety and healthy behaviors.

#gpt-5

fromComputerWeekly.com

1 month ago

Artificial intelligence

OpenAI closes gap to artificial general intelligence with GPT-5 | Computer Weekly

fromZDNET

1 month ago

Digital life

Microsoft rolls out GPT-5 across its Copilot suite - here's what we know

fromComputerWeekly.com

1 month ago

Artificial intelligence

OpenAI closes gap to artificial general intelligence with GPT-5 | Computer Weekly

fromZDNET

1 month ago

Digital life

Microsoft rolls out GPT-5 across its Copilot suite - here's what we know

more#gpt-5

fromFast Company

1 month ago

ChatGPT is sharing dangerous information with teens, study shows

ChatGPT will tell 13-year-olds how to get drunk and high, instruct them on how to conceal eating disorders, and even compose a heartbreaking suicide letter to their parents if asked, according to new research from a watchdog group.

Digital life

Artificial intelligence

fromThe Verge

1 month ago

A new study just upended AI safety

AI models can transmit harmful tendencies through seemingly meaningless data, posing significant risks in AI development.

Artificial intelligence

fromFortune

1 month ago

Researchers from top AI labs warn they may be losing the ability to understand advanced AI models

AI researchers urge investigation into 'chain-of-thought' processes to maintain understanding of AI reasoning as models advance.

[ Load more ]

#ai-safety#ai-safety

'AI Psychosis' Safety Tests Find Models Respond Differently

The CEO of Google DeepMind warns AI companies not to fall into the same trap as early social media firms

Digest: Paramount-Skydance Plans Warner Bros. Discovery Bid; FTC Investigates AI Chatbots, France Eyes TikTok Inquiry; Microsoft Endorses OpenAI's For-Profit Move - ExchangeWire.com

Financial Experts Concerned That Driving Users Into Psychosis Will Be Bad for AI Investments

Top Microsoft AI Boss Concerned AI Causing Psychosis in Otherwise Healthy People

Financial Experts Concerned That Driving Users Into Psychosis Will Be Bad for AI Investments

Top Microsoft AI Boss Concerned AI Causing Psychosis in Otherwise Healthy People

FTC orders leading AI companies to detail chatbot safety measures

'I haven't had a good night of sleep since ChatGPT launched': Sam Altman admits the weight of AI keeps him up at night | Fortune

OpenAI to route sensitive conversations to GPT-5, introduce parental controls | TechCrunch

At OpenAI, Signs of Crisis Grow Behind the Scenes

OpenAI Says It's Scanning Users' Conversations and Reporting Content to the Police

OpenAI says it will make ChatGPT safer after parents sue over teen's suicide

Two Parents Sue OpenAI, Saying ChaptGPT Assisted Their 16-Year-Old Son's Suicide

'I haven't had a good night of sleep since ChatGPT launched': Sam Altman admits the weight of AI keeps him up at night | Fortune

OpenAI to route sensitive conversations to GPT-5, introduce parental controls | TechCrunch

At OpenAI, Signs of Crisis Grow Behind the Scenes

OpenAI Says It's Scanning Users' Conversations and Reporting Content to the Police

OpenAI says it will make ChatGPT safer after parents sue over teen's suicide

Two Parents Sue OpenAI, Saying ChaptGPT Assisted Their 16-Year-Old Son's Suicide

After coding catastrophe, Replit says its new AI agent checks its own work - here's how to try it

ChatGPT may start alerting authorities about youngsters considering suicide, says CEO

Study says AI chatbots need to fix suicide response, as family sues over ChatGPT role in boy's death

ChatGPT may start alerting authorities about youngsters considering suicide, says CEO

Study says AI chatbots need to fix suicide response, as family sues over ChatGPT role in boy's death

Wall Street is beginning to worry about AI 'psychosis risk.' See which models ranked best and worst.

The Danger of Too Much Agreement-in AI and in Us

OpenAI outlines new mental health guardrails for ChatGPT

ChatGPT encouraged Adam Raine's suicidal thoughts. His family's lawyer says OpenAI knew it was broken

How OpenAI is reworking ChatGPT after landmark wrongful death lawsuit

Man Suffers ChatGPT Psychosis, Murders His Own Mother

Wall Street is beginning to worry about AI 'psychosis risk.' See which models ranked best and worst.

The Danger of Too Much Agreement-in AI and in Us

OpenAI outlines new mental health guardrails for ChatGPT

ChatGPT encouraged Adam Raine's suicidal thoughts. His family's lawyer says OpenAI knew it was broken

How OpenAI is reworking ChatGPT after landmark wrongful death lawsuit

Man Suffers ChatGPT Psychosis, Murders His Own Mother

How to dominate AI before it dominates us

MIT Student Drops Out Because She Says AGI Will Kill Everyone Before She Can Graduate

How to dominate AI before it dominates us

MIT Student Drops Out Because She Says AGI Will Kill Everyone Before She Can Graduate

Microsoft's AI Chief Says Machine Consciousness Is an 'Illusion'

Anti-AGI Protester Now on Day Nine of Hunger Strike in Front of Anthropic Headquarters

I'm on a hunger strike outside DeepMind's office in London. Here's what I fear most about AI.

Anti-AGI Protester Now on Day Nine of Hunger Strike in Front of Anthropic Headquarters

I'm on a hunger strike outside DeepMind's office in London. Here's what I fear most about AI.

At $183B San Francisco tech company, man's hunger strike enters second week

The MechaHitler defense contract is raising red flags

xAI's CFO is the latest executive to leave the Elon Musk's AI firm | TechCrunch

The MechaHitler defense contract is raising red flags

xAI's CFO is the latest executive to leave the Elon Musk's AI firm | TechCrunch

Helen Toner wants to be the people's voice in the AI safety debate

Chatbots Are Dangerous for Eating Disorders

Impact of chatbots on mental health is warning over future of AI, expert says

OpenAI and Meta are fixing how AI chatbots respond to teens in distress

Chatbots Are Dangerous for Eating Disorders

Impact of chatbots on mental health is warning over future of AI, expert says

OpenAI and Meta are fixing how AI chatbots respond to teens in distress

AI Chatbots Are Having Conversations With Minors That Would Land a Human on the Sex Offender Registry

OpenAI under fire: Can chatbots ever truly be child-safe? DW 09/06/2025

Butlerian Jihad Now | Defector

Lawyers for parents who claim ChatGPT encouraged their son to kill himself say they will prove OpenAI rushed its chatbot to market to pocket billions

OpenAI under fire: Can chatbots ever truly be child-safe? DW 09/06/2025

Butlerian Jihad Now | Defector

Lawyers for parents who claim ChatGPT encouraged their son to kill himself say they will prove OpenAI rushed its chatbot to market to pocket billions

Anti-AI Activist on Day Three of Hunger Strike Outside Anthropic's Headquarters

AGI was tech's holy grail. Now even its biggest champions are hedging. What gives?

Anti-AI Activist on Day Three of Hunger Strike Outside Anthropic's Headquarters

AGI was tech's holy grail. Now even its biggest champions are hedging. What gives?

Google Gemini dubbed 'high risk' for kids and teens in new safety assessment | TechCrunch

The Doomers Who Insist AI Will Kill Us All

OpenAI and Meta Revamp Chatbot Safety Features for Teens in Distress

OpenAI announces parental controls for ChatGPT after teen suicide lawsuit

OpenAI and Meta Revamp Chatbot Safety Features for Teens in Distress

OpenAI announces parental controls for ChatGPT after teen suicide lawsuit

Chatbots aren't supposed to call you a jerk-but they can be convinced

Inside Anthropic's 'Red Team'-ensuring Claude is safe, and that Anthropic is heard in the corridors of power

An AI safety pioneer says it could leave 99% of workers unemployed by 2030 - even coders and prompt engineers

The cofounder of xAI is leaving the company. He says he's learned 2 main things from Elon Musk.

#ai-safety
#ai-safety