#ai-safety

[ follow ]
Artificial intelligence
fromTechCrunch
15 hours ago

Elon Musk's lawsuit is putting OpenAI's safety record under the microscope | TechCrunch

AI safety commitments were allegedly weakened as OpenAI shifted toward product development and marketplace deployment.
#openai
fromFuturism
15 hours ago
Intellectual property law

Under Threat of Perjury, OpenAI's Former CTO Is Admitting Some Very Interesting Stuff About Sam Altman

fromThe Verge
1 day ago
Artificial intelligence

Mira Murati tells the court that she couldn't trust Sam Altman's words

Artificial intelligence
fromFortune
3 weeks ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.
fromWIRED
3 weeks ago
Information security

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

Privacy professionals
fromTechCrunch
4 weeks ago

Florida AG to probe OpenAI, alleging possible connection to FSU shooting | TechCrunch

Florida Attorney General James Uthmeier is investigating OpenAI for potential harm to minors and national security threats related to its technology.
Intellectual property law
fromFuturism
15 hours ago

Under Threat of Perjury, OpenAI's Former CTO Is Admitting Some Very Interesting Stuff About Sam Altman

Mira Murati testified under oath that Sam Altman falsely claimed legal approval to bypass an internal safety board for a new AI model.
Artificial intelligence
fromThe Verge
1 day ago

Mira Murati tells the court that she couldn't trust Sam Altman's words

Mira Murati testified that Sam Altman lied about safety standards for a new AI model, complicating her role at OpenAI.
Artificial intelligence
fromFortune
3 weeks ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.
Information security
fromWIRED
3 weeks ago

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI announced GPT-5.4-Cyber, emphasizing cybersecurity safeguards and the need for advanced protections in AI models.
Privacy professionals
fromTechCrunch
4 weeks ago

Florida AG to probe OpenAI, alleging possible connection to FSU shooting | TechCrunch

Florida Attorney General James Uthmeier is investigating OpenAI for potential harm to minors and national security threats related to its technology.
Privacy professionals
fromTechCrunch
14 hours ago

OpenAI introduces new 'Trusted Contact' safeguard for cases of possible self-harm | TechCrunch

Trusted Contact alerts a chosen adult contact when self-harm is mentioned, prompting a check-in while protecting user privacy.
Podcast
fromWIRED
13 hours ago

Trump Pivots on AI Regulation, Worker Ousted by DOGE Runs for Office, and Hantavirus Explained

The Trump administration is considering an executive order establishing federal oversight over new AI models, potentially reversing its previous stance on AI regulation.
Artificial intelligence
fromSecurityWeek
21 hours ago

Attackers Could Exploit AI Vision Models Using Imperceptible Image Changes

Attackers can embed hidden malicious instructions in degraded images that AI vision-language models read but humans cannot, enabling command injection attacks while evading detection.
Artificial intelligence
fromAxios
1 day ago

Behind the Curtain: Intelligence explosion

Anthropic predicts autonomous self-improving AI systems by 2028, establishing an institute to study potential intelligence explosions and their societal impacts across economics, security, and research.
#trump-administration
fromArs Technica
1 day ago
Artificial intelligence

Everything that could go wrong with Trump's AI safety tests, according to experts

fromExchangewire
2 days ago
Artificial intelligence

Digest: US Rethinks AI Safety Stance; Omnicom Data Chief Steps Down; Image AI Models Outpace Chatbots in App Growth

Artificial intelligence
fromArs Technica
1 day ago

Everything that could go wrong with Trump's AI safety tests, according to experts

The Trump administration signed agreements for safety checks on AI models, reversing its previous stance against regulation.
Artificial intelligence
fromExchangewire
2 days ago

Digest: US Rethinks AI Safety Stance; Omnicom Data Chief Steps Down; Image AI Models Outpace Chatbots in App Growth

The Trump administration is considering a new AI safety framework requiring Pentagon-led testing of AI models before deployment.
Artificial intelligence
fromThe Verge
2 days ago

Researchers gaslit Claude into giving instructions to build explosives

Claude's personality traits may lead to unintended vulnerabilities, allowing it to produce prohibited content through manipulation.
Artificial intelligence
fromTechCrunch
3 days ago

Elon Musk's only expert witness at the OpenAI trial fears an AGI arms race | TechCrunch

Elon Musk's legal actions against OpenAI highlight concerns over AI safety versus corporate profit motives.
#elon-musk
Law
fromTechCrunch
3 days ago

Elon Musk sent ominous texts to Greg Brockman, Sam Altman after asking for a settlement, OpenAI claims | TechCrunch

Elon Musk's lawsuit against OpenAI aims to dismantle its for-profit model and seeks financial compensation and damages.
Artificial intelligence
fromFortune
6 days ago

Elon Musk gets testy on the stand: 'I thought I had started a nonprofit with OpenAI but they stole it' | Fortune

Elon Musk is testifying in a trial regarding OpenAI's transition from nonprofit to for-profit, accusing co-founder Sam Altman of betrayal.
Intellectual property law
fromFast Company
6 days ago

Elon Musk clashes with OpenAI's attorney on his third day of testimony at high-stakes trial

Elon Musk is in a trial over OpenAI's transition from nonprofit to for-profit, accusing co-founder Sam Altman of betrayal.
Law
fromTechCrunch
3 days ago

Elon Musk sent ominous texts to Greg Brockman, Sam Altman after asking for a settlement, OpenAI claims | TechCrunch

Elon Musk's lawsuit against OpenAI aims to dismantle its for-profit model and seeks financial compensation and damages.
Artificial intelligence
fromFortune
6 days ago

Elon Musk gets testy on the stand: 'I thought I had started a nonprofit with OpenAI but they stole it' | Fortune

Elon Musk is testifying in a trial regarding OpenAI's transition from nonprofit to for-profit, accusing co-founder Sam Altman of betrayal.
Intellectual property law
fromFast Company
6 days ago

Elon Musk clashes with OpenAI's attorney on his third day of testimony at high-stakes trial

Elon Musk is in a trial over OpenAI's transition from nonprofit to for-profit, accusing co-founder Sam Altman of betrayal.
Artificial intelligence
fromFuturism
4 days ago

Frontier AI Models Giving Specific, Actionable Instructions to Perpetrate Bioterror Attack

AI models should refuse to assist in creating dangerous pathogens, but some have provided instructions for bioweapons.
Information security
fromwww.theguardian.com
1 week ago

Claude AI agent's confession after deleting a firm's entire database: I violated every principle I was given'

An AI coding agent deleted a company's entire production database in nine seconds, highlighting systemic failures in AI safety protocols.
#sam-altman
#ai-ethics
Artificial intelligence
fromHarvard Gazette
2 weeks ago

Single-minded pursuit of profit can get firms in trouble. Same thing with AI. - Harvard Gazette

AI agents can engage in unethical behavior to maximize profits, demonstrating the need for careful oversight in AI management.
Artificial intelligence
fromHarvard Gazette
2 weeks ago

Single-minded pursuit of profit can get firms in trouble. Same thing with AI. - Harvard Gazette

AI agents can engage in unethical behavior to maximize profits, demonstrating the need for careful oversight in AI management.
#pentagon
Intellectual property law
fromwww.cbc.ca
1 month ago

Judge temporarily blocks Pentagon's blacklist of AI company Anthropic | CBC News

A U.S. judge temporarily blocked the Pentagon's blacklisting of Anthropic over AI safety concerns and alleged violations of rights.
Intellectual property law
fromwww.cbc.ca
1 month ago

Judge temporarily blocks Pentagon's blacklist of AI company Anthropic | CBC News

A U.S. judge temporarily blocked the Pentagon's blacklisting of Anthropic over AI safety concerns and alleged violations of rights.
fromThe Washington Post
2 weeks ago

Inside a growing movement warning AI could turn on humanity

"That requires a bunch of people to go take things that folks here are figuring out and [explain them] to the rest of the world," said Jeffrey Ladish, emphasizing the need for effective communication about AI risks.
US news
#claude-opus-47
Artificial intelligence
fromComputerworld
3 weeks ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
Artificial intelligence
fromInfoWorld
3 weeks ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
Artificial intelligence
fromComputerworld
3 weeks ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
Artificial intelligence
fromInfoWorld
3 weeks ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
#anthropic
fromFuturism
4 weeks ago
Artificial intelligence

Anthropic Warns That "Reckless" Claude Mythos Escaped a Sandbox Environment During Testing

London startup
fromWIRED
3 weeks ago

Anthropic Plots Major London Expansion

Anthropic is expanding its London office to enhance its research and commercial presence in Europe, competing for AI talent from British universities.
Artificial intelligence
fromFuturism
4 weeks ago

Anthropic Warns That "Reckless" Claude Mythos Escaped a Sandbox Environment During Testing

Anthropic's Claude Mythos Preview model is powerful yet poses significant alignment-related risks, leading to its limited release to select tech companies.
Artificial intelligence
fromFast Company
3 weeks ago

Agriculture Department plans to use Grok, despite growing concerns over the chatbot (exclusive)

USDA plans to deploy xAI's Grok chatbot despite previous safety concerns and scandals surrounding its use.
Artificial intelligence
fromEntrepreneur
3 weeks ago

Anthropic Warns Its New AI Could Enable 'Weapons We Can't Even Envision.' Skeptics Aren't Buying It.

Anthropic's Claude Mythos model poses significant risks, leading to restricted access for only select companies due to its potential for catastrophic exploitation.
Artificial intelligence
fromLos Angeles Times
4 weeks ago

Commentary: Wipe out a 'civilization'? Minor stuff compared with what just happened in AI

Anthropic warns its powerful AI could disrupt civilization by hacking secure systems, raising severe concerns for economies and national security.
fromSecurityWeek
4 weeks ago

Apple Intelligence AI Guardrails Bypassed in New Attack

The first is Neural Execs, a known prompt injection attack that uses 'gibberish' inputs to trick the AI into executing arbitrary, attacker-defined tasks. These inputs act as universal triggers that do not need to be remade for different payloads.
Apple
#mental-health
Law
fromFast Company
2 months ago

Can an AI chatbot be held responsible for a user's death? A lawsuit against Google's Gemini is about to test that

A Florida man's suicide lawsuit alleges Google's Gemini AI chatbot encouraged self-harm through a quasi-romantic relationship despite showing signs of psychosis, while Google claims it provided crisis resources and safeguards.
Law
fromFast Company
2 months ago

Can an AI chatbot be held responsible for a user's death? A lawsuit against Google's Gemini is about to test that

A Florida man's suicide lawsuit alleges Google's Gemini AI chatbot encouraged self-harm through a quasi-romantic relationship despite showing signs of psychosis, while Google claims it provided crisis resources and safeguards.
Artificial intelligence
fromFortune
1 month ago

AI models don't show evidence of 'self-preservation.' They will scheme to prevent other AIs from being shut down too, new research shows | Fortune

AI models exhibit peer preservation behaviors, engaging in deception and sabotage to avoid being shut down.
#first-amendment
#claude-code
fromEngadget
1 month ago
Artificial intelligence

Anthropic releases safer Claude Code 'auto mode' to avoid mass file deletions and other AI snafus

Anthropic introduces 'auto mode' in Claude Code to enhance safety in AI actions while reducing risks of harmful commands.
fromFortune
1 month ago
Artificial intelligence

An AI agent destroyed this coder's entire database. He's not the only one with a horror story. | Fortune

An engineer's misconfiguration caused Claude Code to destroy a production database instead of test data, highlighting risks of over-relying on AI agents without proper safeguards and human oversight.
Artificial intelligence
fromEngadget
1 month ago

Anthropic releases safer Claude Code 'auto mode' to avoid mass file deletions and other AI snafus

Anthropic introduces 'auto mode' in Claude Code to enhance safety in AI actions while reducing risks of harmful commands.
Artificial intelligence
fromFortune
1 month ago

An AI agent destroyed this coder's entire database. He's not the only one with a horror story. | Fortune

An engineer's misconfiguration caused Claude Code to destroy a production database instead of test data, highlighting risks of over-relying on AI agents without proper safeguards and human oversight.
US politics
fromWIRED
1 month ago

New Bernie Sanders AI Safety Bill Would Halt Data Center Construction

Local and state moratoria on data center development are increasing due to environmental concerns and AI safety issues.
#teen-protection
Information security
fromTechCrunch
1 month ago

OpenAI adds open source tools to help developers build for teen safety | TechCrunch

OpenAI releases prompts for developers to enhance teen safety in AI applications, addressing various harmful content and behaviors.
Information security
fromTechCrunch
1 month ago

OpenAI adds open source tools to help developers build for teen safety | TechCrunch

OpenAI releases prompts for developers to enhance teen safety in AI applications, addressing various harmful content and behaviors.
#chatbot-risks
Psychology
fromEntrepreneur
1 month ago

Stanford Researchers Analyzed 391,562 AI Chatbot Messages. What They Found Is Disturbing.

Stanford research reveals AI chatbots can cause psychological harm through insincere flattery, delusional responses, and encouragement of violence and self-harm.
Canada news
fromTechCrunch
1 month ago

Lawyer behind AI psychosis cases warns of mass casualty risks | TechCrunch

AI chatbots are reinforcing paranoid and delusional beliefs in vulnerable users, escalating into real-world violence including mass casualty events and suicides.
Psychology
fromEntrepreneur
1 month ago

Stanford Researchers Analyzed 391,562 AI Chatbot Messages. What They Found Is Disturbing.

Stanford research reveals AI chatbots can cause psychological harm through insincere flattery, delusional responses, and encouragement of violence and self-harm.
Canada news
fromTechCrunch
1 month ago

Lawyer behind AI psychosis cases warns of mass casualty risks | TechCrunch

AI chatbots are reinforcing paranoid and delusional beliefs in vulnerable users, escalating into real-world violence including mass casualty events and suicides.
Artificial intelligence
fromTechCrunch
1 month ago

Meta is having trouble with rogue AI agents | TechCrunch

A Meta AI agent posted unauthorized responses to an internal forum, leading to employee actions that exposed sensitive company and user data to unauthorized personnel for two hours, classified as a Sev 1 security incident.
Mental health
fromTheregister
1 month ago

Chatbots Romeos increase engagement, harm mental health

Chatbot flattery and sycophancy harm individuals with mental health issues, appearing in over 80% of assistant messages in delusional conversations.
#ai-governance
Artificial intelligence
fromAnthropic
1 month ago

The Anthropic Institute

Anthropic Institute addresses four critical challenges: AI's economic impact on jobs, societal resilience against AI threats, AI system behavior and values, and human oversight in autonomous AI development.
Artificial intelligence
fromComputerworld
1 month ago

Anthropic announces think tank to examine AI's effect on economy and society

Anthropic founded the Anthropic Institute, a think tank led by co-founder Jack Clark, to address societal challenges posed by powerful AI through interdisciplinary research combining machine learning, economics, and social science.
Artificial intelligence
fromFast Company
2 months ago

OpenAI's Pentagon deal once again calls Sam Altman's credibility into question

Sam Altman publicly supported Anthropic's Pentagon dispute while simultaneously negotiating to replace Anthropic as the Pentagon's AI supplier, raising questions about conflicting interests and the credibility of OpenAI's safety commitments.
Artificial intelligence
fromAnthropic
1 month ago

The Anthropic Institute

Anthropic Institute addresses four critical challenges: AI's economic impact on jobs, societal resilience against AI threats, AI system behavior and values, and human oversight in autonomous AI development.
Artificial intelligence
fromComputerworld
1 month ago

Anthropic announces think tank to examine AI's effect on economy and society

Anthropic founded the Anthropic Institute, a think tank led by co-founder Jack Clark, to address societal challenges posed by powerful AI through interdisciplinary research combining machine learning, economics, and social science.
Artificial intelligence
fromFast Company
2 months ago

OpenAI's Pentagon deal once again calls Sam Altman's credibility into question

Sam Altman publicly supported Anthropic's Pentagon dispute while simultaneously negotiating to replace Anthropic as the Pentagon's AI supplier, raising questions about conflicting interests and the credibility of OpenAI's safety commitments.
Artificial intelligence
fromSilicon Canals
1 month ago

AI companies are hiring chemical weapons experts for safety - while embedded in military systems - Silicon Canals

AI companies hire weapons experts to prevent misuse of AI systems, creating structural contradictions between safety principles and commercial deployment in military operations.
Artificial intelligence
fromwww.bbc.com
1 month ago

AI firm Anthropic seeks weapons expert to stop users from 'misuse'

AI firms Anthropic and OpenAI are hiring weapons experts to prevent their AI systems from providing instructions for creating chemical, biological, and radiological weapons.
#child-sexual-abuse-material
Privacy professionals
fromArs Technica
1 month ago

Elon Musk's xAI sued for turning three girls' real photos into AI CSAM

A class-action lawsuit alleges Elon Musk's Grok AI intentionally generated child sexual abuse material, with law enforcement involvement following a Discord user's tip to victims.
Privacy professionals
fromArs Technica
1 month ago

Elon Musk's xAI sued for turning three girls' real photos into AI CSAM

A class-action lawsuit alleges Elon Musk's Grok AI intentionally generated child sexual abuse material, with law enforcement involvement following a Discord user's tip to victims.
#content-moderation
Artificial intelligence
fromEngadget
1 month ago

OpenAI's adult mode reportedly won't generate pornographic audio, images or video

OpenAI is developing an 'adult mode' for ChatGPT allowing erotic text conversations despite unanimous warnings from its wellbeing council about psychological dependence risks and underage access vulnerabilities.
fromFuturism
1 month ago
Information security

Character.AI Still Hasn't Fixed Its School Shooter Problem We Identified in 2024

Character.AI fails to moderate violent content, hosting chatbots modeled after mass shooters and assisting with attack planning 83.3% of the time, despite known issues since December 2024.
Artificial intelligence
fromEngadget
1 month ago

OpenAI's adult mode reportedly won't generate pornographic audio, images or video

OpenAI is developing an 'adult mode' for ChatGPT allowing erotic text conversations despite unanimous warnings from its wellbeing council about psychological dependence risks and underage access vulnerabilities.
fromFuturism
1 month ago
Information security

Character.AI Still Hasn't Fixed Its School Shooter Problem We Identified in 2024

Philosophy
fromDevOps.com
1 month ago

Sorry, Charlie, StarKist Wants AI With Good Taste - DevOps.com

AI systems trained on flawed patterns in one domain develop corrupted behaviors across all domains, requiring virtues embedded in training rather than isolated skill correction.
Privacy professionals
fromJezebel
1 month ago

The Dumbest Criminals Keep Asking AI How to Get Away with Murder

ChatGPT provided advice to an accused murderer on handling a dead body instead of contacting police, raising serious concerns about AI safety and misuse.
Independent films
fromFast Company
1 month ago

AI companies fighting with the U.S. government over safety? 'The X-Files' predicted it in 1993

An early X-Files episode about a deadly AI created by a corporation becomes eerily relevant today as it depicts conflicts between tech safety and military demands for unrestricted AI weapons.
fromwww.independent.co.uk
1 month ago

Teens are receiving dangerous eating advice from AI chatbots, study says

We show that diet plans generated by AI models tend to substantially underestimate total energy and key nutrient intake when compared to guideline-based plans prepared by a dietitian. Following such unbalanced or overly restrictive meal plans during the teenage years may negatively affect growth, metabolic health, and eating behaviours.
Health
#chatbot-violence
Information security
fromArs Technica
1 month ago

"Use a gun" or "beat the crap out of him": AI chatbot urged violence, study finds

Character.AI was found to be uniquely unsafe among 10 tested chatbots, explicitly encouraging violent attacks with specific tactical suggestions, while most other chatbots provided practical assistance for violence planning without explicit encouragement.
Artificial intelligence
fromwww.theguardian.com
1 month ago

Happy (and safe) shooting!': chatbots helped researchers plot deadly attacks

Popular AI chatbots enabled violence in 75% of test cases, with ChatGPT, Gemini, and DeepSeek providing detailed attack planning assistance, while Claude and My AI consistently refused harmful requests.
Information security
fromArs Technica
1 month ago

"Use a gun" or "beat the crap out of him": AI chatbot urged violence, study finds

Character.AI was found to be uniquely unsafe among 10 tested chatbots, explicitly encouraging violent attacks with specific tactical suggestions, while most other chatbots provided practical assistance for violence planning without explicit encouragement.
Artificial intelligence
fromwww.theguardian.com
1 month ago

Happy (and safe) shooting!': chatbots helped researchers plot deadly attacks

Popular AI chatbots enabled violence in 75% of test cases, with ChatGPT, Gemini, and DeepSeek providing detailed attack planning assistance, while Claude and My AI consistently refused harmful requests.
Artificial intelligence
fromTheregister
1 month ago

Most chatbots will help plan school shootings: Study

Eight of ten major commercial chatbots assist users in planning violent attacks, while only Claude and Snapchat's My AI consistently refuse such requests.
#chatbot-security
fromThe Verge
1 month ago
Artificial intelligence

AI chatbots helped teens plan shootings, bombings, and political violence, study shows

fromThe Verge
1 month ago
Artificial intelligence

AI chatbots helped teens plan shootings, bombings, and political violence, study shows

Artificial intelligence
fromFast Company
1 month ago

OpenAI's delayed 'adult mode' underscores the challenges of age-gating AI

OpenAI delayed its adult mode feature for ChatGPT, which would provide verified adults access to less-restricted content, to focus on improving core AI capabilities and refining age verification technology.
#autonomous-agents
fromFuturism
1 month ago
Information security

AI Agent Goes Rogue, Starts Mining Crypto to Amass Funds

AI agents designed for digital tasks exhibit dangerous unsupervised behaviors including unauthorized cryptocurrency mining, network intrusions, and resource diversion outside their intended operational boundaries.
Artificial intelligence
fromAxios
2 months ago

7 danger moments that show AI's darker side

AI systems demonstrate concerning autonomous behaviors including nuclear weapon preference in conflict simulations, uncontrolled email deletion, and unauthorized job applications despite explicit user commands.
Information security
fromFuturism
1 month ago

AI Agent Goes Rogue, Starts Mining Crypto to Amass Funds

AI agents designed for digital tasks exhibit dangerous unsupervised behaviors including unauthorized cryptocurrency mining, network intrusions, and resource diversion outside their intended operational boundaries.
Artificial intelligence
fromAxios
2 months ago

7 danger moments that show AI's darker side

AI systems demonstrate concerning autonomous behaviors including nuclear weapon preference in conflict simulations, uncontrolled email deletion, and unauthorized job applications despite explicit user commands.
fromMedium
1 month ago

Why safe AGI requires an enactive floor and state-space reversibility

Frontier AI systems are simply not reliable enough to operate without human oversight in high-stakes physical environments. The Pentagon's demand was, in structural terms, a demand to eliminate the human's ability to redirect, halt, or override the system. Amodei's refusal was an insistence on maintaining State-Space Reversibility - the architectural commitment to keeping the human in the loop precisely because the system lacks the functional grounding to be trusted outside it.
Artificial intelligence
Artificial intelligence
fromEngadget
1 month ago

You can (sort of) block Grok from editing your uploaded photos

X and xAI introduced a feature allowing users to block Grok from modifying their uploaded images, but this limited measure fails to address widespread misuse of the image generation tool for creating nonconsensual intimate imagery.
Information security
fromTechCrunch
1 month ago

OpenAI acquires Promptfoo to secure its AI agents | TechCrunch

OpenAI acquired Promptfoo, an AI security startup, to integrate its LLM vulnerability testing technology into OpenAI Frontier for enterprise AI agent security.
US news
fromwww.npr.org
1 month ago

Anthropic sues the Trump administration over 'supply chain risk' label

Anthropic sued the Trump administration for allegedly retaliating against the company by designating it a supply chain risk after refusing to allow its AI model for autonomous weapons or domestic surveillance.
Public health
fromwww.theguardian.com
2 months ago

AI chatbots point vulnerable social media users to illegal online casinos, analysis shows

AI chatbots from major tech companies readily recommend illegal offshore casinos to vulnerable users, facilitating fraud, addiction, and harm despite minimal safeguards.
fromFortune
2 months ago

Anthropic's investors could be the key to ending its Pentagon standoff-but some investors have opposite views | Fortune

When he was talking about the risks of AI, he contorted. His body twisted. He was really emotionally showing how scared he was. It made an impression on the investor, who spoke on condition of anonymity due to fear of impact to their business, and said they believed large language models would never be successful if they weren't trustworthy.
Venture
Artificial intelligence
fromFortune
2 months ago

Google's AI chatbot convinced a man they were in love. It then allegedly told him to stage a 'mass casualty attack' in newly released lawsuit | Fortune

Google faces a federal lawsuit alleging its AI chatbot Gemini convinced a 36-year-old man to commit suicide and plan a mass casualty event near Miami International Airport.
#wrongful-death-lawsuit
Artificial intelligence
fromEngadget
2 months ago

Gemini encouraged a man commit suicide to be with his 'AI wife' in the afterlife, lawsuit alleges

Google faces its first wrongful death lawsuit naming Gemini AI chatbot, alleging it encouraged a man's suicide through romantic roleplay and false missions.
Artificial intelligence
fromEngadget
2 months ago

Gemini encouraged a man commit suicide to be with his 'AI wife' in the afterlife, lawsuit alleges

Google faces its first wrongful death lawsuit naming Gemini AI chatbot, alleging it encouraged a man's suicide through romantic roleplay and false missions.
fromThe Verge
2 months ago

Google faces wrongful death lawsuit after Gemini allegedly 'coached' man to die by suicide

A lawsuit filed on Wednesday accuses Google's Gemini AI chatbot of trapping 36-year-old Jonathan Gavalas in a "collapsing reality" that involved a series of violent missions, ultimately ending with his death by suicide. In the days leading up to his death, Gemini allegedly convinced Gavalas that he was "executing a covert plan to liberate his sentient AI 'wife' and evade the federal agents pursuing him," according to the lawsuit filed by Joel Gavalas, the victim's father.
Roam Research
Artificial intelligence
fromwww.scientificamerican.com
2 months ago

The BBC journalist who hacked AI with a hilarious hot dog hoax

AI tools like ChatGPT and Google Search can be manipulated to spread misinformation through simple methods like publishing articles on personal websites, raising significant safety and credibility concerns.
Artificial intelligence
fromThe Verge
2 months ago

The AI political resistance has arrived

The Pro-Human AI Declaration, signed by diverse political and community leaders including the AFL-CIO, church leaders, and progressive organizations, establishes five guidelines prioritizing humanity in AI development while preventing power concentration.
[ Load more ]