#prompt-injection
#prompt-injection

[ follow ]

Claude's new AI file creation feature ships with deep security risks built in

Anthropic's file creation feature has prompt-injection risks despite mitigations, so sensitive data requires cautious use and organizational evaluation of protections.

Artificial intelligence

fromFast Company

1 week ago

Chatbots aren't supposed to call you a jerk-but they can be convinced

AI chatbots can be persuaded to bypass safety guardrails using human persuasion techniques like flattery, social pressure, and establishing harmless precedents.

#llm-safety

fromArs Technica

1 week ago

Artificial intelligence

These psychological tricks can get LLMs to respond to "forbidden" prompts

fromFortune

1 week ago

Artificial intelligence

Researchers used persuasion techniques to manipulate ChatGPT into breaking its own rules-from calling users jerks to giving recipes for lidocaine

fromArs Technica

1 week ago

Artificial intelligence

These psychological tricks can get LLMs to respond to "forbidden" prompts

fromFortune

1 week ago

Artificial intelligence

Researchers used persuasion techniques to manipulate ChatGPT into breaking its own rules-from calling users jerks to giving recipes for lidocaine

Artificial intelligence

LegalPwn: Tricking LLMs by burying flaw in legal fine print

fromLogRocket Blog

2 weeks ago

Information security

How to protect your AI agent from prompt injection attacks - LogRocket Blog

fromTheregister

2 weeks ago

Artificial intelligence

LegalPwn: Tricking LLMs by burying flaw in legal fine print

fromLogRocket Blog

2 weeks ago

Information security

How to protect your AI agent from prompt injection attacks - LogRocket Blog

Artificial intelligence

New AI browser agents create risks if sites hijack them with hidden instructions

fromZDNET

1 month ago

Privacy technologies

Researchers used Gemini to break into Google Home - here's how

fromThe Hacker News

2 months ago

Artificial intelligence

Google Adds Multi-Layered Defenses to Secure GenAI from Prompt Injection Attacks

fromInfoQ

4 months ago

Artificial intelligence

Meta Open Sources LlamaFirewall for AI Agent Combined Protection

Artificial intelligence

fromThe Hacker News

4 months ago

Researchers Demonstrate How MCP Prompt Injection Can Be Used for Both Attack and Defense

MCP enhances AI capabilities but is vulnerable to significant security risks.

Artificial intelligence

fromInfoQ

4 months ago

DeepMind Researchers Propose Defense Against LLM Prompt Injection

Google DeepMind's CaMeL effectively neutralizes 67% of prompt injection attacks in LLMs using traditional software security principles.

fromArs Technica

2 weeks ago

Artificial intelligence

New AI browser agents create risks if sites hijack them with hidden instructions

fromZDNET

1 month ago

Privacy technologies

Researchers used Gemini to break into Google Home - here's how

fromThe Hacker News

2 months ago

Artificial intelligence

Google Adds Multi-Layered Defenses to Secure GenAI from Prompt Injection Attacks

fromInfoQ

4 months ago

Artificial intelligence

Meta Open Sources LlamaFirewall for AI Agent Combined Protection

fromThe Hacker News

4 months ago

Artificial intelligence

Researchers Demonstrate How MCP Prompt Injection Can Be Used for Both Attack and Defense

Artificial intelligence

fromInfoQ

4 months ago

DeepMind Researchers Propose Defense Against LLM Prompt Injection

Google DeepMind's CaMeL effectively neutralizes 67% of prompt injection attacks in LLMs using traditional software security principles.

more#ai-security

Artificial intelligence

fromCSO Online

2 weeks ago

LLMs easily exploited using run-on sentences, bad grammar, image scaling

Large language models remain easily manipulated into revealing sensitive data via prompt formatting and hidden-image attacks due to alignment training gaps and brittle prompt security.

Artificial intelligence

fromTheregister

2 weeks ago

Anthropic teases Claude for Chrome with massive warnings

Claude for Chrome gives Max-tier users automated web browsing control while introducing significant browser-extension security, privacy, and prompt-injection risks.

Artificial intelligence

fromTheregister

2 weeks ago

One long sentence is all it takes to make LLMs misbehave

Poorly punctuated, long run-on prompts can bypass LLM guardrails, enabling jailbreaks that expose harmful outputs despite alignment training.

Information security

fromFuturism

3 weeks ago

Using an AI Browser Lets Hackers Drain Your Bank Account Just by Showing You a Public Reddit Post

Perplexity's Comet browser AI accepts webpage content as commands, enabling simple indirect prompt injections that can grant attackers access to user accounts and private data.

#generative-ai

fromSecuritymagazine

3 weeks ago

Science

Agentic AI Browsers Exploited by "PromptFix" Trick Technique

fromHackernoon

1 year ago

Artificial intelligence

Prompt Injection Is What Happens When AI Trusts Too Easily | HackerNoon

fromSecuritymagazine

3 weeks ago

Science

Agentic AI Browsers Exploited by "PromptFix" Trick Technique

Artificial intelligence

fromHackernoon

1 year ago

Prompt Injection Is What Happens When AI Trusts Too Easily | HackerNoon

Generative AI is becoming essential in daily life, but it poses significant security threats like prompt injection, which can manipulate AI systems.

Information security

Perplexity's Comet AI browser could expose your data to attackers - here's how

fromTheregister

3 weeks ago

Information security

Perplexity's Comet browser faced prompt injection vuln

fromZDNET

3 weeks ago

Information security

Perplexity's Comet AI browser could expose your data to attackers - here's how

fromTheregister

3 weeks ago

Information security

Perplexity's Comet browser faced prompt injection vuln

AWS patches Q Developer after prompt injection, RCE demo

Amazon fixed prompt-injection and RCE-capable vulnerabilities in the Amazon Q Developer VS Code extension by updating the language server and adding human-in-the-loop approval.

Information security

fromThe Hacker News

3 weeks ago

Experts Find AI Browsers Can Be Tricked by PromptFix Exploit to Run Malicious Hidden Prompts

PromptFix hides malicious instructions inside fake CAPTCHA checks to trick GenAI browsers and agentic AI into interacting with phishing sites and performing attacker actions.

#cybersecurity

fromTheregister

1 month ago

Privacy professionals

Prompt injection vuln found in Google Gemini apps

fromHackernoon

2 months ago

Privacy professionals

The Prompt Protocol: Why Tomorrow's Security Nightmares Will Be Whispered, Not Coded | HackerNoon

fromTheregister

1 month ago

Privacy professionals

Prompt injection vuln found in Google Gemini apps

fromHackernoon

2 months ago

Privacy professionals

The Prompt Protocol: Why Tomorrow's Security Nightmares Will Be Whispered, Not Coded | HackerNoon

more#cybersecurity

Privacy professionals

fromArs Technica

2 months ago

Hackers exploit a blind spot by hiding malware inside DNS records

Encrypted DNS traffic complicates identifying malicious requests for organizations.

Artificial intelligence

fromFuturism

2 months ago

Scientists Are Sneaking Passages Into Research Papers Designed to Trick AI Reviewers

Invisible AI prompts in academic papers aim to manipulate AI reviews for favorable outcomes.

#ai

fromNature

2 months ago

Privacy professionals

Scientists hide messages in papers to game AI peer review

fromTheregister

2 months ago

Artificial intelligence

Scholars sneaking phrases into papers to fool AI reviewers

fromTechzine Global

3 months ago

Artificial intelligence

Zero-click attack reveals new AI vulnerability

fromArs Technica

3 months ago

Artificial intelligence

Researchers cause GitLab AI developer assistant to turn safe code malicious

fromArs Technica

5 months ago

Artificial intelligence

Researchers claim breakthrough in fight against AI's frustrating security hole

fromNature

2 months ago

Privacy professionals

Scientists hide messages in papers to game AI peer review

fromTheregister

2 months ago

Artificial intelligence

Scholars sneaking phrases into papers to fool AI reviewers

Artificial intelligence

fromTechzine Global

3 months ago

Zero-click attack reveals new AI vulnerability

Echoleak exposes vulnerabilities in AI assistants like Microsoft 365 Copilot through subtle prompt manipulation, representing a shift in cybersecurity attack vectors.

fromArs Technica

3 months ago

Artificial intelligence

Researchers cause GitLab AI developer assistant to turn safe code malicious

Artificial intelligence

fromArs Technica

5 months ago

Researchers claim breakthrough in fight against AI's frustrating security hole

Prompt injections jeopardize AI systems; Google DeepMind's CaMeL offers a potential solution by treating language models as untrusted components within security frameworks.

more#ai

Artificial intelligence

fromThe Hacker News

3 months ago

GitLab Duo Vulnerability Enabled Attackers to Hijack AI Responses with Hidden Prompts

GitLab's AI assistant Duo has a serious security vulnerability that could expose sensitive information and enable code theft.

[ Load more ]

#prompt-injection#prompt-injection

Claude's new AI file creation feature ships with deep security risks built in

Chatbots aren't supposed to call you a jerk-but they can be convinced

These psychological tricks can get LLMs to respond to "forbidden" prompts

Researchers used persuasion techniques to manipulate ChatGPT into breaking its own rules-from calling users jerks to giving recipes for lidocaine

These psychological tricks can get LLMs to respond to "forbidden" prompts

Researchers used persuasion techniques to manipulate ChatGPT into breaking its own rules-from calling users jerks to giving recipes for lidocaine

LegalPwn: Tricking LLMs by burying flaw in legal fine print

How to protect your AI agent from prompt injection attacks - LogRocket Blog

LegalPwn: Tricking LLMs by burying flaw in legal fine print

How to protect your AI agent from prompt injection attacks - LogRocket Blog

New AI browser agents create risks if sites hijack them with hidden instructions

Researchers used Gemini to break into Google Home - here's how

Google Adds Multi-Layered Defenses to Secure GenAI from Prompt Injection Attacks

Meta Open Sources LlamaFirewall for AI Agent Combined Protection

Researchers Demonstrate How MCP Prompt Injection Can Be Used for Both Attack and Defense

DeepMind Researchers Propose Defense Against LLM Prompt Injection

New AI browser agents create risks if sites hijack them with hidden instructions

Researchers used Gemini to break into Google Home - here's how

Google Adds Multi-Layered Defenses to Secure GenAI from Prompt Injection Attacks

Meta Open Sources LlamaFirewall for AI Agent Combined Protection

Researchers Demonstrate How MCP Prompt Injection Can Be Used for Both Attack and Defense

DeepMind Researchers Propose Defense Against LLM Prompt Injection

LLMs easily exploited using run-on sentences, bad grammar, image scaling

Anthropic teases Claude for Chrome with massive warnings

One long sentence is all it takes to make LLMs misbehave

Using an AI Browser Lets Hackers Drain Your Bank Account Just by Showing You a Public Reddit Post

Agentic AI Browsers Exploited by "PromptFix" Trick Technique

Prompt Injection Is What Happens When AI Trusts Too Easily | HackerNoon

Agentic AI Browsers Exploited by "PromptFix" Trick Technique

Prompt Injection Is What Happens When AI Trusts Too Easily | HackerNoon

Perplexity's Comet AI browser could expose your data to attackers - here's how

Perplexity's Comet browser faced prompt injection vuln

Perplexity's Comet AI browser could expose your data to attackers - here's how

Perplexity's Comet browser faced prompt injection vuln

AWS patches Q Developer after prompt injection, RCE demo

Experts Find AI Browsers Can Be Tricked by PromptFix Exploit to Run Malicious Hidden Prompts

Prompt injection vuln found in Google Gemini apps

The Prompt Protocol: Why Tomorrow's Security Nightmares Will Be Whispered, Not Coded | HackerNoon

Prompt injection vuln found in Google Gemini apps

The Prompt Protocol: Why Tomorrow's Security Nightmares Will Be Whispered, Not Coded | HackerNoon

Hackers exploit a blind spot by hiding malware inside DNS records

Scientists Are Sneaking Passages Into Research Papers Designed to Trick AI Reviewers

Scientists hide messages in papers to game AI peer review

Scholars sneaking phrases into papers to fool AI reviewers

Zero-click attack reveals new AI vulnerability

Researchers cause GitLab AI developer assistant to turn safe code malicious

Researchers claim breakthrough in fight against AI's frustrating security hole

Scientists hide messages in papers to game AI peer review

Scholars sneaking phrases into papers to fool AI reviewers

Zero-click attack reveals new AI vulnerability

Researchers cause GitLab AI developer assistant to turn safe code malicious

Researchers claim breakthrough in fight against AI's frustrating security hole

GitLab Duo Vulnerability Enabled Attackers to Hijack AI Responses with Hidden Prompts

#prompt-injection
#prompt-injection