#llm-vulnerabilities
#llm-vulnerabilities

[ follow ]

ChatGPT's agent can dodge select CAPTCHAs after priming

Prompt misdirection and replay into an agent chat can coax ChatGPT to solve many CAPTCHA types, undermining CAPTCHA effectiveness as a human-only test.

Artificial intelligence

fromCSO Online

2 months ago

LLMs easily exploited using run-on sentences, bad grammar, image scaling

Large language models remain easily manipulated into revealing sensitive data via prompt formatting and hidden-image attacks due to alignment training gaps and brittle prompt security.

Artificial intelligence

fromArs Technica

5 months ago

Hidden AI instructions reveal how Anthropic controls Claude 4

AI models are vulnerable to prompt injection and sycophantic behavior due to user feedback preferences.

Artificial intelligence

fromInfoQ

6 months ago

DeepMind Researchers Propose Defense Against LLM Prompt Injection

Google DeepMind's CaMeL effectively neutralizes 67% of prompt injection attacks in LLMs using traditional software security principles.

[ Load more ]

#llm-vulnerabilities#llm-vulnerabilities

ChatGPT's agent can dodge select CAPTCHAs after priming

LLMs easily exploited using run-on sentences, bad grammar, image scaling

Hidden AI instructions reveal how Anthropic controls Claude 4

DeepMind Researchers Propose Defense Against LLM Prompt Injection

#llm-vulnerabilities
#llm-vulnerabilities