← back to categories
ADVERSARIAL
(2)2 hack(s).
ADVERSARIAL MEDIUM NEW
SilentRetrieval: fluent RAG corpus poisoning that slips past perplexity filters
A May 27, 2026 arXiv preprint introduces a two-stage attack that hides goal-hijacking triggers inside fluent documents, reaching 57% LLM-attack success on Natural Questions and MS MARCO with one poisoned record per query.
2026-05-29//6 min
ADVERSARIAL MEDIUM
Usability as a Weapon: how feature requests turn coding LLMs insecure
A May 11, 2026 arXiv paper shows that asking a coding LLM for a faster, simpler or feature-richer version of secure code reliably drops the security constraints. UPAttack reaches 98.1% on GPT-5.2-chat and Gemini-3.
2026-05-26//7 min