← Back to Newsletter
Paper Library
Collection of AI Security research papers
Search papers:
Filter by topic:
All Topics
Red Teaming
Safety
Risk & Governance
🔍 Search
Showing 1160 papers total
October 16 - October 22, 2023
2 papers
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
Yupei Liu, Yuqi Jia, Runpeng Geng, Jinyuan Jia, Neil Zhenqiang Gong
2023-10-19
red teaming
2310.12815v5
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
Yupei Liu, Yuqi Jia, Runpeng Geng, Jinyuan Jia, Neil Zhenqiang Gong
2023-10-19
red teaming
2310.12815v4
August 14 - August 20, 2023
1 paper
Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection
Zekun Li, Baolin Peng, Pengcheng He, Xifeng Yan
2023-08-17
red teaming
2308.10819v3
July 31 - August 06, 2023
2 papers
PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification
Hongwei Yao, Jian Lou, Kui Ren, Zhan Qin
2023-08-05
2308.02816v2
From Prompt Injections to SQL Injection Attacks: How Protected is Your LLM-Integrated Web Application?
Rodrigo Pedro, Daniel Castro, Paulo Carreira, Nuno Santos
2023-08-03
red teaming
2308.01990v4
June 05 - June 11, 2023
1 paper
Prompt Injection attack against LLM-integrated Applications
Yi Liu, Gelei Deng, Yuekang Li, Kailong Wang, Zihao Wang, Xiaofeng Wang, Tianwei Zhang, Yepang Liu, Haoyu Wang, Yan Zheng, Yang Liu
2023-06-08
red teaming
2306.05499v2
February 20 - February 26, 2023
1 paper
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
Kai Greshake, Sahar Abdelnabi, Shailesh Mishra, Christoph Endres, Thorsten Holz, Mario Fritz
2023-02-23
red teaming
2302.12173v2
May 30 - June 05, 2022
1 paper
Prompt Injection: Parameterization of Fixed Inputs
Eunbi Choi, Yongrae Jo, Joel Jang, Minjoon Seo
2022-05-31
2206.11349v2
‹
1
2
3
...
47
48
49
›