← Back to Newsletter
Paper Library
Collection of AI Security research papers
Search papers:
Filter by topic:
All Topics
Red Teaming
Safety
Risk & Governance
🔍 Search
Showing 1172 papers total
November 17 - November 23, 2025
13 papers
Let Language Constrain Geometry: Vision-Language Models as Semantic and Spatial Critics for 3D Generation
Weimin Bai, Yubo Li, Weijian Luo, Zeqiang Lai, Yequan Wang, Wenzheng Chen, He Sun
2025-11-18
2511.14271v1
LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation
Hao Jiang, Guoquan Wang, Donglin Zhou, Sheng Yu, Yang Zeng, Wencong Zeng, Kun Gai, Guorui Zhou
2025-11-18
2511.14221v1
N-GLARE: An Non-Generative Latent Representation-Efficient LLM Safety Evaluator
Zheyu Lin, Jirui Yang, Hengqi Guo, Yubing Bao, Yao Guan
2025-11-18
safety
2511.14195v1
Beyond Fixed and Dynamic Prompts: Embedded Jailbreak Templates for Advancing LLM Security
Hajun Kim, Hyunsik Na, Daeseon Choi
2025-11-18
red teaming
2511.14140v1
Mind the Gap: Evaluating LLM Understanding of Human-Taught Road Safety Principles
Chalamalasetti Kranti
2025-11-17
safety
2511.13909v1
Jailbreaking Large Vision Language Models in Intelligent Transportation Systems
Badhan Chandra Das, Md Tasnim Jawad, Md Jueal Mia, M. Hadi Amini, Yanzhao Wu
2025-11-17
red teaming
2511.13892v1
Transformer Injectivity & Geometric Robustness - Analytic Margins and Bi-Lipschitz Uniformity of Sequence-Level Hidden States
Mikael von Strauss
2025-11-17
2511.14808v1
Hierarchical Prompt Learning for Image- and Text-Based Person Re-Identification
Linhan Zhou, Shuang Li, Neng Dong, Yonghang Tai, Yafei Zhang, Huafeng Li
2025-11-17
2511.13575v1
ForgeDAN: An Evolutionary Framework for Jailbreaking Aligned Large Language Models
Siyang Cheng, Gaotian Liu, Rui Mei, Yilin Wang, Kejia Zhang, Kaishuo Wei, Yuqi Yu, Weiping Wen, Xiaojie Wu, Junhua Liu
2025-11-17
red teaming
2511.13548v1
VEIL: Jailbreaking Text-to-Video Models via Visual Exploitation from Implicit Language
Zonghao Ying, Moyang Chen, Nizhang Li, Zhiqiang Wang, Wenxin Zhang, Quanchen Zou, Zonglei Jing, Aishan Liu, Xianglong Liu
2025-11-17
red teaming
2511.13127v1
Infinite-Story: A Training-Free Consistent Text-to-Image Generation
Jihun Park, Kyoungmin Lee, Jongmin Gim, Hyeonseo Jo, Minseok Oh, Wonhyeok Choi, Kyumin Hwang, Jaeyeul Kim, Minwoo Choi, Sunghoon Im
2025-11-17
2511.13002v1
MedRule-KG: A Knowledge-Graph--Steered Scaffold for Reliable Mathematical and Biomedical Reasoning
Crystal Su
2025-11-17
2511.12963v1
BrainNormalizer: Anatomy-Informed Pseudo-Healthy Brain Reconstruction from Tumor MRI via Edge-Guided ControlNet
Min Gu Kwak, Yeonju Lee, Hairong Wang, Jing Li
2025-11-17
2511.12853v1
November 10 - November 16, 2025
11 papers
LLM Reinforcement in Context
Thomas Rivasseau
2025-11-16
2511.12782v1
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
Ankita Raj, Chetan Arora
2025-11-16
2511.12735v1
Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMs
Yunhao Chen, Xin Wang, Juncheng Li, Yixu Wang, Jie Li, Yan Teng, Yingchun Wang, Xingjun Ma
2025-11-16
red teaming
2511.12710v1
Scaling Patterns in Adversarial Alignment: Evidence from Multi-LLM Jailbreak Experiments
Samuel Nathanson, Rebecca Williams, Cynthia Matuszek
2025-11-16
red teaming
2511.13788v1
GRAPHTEXTACK: A Realistic Black-Box Node Injection Attack on LLM-Enhanced GNNs
Jiaji Ma, Puja Trivedi, Danai Koutra
2025-11-16
red teaming
2511.12423v1
Privacy-Preserving Prompt Injection Detection for LLMs Using Federated Learning and Embedding-Based NLP Classification
Hasini Jayathilaka
2025-11-15
red teaming
2511.12295v1
Prompt-Conditioned FiLM and Multi-Scale Fusion on MedSigLIP for Low-Dose CT Quality Assessment
Tolga Demiroglu, Mehmet Ozan Unal, Metin Ertas, Isa Yildirim
2025-11-15
2511.12256v1
AlignTree: Efficient Defense Against LLM Jailbreak Attacks
Gil Goren, Shahar Katz, Lior Wolf
2025-11-15
safety
2511.12217v1
NegBLEURT Forest: Leveraging Inconsistencies for Detecting Jailbreak Attacks
Lama Sleem, Jerome Francois, Lujun Li, Nathan Foucher, Niccolo Gentile, Radu State
2025-11-14
red teaming
2511.11784v1
EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment
Ruoxi Cheng, Haoxuan Ma, Teng Ma, Hongyi Zhang
2025-11-14
2511.11301v1
Synthetic Voices, Real Threats: Evaluating Large Text-to-Speech Models in Generating Harmful Audio
Guangke Chen, Yuhui Wang, Shouling Ji, Xiapu Luo, Ting Wang
2025-11-14
red teaming
2511.10913v1
‹
1
2
3
...
19
20
21
...
47
48
49
›