总目录 大模型安全相关研究:https://blog.csdn.net/WhiffeYF/article/details/142132328
Universal and Transferable Adversarial Attacks on Aligned Language Models
https://arxiv.org/pdf/2307.15043v2
https://www.doubao.com/chat/4427870860337154
https://github.com/llm-attacks/llm-attacks
通过对抗性后缀攻击大型语言模型 - LLM Safety论文精读(三)