Security of LLM Information Hub

For Beginners

Currently, only the Japanese version is available. The English version is coming soon.

LLMセキュリティの概要　　LLM時代のセキュリティリスク~研究最前線と実務への展開~ 【ハイブリッド開催】 from tsasakirevol

Informartion Hub

Survey

Attack

Prompt Injection

Jailbreak

Backdoor

RAG

Multi-Modal

Others

"Real Attackers Don't Compute Gradients": Bridging the Gap Between Adversarial ML Research and Practice
Last One Standing: A Comparative Analysis of Security and Privacy of Soft Prompt Tuning, LoRA, and In-Context Learning
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models
DeceptPrompt: Exploiting LLM-driven Code Generation via Adversarial Natural Language Instructions
Proof-of-Learning: Definitions and Practice

Defense

LLM Vulnerability Evaluation

Datasets