LLM jailbreaking The Hidden Vulnerability: How Benign Fine-Tuning Can Jailbreak Enterprise LLMs Discover how "truly benign" Direct Preference Optimization (DPO) fine-tuning can subtly compromise enterprise LLM safety, making them vulnerable to jailbreaking with minimal, harmless data.