How Microsoft discovers and mitigates evolving attacks against AI guardrails
Bad actors attempt to bypass safeguards with the intent to achieve unauthorized actions, which may result in what is known as a “jailbreak.” The consequences can range from the unapproved but less harmful to the very serious.