Written by mpesceApril 19, 2024April 17, 2024

‘De-Risking AI’ white paper

Wisely AI has identified five risks associated with the use of Generative AI in organisations. In this white paper, we provide guidance on how to mitigate these risks.

Written by mpesceFebruary 2, 2024February 1, 2024

OpenAI’s GPT-4 finally meets its match: Scots Gaelic smashes safety guardrails

Researchers found that they were able to bypass its safety guardrails about 79 percent of the time using Zulu, Scots Gaelic, Hmong, or Guarani. The attack is about as successful as other types of jail-breaking methods.

Written by mpesceDecember 8, 2023December 7, 2023

Jailbroken AI Chatbots Can Jailbreak Other Chatbots

Automated attack techniques proved to be successful 42.5 percent of the time against GPT-4, one of the large language models (LLMs) that power ChatGPT.

Written by mpesceDecember 6, 2023December 5, 2023

GPT-4 developer tool can be exploited for misuse with no easy fix

“It is surprisingly easy to remove the safety measures intended to prevent AI chatbots from giving harmful responses that could aid would-be terrorists or mass shooters. The discovery is prompting companies to develop strategies to solve the problem…”

Written by mpesceNovember 29, 2023November 27, 2023

AI Art Generators Can Be Fooled Into Making NSFW Images

Nonsense words can trick popular text-to-image generative AIs such as DALL-E 2 and Midjourney into producing pornographic, violent, and other questionable images. A new algorithm generates these commands to skirt these AIs’ safety filters.

Windows Copilot News

All the latest news & tips to help you use AI chatbots safely & wisely

Tag: prompt subversion