AI Safety Archives

Dark GPT

April 17, 2024February 26, 2024 by Daniel Detlaf

You’re enjoying your ChatGPT innocently, merrily researching and coding and being productive. Kicking ass and taking names. But not everyone using GPT is formatting Excel spreadsheets or writing Weird Al … Read more

“Universal and Transferable Adversarial Attacks”: Researchers Jailbreak GPT

March 8, 2024July 28, 2023 by Daniel Detlaf

“Universal and Transferable Adversarial Attacks”: Researchers Jailbreak GPT Researchers with Carnegie Mellon’s Center for AI Safety have published a paper describing a method for developing adversarial attacks (a.k.a. jailbreaks) that … Read more

AI “Safety” and AI Ethics Are Not the Same

March 8, 2024July 19, 2023 by Daniel Detlaf

I have been agitating for some time now over the decreasing performance of AI models like GPT-4 or (especially) the new LLaMA 2 in direct relation to the “safeguards” being … Read more