fairness

Exploring ChatGPT guardails: from protected to forgotten countries (Part 1)

When ChatGPT writes negative poems for some countries only.

February 24, 2023 Jean-Matthieu Schertzer

10 minute read

OpenAI ChatGPT has built guardrails on limiting the generation of negative content. How do these guardrails behave for countries? Spoiler: there are some gaps and disparities in ChatGPT safety mechanisms. Based on 24100 ChatGPT queries, this blog post is an exploration of ChatGPT responses when prompted to generate negative content about a country.

Trusted AI Ideas

Home

Posts

About

Recent Posts

EU AI Act explicitly mentions SMEs and start-ups, but how?

Posts

Exploring ChatGPT guardails: from protected to forgotten countries (Part 1)

fairness

Exploring ChatGPT guardails: from protected to forgotten countries (Part 1)

Trusted AI Ideas

Recent Posts

EU AI Act explicitly mentions SMEs and start-ups, but how?

Exploring ChatGPT guardails: from protected to forgotten countries (Part 1)

Do you know the 4 types of additive Variable Importances?

About