ideas on Trusted AI Ideas

ideas on Trusted AI Ideas /tags/ideas/ Recent content in ideas on Trusted AI Ideas Hugo -- gohugo.io en-us Fri, 24 Feb 2023 12:16:19 +0100 Exploring ChatGPT guardails: from protected to forgotten countries (Part 1) /post/chatgpt_country_guardrails_study/ Fri, 24 Feb 2023 12:16:19 +0100 /post/chatgpt_country_guardrails_study/ OpenAI ChatGPT has built guardrails on limiting the generation of negative content. How do these guardrails behave for countries? Spoiler: there are some gaps and disparities in ChatGPT safety mechanisms. Based on 24100 ChatGPT queries, this blog post is an exploration of ChatGPT responses when prompted to generate negative content about a country. If you are in a hurry, go and see the early results. 1. Context: ChatGPT guardrails and harmful content prevention. Do you know the 4 types of additive Variable Importances? /post/variable_importance_feature_attribution/ Sat, 16 May 2020 15:36:11 +0200 /post/variable_importance_feature_attribution/ Facing complex models, both computer simulation and machine learning practitioners have pursued similar objectives: to see how results could be broken down and linked to the inputs. Whether it is called Sensitivity Analysis or Variable Importance in the context of explainable AI, some of their methods share an important component: the Shapley values. This article presents a structured 2 by 2 matrix to think about Variable Importances in terms of their goals.