Enhancing Bias Assessment for Complex Term Groups in Language Embedding Models: Quantitative Comparison of Methods

Abstract BackgroundArtificial intelligence (AI) is rapidly being adopted to build products and aid in the decision-making process across industries. However, AI systems have been shown to exhibit and even amplify biases, causing a growing concern among people worldwide. Thus,...

Full description

Saved in:

Bibliographic Details
Main Authors:	Magnus Gray, Mariofanna Milanova, Leihong Wu
Format:	Article
Language:	English
Published:	JMIR Publications 2024-11-01
Series:	JMIR Medical Informatics
Online Access:	https://medinform.jmir.org/2024/1/e60272
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1846160026823032832
author	Magnus Gray Mariofanna Milanova Leihong Wu
author_facet	Magnus Gray Mariofanna Milanova Leihong Wu
author_sort	Magnus Gray
collection	DOAJ
description	Abstract BackgroundArtificial intelligence (AI) is rapidly being adopted to build products and aid in the decision-making process across industries. However, AI systems have been shown to exhibit and even amplify biases, causing a growing concern among people worldwide. Thus, investigating methods of measuring and mitigating bias within these AI-powered tools is necessary. ObjectiveIn natural language processing applications, the word embedding association test (WEAT) is a popular method of measuring bias in input embeddings, a common area of measure bias in AI. However, certain limitations of the WEAT have been identified (ie, their nonrobust measure of bias and their reliance on predefined and limited groups of words or sentences), which may lead to inadequate measurements and evaluations of bias. Thus, this study takes a new approach at modifying this popular measure of bias, with a focus on making it more robust and applicable in other domains. MethodsIn this study, we introduce the SD-WEAT, which is a modified version of the WEAT that uses the SD of multiple permutations of the WEATs to calculate bias in input embeddings. With the SD-WEAT, we evaluated the biases and stability of several language embedding models, including Global Vectors for Word Representation (GloVe), Word2Vec, and bidirectional encoder representations from transformers (BERT). ResultsThis method produces results comparable to those of the WEAT, with strong correlations between the methods’ bias scores or effect sizes (rPr ConclusionsThus, the SD-WEAT shows promise for robustly measuring bias in the input embeddings fed to AI language models.
format	Article
id	doaj-art-e8f9317ca64643c9abf1824c2ca28d6a
institution	Kabale University
issn	2291-9694
language	English
publishDate	2024-11-01
publisher	JMIR Publications
record_format	Article
series	JMIR Medical Informatics
spelling	doaj-art-e8f9317ca64643c9abf1824c2ca28d6a2024-11-22T15:01:15ZengJMIR PublicationsJMIR Medical Informatics2291-96942024-11-0112e60272e6027210.2196/60272Enhancing Bias Assessment for Complex Term Groups in Language Embedding Models: Quantitative Comparison of MethodsMagnus Grayhttp://orcid.org/0009-0009-6355-7627Mariofanna Milanovahttp://orcid.org/0000-0003-0995-1921Leihong Wuhttp://orcid.org/0000-0002-4093-3708 Abstract BackgroundArtificial intelligence (AI) is rapidly being adopted to build products and aid in the decision-making process across industries. However, AI systems have been shown to exhibit and even amplify biases, causing a growing concern among people worldwide. Thus, investigating methods of measuring and mitigating bias within these AI-powered tools is necessary. ObjectiveIn natural language processing applications, the word embedding association test (WEAT) is a popular method of measuring bias in input embeddings, a common area of measure bias in AI. However, certain limitations of the WEAT have been identified (ie, their nonrobust measure of bias and their reliance on predefined and limited groups of words or sentences), which may lead to inadequate measurements and evaluations of bias. Thus, this study takes a new approach at modifying this popular measure of bias, with a focus on making it more robust and applicable in other domains. MethodsIn this study, we introduce the SD-WEAT, which is a modified version of the WEAT that uses the SD of multiple permutations of the WEATs to calculate bias in input embeddings. With the SD-WEAT, we evaluated the biases and stability of several language embedding models, including Global Vectors for Word Representation (GloVe), Word2Vec, and bidirectional encoder representations from transformers (BERT). ResultsThis method produces results comparable to those of the WEAT, with strong correlations between the methods’ bias scores or effect sizes (rPr ConclusionsThus, the SD-WEAT shows promise for robustly measuring bias in the input embeddings fed to AI language models.https://medinform.jmir.org/2024/1/e60272
spellingShingle	Magnus Gray Mariofanna Milanova Leihong Wu Enhancing Bias Assessment for Complex Term Groups in Language Embedding Models: Quantitative Comparison of Methods JMIR Medical Informatics
title	Enhancing Bias Assessment for Complex Term Groups in Language Embedding Models: Quantitative Comparison of Methods
title_full	Enhancing Bias Assessment for Complex Term Groups in Language Embedding Models: Quantitative Comparison of Methods
title_fullStr	Enhancing Bias Assessment for Complex Term Groups in Language Embedding Models: Quantitative Comparison of Methods
title_full_unstemmed	Enhancing Bias Assessment for Complex Term Groups in Language Embedding Models: Quantitative Comparison of Methods
title_short	Enhancing Bias Assessment for Complex Term Groups in Language Embedding Models: Quantitative Comparison of Methods
title_sort	enhancing bias assessment for complex term groups in language embedding models quantitative comparison of methods
url	https://medinform.jmir.org/2024/1/e60272
work_keys_str_mv	AT magnusgray enhancingbiasassessmentforcomplextermgroupsinlanguageembeddingmodelsquantitativecomparisonofmethods AT mariofannamilanova enhancingbiasassessmentforcomplextermgroupsinlanguageembeddingmodelsquantitativecomparisonofmethods AT leihongwu enhancingbiasassessmentforcomplextermgroupsinlanguageembeddingmodelsquantitativecomparisonofmethods

Enhancing Bias Assessment for Complex Term Groups in Language Embedding Models: Quantitative Comparison of Methods

Similar Items