Mitigating Unfair Regression in Machine Learning Model Updates

IRIS

Machine learning systems often require updates for various reasons, such as the availability of new data or models and the need to optimize different technical or ethical metrics. Typically, these metrics reflect an average performance rather than sample-wise behavior. Indeed, improvements in metrics like accuracy can introduce negative flips, where the updated model makes errors that the previous model did not make. In certain applications, these negative flips can be perceived by developers or users as a regression in performance, contributing to the hidden technical debt of machine learning systems. Moreover, if the distribution of negative flips is biased with respect to some sensitive attribute (e.g., gender or race), it may be perceived as discrimination, termed unfair regression. In this paper we show, for the first time, the existence of the phenomenon of unfair regression and propose different ethical metrics to measure it. Additionally, we offer two mitigation strategies - one focused on modifying the learning algorithm and one focused on modifying the tuning phase - to address this issue. Our results on real-world datasets confirm the existence of the unfair regression phenomenon and demonstrate the effectiveness of the proposed mitigation strategies.

Mitigating Unfair Regression in Machine Learning Model Updates

Buselli, Irene;López, Anna Pallarès;Jiménez, Eduard Martín;Anguita, Davide;Roli, Fabio;Oneto, Luca

2024-01-01

Abstract

Machine learning systems often require updates for various reasons, such as the availability of new data or models and the need to optimize different technical or ethical metrics. Typically, these metrics reflect an average performance rather than sample-wise behavior. Indeed, improvements in metrics like accuracy can introduce negative flips, where the updated model makes errors that the previous model did not make. In certain applications, these negative flips can be perceived by developers or users as a regression in performance, contributing to the hidden technical debt of machine learning systems. Moreover, if the distribution of negative flips is biased with respect to some sensitive attribute (e.g., gender or race), it may be perceived as discrimination, termed unfair regression. In this paper we show, for the first time, the existence of the phenomenon of unfair regression and propose different ethical metrics to measure it. Additionally, we offer two mitigation strategies - one focused on modifying the learning algorithm and one focused on modifying the tuning phase - to address this issue. Our results on real-world datasets confirm the existence of the unfair regression phenomenon and demonstrate the effectiveness of the proposed mitigation strategies.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	ISBN
	
				9798350375022
			
	Appare nelle tipologie:
	
				04.01 - Contributo in atti di convegno

File in questo prodotto:

File	Dimensione	Formato
ICMLA_W_Buselli.pdf accesso chiuso Tipologia: Documento in Post-print Dimensione 546.29 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	546.29 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
Mitigating_Unfair_Regression_in_Machine_Learning_Model_Updates.pdf accesso chiuso Tipologia: Documento in versione editoriale Dimensione 550.61 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	550.61 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11567/1259097

Citazioni

ND

0

0

social impact