Bicocca Open Archive

In this paper, we study human–AI collaboration protocols, a design-oriented construct aimed at establishing and evaluating how humans and AI can collaborate in cognitive tasks. We applied this construct in two user studies involving 12 specialist radiologists (the knee MRI study) and 44 ECG readers of varying expertise (the ECG study), who evaluated 240 and 20 cases, respectively, in different collaboration configurations. We confirm the utility of AI support but find that XAI can be associated with a “white-box paradox”, producing a null or detrimental effect. We also find that the order of presentation matters: AI-first protocols are associated with higher diagnostic accuracy than human-first protocols, and with higher accuracy than both humans and AI alone. Our findings identify the best conditions for AI to augment human diagnostic skills, rather than trigger dysfunctional responses and cognitive biases that can undermine decision effectiveness.

Cabitza, F., Campagner, A., Ronzio, L., Cameli, M., Mandoli, G., Pastore, M., et al. (2023). Rams, hounds and white boxes: Investigating human–AI collaboration protocols in medical diagnosis. ARTIFICIAL INTELLIGENCE IN MEDICINE, 138(April 2023) [10.1016/j.artmed.2023.102506].

Rams, hounds and white boxes: Investigating human–AI collaboration protocols in medical diagnosis

Cabitza F.;Campagner A.;Ronzio L.;Cameli M.;Mandoli G. E.;Pastore M. C.;Sconfienza L. M.;Folgado D.;Barandas M.;Gamboa H.

2023

Abstract

In this paper, we study human–AI collaboration protocols, a design-oriented construct aimed at establishing and evaluating how humans and AI can collaborate in cognitive tasks. We applied this construct in two user studies involving 12 specialist radiologists (the knee MRI study) and 44 ECG readers of varying expertise (the ECG study), who evaluated 240 and 20 cases, respectively, in different collaboration configurations. We confirm the utility of AI support but find that XAI can be associated with a “white-box paradox”, producing a null or detrimental effect. We also find that the order of presentation matters: AI-first protocols are associated with higher diagnostic accuracy than human-first protocols, and with higher accuracy than both humans and AI alone. Our findings identify the best conditions for AI to augment human diagnostic skills, rather than trigger dysfunctional responses and cognitive biases that can undermine decision effectiveness.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Artificial intelligence; Automation bias; Cognitive biases; Explainable AI; Human–AI collaboration protocols;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				8-feb-2023
			
	Data di pubblicazione
	
				2023
			
	Rivista
	
				ARTIFICIAL INTELLIGENCE IN MEDICINE
			
	Numero del volume
	
				138
			
	Fascicolo
	
				April 2023
			
	Article number
	
				102506
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1016/j.artmed.2023.102506
			
	Fulltext
	
				open
			
	Citazione
	
				Cabitza, F., Campagner, A., Ronzio, L., Cameli, M., Mandoli, G., Pastore, M., et al. (2023). Rams, hounds and white boxes: Investigating human–AI collaboration protocols in medical diagnosis. ARTIFICIAL INTELLIGENCE IN MEDICINE, 138(April 2023) [10.1016/j.artmed.2023.102506].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Cabitza-2023-Art Intell Med-preprint.pdf accesso aperto Descrizione: Research Article Tipologia di allegato: Submitted Version (Pre-print) Licenza: Altro Dimensione 2.83 MB Formato Adobe PDF Visualizza/Apri	2.83 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/416376

Citazioni

50

34

Social impact