Bicocca Open Archive

The lack of annotated publicly available medical images is a major barrier for computational research and education innovations. At the same time, many de-identified images and much knowledge are shared by clinicians on public forums such as medical Twitter. Here we harness these crowd platforms to curate OpenPath, a large dataset of 208,414 pathology images paired with natural language descriptions. We demonstrate the value of this resource by developing pathology language–image pretraining (PLIP), a multimodal artificial intelligence with both image and text understanding, which is trained on OpenPath. PLIP achieves state-of-the-art performances for classifying new pathology images across four external datasets: for zero-shot classification, PLIP achieves F1 scores of 0.565–0.832 compared to F1 scores of 0.030–0.481 for previous contrastive language–image pretrained model. Training a simple supervised classifier on top of PLIP embeddings also achieves 2.5% improvement in F1 scores compared to using other supervised model embeddings. Moreover, PLIP enables users to retrieve similar cases by either image or natural language search, greatly facilitating knowledge sharing. Our approach demonstrates that publicly shared medical information is a tremendous resource that can be harnessed to develop medical artificial intelligence for enhancing diagnosis, knowledge sharing and education.

Huang, Z., Bianchi, F., Yuksekgonul, M., Montine, T., Zou, J. (2023). A visual–language foundation model for pathology image analysis using medical Twitter. NATURE MEDICINE, 29(9), 2307-2316 [10.1038/s41591-023-02504-3].

A visual–language foundation model for pathology image analysis using medical Twitter

Huang Z.;Bianchi F.;Yuksekgonul M.;Montine T. J.;Zou J.

2023

Abstract

The lack of annotated publicly available medical images is a major barrier for computational research and education innovations. At the same time, many de-identified images and much knowledge are shared by clinicians on public forums such as medical Twitter. Here we harness these crowd platforms to curate OpenPath, a large dataset of 208,414 pathology images paired with natural language descriptions. We demonstrate the value of this resource by developing pathology language–image pretraining (PLIP), a multimodal artificial intelligence with both image and text understanding, which is trained on OpenPath. PLIP achieves state-of-the-art performances for classifying new pathology images across four external datasets: for zero-shot classification, PLIP achieves F1 scores of 0.565–0.832 compared to F1 scores of 0.030–0.481 for previous contrastive language–image pretrained model. Training a simple supervised classifier on top of PLIP embeddings also achieves 2.5% improvement in F1 scores compared to using other supervised model embeddings. Moreover, PLIP enables users to retrieve similar cases by either image or natural language search, greatly facilitating knowledge sharing. Our approach demonstrates that publicly shared medical information is a tremendous resource that can be harnessed to develop medical artificial intelligence for enhancing diagnosis, knowledge sharing and education.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Artificial Intelligence; Humans; Image Processing, Computer-Assisted; Knowledge; Language; Social Media
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				17-ago-2023
			
	Data di pubblicazione
	
				2023
			
	Rivista
	
				NATURE MEDICINE
			
	Numero del volume
	
				29
			
	Fascicolo
	
				9
			
	Pagina iniziale
	
				2307
			
	Pagina finale
	
				2316
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1038/s41591-023-02504-3
			
	Fulltext
	
				none
			
	Citazione
	
				Huang, Z., Bianchi, F., Yuksekgonul, M., Montine, T., Zou, J. (2023). A visual–language foundation model for pathology image analysis using medical Twitter. NATURE MEDICINE, 29(9), 2307-2316 [10.1038/s41591-023-02504-3].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/528020

Citazioni

197

151

Social impact