This study investigates the effect that various patient-related information extracted from unstructured clinical notes has on two different tasks, i.e., patient allocation in clinical trials and medical literature retrieval. Specifically, we combine standard and transformer-based methods to extract entities (e.g., drugs, medical problems), disambiguate their meaning (e.g., family history, negations), or expand them with related medical concepts to synthesize diverse query representations. The empirical evaluation showed that certain query representations positively affect retrieval effectiveness for patient allocation in clinical trials, but no statistically significant improvements have been identified in medical literature retrieval. Across the queries, it has been found that removing negated entities using a domain-specific pre-trained transformer model has been more effective than a standard rule-based approach. In addition, our experiments have shown that removing information related to family history can further improve patient allocation in clinical trials.

Peikos, G., Alexander, D., Pasi, G., de Vries, A. (2023). Investigating the Impact of Query Representation on Medical Information Retrieval. In Advances in Information Retrieval 45th European Conference on Information Retrieval, ECIR 2023, Dublin, Ireland, April 2–6, 2023, Proceedings, Part II (pp.512-521). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-28238-6_42].

Investigating the Impact of Query Representation on Medical Information Retrieval

Peikos G.;Pasi G.;
2023

Abstract

This study investigates the effect that various patient-related information extracted from unstructured clinical notes has on two different tasks, i.e., patient allocation in clinical trials and medical literature retrieval. Specifically, we combine standard and transformer-based methods to extract entities (e.g., drugs, medical problems), disambiguate their meaning (e.g., family history, negations), or expand them with related medical concepts to synthesize diverse query representations. The empirical evaluation showed that certain query representations positively affect retrieval effectiveness for patient allocation in clinical trials, but no statistically significant improvements have been identified in medical literature retrieval. Across the queries, it has been found that removing negated entities using a domain-specific pre-trained transformer model has been more effective than a standard rule-based approach. In addition, our experiments have shown that removing information related to family history can further improve patient allocation in clinical trials.
paper
Information extraction; Medical information retrieval; Query reformulation;
English
45th European Conference on Information Retrieval, ECIR 2023 - 2 April 2023 through 6 April 2023
2023
Kamps, J; Goeuriot, L; Crestani, F; Maistro, M; Joho, H; Davis, B; Gurrin, C; Caputo, A; Kruschwitz, U
Advances in Information Retrieval 45th European Conference on Information Retrieval, ECIR 2023, Dublin, Ireland, April 2–6, 2023, Proceedings, Part II
9783031282379
2023
13981
512
521
none
Peikos, G., Alexander, D., Pasi, G., de Vries, A. (2023). Investigating the Impact of Query Representation on Medical Information Retrieval. In Advances in Information Retrieval 45th European Conference on Information Retrieval, ECIR 2023, Dublin, Ireland, April 2–6, 2023, Proceedings, Part II (pp.512-521). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-28238-6_42].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/454389
Citazioni
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 1
Social impact