Semantic Query Labeling is the task of locating the constituent parts of a query and assigning domain-specific semantic labels to each of them. It allows unfolding the relations between the query terms and the documents’ structure while leaving unaltered the keyword-based query formulation. In this paper, we investigate the pre-training of a semantic query-tagger with synthetic data generated by leveraging the documents’ structure. By simulating a dynamic environment, we also evaluate the consistency of performance improvements brought by pre-training as real-world training data becomes available. The results of our experiments suggest both the utility of pre-training with synthetic data and its improvements’ consistency over time.

Bassani, E., Pasi, G. (2022). Evaluating the Use of Synthetic Queries for Pre-training a Semantic Query Tagger. In Advances in Information Retrieval. ECIR 2022 (pp.39-46). GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND : Springer Science and Business Media Deutschland GmbH [10.1007/978-3-030-99739-7_5].

Evaluating the Use of Synthetic Queries for Pre-training a Semantic Query Tagger

Bassani E.
Primo
;
Pasi G.
Ultimo
2022

Abstract

Semantic Query Labeling is the task of locating the constituent parts of a query and assigning domain-specific semantic labels to each of them. It allows unfolding the relations between the query terms and the documents’ structure while leaving unaltered the keyword-based query formulation. In this paper, we investigate the pre-training of a semantic query-tagger with synthetic data generated by leveraging the documents’ structure. By simulating a dynamic environment, we also evaluate the consistency of performance improvements brought by pre-training as real-world training data becomes available. The results of our experiments suggest both the utility of pre-training with synthetic data and its improvements’ consistency over time.
paper
Query generation; Semantic query labeling; Vertical search;
English
44th European Conference on Information Retrieval, ECIR 2022 - 10 April 2022 through 14 April 2022
2022
Hagen, M; Verberne, S; Macdonald, C; Seifert, C; Balog, K; Nørvåg, K; Setty, L
Advances in Information Retrieval. ECIR 2022
978-3-030-99738-0
2022
13186 LNCS
39
46
none
Bassani, E., Pasi, G. (2022). Evaluating the Use of Synthetic Queries for Pre-training a Semantic Query Tagger. In Advances in Information Retrieval. ECIR 2022 (pp.39-46). GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND : Springer Science and Business Media Deutschland GmbH [10.1007/978-3-030-99739-7_5].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/392846
Citazioni
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
Social impact