Due to the growing interest in speech recognition technologies, several datasets of speech acquired under uncontrolled conditions have been proposed in recent years. The majority of the datasets available to the community are in English, which reduces the possibility of developing and evaluating recognition technologies in languages other than English. In this paper we try to reduce this language-related gap by proposing a dataset for Arabic language speech recognition. The dataset is made available to the community and contains 100 speakers of both genders. Experiments with some of the latest speaker recognition approaches have been performed both with and without a suitable training on the Arabic language. Results suggest that, to effectively develop recognition technologies in other languages, suitable data for that language are necessary to allow at least a transfer learning approach. In particular, such data is crucial when short utterances are considered.

Bianco, S., Celona, L., Khalifa, I., Napoletano, P., Petrovsky, A., Piccoli, F., et al. (2022). ArabCeleb: Speaker Recognition in Arabic. In AIxIA 2021 – Advances in Artificial Intelligence - 20th International Conference of the Italian Association for Artificial Intelligence, Virtual Event, December 1–3, 2021, Revised Selected Papers (pp.338-347). Cham : Springer International [10.1007/978-3-031-08421-8_23].

ArabCeleb: Speaker Recognition in Arabic

Bianco, Simone;Celona, Luigi
;
Khalifa, Intissar;Napoletano, Paolo;Piccoli, Flavio;Schettini, Raimondo;
2022

Abstract

Due to the growing interest in speech recognition technologies, several datasets of speech acquired under uncontrolled conditions have been proposed in recent years. The majority of the datasets available to the community are in English, which reduces the possibility of developing and evaluating recognition technologies in languages other than English. In this paper we try to reduce this language-related gap by proposing a dataset for Arabic language speech recognition. The dataset is made available to the community and contains 100 speakers of both genders. Experiments with some of the latest speaker recognition approaches have been performed both with and without a suitable training on the Arabic language. Results suggest that, to effectively develop recognition technologies in other languages, suitable data for that language are necessary to allow at least a transfer learning approach. In particular, such data is crucial when short utterances are considered.
slide + paper
Arabic language; Dataset; Speaker recognition;
English
AIxIA 2021 - 20th International Conference Italian Association for Artificial Intelligence - 1 December 2021 through 3 December 2021
2021
Bandini, S; Gasparini, F; Mascardi, V; Palmonari , M; Vizzari , G
AIxIA 2021 – Advances in Artificial Intelligence - 20th International Conference of the Italian Association for Artificial Intelligence, Virtual Event, December 1–3, 2021, Revised Selected Papers
978-3-031-08420-1
19-lug-2022
2022
13196
338
347
none
Bianco, S., Celona, L., Khalifa, I., Napoletano, P., Petrovsky, A., Piccoli, F., et al. (2022). ArabCeleb: Speaker Recognition in Arabic. In AIxIA 2021 – Advances in Artificial Intelligence - 20th International Conference of the Italian Association for Artificial Intelligence, Virtual Event, December 1–3, 2021, Revised Selected Papers (pp.338-347). Cham : Springer International [10.1007/978-3-031-08421-8_23].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/388166
Citazioni
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
Social impact