In this paper, we investigate brain activity associated with complex visual tasks, showing that electroencephalography (EEG) data can help computer vision in reliably recognizing actions from video footage that is used to stimulate human observers. Notably, we consider not only typical “explicit” video action benchmarks, but also more complex data sequences in which action concepts are only referred to, implicitly. To this end, we consider a challenging action recognition benchmark dataset—Moments in Time—whose video sequences do not explicitly visualize actions, but only implicitly refer to them (e.g., fireworks in the sky as an extreme example of “flying”). We employ such videos as stimuli and involve a large sample of subjects to collect a high-definition, multi-modal EEG and video data, designed for understanding action concepts. We discover an agreement among brain activities of different subjects stimulated by the same video footage. We name it as subjects consensus, and we design a computational pipeline to transfer knowledge from EEG to video, sharply boosting the recognition performance.

Cavazza, J., Ahmed, W., Volpi, R., Morerio, P., Bossi, F., Willemse, C., et al. (2022). Understanding action concepts from videos and brain activity through subjects’ consensus. SCIENTIFIC REPORTS, 12(1) [10.1038/s41598-022-23067-2].

Understanding action concepts from videos and brain activity through subjects’ consensus

Bossi F.;
2022

Abstract

In this paper, we investigate brain activity associated with complex visual tasks, showing that electroencephalography (EEG) data can help computer vision in reliably recognizing actions from video footage that is used to stimulate human observers. Notably, we consider not only typical “explicit” video action benchmarks, but also more complex data sequences in which action concepts are only referred to, implicitly. To this end, we consider a challenging action recognition benchmark dataset—Moments in Time—whose video sequences do not explicitly visualize actions, but only implicitly refer to them (e.g., fireworks in the sky as an extreme example of “flying”). We employ such videos as stimuli and involve a large sample of subjects to collect a high-definition, multi-modal EEG and video data, designed for understanding action concepts. We discover an agreement among brain activities of different subjects stimulated by the same video footage. We name it as subjects consensus, and we design a computational pipeline to transfer knowledge from EEG to video, sharply boosting the recognition performance.
Articolo in rivista - Articolo scientifico
Brain; Consensus; Electroencephalography; Humans; Recognition, Psychology
English
9-nov-2022
2022
12
1
19073
none
Cavazza, J., Ahmed, W., Volpi, R., Morerio, P., Bossi, F., Willemse, C., et al. (2022). Understanding action concepts from videos and brain activity through subjects’ consensus. SCIENTIFIC REPORTS, 12(1) [10.1038/s41598-022-23067-2].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/528968
Citazioni
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
Social impact