The paper explores the possibility to employ the source code of corporate websites as an information source for research in innovation studies. Research in this area is generally based on studies that collect data on patents or official data sources. Our paper links the standard economic information of the firm with web-based data and joins the ongoing debate with a threefold contribution. First, whereas the majority of the literature focused on the linguistic content of web-pages, we mostly use HTML tags. Second, we propose a method to assess the quality of the linkage of Web data to firm-level information. Third, we show that the data retrieved from corporate websites can aid to identify 'innovative SMEs'.
Bottai, C., Crosato, L., Domenech, J., Guerzoni, M., Liberati, C. (2022). Unconventional data for policy: Using Big Data for detecting Italian innovative SMEs. In 2nd ACM Conference on Information Technology for Social Good, GoodIT 2022 (pp.338-344). Association for Computing Machinery [10.1145/3524458.3547246].
Unconventional data for policy: Using Big Data for detecting Italian innovative SMEs
Bottai, Carlo;Guerzoni, Marco;Liberati, Caterina
2022
Abstract
The paper explores the possibility to employ the source code of corporate websites as an information source for research in innovation studies. Research in this area is generally based on studies that collect data on patents or official data sources. Our paper links the standard economic information of the firm with web-based data and joins the ongoing debate with a threefold contribution. First, whereas the majority of the literature focused on the linguistic content of web-pages, we mostly use HTML tags. Second, we propose a method to assess the quality of the linkage of Web data to firm-level information. Third, we show that the data retrieved from corporate websites can aid to identify 'innovative SMEs'.File | Dimensione | Formato | |
---|---|---|---|
Bottai-2022-GoodIT '22-AAM.pdf
Solo gestori archivio
Descrizione: Research Article
Tipologia di allegato:
Publisher’s Version (Version of Record, VoR)
Dimensione
655.58 kB
Formato
Adobe PDF
|
655.58 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.