Dementia is a set of mental diseases affecting millions of people worldwide. Similarly to all the other mental health issues, it is often difficult to forecast the trend of the disease for patients suffering from it. In this context, data of patients suffering from mental health are usually collected through questionnaires, psychological and cognitive tests, over several timepoints. This way, longitudinal data can help identify disease trajectories and allow medical doctors to forecast specific treatments. In this study, we analyze an open, unrestricted dataset of electronic health records (EHRs) of patients suffering from dementia, called OASIS-2, through several unsupervised machine learning methods (К-means, Hierarchical Clustering, Gaussian Mixture Model, and Spectral Clustering). This dataset contains demographic data and psychological test data collected over five independent visits, and having 142 patients at the first visit and ten features. Our goal is to identify patients’ clusters that stay stable over the first four visits (we discarded the data of the fifth visit because of its small size), and then to characterize these clusters by studying their variables. We also measure the performances of the clustering methods through conventional metrics for internal and external validation. Our preliminary results show that unsupervised techniques can identify significant clusters of patients with mental health issues in this dataset and that Hierarchical Clustering outperforms the other algorithms to this end.
Ribino, P., Di Napoli, C., Paragliola, G., Serino, L., Gasparini, F., Chicco, D. (2023). Exploratory analysis of longitudinal data of patients with dementia through unsupervised techniques. In Proceedings of the 4th Italian Workshop on Artificial Intelligence for an Ageing Society co-located with 22nd International Conference of the Italian Association for Artificial Intelligence (AIxIA 2023) (pp.67-87). CEUR-WS.
Exploratory analysis of longitudinal data of patients with dementia through unsupervised techniques
Gasparini F.;Chicco D.
2023
Abstract
Dementia is a set of mental diseases affecting millions of people worldwide. Similarly to all the other mental health issues, it is often difficult to forecast the trend of the disease for patients suffering from it. In this context, data of patients suffering from mental health are usually collected through questionnaires, psychological and cognitive tests, over several timepoints. This way, longitudinal data can help identify disease trajectories and allow medical doctors to forecast specific treatments. In this study, we analyze an open, unrestricted dataset of electronic health records (EHRs) of patients suffering from dementia, called OASIS-2, through several unsupervised machine learning methods (К-means, Hierarchical Clustering, Gaussian Mixture Model, and Spectral Clustering). This dataset contains demographic data and psychological test data collected over five independent visits, and having 142 patients at the first visit and ten features. Our goal is to identify patients’ clusters that stay stable over the first four visits (we discarded the data of the fifth visit because of its small size), and then to characterize these clusters by studying their variables. We also measure the performances of the clustering methods through conventional metrics for internal and external validation. Our preliminary results show that unsupervised techniques can identify significant clusters of patients with mental health issues in this dataset and that Hierarchical Clustering outperforms the other algorithms to this end.File | Dimensione | Formato | |
---|---|---|---|
Ribino-2023-CEUR Workshop Proceedings-VoR.pdf
accesso aperto
Tipologia di allegato:
Publisher’s Version (Version of Record, VoR)
Licenza:
Creative Commons
Dimensione
485.52 kB
Formato
Adobe PDF
|
485.52 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.