Obiettivo
The growing worldwide adoption of Electronic Health Records (EHR) enables new research opportunities to analyse massive amounts of medical information, motivated by the promise of improving health systems while providing significant budget savings. Biomedical research increasingly uses machine learning methods as a data-driven approach to learn complex comorbidity patterns of diseases, study drug interactions, and form predictions. The analysis of EHRs may not only lead to knowledge discovery, but it also facilitates personalised medical treatment and early diagnosis of the diseases through the design of clinical support systems.
However, current approaches for the analysis of EHRs are still in their early stages. The two main technical challenges that need to be addressed are integration of heterogeneous data and scalability to massive datasets. Most of the existing methods are tailored to homogeneous data and, therefore, to a single source of information, and hence they cannot handle EHR datasets. Scalability also represents a difficulty for most of the current machine learning techniques, which are limited to the analysis to moderate-sized datasets.
In this project, we will develop novel tools for the analysis of heterogeneous EHR data. Our approach will be based on probabilistic modelling techniques, since they are an effective approach for understanding real-world data in many areas of science. We will make use of Bayesian nonparametric modelling techniques, coupled with stochastic variational inference to allow for scalable inference. Probabilistic models, including BNPs, are amenable to both descriptive and predictive analysis at the same time. We will collaborate with the Department of Biomedical Informatics, who will provide their knowledge about the problem, allowing for good model formulations and results analysis.
Campo scientifico
CORDIS classifica i progetti con EuroSciVoc, una tassonomia multilingue dei campi scientifici, attraverso un processo semi-automatico basato su tecniche NLP.
CORDIS classifica i progetti con EuroSciVoc, una tassonomia multilingue dei campi scientifici, attraverso un processo semi-automatico basato su tecniche NLP.
- engineering and technologymaterials engineeringcolors
- natural sciencesmathematicsapplied mathematicsstatistics and probabilitybayesian statistics
- social scienceseconomics and businesseconomicsproduction economicsproductivity
- medical and health scienceshealth sciencespersonalized medicine
- natural sciencescomputer and information sciencesartificial intelligencemachine learning
Programma(i)
Meccanismo di finanziamento
MSCA-IF-GF - Global FellowshipsCoordinatore
CB2 1TN Cambridge
Regno Unito