Probabilistic modeling of disease progression

Elahe Dorani1 Zahra Narimani2 Holger Froehlich3

1) Iran, Institute for Advanced Studies in Basic Sciences, Email:
2) Iran, Institute for Advanced Studies in Basic Sciences, Email:
3) Germany, University of Bonn, Email:

Publication : Second National Conference on Biomathematics (biomathtabrizu.ir)
Abstract :
Disease progression modeling is the use of mathematical and statistical approaches for tracking the disease severity in a special period of timecite{cook2016disease}. One of the major problems in medicine is detecting the chronic diseases in the first stages of progression. Chronic disease like Alzheimer s start from mild symptoms and progress slowly to the severe stages during a long time. Early detection of symptoms leads to more effective therapies, detect different subtypes of disease, and designing more effective drugs. par In previous research, they have used limited well-known features for modeling progression of Alzheimer s disease and algorithms such as Random Forestscite{dimitriadis2018random, moore2019random}, SVMscite{kloppel2008automatic, zhang2011multimodal}, and Linear Mixed-Effects modelscite{sabuncu2011dynamics} are hired to distinguish the healthy elderly, patients with mild or sever impairments and AD.par In this work we used an automatic feature selection method on high dimensional data set and controlling the complexity of the model as well. We focused on probabilistic modeling of Alzheimer’s disease progression using TADPOLE data set. This data set contains different groups of features, such as demographics, cognitive scores, genetic bio-markers, MRI, DTI and PET scans. We applied Bayesian approach using variational inference and sparse prior.par We compared our model with the reference model proposed in cite{Zhu2018APD} using normal prior, the Random Forests proposed in cite{moore2019random} and Linear Mixed-Effects regression. The target variables that we were interested in are Ventricles, Hippocampus, MMSE, ADAS13 and CDRSB as the three well-known cognitive tests. In the first experiment, comparing the results of proposed Bayesian model using sparse prior (Laplace distribution) with the reference model using normal priorcite{Zhu2018APD}, showed us that that it was more accurate for the Ventricles, Hippocampus, ADAS13 and CDRSB than the reference model. In the second experiment we used the Random Forest both as the features selection method for the proposed Bayesian model and benchmark. As another comparison, this version of the Bayesian model with the reference model showed us that on it was more accurate on Ventricles, Hippocampus, ADAS13 and CDRSB than the reference model. But the LME model and Random Forests regression algorithm were better than the reference and proposed models.
Keywords : Disease progression modeling; Bayesian model; Variational inference; Sparse prior; Bayesian feature selection