Published on in Vol 23, No 2 (2021): February

Preprints (earlier versions) of this paper are available at, first published .
A Risk Prediction Model Based on Machine Learning for Cognitive Impairment Among Chinese Community-Dwelling Elderly People With Normal Cognition: Development and Validation Study

A Risk Prediction Model Based on Machine Learning for Cognitive Impairment Among Chinese Community-Dwelling Elderly People With Normal Cognition: Development and Validation Study

A Risk Prediction Model Based on Machine Learning for Cognitive Impairment Among Chinese Community-Dwelling Elderly People With Normal Cognition: Development and Validation Study

Original Paper

1Xiangya Nursing School, Central South University, Changsha, China

2Henan Cancer Hospital Province, Zhengzhou University, Zhengzhou, China

3Department of Biomedical Engineering, School of Basic Medical Science, Central South University, Changsha, China

4Xiangya School of Public Health, Central South University, Changsha, China

5Department of Nursing Science, University of Turku, Helsinki, Finland

6Oceanwide Health Management Institute, Central South University, Changsha, China

7National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China

*these authors contributed equally

Corresponding Author:

Hui Feng, PhD

Xiangya Nursing School

Central South University

Yuelu District

172 Tongzipo Road



Phone: 86 15173121969


Background: Identifying cognitive impairment early enough could support timely intervention that may hinder or delay the trajectory of cognitive impairment, thus increasing the chances for successful cognitive aging.

Objective: We aimed to build a prediction model based on machine learning for cognitive impairment among Chinese community-dwelling elderly people with normal cognition.

Methods: A prospective cohort of 6718 older people from the Chinese Longitudinal Healthy Longevity Survey (CLHLS) register, followed between 2008 and 2011, was used to develop and validate the prediction model. Participants were included if they were aged 60 years or above, were community-dwelling elderly people, and had a cognitive Mini-Mental State Examination (MMSE) score ≥18. They were excluded if they were diagnosed with a severe disease (eg, cancer and dementia) or were living in institutions. Cognitive impairment was identified using the Chinese version of the MMSE. Several machine learning algorithms (random forest, XGBoost, naïve Bayes, and logistic regression) were used to assess the 3-year risk of developing cognitive impairment. Optimal cutoffs and adjusted parameters were explored in validation data, and the model was further evaluated in test data. A nomogram was established to vividly present the prediction model.

Results: The mean age of the participants was 80.4 years (SD 10.3 years), and 50.85% (3416/6718) were female. During a 3-year follow-up, 991 (14.8%) participants were identified with cognitive impairment. Among 45 features, the following four features were finally selected to develop the model: age, instrumental activities of daily living, marital status, and baseline cognitive function. The concordance index of the model constructed by logistic regression was 0.814 (95% CI 0.781-0.846). Older people with normal cognitive functioning having a nomogram score of less than 170 were considered to have a low 3-year risk of cognitive impairment, and those with a score of 170 or greater were considered to have a high 3-year risk of cognitive impairment.

Conclusions: This simple and feasible cognitive impairment prediction model could identify community-dwelling elderly people at the greatest 3-year risk for cognitive impairment, which could help community nurses in the early identification of dementia.

J Med Internet Res 2021;23(2):e20298



Dementia constitutes a major health care burden nationally and worldwide [1]. Approximately every 3 seconds, a person somewhere in the world is diagnosed with dementia, and the current annual cost of dementia is estimated to be US $1 trillion, which is set to double by 2030 [2]. China has the largest population of patients with dementia in the world (9.5 million) followed by the United States (4.2 million) [1]. Evidence suggests that delaying the onset of dementia by 1 year is likely to reduce its prevalence by 11% by 2050, while delaying it by 5 years could halve the number of people living with dementia by 2050 [3]. Given that dementia is incurable, it is of high importance to detect cognitive impairment in its early stages [4].

Good evidence already exists that specific risk factors can contribute to increased dementia risk at different life stages. The risk factors are education in early life, hypertension and obesity in midlife, and smoking and depression in later life [5]. Prediction models concerning risk factors for cognitive impairment have already been published. However, the variables included in the models vary, and they mostly focus on laboratory markers only [6-8]. A systematic review by Hou et al included 61 studies of the prediction models of dementia. They found that age, sex, education, cognition assessment scales, BMI, alcohol intake, and genetic variables were the most common predictors included in the models [8]. Questionnaire-based data have also been used to explore the clinical variables with promising predictive values in the transition to cognitive impairment (demographic characteristics and neuropsychiatric symptoms). Other studies have used data based on medical imaging (brain atrophy), genes (apolipoprotein Eε4), or biomarkers (amyloid-β, tau, etc) [5,9,10]. One study used the C-Pittsburgh compound B (C-PiB) medial temporal standard uptake value ratio with the Mini-Mental State Examination (MMSE) for the prediction of a person going from mild cognitive impairment to dementia, and the area under the curve was 0.92 [6]. Kivipelto et al used big data to develop a prediction model of the risk of late-life dementia in middle-aged people, and the model included age, education, hypertension, hypercholesterolemia, and obesity as variables, with an average area under the curve of 0.77 [7]. However, these prediction models are complex, less accurate, and difficult to implement in practice for nursing staff who are working with elderly patients. Therefore, especially for use in community environments, simpler, more accurate, and feasible models are needed [8].

Machine learning has recently been used to produce a prediction model for practice. Machine learning can help in modeling information based on causal and/or statistical data, potentially revealing hidden dependencies between factors and diseases in a big data environment [11]. Published studies show how machine learning algorithms, such as naïve Bayes (NB), AdaBoost, and random forest (RF), have been used to predict or detect cognitive impairment [12-15].

We systematically searched PubMed ([“cognitive impairment” OR “cognitive decline” OR “dementia” OR “alzheimer*”] AND [“machine learning” OR “data mining” OR “big data”] AND “prediction”) and found four studies in which machine learning was used to identify risk factors for dementia among people with normal cognition at baseline. One study [16] used unsupervised machine learning to develop a dementia prediction model that could identify people at a high risk of developing dementia. Another study [13] used the medical records of 93,120 patients to develop a model for exploring undetected dementia using a machine learning approach (with an area under the curve of 0.74). One study [17] developed a model for predicting the risk of developing dementia within the next 2 years among older people (aged 85 years or above) without dementia (with an area under the curve of 0.73). The study showed that the predictors differed between the youngest and oldest individuals in the population. Further, another study used supervised machine learning to develop a dementia prediction model (area under the curve values of 0.75 and 0.79) and found that the Disease State Index is useful for identifying individuals who are most at risk [18].

However, a variety of difficulties have been identified in implementing the results of machine learning in clinical practice, as the data have been collected at one time point only, meaning that the causality of the data can be questioned [13]. Some prediction models have been too complicated, and there have been problems with accuracy in the prediction [16-18]. In addition, although the results seem to be acceptable from a statistical point of view, understanding the interpretation of the unsupervised machine learning result and its implementation into practice is demanding [16]. There is still room for the improvement of prediction models for forecasting risks for dementia. In addition, more studies are needed to develop and translate the results into clinical practice, especially for community environments [19]. We therefore aimed to develop an algorithm to be used in a prediction model to identify risk factors for cognitive impairment among Chinese community-dwelling elderly people with normal cognition. The study results are important, as an approach to stratify the individual risks for cognitive impairment is needed in community settings for both national and international purposes [20].

Design and Participants

This study strictly followed the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD): The TRIPOD Statement [21]. In this machine learning approach, the national prospective longitudinal results of the Chinese Longitudinal Healthy Longevity Survey (CLHLS) were used [22]. The CLHLS is one of the largest national longitudinal studies for investigating the health of older Chinese adults. Launched in 1998, the CLHLS implemented follow-up surveys in 2000, 2002, 2005, 2008-09, 2011-12, and 2014. A total of 22 Chinese provinces were randomly recruited, and the sampling frame covered about 85% of the total population of China. The survey results in the national database are freely accessible and available online [23]. The 2008-09 survey included a total of 16,954 participants.

We included 11,788 participants from the 2008-09 wave, and 6718 participants were eligible for model development and internal validation. Participants were included if they were (1) aged 60 years or above; (2) community-dwelling elderly people; and (3) normally cognitive (MMSE score ≥18). They were excluded if they (1) were diagnosed with a severe disease (eg, cancer and dementia) or (2) lived in an institution. A detailed flow chart of participant selection is shown in Multimedia Appendix 1. Among the remaining participants, in the 2011-12 wave, a total of 1913 participants were lost in the follow-up and 2879 died. Those who were excluded from analyses owing to nonparticipation or death were on average older (P<.001) and had lower physical function scores (P<.001) and lower baseline cognition scores (P<.001). The two groups were not significantly different in terms of sex (P=.45).

Outcome Variables and Predictors

Cognitive impairment was defined by the Chinese version of the Mini-Mental State Examination (CMMSE) [24], which was culturally translated from the international standard of the MMSE questionnaire. The CMMSE contains 24 items within six dimensions (five items for orientation, three for registration, one for naming, five for attention and calculation, three for recall, and seven for language). The score of the Chinese MMSE ranges from 0 to 30 points, with higher scores indicating better cognition. The CMMSE has been validated among the Chinese elderly population, and a score below 18 points has been defined as cognitive impairment [24].

Predictors related to cognitive impairment were assessed a priori based on clinical importance, scientific knowledge, and predictors identified in previously published studies [25]. We therefore selected 45 factors related to demographic characteristics, which included lifestyle, mental health, leisure activities, sleep, chronic diseases, physical function, anthropometric index, and baseline cognitive function (Multimedia Appendix 2).

Statistical Analysis

Categorical variables have been reported as numbers and proportions, and compared using a chi-square test or Fisher exact test. Continuous variables have been expressed as medians with IQRs and compared using the Wilcoxon test when data were not normally distributed. Detailed information is presented in Multimedia Appendix 2. Some covariates contained missing values. The proportion of missing values was less than 5% for all variables. Thus, we performed imputations, using multivariable regression methods via the R package mice. Feature selection was performed using recursive feature elimination (RFE) combined with RF. During the process of elimination, a 10-fold cross-validation was implemented to optimize the variable selection. In addition, the RFE method with the NB method was used to extract variables, and the result was compared with RFE combined with RF. According to the results of RF and NB, the final feature selection was based on the number of features included and accuracy.

We divided the original data into a 2/3 training set, 1/6 validation set, and 1/6 test set [26]. The training set was used for model development. The validation set was used to adjust parameters of the model and explore optimal cutoffs after training was finished. The test set was used to estimate the generalization of the model. Regarding the algorithm used in the development of prediction models, we chose four machine learning algorithms, including RF, XGBoost, NB, and logistic regression, to construct models based on the results of the feature selection. We chose these four learning algorithms because they are recommended by “Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View” [27]. The performances of the four prediction models were compared with each other using areas under the curve, specificity, sensitivity, accuracy, and specificity/sensitivity.

In addition, if logistic regression performs well compared with the other three methods, we will formulate a nomogram based on the result of logistic regression for practical use. The nomogram works by proportionally converting each regression coefficient into a 0 to 100-point scale, with 100 points being the highest β coefficient. The points across each independent variable are added to derive total points, which are translated to predicted probabilities [28]. All analyses were conducted using R, version 3.6.0 [29]. A P value <0.05 was considered to indicate statistical significance.

Population Demographics

A total of 6718 participants were involved. Forty-five explanatory variables were selected, and these variables included nine items of demographic characteristics, five items of lifestyle, 10 items of mental health, five items of leisure activities, two items of sleep, seven items of chronic diseases, two items of physical function, four items of the anthropometric index, and one item of baseline cognitive function. Six variables of demographic characteristics (age, sex, ethnicity, years of education, occupation, and marital status), four variables of lifestyle (fruit, smoking, drinking, and exercise), eight variables of mental health (self-reported quality of life, being positive, hygiene, anxiety, loneliness, decision making, feeling useless, and feeling happy), five variables of leisure activity (garden work, reading, raising pets, playing cards or mah-jongg, and social activities), two variables of sleep (sleep quality and sleep duration), five variables of chronic diseases (hypertension, diabetes, heart disease, cataract, and arthritis), two variables of physical function (activities of daily living and instrumental activities of daily living), one variable of anthropometric measurement (BMI), and baseline MMSE were significantly associated with a 3-year risk of cognitive impairment (P<.001). Detailed information is presented in Multimedia Appendix 2.

Feature Selection

NB combined with RFE showed that accuracy (0.8342) was the highest with four features included in the model (age, instrumental activities of daily living, baseline MMSE, and marital status). RF combined with RFE showed that the model involving 45 variables had the highest accuracy (0.8502), while the model including four variables had an accuracy of 0.8304 (Table 1). Considering the simplicity and accuracy of the prediction model, we finally chose the following four features to develop the model: age, instrumental activities of daily living, baseline MMSE, and marital status.

Table 1. Feature selection using naïve Bayes combined with recursive feature elimination and random forest combined with recursive feature elimination.
MethodNumber of featuresAccuracyKappaAccuracy SDKappa SD
NBa combined with RFEb40.83420.32580.0078010.02452
NB combined with RFE80.82290.35430.0073400.02408
NB combined with RFE160.81360.34210.0127240.02540
NB combined with RFE450.83150.32200.0075670.02639
RFc combined with RFE40.83040.15450.0083560.05486
RF combined with RFE80.84750.15940.0085450.05569
RF combined with RFE160.84710.17890.0078150.03612
RF combined with RFE450.85020.12140.0055720.04800

aNB: naïve Bayes.

bRFE: recursive feature elimination.

cRF: random forest.

Model Evaluation and Comparison

The training, validation, and test sets involved 4514, 1100, and 1104 points of data, respectively. We tried to use several widely applied machine learning algorithms (RF, NB, XGBoost, and logistic regression) for the construction of the prediction models in the training set.

We used a receiver operating characteristic (ROC) curve, specificity, sensitivity, accuracy, and specificity/sensitivity to evaluate the prediction model in both validation and test data. Before the evaluation, optimal cutoffs were determined by maximizing the Youden index (ie, sensitivity + specificity − 1) by the ROC curve in the validation set. In the test set, ROC curves revealed that logistic regression and NB had better predictive performances, with an area under the curve of 0.814. The area under the curve of XGBoost (0.811) was less than that of logistic regression and NB. RF underperformed, with an area under the curve of 0.780 (Figure 1).

The model of NB performed well in terms of specificity, with a value of 0.776. The specificities of the models of logistic regression (0.770), RF (0.645), and XGBoost (0.738) were lower than that of NB. The model of RF performed well in terms of sensitivity, with a value of 0.793. The sensitivities of the models of logistic regression (0.701), NB (0.672), and XGBoost (0.724) were lower than that of RF. The accuracy of the NB model (0.760) was higher than the accuracies of the other three models. All details about the parameters of the models developed with different algorithms are shown in Table 2.

Figure 1. Receiver operating characteristic curve performance of four models on the test set. AUC: area under the curve.
View this figure
Table 2. Evaluation of the performance of the four algorithms.
AlgorithmData setArea under the curveOptimal cutoffSpecificitySensitivityAccuracySpecificity/sensitivity
Logistic regressionValidation0.8120.1160.7850.6820.7681.151
Logistic regressionTest0.8140.1160.7700.7010.7591.098
Random forestValidation0.7730.0400.6540.7840.6750.834
Random forestTest0.7800.0400.6450.7930.6690.813
Naïve BayesValidation0.8040.2140.7960.6880.7781.157
Naïve BayesTest0.8140.2140.7760.6720.7601.155

Development of the Nomogram

As the prediction developed by logistic regression performed well, a nomogram (Figure 2) was used to present the data and predicted probabilities vividly. The optimal cutoff value of the total nomogram scores was determined to be 170 in the test set. In Figure 2, a box has been used to represent the proportion of the sample in category variables. For instance, for marital status, most of the participants were grouped into “married or partnered” and “widowed.” Density exhibited the distribution of the sample in continuous variables. For example, for baseline MMSE, most individuals scored over 28. The total points corresponded to the predicted probability. For example, an individual score of 200 points in total, with an age score of 85, a marital status of widowed, an instrumental activities of daily living score of 6, and a baseline MMSE score of 22 corresponded to a predicted probability of cognitive impairment of 29.3%.

Figure 2. Developed nomogram with age, marital status, instrumental activities of daily living (IADL), and baseline Mini-Mental State Examination (MMSE) incorporated.
View this figure

Main Findings

In this study, we developed a prediction model for forecasting the 3-year risk of cognitive impairment among 6718 community-dwelling elderly individuals aged 60 years or older with normal cognitive function via machine learning algorithms. The model performed comparably to the best available biomarkers, such as apolipoprotein genotype [13,17], but is less expensive, easier to implement, and validated internally with reasonable results.

Feature or variable selection is central in the development of a prediction model [30]. Out of the original 45 variables, the following four variables used showed the highest accuracy of the model: age, marital status, instrumental activities of daily living, and baseline MMSE. Our findings support the previous literature regarding age [5,31], marital status [32,33], activities of daily living [34,35], and cognitive status [8,36], indicating that the predictors we selected were usable and reliable.

In this study, age was one of the predictors for the risk of cognitive impairment. It has already been estimated that, in 2020, 83% of people aged 75 years or above live with dementia in the US [37]. Those aged 65 to 74 years account for 17% of cases, while those younger than 65 years could develop dementia, but it is much less common and the prevalence is uncertain [37]. It is indicated that people aged 75 years or above should be screened for the risk of cognitive impairment. However, screening all older people for dementia is not recommended, as the benefits for that are still unclear [5]. Using large existing registers and databases can offer opportunities to explore existing information to predict the health status [38].

We found that marital status was a strong predictor for the risk of cognitive impairment. A recent meta-analysis involving 812,047 participants evaluated the association between marriage and dementia. The results showed that life-long single and widowed older people were respectively 1.42 and 1.20 times more likely to be diagnosed with dementia compared with married older people [33]. Another 10-year longitudinal population-based study including 2,288,489 individuals explored the influence of marital status and concluded that the risk of dementia in nonmarried individuals was around 1.7 times greater than that in married individuals [32]. The results might be explained by the fact that married individuals are more likely to have healthy lifestyles and participate in social activities, which might be conducive to cognitive reserve and reduced dementia risk over a lifespan [39]. On the contrary, those in widowhood might be more likely to experience a higher risk of cognitive impairment than divorced people because of the detrimental effect of stress from bereavement on hippocampal neurons or cognition. Further, as marriage has been considered a social norm, people with difficulties in communication and smaller cognitive reserves across life may be less likely to marry [33]. In today’s society, staying unmarried has become more common, and this phenomenon deserves more attention. Social factors like marital status should be taken seriously as risk determinants for cognitive impairment.

An association between physical function and cognitive capacity among older people has been found in previous studies [34,40]. Our study only included instrumental activities of daily living as a predictor for constructing the model, and it did not consider activities of daily living that represent functional ability. The reason might be that there is a natural hierarchy of functional loss associated with cognitive decline among older people [34]. Older persons with progressive cognitive decline lose the ability to perform tasks, often in the order of bathing, dressing, toileting, transferring, continence, and feeding. Therefore, older people who are not able to feed themselves might not be able to perform other tasks independently [35]. Similarly, a study by Njegovan et al found that among the 14 items of activities of daily living and instrumental activities of daily living, a hierarchy of functional items existed, with instrumental activities of daily living (such as shopping, banking, etc) being lost at higher cognitive scores than basic activities of daily living (such as eating, dressing, etc), which were lost later [34]. Our results confirmed that there was a tendency for instrumental activities of daily living to be a stronger predictor compared with activities of daily living. However, since there was overlap, subdomains of these two tools might be more meaningful for developing a prediction model. For nurses and caregivers, this information can help anticipate the need for intervention in people with declining cognition showing subtle declines in instrumental activities of daily living, which could improve the quality of life of these people and their caregivers and play an important part in health care planning [34].

Baseline cognitive function affected the degree to which cognitive scores changed over time and had a profound effect on further cognitive impairment. In one study using UK biobank data to assess the effect of baseline cognitive performance on a prediction model for 3 to 8-year risk of dementia, the results showed that cognitive performance added up to 5% (from 0.78 to 0.83) to the discriminative accuracy of the ROC model developed with the variables of age, sex, education, family history, and depression [36]. The MMSE has been the most common cognitive variable for developing a dementia prediction model [8]. However, variables of specific cognitive domains, such as memory and executive function, might be more feasible and useful predictors in constructing cognitive impairment prediction models. The total MMSE score was associated less strongly with dementia and Alzheimer disease than the episodic memory subset [8]. Therefore, future studies could consider more specific cognitive domain variables.


Our study has limitations, which should be considered in the interpretation of the study results. First, retained cohort members were younger and had on average better cognitive and physical functioning than those who dropped out, which can lead to studies being severely underpowered and biased toward the healthier part of the aging population. As we used a nationally representative database, the ascertainment bias could, to an extent, be limited. Second, we utilized a cross-validation approach to model development and assessment. The results still need to be validated in an independent cohort. Third, we used cohort studies with insufficient details on the duration of marriage, widowhood, and divorce to allow the exploration of a dose-response effect. A future study could take the dose-response effect of marital status on cognitive function into consideration. Fourth, the baseline MMSE was used as one of the predictors. However, other specific cognitive domains, such as memory and executive function, could also be used as features, and they might perform better than whole cognitive function. Fifth, our models were based on a prospective cohort that may have some level of bias. A prospective external validation cohort is needed for further confirmation in future research. Lastly, some predictors used in our study were measured by self-reporting, resulting in information bias. Nevertheless, self-reported data are more feasibly collected in primary health care settings, and the results can be generalized to wider communities. Despite these limitations, we believe that the results are usable in terms of cognitive impairment prevention and further intervention globally.


This is one of the first studies where a machine learning approach has been used in a nursing context. The study showed that machine learning can be used more widely in nursing science in different contexts and various functions. The prediction models exert implications in the three-grade prevention system of diseases [38]. In the primary prevention of diseases, a cognitive impairment prediction model could provide quantitative risk value (probability) of cognitive deficiency in the next 3 years, based on the current health status, offering a more intuitive and powerful scientific tool for health education and behavioral intervention. In the secondary prevention of diseases, using noninvasive, low-cost, and easy-to-acquire variables to develop a prediction model is more practicable for staff, particularly general practitioners in community health, to bring about “early detection, early diagnosis, and early treatment,” which have large influences on medical costs for dementia. In the tertiary prevention of diseases, the prediction model could be used to predict recurrence, reducing mortality and disability [38]. A simple and feasible prediction model would also help nurses to be aware of the progression of diseases over time. Therefore, nurses could be better aware of triggers that might alarm them about any hidden problems. In addition, a precise prediction model with predictors that are more available in clinical environments could help clinical nurses understand the prognostic factors of diseases. Based on this information, nurses could offer tailored preventive interventions to patients before any signs of cognition deficits occur.

This study provides guidance for future research as well. First, the use of several algorithms to construct prediction models in specific diseases offers more opportunity to find a more suitable model with a high area under the curve and accuracy. Second, selecting the most suitable predictors is important for developing a prediction model to use in clinical practice. Easy-to-acquire, noninvasive, and low-cost variables are welcome in clinical nursing, and invasive biomarkers could improve the prediction. The former is more suitable for community health care and any clinical environment because of large populations and insufficient staff and funds, while the latter is more applicable in more specific clinical environments for people with high risk of diseases. Lastly, we included Chinese elderly people aged 60 years or above and developed a cognitive impairment prediction model. Further studies could develop cognitive impairment prediction models for middle-aged people as the World Health Organization has suggested to increase the cognitive reserve in mid-life and early aging (45-70 years) [41].

In the future, the results of this study could be used in countries and areas with less human resources, such as low- and middle-income countries, to identify elderly people with a high risk of developing cognitive deficiency in the next 3 years (ie, age, marital status, physical function, and cognitive function). Simple, relevant, and easy-to-detect risk factors would save time and resources in health care and would especially help nursing staff identify those people who are at high risk of developing cognitive impairment. As family members living with elderly people do not always recognize the early signs of dementia [42], the knowledge obtained from this study could be used to educate family members as well.


We appreciate the support from the Special Funding for the Construction of Innovative Provinces in Hunan (grant number: 2019SK2141), China Scholarship Council (grant number: Liumeijin 2020-1509), and Central South University-Flinders University Seed Project (grant number: 2020xyhlzzjj002).

Authors' Contributions

MYH and XYW conceived of the presented idea. MYH and XHS performed the analytical calculations and took the lead in writing the manuscript. GY verified the analytical methods. VM investigated and supervised the findings of this work and helped in the language edit. HF supervised the whole project. In addition, VM and HF contributed equally to this manuscript.

Conflicts of Interest

None declared.

Multimedia Appendix 1

Flow chart of participant selection.

DOCX File , 25 KB

Multimedia Appendix 2

Detailed information about 45 features among the participants (grouped by cognitive impairment).

DOCX File , 35 KB


  1. World Alzheimer Report 2015: The global impact of dementia: an analysis of prevalence, incidence, cost and trends. Alzheimer's Disease International. 2015.   URL: [accessed 2021-02-12]
  2. World Alzheimer Report 2019: Attitudes to dementia. Alzheimer's Disease International. 2019.   URL: [accessed 2021-01-12]
  3. Frankish H, Horton R. Prevention and management of dementia: a priority for public health. The Lancet 2017 Dec;390(10113):2614-2615. [CrossRef]
  4. World Alzheimer Report 2018-The state of the art of dementia research: New frontiers. Alzheimer's Disease International. 2018.   URL: [accessed 2021-01-12]
  5. Livingston G, Sommerlad A, Orgeta V, Costafreda SG, Huntley J, Ames D, et al. Dementia prevention, intervention, and care. The Lancet 2017 Dec;390(10113):2673-2734. [CrossRef]
  6. Chen X, Zhou Y, Wang R, Cao H, Reid S, Gao R, Alzheimer’s Disease Neuroimaging Initiative. Potential Clinical Value of Multiparametric PET in the Prediction of Alzheimer's Disease Progression. PLoS One 2016 May 16;11(5):e0154406 [FREE Full text] [CrossRef] [Medline]
  7. Kivipelto M, Ngandu T, Laatikainen T, Winblad B, Soininen H, Tuomilehto J. Risk score for the prediction of dementia risk in 20 years among middle aged people: a longitudinal, population-based study. The Lancet Neurology 2006 Sep;5(9):735-741. [CrossRef]
  8. Hou X, Feng L, Zhang C, Cao X, Tan L, Yu J. Models for predicting risk of dementia: a systematic review. J Neurol Neurosurg Psychiatry 2019 Apr 28;90(4):373-379. [CrossRef] [Medline]
  9. Pase MP, Beiser AS, Himali JJ, Satizabal CL, Aparicio HJ, DeCarli C, et al. Assessment of Plasma Total Tau Level as a Predictive Biomarker for Dementia and Related Endophenotypes. JAMA Neurol 2019 May 01;76(5):598-606 [FREE Full text] [CrossRef] [Medline]
  10. Ding D, Zhao Q, Guo Q, Liang X, Luo J, Yu L, Shanghai Aging Study (SAS). Progression and predictors of mild cognitive impairment in Chinese elderly: A prospective follow-up in the Shanghai Aging Study. Alzheimers Dement (Amst) 2016;4:28-36 [FREE Full text] [CrossRef] [Medline]
  11. Bratić B, Kurbalija V, Ivanović M, Oder I, Bosnić Z. Machine Learning for Predicting Cognitive Diseases: Methods, Data Sources and Risk Factors. J Med Syst 2018 Oct 27;42(12):243. [CrossRef] [Medline]
  12. Joshi S, Deepa Shenoy P, Simha GG V, Venugopal KR, Patnaik L. Classification of Neurodegenerative Disorders Based on Major Risk Factors Employing Machine Learning Techniques. IJET 2010;2(4):350-355. [CrossRef]
  13. Ford E, Rooney P, Oliver S, Hoile R, Hurley P, Banerjee S, et al. Identifying undetected dementia in UK primary care patients: a retrospective case-control study comparing machine-learning and standard epidemiological approaches. BMC Med Inform Decis Mak 2019 Dec 02;19(1):248 [FREE Full text] [CrossRef] [Medline]
  14. Li R. Data Mining and Machine Learning Methods for Dementia Research. In: Perneczky R, editor. Biomarkers for Alzheimer’s Disease Drug Development. Methods in Molecular Biology, vol 1750. New York, NY: Humana Press; 2018:363-370.
  15. Chekroud AM, Zotti RJ, Shehzad Z, Gueorguieva R, Johnson MK, Trivedi MH, et al. Cross-trial prediction of treatment outcome in depression: a machine learning approach. The Lancet Psychiatry 2016 Mar;3(3):243-250. [CrossRef]
  16. Cleret de Langavant L, Bayen E, Yaffe K. Unsupervised Machine Learning to Identify High Likelihood of Dementia in Population-Based Surveys: Development and Validation Study. J Med Internet Res 2018 Jul 09;20(7):e10493 [FREE Full text] [CrossRef] [Medline]
  17. Hall A, Pekkala T, Polvikoski T, van Gils M, Kivipelto M, Lötjönen J, et al. Prediction models for dementia and neuropathology in the oldest old: the Vantaa 85+ cohort study. Alzheimers Res Ther 2019 Jan 22;11(1):11 [FREE Full text] [CrossRef] [Medline]
  18. Pekkala T, Hall A, Lötjönen J, Mattila J, Soininen H, Ngandu T, et al. Development of a Late-Life Dementia Prediction Index with Supervised Machine Learning in the Population-Based CAIDE Study. JAD 2016 Dec 06;55(3):1055-1067. [CrossRef]
  19. Xie Y, Yu Z. Models and prediction, how and what? Ann Transl Med 2020 Feb;8(4):75-75 [FREE Full text] [CrossRef] [Medline]
  20. Obermeyer Z, Emanuel EJ. Predicting the Future — Big Data, Machine Learning, and Clinical Medicine. N Engl J Med 2016 Sep 29;375(13):1216-1219. [CrossRef]
  21. Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD). Circulation 2015 Jan 13;131(2):211-219. [CrossRef]
  22. Zeng Y. Chinese Longitudinal Healthy Longevity Survey and some research findings. Geriatr Gerontol Int 2004 Sep;4(s1):S49-S52. [CrossRef]
  23. Center for Healthy Aging and Development Studies. Peking University Open Research Data Platform.   URL: [accessed 2021-02-11]
  24. Zhang Z, Zahner GEP, Román GC, Liu X, Wu C, Hong Z, et al. Socio-demographic variation of dementia subtypes in china: Methodology and results of a prevalence study in Beijing, Chengdu, Shanghai, and Xian. Neuroepidemiology 2006;27(4):177-187 [FREE Full text] [CrossRef] [Medline]
  25. Kim Y, Margonis GA, Prescott JD, Tran TB, Postlewait LM, Maithel SK, et al. Nomograms to Predict Recurrence-Free and Overall Survival After Curative Resection of Adrenocortical Carcinoma. JAMA Surg 2016 Apr 01;151(4):365-373 [FREE Full text] [CrossRef] [Medline]
  26. Lee S, Gui X, Manquen M, Hamilton E. Use of Training, Validation, and Test Sets for Developing Automated Classifiers in Quantitative Ethnography. In: Eagan B, Misfeldt M, Siebert-Evenstone A, editors. Advances in Quantitative Ethnography. ICQE 2019. Communications in Computer and Information Science, vol 1112. Cham: Springer; 2019:117-127.
  27. Luo W, Phung D, Tran T, Gupta S, Rana S, Karmakar C, et al. Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View. J Med Internet Res 2016 Dec 16;18(12):e323 [FREE Full text] [CrossRef] [Medline]
  28. Lei Z, Li J, Wu D, Xia Y, Wang Q, Si A, et al. Nomogram for Preoperative Estimation of Microvascular Invasion Risk in Hepatitis B Virus-Related Hepatocellular Carcinoma Within the Milan Criteria. JAMA Surg 2016 Apr 01;151(4):356-363. [CrossRef] [Medline]
  29. The R Project for Statistical Computing.   URL: [accessed 2021-02-11]
  30. Deo RC. Machine Learning in Medicine. Circulation 2015 Nov 17;132(20):1920-1930. [CrossRef]
  31. Chinese Dementia Cognitive Impairment Diagnosis Treatment Guidelines Writing Group. Diagnosis and treatment of dementia (2018): risk factors and interventions of Alzhemer's disease. Natl Med J China 2018;98(19):1461-1466.
  32. Sundström A, Westerlund O, Kotyrlo E. Marital status and risk of dementia: a nationwide population-based prospective study from Sweden. BMJ Open 2016 Jan 04;6(1):e008565 [FREE Full text] [CrossRef] [Medline]
  33. Sommerlad A, Ruegger J, Singh-Manoux A, Lewis G, Livingston G. Marriage and risk of dementia: systematic review and meta-analysis of observational studies. J Neurol Neurosurg Psychiatry 2018 Mar 28;89(3):231-238 [FREE Full text] [CrossRef] [Medline]
  34. Njegovan V, Hing MM, Mitchell SL, Molnar FJ. The hierarchy of functional loss associated with cognitive decline in older persons. J Gerontol A Biol Sci Med Sci 2001 Oct 01;56(10):M638-M643. [CrossRef] [Medline]
  35. Katz S, Ford AB, Moskowitz RW, Jackson BA, Jaffe MW. Studies of illness in the aged. The index of ADL: a standardized measure of biological and psychosocial function. JAMA 1963 Sep 21;185:914-919. [CrossRef] [Medline]
  36. Calvin CM, Wilkinson T, Starr JM, Sudlow C, Hagenaars SP, Harris SE, et al. Predicting incident dementia 3-8 years after brief cognitive tests in the UK Biobank prospective study of 500,000 people. Alzheimers Dement 2019 Dec 13;15(12):1546-1557. [CrossRef] [Medline]
  37. Hebert LE, Weuve J, Scherr PA, Evans DA. Alzheimer disease in the United States (2010-2050) estimated using the 2010 census. Neurology 2013 Feb 06;80(19):1778-1783. [CrossRef]
  38. Zhou Z, Wang W, Li Y, Jin K, Wang X, Wang Z, et al. In-depth mining of clinical data: the construction of clinical prediction model with R. Ann Transl Med 2019 Dec;7(23):796-796 [FREE Full text] [CrossRef] [Medline]
  39. Kuiper JS, Zuidersma M, Oude Voshaar RC, Zuidema SU, van den Heuvel ER, Stolk RP, et al. Social relationships and risk of dementia: A systematic review and meta-analysis of longitudinal cohort studies. Ageing Res Rev 2015 Jul;22:39-57. [CrossRef] [Medline]
  40. Barberger-Gateau P, Commenges D, Gagnon M, Letenneur L, Sauvel C, Dartigues J. Instrumental activities of daily living as a screening tool for cognitive impairment and dementia in elderly community dwellers. J Am Geriatr Soc 1992 Nov 27;40(11):1129-1134. [CrossRef] [Medline]
  41. Vellas B, Scrase D, Rosenberg G, Andrieu S, Araujo de Carvalho I, Middleton L. Editorial: WHO Guidelines on Community-Level Interventions to Manage Declines in Intrinsic Capacity: The Road for Preventing Cognitive Declines in Older Age? J Prev Alzheimers Dis 2018;5(3):165-167. [CrossRef] [Medline]
  42. World Alzheimer Report 2016-Improving healthcare for people living with dementia: Coverage, quality and costs now and in the future. Alzheimer's Disease International. 2016.   URL: [accessed 2021-01-12]

CLHLS: Chinese Longitudinal Healthy Longevity Survey
CMMSE: Chinese version of the Mini-Mental State Examination
MMSE: Mini-Mental State Examination
NB: naïve Bayes
RF: random forest
RFE: recursive feature elimination
ROC: receiver operating characteristic
TRIPOD: Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis

Edited by G Eysenbach, Q Zeng; submitted 15.05.20; peer-reviewed by X Ling, C Li; comments to author 28.08.20; revised version received 14.01.21; accepted 18.01.21; published 24.02.21


©Mingyue Hu, Xinhui Shu, Gang Yu, Xinyin Wu, Maritta Välimäki, Hui Feng. Originally published in the Journal of Medical Internet Research (, 24.02.2021.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on, as well as this copyright and license information must be included.