Original Paper
Abstract
Background: Early prediction of treatment outcomes for patients with multidrug-resistant or rifampicin-resistant tuberculosis (MDR/RR-TB) undergoing extended therapy is crucial for enhancing clinical prognoses and preventing the transmission of this deadly disease. However, the absence of validated predictive models remains a significant challenge.
Objective: This study compared a conventional logistic regression model with machine learning (ML) models using demographic and clinical data to predict outcomes at 2 and 6 months of treatment for MDR/RR-TB. The goal was to advance model applications, refine control strategies, and boost MDR/RR-TB cure rates.
Methods: This retrospective study encompassed an internal cohort of 744 patients with MDR/RR-TB examined between January 2017 and June 2023, as well as an external cohort comprising 137 patients with MDR/RR-TB examined between March 2021 and June 2022. Data on culture conversion were collected at 2 and 6 months, and culture conversion was tracked in the external cohort at the same time points. The internal cohort was assigned as the training set, whereas the external cohort was used as the validation set. Logistic regression and 7 ML models were developed to predict the culture conversion of patients with MDR/RR-TB at 2 and 6 months of treatment. Model performance was evaluated using the area under the curve, accuracy, sensitivity, and specificity.
Results: In the internal cohort, culture conversion rates for MDR/RR-TB were 81.9% (485/592) at 2 months and 87.1% (406/466) at 6 months. The odds ratio for treatment success was 8.55 (95% CI 3.31-22.08) at 2 months and 20.33 (95% CI 6.90-59.86) at 6 months after conversion, with sensitivities of 86.5% and 92.2% and specificities of 57.1% and 63.2%, respectively. The artificial neural network model was the best for culture conversion at both 2 and 6 months of treatment, with areas under the curve of 0.82 (95% CI 0.77-0.86) and 0.90 (95% CI 0.86-0.93), respectively. The accuracy, sensitivity, and specificity of the model were 0.74, 0.74, and 0.75 at 2 months of treatment and 0.80, 0.79, and 0.87 at 6 months of treatment, respectively.
Conclusions: The ML models based on 2- and 6-month culture conversion could accurately predict treatment outcomes for patients with MDR/RR-TB. ML models, particularly the artificial neural network model, outperformed the logistic regression model in both stability and generalizability and offer a rapid and effective tool for evaluating therapeutic efficacy in the early stages of MDR/RR-TB treatment.
doi:10.2196/69998
Keywords
Introduction
Background
Multidrug-resistant or rifampicin-resistant tuberculosis (MDR/RR-TB; resistance to rifampicin and isoniazid or rifampicin alone) poses a significant challenge to tuberculosis (TB) control worldwide, particularly in limited-income countries [,]. As of 2023, the World Health Organization (WHO) estimated that 3.2% of new TB cases and 16% of retreatment cases worldwide were MDR/RR-TB []. Despite improvements in treatment options, including new drugs such as linezolid (LZD), bedaquiline (BDQ), and delamanid (DLM), the current cure rate for MDR/RR-TB remains at approximately 60% []. MDR/RR-TB treatment is more expensive, lengthier, and associated with more side effects compared to drug-susceptible TB []. In addition, prolonged treatment can lead to extensively drug-resistant TB, which has an 80% mortality rate [].
Sputum culture is the gold standard for assessing TB treatment efficacy, with culture conversion indicating a favorable outcome. However, sputum cultures are susceptible to contamination and lengthy incubation periods []. Due to the limitations of traditional assessment methods, innovative evaluation strategies have emerged, incorporating novel biomarkers, omics approaches, and therapeutic efficacy models [,]. However, these advanced biomarkers and omics-based analyses often rely on laboratory testing for efficacy assessment, which can be time-consuming and costly []. Furthermore, although some novel biomarkers have been used to evaluate treatment outcomes in TB, their application remains at an early stage and requires further validation []. Consequently, it is feasible to develop efficacy evaluation models based on demographic and clinical parameters such as age, gender, and treatment regimens to predict TB treatment outcomes. Currently, some studies use patient demographic and clinical features to construct predictive models for prognosis [-]. However, most of these models focus on predicting the ultimate outcomes of MDR/RR-TB, with a notable gap in early efficacy prediction models specifically for MDR/RR-TB. Therefore, the use of predictive models to forecast early treatment efficacy in patients with MDR/RR-TB not only advances the development of this field but also aids in monitoring therapeutic outcomes. There is evidence suggesting that culture conversion at 2 and 6 months after treatment initiation serves as an early predictor of successful outcomes in patients with MDR/RR-TB [], yet research in this area is still limited.
Objectives
Machine learning (ML) algorithms based on artificial intelligence are increasingly applied in medical and public health research, showing significant potential in disease prevention, diagnosis, and treatment [-]. The conventional logistic regression model is widely used as a benchmark against more complex ML algorithms to assess whether the latter offer any meaningful improvement in predictive performance []. This study compared conventional logistic regression with several ML models using demographic and clinical data to predict outcomes at 2 and 6 months of treatment for MDR/RR-TB. The goal was to advance model applications, refine control strategies, and boost MDR/RR-TB cure rates.
Methods
Study Design
We conducted a retrospective cohort study collecting data from patients with MDR/RR-TB diagnosed through drug susceptibility testing or molecular diagnostic testing and who had completed 2 or 6 months of treatment at Beijing Chest Hospital, Capital Medical University, and Guangzhou Chest Hospital in China. These patients were divided into a culture conversion group and no–culture conversion group. Predictive models for assessing the efficacy after 2 or 6 months of treatment were then constructed. The inclusion and exclusion criteria for the study are presented in .
Inclusion criteria
- Patients with a definitive laboratory diagnosis of multidrug-resistant or rifampicin-resistant tuberculosis (MDR/RR-TB; drug susceptibility testing or molecular diagnostic testing)
- Patients aged ≥16 years
- Patients who had not yet started MDR/RR treatment or had started treatment but required further adjustment due to poor efficacy
- Patients with positive sputum smear or culture at baseline
Exclusion criteria
- Patients with multidrug-resistant or rifampicin-resistant pulmonary TB without a definitive laboratory diagnosis
- Patients with isolated extrapulmonary TB
- Patients who had not received TB treatment
- Patients who died
- Patients with ≥80% of missing data
From January 3, 2017, to June 29, 2023, data from patients with MDR/RR-TB were collected at Beijing Chest Hospital, Capital Medical University, to form an internal training cohort. Patients were included for predicting treatment outcomes at both 2 and 6 months. Subsequently, an external validation cohort was established by collecting data from patients with MDR/RR-TB at Guangzhou Chest Hospital between March 1, 2021, and June 30, 2022. All patients with MDR/RR-TB included in this study were treated according to the WHO guidelines for MDR/RR-TB treatment regimens.
Data Collection and Processing
Data were collected using a Microsoft Excel form from the electronic patient management system of the hospital, which included demographic information, clinical data, and the outcomes of sputum culture conversion after 2 months of treatment. We gathered general demographic characteristics, treatment-related factors, comorbidities, computed tomography imaging features, immunological indicators, results of sputum smears and cultures, and treatment outcomes for each patient. Treatment success was defined as the achievement of 2 consecutive negative sputum cultures obtained at least 30 days apart following the completion of therapy in patients with MDR/RR-TB. Conversely, treatment failure was characterized by the failure to meet these criteria. After standardizing the data, patients with ≥80% of missing data were excluded. For ≤5% of individual missing data, numerical variables were filled in using the mean, and categorical variables were filled in using the mode, ensuring the clarity, logic, and completeness of the data.
Model Construction
Univariate logistic regression analysis was conducted on the internal training cohort treated for 2 and 6 months to identify factors affecting the sputum culture conversion outcomes in patients with MDR/RR-TB (P<.05). Multicollinearity and Spearman correlation analyses were conducted on these factors. A Spearman correlation coefficient of <0.4 was regarded as indicating a weak correlation. All variables with weak correlations were included in the model, whereas for those with strong correlations, only 1 variable with a greater impact on treatment outcomes was selected for model construction. The internal cohort treated for 2 and 6 months were used as the training sets to build the models, and the external cohort served as the validation sets to evaluate the models. Logistic regression, random forest (RF), support vector machine (SVM), gradient boosting decision tree (GBDT), elastic net (EN), and artificial neural network (ANN) models were developed using the selected variables. In addition, stacking and voting ensemble methods were applied to the single RF, SVM, GBDT, EN, and ANN models for ensemble learning. The predictive performance of the different models was compared.
Statistical Analysis
Baseline characteristics and univariate logistic regression analysis of the internal cohort were conducted using SPSS (version 27; IBM Corp). For count variables, frequency counts and percentages were used, and for quantitative variables, the mean and SD or median and IQR were used based on their normal distribution. To measure differences between groups, count variables were assessed using the chi-square test or Fisher exact test, whereas quantitative variables were evaluated using 2-tailed t tests or Mann-Whitney U tests based on their distribution. The odds ratio (OR) and its 95% CI were used to assess the strength of the association between influencing factors and early treatment outcomes in patients with MDR/RR-TB. Tolerance and variance inflation factor (VIF) tests were conducted to check for multicollinearity among selected variables; a tolerance of <0.1 or a VIF of >10 indicated the presence of multicollinearity. Spearman correlation coefficient matrix analysis was conducted using the corrplot package in RStudio (version 4.4.1; Posit PBC).
Prediction models were constructed using Python’s scikit-learn (Google Summer of Code project) ML library using the synthetic minority oversampling technique to balance the training set and enhance predictive performance. The hyperparameters were optimized using 5-fold cross-validation combined with grid search. The optimal hyperparameters and models for the RF, SVM, GBDT, EN, and ANN models were determined. The best parameter settings for each of the 5 models are shown in Table S1 in . Receiver operating characteristic curves were plotted, and 95% CIs were calculated to evaluate the predictive performance of the models through the area under the curve (AUC). In addition, the accuracy, sensitivity, specificity, and F1-scores of each model were compared, and the importance scores of variables in the explainable models were assessed. A 2-sided significance level was set, with P<.05.
Ethical Considerations
This study received ethics approval from the ethics committee of Beijing Chest Hospital, Capital Medical University, under approval YJS-2022-01 in accordance with the 2013 Declaration of Helsinki. Due to the retrospective nature of the research and the use of anonymized data, the ethics committee waived the requirement for patient informed consent.
Results
Patients and Follow-Up
A total of 2095 patients diagnosed with MDR/RR-TB were identified from 2 local hospitals in China. After applying strict inclusion and exclusion criteria, 42.1% (881/2095) of the patients with MDR/RR-TB were deemed eligible for the analysis. The internal cohort of 744 patients included 592 (79.6%) patients at 2 months of treatment and 466 (62.6%) patients at 6 months of treatment. Of these patients, 81.9% (485/592) achieved culture conversion after 2 months of treatment, and 87.1% (406/466) achieved culture conversion after 6 months of treatment. The external validation cohort included 137 patients, of whom 107 (78.1%) achieved culture conversion after 2 months of treatment and 122 (89.1%) achieved culture conversion after 6 months of treatment. The patient inclusion process is shown in . As shown in , a follow-up analysis of treatment outcomes in the internal cohort indicated that, for patients with MDR/RR-TB, the OR for treatment success following culture conversion after 2 months of treatment was 8.55 (95% CI 3.31-22.08), with a sensitivity of 86.5% (95% CI 81%-90.6%) and specificity of 57.1% (95% CI 34.4%-77.4%), and the OR for treatment success following culture conversion after 6 months of treatment was 20.33 (95% CI 6.90-59.86), with a sensitivity of 92.2% (95% CI 87%-95.5%) and a specificity of 63.2% (95% CI 38.6%-82.8%).

| Month of treatment | Treatment outcomes | ORa (95% CI) | P value | Sensitivityb (95% CI) | Specificityc (95% CI) | PPVd (95% CI) | NPVe (95% CI) | |||||||||||||||
| Number of failures | Number of successes | |||||||||||||||||||||
| 2 months | <.001 | 86.5 (81.0-90.6) | 57.1 (34.4-77.4) | 95.4 (91.1-97.7) | 29.3 (16.6-45.7) | |||||||||||||||||
| No culture conversion | 12 | 29 | 1 (reference) | |||||||||||||||||||
| Culture conversion | 9 | 186 | 8.55 (3.31-22.08) | |||||||||||||||||||
| 6 months | <.001 | 92.2 (87.0-95.5) | 63.2 (38.6-82.8) | 96.0 (91.5-98.2) | 46.2 (27.1-66.3) | |||||||||||||||||
| No culture conversion | 12 | 14 | 1 (reference) | |||||||||||||||||||
| Culture conversion | 7 | 166 | 20.33 (6.90-59.86) | |||||||||||||||||||
aOR: odds ratio.
bThe ability of early-treatment sputum culture status to predict treatment success.
cThe ability of early-treatment sputum culture status to predict treatment failure.
dPPV: positive predictive value.
eNPV: negative predictive value.
Clinical Characteristics
The characteristics of patients with culture conversion and no culture conversion in the internal cohort and external validation cohort are shown in Tables S2, S3, S4, and S5 in , respectively. There were statistically significant differences (P<.05) between patients for treatment of 2 months with culture conversion and no culture conversion in terms of age, occupation, marital status, drug resistance types, number of resistant drugs, medication compliance, pneumoconiosis, hepatitis, diabetes, cavity, mediastinal lymphadenopathy, C-reactive protein levels, lymphocyte percentage, absolute monocyte count, and sputum smear grading, whereas significant differences (P<.05) in age, marital status, contact individuals, TB treatment history, duration of treatment history, drug resistance types, number of resistant drugs, medication compliance, treatment regimens with LZD or BDQ, cavity, mediastinal lymphadenopathy, C-reactive protein levels, lymphocyte percentage, absolute lymphocyte count, and sputum smear grading were found between patients who achieved culture conversion and those who did not after 6 months of treatment. In the external validation cohort, patients exhibited more statistically significant differences in various characteristics between those with and without culture conversion at 2 and 6 months of treatment. After 2 months of treatment, there were no statistically significant differences in the characteristics of occupation, hepatitis, and absolute monocyte count compared to the internal cohort. However, after 6 months of treatment, there were no statistically significant differences in marital status, treatment regimens with BDQ, mediastinal lymphadenopathy, and absolute lymphocyte count compared to the internal cohort.
Predictor Selection
In the internal cohort with 2 months of MDR/RR-TB treatment, univariate logistic regression analysis revealed that age, marital status, drug resistance types, number of resistant drugs, medication compliance, pneumoconiosis, hepatitis, diabetes, cavity, mediastinal lymphadenopathy, lymphocyte percentage, and sputum smear grading (≥3) were associated with culture conversion (). The tolerance of 12 influence factors of >0.1 and VIF of <10 suggested that they did not have multivariate collinearity (Table S6 in ). The Spearman correlation coefficient matrix analysis revealed a strong correlation between age and marital status, as well as between drug resistance types and the number of resistant drugs, whereas the correlation coefficients among other variables were of <0.4 (Figure S1A-C in ). For the internal cohort treated for 2 months, this study selected age, drug resistance pattern, medication adherence, pneumoconiosis, hepatitis, diabetes, cavity, mediastinal lymphadenopathy, lymphocyte percentage, and sputum smear grade as predictors for model development.
| Variable name | β (SE) | Wald c2 | ORa (95% CI) | P value | |||||
| Age (y) | −0.033 (0.007) | 21.721 | 0.967 (0.954-0.981) | <.001b | |||||
| Gender (man) | −0.167 (0.239) | 0.486 | 0.847 (0.530-1.352) | .49 | |||||
| Occupation | |||||||||
| Worker | −0.152 (0.390) | 0.153 | 0.859 (0.400-1.845) | .70 | |||||
| Service staff | 0.211 (0.358) | 0.348 | 1.235 (0.613-2.490) | .56 | |||||
| Student | 0.859 (0.558) | 2.371 | 2.361 (0.791-7.048) | .12 | |||||
| Unemployed individual | 0.442 (0.323) | 1.875 | 1.556 (0.826-2.928) | .17 | |||||
| Others | −0.381 (0.291) | 1.721 | 0.683 (0.386-1.207) | .19 | |||||
| Ethnicity (non-Han) | 0.005 (0.385) | 0.000 | 1.006 (0.473-2.137) | .99 | |||||
| Marital status | |||||||||
| Married | −1.155 (0.284) | 16.594 | 0.315 (0.181-0.549) | <.001b | |||||
| Divorced or widowed | −0.759 (0.464) | 2.677 | 0.468 (0.189-1.162) | .10 | |||||
| Household registration (urban) | −0.159 (0.214) | 0.555 | 0.853 (0.561-1.296) | .46 | |||||
| Current residence (urban) | −0.032 (0.214) | 0.023 | 0.968 (0.636-1.474) | .88 | |||||
| BMI (kg/m2) | 0.044 (0.029) | 2.267 | 1.045 (0.987-1.106) | .13 | |||||
| Smoking habit (yes) | 0.134 (0.225) | 0.354 | 1.144 (0.735-1.778) | .55 | |||||
| Drinking habit (yes) | −0.123 (0.242) | 0.259 | 0.884 (0.551-1.420) | .61 | |||||
| TB close contact (yes) | 0.013 (0.338) | 0.001 | 1.013 (0.522-1.965) | .97 | |||||
| TB treatment history (yes) | −0.160 (0.234) | 0.467 | 0.852 (0.539-1.348) | .49 | |||||
| Duration of treatment history (y) | −0.015 (0.014) | 1.227 | 0.985 (0.959-1.012) | .27 | |||||
| Drug resistance types (XDRc) | −0.904 (0.220) | 16.859 | 0.405 (0.263-0.623) | <.001b | |||||
| Number of resistant drugs | −0.092 (0.033) | 7.714 | 0.912 (0.854-0.973) | .005b | |||||
| Medication compliance (yes) | 0.626 (0.263) | 5.650 | 1.870 (1.116-3.132) | .02b | |||||
| Treatment regimens with LZDd | −0.326 (0.294) | 1.228 | 0.722 (0.405-1.285) | .27 | |||||
| Treatment regimens with BDQe | −0.192 (0.219) | 0.770 | 0.825 (0.537-1.267) | .38 | |||||
| Treatment regimens with DLMf | −0.296 (0.444) | 0.446 | 0.744 (0.312-1.774) | .50 | |||||
| Extrapulmonary TB (yes) | −0.008 (0.339) | 0.001 | 0.992 (0.511-1.926) | .98 | |||||
| Pneumoconiosis (yes) | −1.400 (0.567) | 6.099 | 0.247 (0.081-0.749) | .01b | |||||
| Malignant tumor (yes) | 1.071 (1.043) | 1.055 | 2.919 (0.378-22.561) | .30 | |||||
| Hepatitis (yes) | −1.320 (0.435) | 9.201 | 0.267 (0.114-0.627) | .002b | |||||
| Diabetes (yes) | −0.459 (0.232) | 3.901 | 0.632 (0.401-0.996) | .048b | |||||
| Rheumatism (yes) | −0.576 (0.537) | 1.151 | 0.562 (0.196-1.611) | .28 | |||||
| Cavity (yes) | −0.727 (0.227) | 10.290 | 0.483 (0.310-0.754) | .001b | |||||
| Calcification (yes) | −0.327 (0.265) | 1.516 | 0.721 (0.429-1.213) | .22 | |||||
| Mediastinal lymphadenopathy (yes) | −1.032 (0.245) | 17.765 | 0.356 (0.220-0.576) | <.001b | |||||
| Pleural effusion (yes) | 0.290 (0.270) | 1.154 | 1.337 (0.787-2.270) | .28 | |||||
| Degree of pleural effusion | |||||||||
| Small amount | 0.224 (0.283) | 0.624 | 1.251 (0.718-2.178) | .43 | |||||
| Moderate amount | 1.258 (1.040) | 1.464 | 3.518 (0.458-26.989) | .23 | |||||
| Large amount | −0.352 (1.161) | 0.092 | 0.704 (0.072-6.845) | .76 | |||||
| Pulmonary lesion location | |||||||||
| Upper left | 1.827 (1.005) | 3.302 | 6.214 (0.866-44.578) | .07 | |||||
| Lower left | 1.350 (1.016) | 1.766 | 3.857 (0.527-28.241) | .18 | |||||
| Upper right | 1.099 (0.926) | 1.408 | 3.000 (0.489-18.415) | .24 | |||||
| Lower middle right | 1.504 (1.012) | 2.210 | 4.500 (0.619-32.695) | .14 | |||||
| Multiple lobes or both lungs | 0.562 (0.699) | 0.646 | 1.755 (0.445-6.913) | .42 | |||||
| CRPg (mg/L) | −0.003 (0.002) | 1.745 | 0.997 (0.992-1.002) | .19 | |||||
| Hbh (g/L) | 0.003 (0.005) | 0.275 | 1.003 (0.993-1.013) | .60 | |||||
| Platelet count (×109/L) | −0.001 (0.001) | 2.547 | 0.999 (0.997-1.000) | .11 | |||||
| Lymphocyte percentage | 0.037 (0.012) | 10.374 | 1.038 (1.015-1.062) | .001b | |||||
| ALCi (×109/L) | 0.125 (0.178) | 0.492 | 1.133 (0.800-1.605) | .48 | |||||
| Monocyte percentage | 0.027 (0.040) | 0.457 | 1.027 (0.950-1.110) | .50 | |||||
| AMCj (×109/L) | −0.109 (0.179) | 0.369 | 0.897 (0.632-1.274) | .54 | |||||
| Eosinophil percentage | 0.074 (0.061) | 1.499 | 1.077 (0.956-1.213) | .22 | |||||
| AECk (×109/L) | 0.196 (0.836) | 0.055 | 1.217 (0.237-6.261) | .81 | |||||
| Sputum smear grading of ≥1 | 0.090 (0.271) | 0.109 | 1.094 (0.643-1.860) | .74 | |||||
| Sputum smear grading of ≥2 | −0.443 (0.299) | 2.198 | 0.642 (0.358-1.153) | .14 | |||||
| Sputum smear grading of ≥3 | −1.325 (0.385) | 11.860 | 0.266 (0.125-0.565) | <.001b | |||||
| Sputum smear grading of ≥4 | −0.897 (0.630) | 2.025 | 0.408 (0.119-1.402) | .16 | |||||
aOR: odds ratio.
bVariables associated with culture conversion (P value <.05).
cXDR: extensively drug resistant.
dLZD: linezolid.
eBDQ: bedaquiline.
fDLM: delamanid.
gCRP: C-reactive protein.
hHb: hemoglobin.
iALC: absolute lymphocyte count.
jAMC: absolute monocyte count.
kAEC: absolute eosinophil count.
In the internal cohort with 6 months of MDR/RR-TB treatment, univariate logistic regression analysis showed that age, TB treatment history, treatment regimens with LZD, treatment regimens with BDQ, cavity, sputum smear grading of ≥2, and sputum smear grading of ≥3 were associated with culture conversion (). The tolerance of 6 influence factors of >0.1 and VIF of <10 suggest that they did not have multivariate collinearity (Table S7 in ). The correlation coefficients between variables were of <0.4 (Figure S1D-F in ). For the internal cohort treated for 6 months, this study selected age, TB treatment history, treatment regimens with LZD, treatment regimens with BDQ, cavity, and sputum smear grading as predictors for model development.
| Variable name | β (SE) | Wald c2 | ORa (95% CI) | P value | |||||
| Age (y) | −0.076 (0.021) | 13.548 | 0.926 (0.890-0.965) | <.001b | |||||
| Gender (man) | −0.354 (0.519) | 0.464 | 0.702 (0.254-1.941) | .50 | |||||
| Occupation | |||||||||
| Worker | −0.366 (0.706) | 0.269 | 0.693 (0.174-2.765) | .60 | |||||
| Service staff | −0.264 (0.742) | 0.127 | 0.768 (0.179-3.287) | .72 | |||||
| Student | −0.639 (1.277) | 0.250 | 0.528 (0.043-6.456) | .62 | |||||
| Unemployed individual | −0.134 (0.563) | 0.056 | 0.875 (0.290-2.636) | .81 | |||||
| Others | 0.290 (0.585) | 0.245 | 1.336 (0.425-4.204) | .62 | |||||
| Ethnicity (non-Han) | −0.960 (0.579) | 2.751 | 0.383 (0.123-1.190) | .10 | |||||
| Marital status | |||||||||
| Married | −0.004 (0.682) | 0.000 | 0.996 (0.262-3.791) | >.99 | |||||
| Divorced or widowed | 0.547 (0.938) | 0.340 | 1.728 (0.275-10.854) | .56 | |||||
| Household registration (urban) | 0.642 (0.808) | 0.631 | 1.901 (0.390-9.270) | .43 | |||||
| Current residence (urban) | −0.690 (0.748) | 0.851 | 0.501 (0.116-2.173) | .36 | |||||
| BMI (kg/m2) | 0.021 (0.061) | 0.118 | 1.021 (0.906-1.151) | .73 | |||||
| Smoking habit (yes) | 0.278 (0.499) | 0.309 | 1.320 (0.496-3.511) | .58 | |||||
| Drinking habit (yes) | 0.422 (0.518) | 0.662 | 1.524 (0.552-4.210) | .42 | |||||
| TB close contact (yes) | −0.028 (0.603) | 0.002 | 0.972 (0.298-3.169) | .96 | |||||
| TB treatment history (yes) | −1.466 (0.572) | 6.566 | 0.231 (0.075-0.708) | .01b | |||||
| Duration of treatment history (y) | 0.008 (0.023) | 0.116 | 1.008 (0.964-1.054) | .73 | |||||
| Drug resistance types (XDRc) | −0.464 (0.525) | 0.784 | 0.629 (0.225-1.758) | .38 | |||||
| Number of resistant drugs | −0.038 (0.269) | 0.269 | 0.963 (0.835-1.111) | .60 | |||||
| Medication compliance (yes) | 0.533 (0.430) | 1.534 | 1.704 (0.733-3.959) | .22 | |||||
| Treatment regimens with LZDd | 1.280 (0.439) | 8.493 | 3.595 (1.520-8.501) | .004b | |||||
| Treatment regimens with BDQe | 1.351 (0.464) | 8.484 | 3.862 (1.556-9.587) | .004b | |||||
| Treatment regimens with DLMf | −0.040 (1.167) | 0.001 | 0.961 (0.098-9.462) | .97 | |||||
| Extrapulmonary TB (yes) | 0.484 (0.636) | 0.579 | 1.623 (0.466-5.649) | .45 | |||||
| Pneumoconiosis (yes) | −0.377 (0.870) | 0.187 | 0.686 (0.125-3.773) | .67 | |||||
| Malignant tumor (yes) | 0.905 (1.338) | 0.457 | 2.471 (0.179-34.017) | .50 | |||||
| Hepatitis (yes) | 0.220 (0.690) | 0.101 | 1.246 (0.322-4.816) | .75 | |||||
| Diabetes (yes) | 0.004 (0.420) | 0.000 | 1.004 (0.441-2.289) | .99 | |||||
| Rheumatism (yes) | −0.069 (1.020) | 0.005 | 0.934 (0.127-6.891) | .95 | |||||
| Cavity (yes) | −0.948 (0.450) | 4.441 | 0.388 (0.161-0.936) | .04b | |||||
| Calcification (yes) | 0.609 (0.498) | 1.492 | 1.839 (0.692-4.884) | .22 | |||||
| Mediastinal lymphadenopathy (yes) | −0.070 (0.418) | 0.028 | 0.933 (0.411-2.114) | .87 | |||||
| Pleural effusion (yes) | −0.132 (0.436) | 0.091 | 0.877 (0.373-2.060) | .76 | |||||
| CRPg (mg/L) | −0.001 (0.006) | 0.046 | 0.999 (0.986-1.011) | .83 | |||||
| Hbh (g/L) | −0.001 (0.011) | 0.006 | 0.999 (0.977-1.021) | .94 | |||||
| Platelet count (×109/L) | 0.004 (0.002) | 3.476 | 1.004 (1.000-1.008) | .06 | |||||
| Lymphocyte percentage | 0.011 (0.043) | 0.066 | 1.011 (0.930-1.099) | .80 | |||||
| ALCi (×109/L) | 0.105 (0.592) | 0.032 | 1.111 (0.348-3.542) | .86 | |||||
| Monocyte percentage | −0.137 (0.082) | 2.799 | 0.872 (0.742-1.024) | .09 | |||||
| AMCj (×109/L) | 0.985 (0.964) | 1.045 | 2.678 (0.405-17.701) | .31 | |||||
| Eosinophil percentage | 0.570 (0.320) | 3.182 | 1.769 (0.945-3.309) | .07 | |||||
| AECk (×109/L) | −7.699 (3.933) | 3.832 | 0.000 (0.000-1.009) | .05 | |||||
| Sputum smear grading of ≥1 | −0.499 (0.473) | 1.115 | 0.607 (0.240-1.533) | .29 | |||||
| Sputum smear grading of ≥2 | −1.282 (0.523) | 6.020 | 0.277 (0.100-0.773) | .01b | |||||
| Sputum smear grading of ≥3 | −1.297 (0.611) | 4.510 | 0.273 (0.083-0.905) | .03b | |||||
| Sputum smear grading of ≥4 | −1.656 (1.095) | 2.287 | 0.191 (0.022-1.632) | .13 | |||||
aOR: odds ratio.
bVariables associated with culture conversion (P value <.05).
cXDR: extensively drug resistant.
dLZD: linezolid.
eBDQ: bedaquiline.
fDLM: delamanid.
gCRP: C-reactive protein.
hHb: hemoglobin.
iALC: absolute lymphocyte count.
jAMC: absolute monocyte count.
kAEC: absolute eosinophil count.
Performance Evaluation of Models
The metrics of performance, including the AUC, accuracy, sensitivity, specificity, and F1-score, are detailed in . For patients with MDR/RR-TB with treatment of 2 months, the AUCs for the RF, GBDT, ANN, stacking, and voting ensemble models in the training set were all of >0.80. The sensitivity and specificity of the GBDT, ANN, stacking, and voting ensemble models were close to or exceeded 0.70, with F1-scores of 0.86, 0.82, 0.91, and 0.83, respectively. In the external validation set, the AUCs for the GBDT, ANN, stacking, and voting ensemble models were 0.60, 0.86, 0.62, and 0.83, respectively. The accuracy of the ANN and voting ensemble models was 0.68 and 0.74, respectively. Although the ANN model exhibited a slightly higher AUC value than the voting ensemble model, the latter demonstrated superior accuracy. While the voting ensemble model could be considered a predictive tool, the ANN model may be preferred overall for predicting culture conversion and no culture conversion in patients with MDR/RR-TB after 2 months of treatment given its stability and clinical generalizability (A and 2B).
| Model | AUCa (95% CI) | Accuracy (95% CI) | Sensitivity (95% CI) | Specificity (95% CI) | F1-score (95% CI) | |||||
| 2 months of treatment | ||||||||||
| Training set | ||||||||||
| Logistic regression | 0.70 (0.64-0.75) | 0.65 (0.61-0.69) | 0.65 (0.61-0.69) | 0.64 (0.54-0.73) | 0.75 (0.72-0.79) | |||||
| RFb | 0.82 (0.77-0.86) | 0.79 (0.76-0.82) | 0.83 (0.80-0.86) | 0.59 (0.49-0.69) | 0.86 (0.84-0.89) | |||||
| SVMc | 0.69 (0.63-0.75) | 0.67 (0.63-0.71) | 0.68 (0.64-0.72) | 0.60 (0.50-0.70) | 0.77 (0.74-0.80) | |||||
| GBDTd | 0.89 (0.85-0.91) | 0.79 (0.75-0.82) | 0.78 (0.75-0.82) | 0.80 (0.73-0.87) | 0.86 (0.83-0.88) | |||||
| ENe | 0.70 (0.64-0.75) | 0.71 (0.67-0.75) | 0.76 (0.72-0.80) | 0.49 (0.40-0.58) | 0.81 (0.78-0.84) | |||||
| ANNf | 0.82 (0.77-0.86) | 0.74 (0.70-0.78) | 0.74 (0.70-0.78) | 0.75 (0.66-0.84) | 0.82 (0.79-0.85) | |||||
| Stacking | 0.90 (0.87-0.93) | 0.85 (0.82-0.88) | 0.88 (0.85-0.91) | 0.72 (0.63-0.80) | 0.91 (0.89-0.93) | |||||
| Voting ensemble | 0.81 (0.76-0.85) | 0.75 (0.71-0.79) | 0.76 (0.72-0.80) | 0.69 (0.60-0.78) | 0.83 (0.80-0.86) | |||||
| Validation set | ||||||||||
| Logistic regression | 0.81 (0.72-0.89) | 0.66 (0.57-0.73) | 0.59 (0.49-0.68) | 0.90 (0.79-1.00) | 0.71 (0.64-0.80) | |||||
| RF | 0.71 (0.58-0.82) | 0.72 (0.65-0.80) | 0.78 (0.70-0.85) | 0.53 (0.35-0.70) | 0.81 (0.75-0.87) | |||||
| SVM | 0.81 (0.72-0.88) | 0.69 (0.61-0.77) | 0.67 (0.58-0.76) | 0.77 (0.61-0.92) | 0.77 (0.71-0.84) | |||||
| GBDT | 0.60 (0.47-0.70) | 0.53 (0.45-0.61) | 0.55 (0.46-0.65) | 0.47 (0.27-0.64) | 0.65 (0.56-0.72) | |||||
| EN | 0.82 (0.73-0.90) | 0.78 (0.72-0.85) | 0.78 (0.69-0.86) | 0.80 (0.66-0.93) | 0.85 (0.79-0.90) | |||||
| ANN | 0.86 (0.76-0.93) | 0.68 (0.60-0.76) | 0.63 (0.53-0.72) | 0.87 (0.71-0.97) | 0.75 (0.67-0.80) | |||||
| Stacking | 0.62 (0.49-0.73) | 0.64 (0.55-0.71) | 0.68 (0.59-0.76) | 0.47 (0.27-0.64) | 0.74 (0.67-0.80) | |||||
| Voting ensemble | 0.83 (0.73-0.90) | 0.74 (0.66-0.80) | 0.73 (0.64-0.82) | 0.77 (0.60-0.90) | 0.81 (0.75-0.87) | |||||
| 6 months of treatment | ||||||||||
| Training set | ||||||||||
| Logistic regression | 0.86 (0.82-0.90) | 0.79 (0.75-0.82) | 0.79 (0.75-0.82) | 0.80 (0.70-0.89) | 0.87 (0.84-0.89) | |||||
| RF | 0.93 (0.90-0.95) | 0.86 (0.83-0.89) | 0.86 (0.83-0.89) | 0.83 (0.73-0.92) | 0.92 (0.89-0.93) | |||||
| SVM | 0.86 (0.82-0.90) | 0.79 (0.75-0.82) | 0.79 (0.75-0.83) | 0.78 (0.68-0.88) | 0.87 (0.84-0.89) | |||||
| GBDT | 0.94 (0.92-0.96) | 0.86 (0.83-0.89) | 0.85 (0.82-0.89) | 0.92 (0.84-0.98) | 0.92 (0.89-0.93) | |||||
| EN | 0.86 (0.82-0.90) | 0.82 (0.78-0.84) | 0.82 (0.78-0.85) | 0.77 (0.66-0.87) | 0.89 (0.86-0.91) | |||||
| ANN | 0.90 (0.86-0.93) | 0.80 (0.77-0.83) | 0.79 (0.75-0.83) | 0.87 (0.77-0.94) | 0.88 (0.85-0.90) | |||||
| Stacking | 0.94 (0.91-0.96) | 0.85 (0.82-0.88) | 0.85 (0.82-0.88) | 0.88 (0.78-0.96) | 0.91 (0.89-0.93) | |||||
| Voting ensemble | 0.91 (0.87-0.94) | 0.82 (0.78-0.85) | 0.82 (0.78-0.85) | 0.82 (0.72-0.91) | 0.89 (0.86-0.91) | |||||
| Validation set | ||||||||||
| Logistic regression | 0.77 (0.66-0.87) | 0.57 (0.47-0.66) | 0.52 (0.43-0.62) | 0.93 (0.79-1.00) | 0.68 (0.60-0.76) | |||||
| RF | 0.68 (0.52-0.82) | 0.64 (0.56-0.72) | 0.66 (0.57-0.74) | 0.53 (0.27-0.79) | 0.77 (0.70-0.83) | |||||
| SVM | 0.72 (0.59-0.83) | 0.55 (0.46-0.63) | 0.52 (0.43-0.61) | 0.73 (0.50-0.94) | 0.67 (0.59-0.74) | |||||
| GBDT | 0.70 (0.57-0.82) | 0.66 (0.58-0.74) | 0.66 (0.57-0.74) | 0.67 (0.40-0.92) | 0.78 (0.71-0.83) | |||||
| EN | 0.75 (0.62-0.87) | 0.61 (0.52-0.69) | 0.59 (0.50-0.67) | 0.73 (0.50-0.94) | 0.73 (0.65-0.79) | |||||
| ANN | 0.74 (0.61-0.88) | 0.72 (0.64-0.80) | 0.73 (0.65-0.81) | 0.60 (0.33-0.86) | 0.82 (0.76-0.87) | |||||
| Stacking | 0.72 (0.58-0.85) | 0.70 (0.62-0.78) | 0.72 (0.64-0.80) | 0.53 (0.27-0.80) | 0.81 (0.75-0.87) | |||||
| Voting ensemble | 0.73 (0.59-0.85) | 0.61 (0.51-0.68) | 0.59 (0.50-0.67) | 0.73 (0.50-0.94) | 0.73 (0.65-0.79) | |||||
aAUC: area under the curve.
bRF: random forest.
cSVM: support vector machine.
dGBDT: gradient boosting decision tree.
eEN: elastic net.
fANN: artificial neural network.

For patients with MDR/RR-TB with treatment of 6 months, the AUCs for the logistic regression, RF, SVM, GBDT, EN, ANN, stacking, and voting ensemble models in the training set were all of >0.80 (C). In addition, their sensitivity, specificity, and F1-scores were close to or exceeded 0.80, indicating that the 8 predictive models used in the training set demonstrated good predictive ability for culture conversion after 6 months of treatment. In the external validation set, the AUCs for the logistic regression, SVM, GBDT, EN, ANN, stacking, and voting ensemble models were all of >0.70 (D). Furthermore, the ANN model had an accuracy, sensitivity, specificity, and F1-score of 0.72, 0.73, 0.60, and 0.82, respectively. These results suggest that the ANN model had good predictive capability for culture conversion and no culture conversion in patients with MDR/RR-TB after 6 months of treatment. The relative importance ranking of predictive factors in the models is shown in Table S8 in .
Discussion
Principal Findings
The treatment duration for MDR/RR-TB cases is typically 20 to 24 months. Although the WHO has endorsed a standardized short-course regimen, anti-TB therapy lasts at least 6 months []. Prolonged treatment increases the burden on patients and health care providers, elevating the risks of poor adherence, treatment failure, and disease transmission []. Early prediction of treatment outcomes is crucial for timely modification of therapy regimens and improving cure rates. Sputum culture conversion is the gold standard for assessing treatment efficacy. In this study, we first developed ML models to predict early sputum culture conversion at 2 and 6 months in patients with MDR/RR-TB. We evaluated the performance of different models and identified the ANN model as the best, which can help inform clinical strategies.
Traditional early efficacy evaluation of MDR/RR-TB treatment relies on sputum smear and culture. However, sputum smear has low sensitivity and cannot distinguish between viable and nonviable bacteria [], whereas sputum culture is susceptible to contamination and requires a prolonged turnaround time for negative results []. The development of predictive models for early efficacy aimed to provide timely insights into treatment outcomes for patients with MDR/RR-TB, ultimately contributing to the prevention of disease transmission. Previous research indicates that culture conversion achieved at 2 months of treatment for multidrug resistant TB serves as a reliable predictor of treatment success [-]. In addition, recent evidence shows that the predictive performance of culture conversion varies across follow‑up time points []. Specifically, culture results at months 3, 6, and 24 have been found to correlate strongly with cure rates [,,]. This time‑dependent predictive value may be attributable to differences in the drug composition and combinations used in treatment regimens []. This study found a strong association between sputum culture status at 2 and 6 months and successful treatment outcomes in patients with MDR/RR-TB. Furthermore, the culture conversion rates at these time points in the cohort analyzed were close to or exceeded 80%, likely due to the inclusion of patients receiving regimens that incorporate new anti-TB drugs [,].
Several factors, including age, drug resistance types, number of resistant drugs, medication compliance, hepatitis, diabetes, cavity, and sputum smear grading, can influence treatment outcomes in patients with MDR/RR-TB [-]. This study revealed that these factors also influenced culture conversion after 2 months of treatment. Notably, marital status, pneumoconiosis, mediastinal lymphadenopathy, and lymphocyte percentage also serve as key determinants of culture conversion at the 2-month mark. While marital status and pneumoconiosis are nonmodifiable factors, they can serve as early prognostic markers. Among patients without culture conversion, a higher proportion of married individuals suggested a risk of household transmission, highlighting the urgent need for preventive education among spouses. Pneumoconiosis indicates compromised baseline lung function, which may lead to treatment delays or suboptimal efficacy. Clinicians who identify these features early can categorize patients as high risk, enabling proactive monitoring and intervention. In contrast, lymphocyte percentage can be readily obtained through routine laboratory tests at treatment initiation and is positively correlated with interferon gamma response []. Patients with higher baseline lymphocyte percentages tend to achieve earlier culture conversion, reflecting better immune status. For patients with low lymphocyte percentages, interventions such as nutritional support and immune regulation may help improve early treatment response. Furthermore, the presence of mediastinal lymphadenopathy on initial imaging often indicates more severe or disseminated disease and, thus, can inform clinical decisions regarding treatment intensity and duration.
After 6 months of treatment, factors such as age, cavity, sputum smear grading, TB treatment history, and regimens including LZD or BDQ continued to influence culture conversion, aligning with previous studies on MDR/RR-TB outcomes [,,]. Age as a nonmodifiable factor can serve as a risk marker for poorer outcomes, with older patients potentially requiring closer monitoring and tailored supportive care. The presence of lung cavities and higher sputum smear grading at baseline or during treatment indicate more extensive disease and a higher bacillary burden, which may delay or impair culture conversion. It is essential for clinicians to identify these signs early and strengthen monitoring efforts accordingly. Furthermore, a history of extensive previous treatment suggests the likelihood of resistant strains and a higher risk of delayed conversion, necessitating personalized treatment adjustments and adherence support. Importantly, the inclusion of LZD or BDQ in the regimen appears to positively influence culture conversion, emphasizing the need for regimen optimization based on evolving evidence and individual patient factors. Therefore, in the early stages of treatment, inherent and immutable characteristics should be used to stratify risk and initiate proactive management. These factors can be further integrated with other relevant variables to facilitate the development of more personalized treatment strategies, ultimately aiming to improve outcomes for patients with MDR/RR-TB.
For patients with MDR/RR-TB after 2 months of treatment, multiple models demonstrated good predictive performance for culture conversion in the training set. In the external validation set, the ANN model had strong generalization capabilities, with a prediction rate of 87% for patients without culture conversion. Thus, in clinical practice, the ANN model can be used to predict treatment efficacy in patients with MDR/RR-TB with 2 months of treatment. The 3 most important predictive factors identified by the ANN model were mediastinal lymphadenopathy, medication compliance, and sputum smear grading. For patients with MDR/RR-TB after 6 months of treatment, both the logistic regression and ML models showed good predictive performance regarding culture conversion during the training phase. However, in the external validation set, the ANN model performed better than the other predictive models. The 3 most important predictive factors identified by the ANN model included treatment regimens with BDQ or LZD, indicating that the use of BDQ and LZD is beneficial for culture conversion in patients with MDR/RR-TB after 6 months of treatment. In patients who meet the indications and can afford the costs, it is recommended to use treatment regimens that include BDQ or LZD. The findings of this study provide strategic guidance for the early prediction of treatment outcomes in patients with MDR/RR-TB ().

Although our results may inform clinical practice, several uncertainties remain. The predictive framework is affected by overlapping internal and external sources of uncertainty. Internally, finite-sample estimation of model parameters yields conditional 95% CIs that may shift with larger cohorts. Moreover, the true functional relationship between predictors and culture conversion is unknown, leading to model (nonparametric) uncertainty despite the use of ensembles. Externally, the single–tertiary care setting, single‑country training cohort may limit generalizability because population differences can produce distributional drift and degrade performance. Temporal changes in treatment protocols introduce additional parametric uncertainty, and policy‑level factors act as unmeasured confounders. These sources are not directly observable in real time, challenging reliability and transportability. Recent artificial intelligence methods offer model‑agnostic, multidimensional approaches that can explicitly quantify such uncertainties. One study extended the classic suspicious-infected-death with nonpharmacological policies epidemic model into a multi-input multi-output structure and used autoregressive with exogenous input, autoregressive moving average with exogenous input, and output error system identification frameworks to learn uncertainties in real time, with promising results []. Future research could adopt similar techniques to dynamically estimate random or correlated uncertainties in MDR/RR-TB treatment response, yielding patient-specific probabilistic prediction intervals and thereby enhancing the robustness of early efficacy assessment.
Comparison to Prior Work
The prediction of early treatment outcomes in MDR/RR-TB remains an unexplored area. The ANN model developed in this study demonstrated superior predictive performance compared to various models developed by Sauer et al [], which primarily used clinical and demographic features to predict the final treatment outcomes of patients with TB. Our ANN model achieved higher AUC values, outperforming those in previous studies on TB treatment outcome prediction [,]. In addition, the ANN model we developed for predicting MDR/RR-TB treatment outcomes at 6 months also surpassed the clinical prediction model developed by Lv et al [] for predicting 6-month treatment outcomes in multidrug-resistant TB. To facilitate further exploration in this field and improve clinical practice, we provide open-source tools and data for validation and application by other researchers and clinicians [].
Strengths and Limitations
Despite the comprehensive examination of ML algorithms for predicting early treatment outcomes in patients with MDR/RR-TB, this study has some limitations. First, it is important to note that 5 participants who died were excluded from the analysis. This decision may have introduced bias by potentially underestimating the challenges in predicting treatment outcomes among patients who are critically ill, thus impacting the overall generalizability of our model. Future studies should aim to include patients who are severely ill to validate and enhance the robustness of the model across diverse patient populations. Second, some CIs were relatively wide, primarily due to significant differences in the distribution of certain variables between the culture conversion and no–culture conversion groups, which are mainly limited by the sample size. To address this issue, future research should increase the sample size and optimize data collection. Third, the training data were sourced from a single center, necessitating the expansion to external datasets for validation to assess whether influencing factors differ in other clinical contexts. Finally, model testing was only conducted on 1 external retrospective cohort. To objectively evaluate the generalizability and reliability of the predictive model, it should be validated on multicenter independent datasets and prospective study datasets to determine its effectiveness in real clinical applications.
Conclusions
To conclude, the ML models based on 2- and 6-month culture conversion can accurately predict treatment outcomes for patients with MDR/RR-TB. ML models, particularly the ANN model, outperformed the logistic regression model in both stability and generalizability and offer a rapid and effective tool for evaluating therapeutic efficacy at the early stages of MDR/RR-TB treatment, thereby helping control the risk of MDR/RR-TB transmission.
Acknowledgments
This work was supported by the Beijing Municipal Science & Technology Commission (grants Z221100007422064 and Z221100007422056), Beijing Hospitals Authority Ascent Plan (grant DFL20221401), and Science and Technology Program of Guangzhou (grant 2023B03J1303).
Data Availability
The datasets and code used in this study are publicly available and can be downloaded via the link provided in the final reference. Further analytical details generated during the study are available from the corresponding author upon reasonable request.
Authors' Contributions
FZ and YP had full access to all of the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. FZ, LL, and YP contributed to concept and design. FZ, ZY, XG, YD, SL, CY, YS, WR, RL, and HK were involved in the acquisition, analysis, or interpretation of the data. The drafting of the manuscript was carried out by FZ and YP. Administrative, technical, or material support was provided by FZ, ZY, LL, and YP. All authors have read and agreed to the published version of the manuscript.
Conflicts of Interest
None declared.
Additional tables and supplementary plots referred to in the paper.
DOC File , 28507 KBReferences
- Global tuberculosis report 2024. World Health Organization. URL: https://www.who.int/teams/global-tuberculosis-programme/tb-reports/global-tuberculosis-report-2024 [accessed 2024-10-29]
- Tiberi S, Utjesanovic N, Galvin J, Centis R, D'Ambrosio L, van den Boom M, et al. Drug resistant TB - latest developments in epidemiology, diagnostics and management. Int J Infect Dis. Nov 2022;124 Suppl 1:S20-S25. [FREE Full text] [CrossRef] [Medline]
- WHO consolidated guidelines on tuberculosis. Module 4: treatment - drug-resistant tuberculosis treatment, 2022 update. World Health Organization. Dec 15, 2022. URL: https://www.who.int/publications/i/item/9789240063129 [accessed 2022-12-15]
- Ahuja SD, Ashkin D, Avendano M, Banerjee R, Bauer M, Bayona JN, et al. Multidrug resistant pulmonary tuberculosis treatment regimens and patient outcomes: an individual patient data meta-analysis of 9,153 patients. PLoS Med. 2012;9(8):e1001300. [FREE Full text] [CrossRef] [Medline]
- Dheda K, Gumbo T, Maartens G, Dooley KE, McNerney R, Murray M, et al. The epidemiology, pathogenesis, transmission, diagnosis, and management of multidrug-resistant, extensively drug-resistant, and incurable tuberculosis. Lancet Respir Med. Mar 15, 2017:S2213-2600(17)30079-6. [CrossRef] [Medline]
- van Zyl-Smit RN, Binder A, Meldau R, Mishra H, Semple PL, Theron G, et al. Comparison of quantitative techniques including Xpert MTB/RIF to evaluate mycobacterial burden. PLoS One. 2011;6(12):e28815. [FREE Full text] [CrossRef] [Medline]
- Zhang F, Zhang F, Dong Y, Li L, Pang Y. New insights into biomarkers for evaluating therapy efficacy in pulmonary tuberculosis: a narrative review. Infect Dis Ther. Dec 2023;12(12):2665-2689. [FREE Full text] [CrossRef] [Medline]
- Zhang F, Zhang F, Li L, Pang Y. Clinical utilization of artificial intelligence in predicting therapeutic efficacy in pulmonary tuberculosis. J Infect Public Health. Apr 2024;17(4):632-641. [FREE Full text] [CrossRef] [Medline]
- Gaikwad UN, Gaikwad NR. Modalities to monitor the treatment response in tuberculosis. Indian J Tuberc. Apr 2018;65(2):109-117. [CrossRef] [Medline]
- Peetluk LS, Ridolfi FM, Rebeiro PF, Liu D, Rolla VC, Sterling TR. Systematic review of prediction models for pulmonary tuberculosis treatment outcomes in adults. BMJ Open. Mar 02, 2021;11(3):e044687. [FREE Full text] [CrossRef] [Medline]
- Holden IK, Lillebaek T, Seersholm N, Andersen PH, Wejse C, Johansen IS. Predictors for pulmonary tuberculosis treatment outcome in Denmark 2009-2014. Sci Rep. Sep 10, 2019;9(1):12995. [FREE Full text] [CrossRef] [Medline]
- Costa-Veiga A, Briz T, Nunes C. Unsuccessful treatment in pulmonary tuberculosis: factors and a consequent predictive model. Eur J Public Health. Apr 01, 2018;28(2):352-358. [CrossRef] [Medline]
- Lu P, Liu Q, Martinez L, Yang H, Lu W, Ding X, et al. Time to sputum culture conversion and treatment outcome of patients with multidrug-resistant tuberculosis: a prospective cohort study from urban China. Eur Respir J. Mar 22, 2017;49(3):1601558. [FREE Full text] [CrossRef] [Medline]
- Haug CJ, Drazen JM. Artificial intelligence and machine learning in clinical medicine, 2023. N Engl J Med. Mar 30, 2023;388(13):1201-1208. [CrossRef] [Medline]
- Bordoloi D, Singh V, Sanober S, Buhari SM, Ujjan JA, Boddu R. Deep learning in healthcare system for quality of service. J Healthc Eng. Mar 8, 2022;2022:8169203. [FREE Full text] [CrossRef] [Medline]
- Rahman A, Debnath T, Kundu D, Khan MS, Aishi AA, Sazzad S, et al. Machine learning and deep learning-based approach in smart healthcare: recent advances, applications, challenges and opportunities. AIMS Public Health. Jan 05, 2024;11(1):58-109. [FREE Full text] [CrossRef] [Medline]
- Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol. Jun 2019;110:12-22. [CrossRef] [Medline]
- Lienhardt C, Raviglione M, Spigelman M, Hafner R, Jaramillo E, Hoelscher M, et al. New drugs for the treatment of tuberculosis: needs, challenges, promise, and prospects for the future. J Infect Dis. May 15, 2012;205 Suppl 2:S241-S249. [CrossRef] [Medline]
- Desikan P. Sputum smear microscopy in tuberculosis: is it still relevant? Indian J Med Res. Mar 2013;137(3):442-444. [FREE Full text] [Medline]
- Chihota VN, Grant AD, Fielding K, Ndibongo B, van Zyl A, Muirhead D, et al. Liquid vs. solid culture for tuberculosis: performance and cost in a resource-constrained setting. Int J Tuberc Lung Dis. Aug 2010;14(8):1024-1031. [Medline]
- Ige OM, Oladokun RE. Time to sputum culture conversion and treatment outcome among the first cohort of multidrug resistant tuberculosis patients in a high burden country. Indian J Tuberc. Oct 2018;65(4):322-328. [CrossRef] [Medline]
- Kurbatova EV, Cegielski JP, Lienhardt C, Akksilp R, Bayona J, Becerra MC, et al. Sputum culture conversion as a prognostic marker for end-of-treatment outcome in patients with multidrug-resistant tuberculosis: a secondary analysis of data from two observational cohort studies. Lancet Respir Med. Mar 2015;3(3):201-209. [FREE Full text] [CrossRef] [Medline]
- Lv L, Li T, Xu K, Shi P, He B, Kong W, et al. Sputum bacteriology conversion and treatment outcome of patients with multidrug-resistant tuberculosis. Infect Drug Resist. Jan 23, 2018;11:147-154. [FREE Full text] [CrossRef] [Medline]
- Phillips PP, Fielding K, Nunn AJ. An evaluation of culture results during treatment for tuberculosis as surrogate endpoints for treatment failure and relapse. PLoS One. May 8, 2013;8(5):e63840. [FREE Full text] [CrossRef] [Medline]
- Tierney DB, Franke MF, Becerra MC, Alcántara Virú FA, Bonilla CA, Sánchez E, et al. Time to culture conversion and regimen composition in multidrug-resistant tuberculosis treatment. PLoS One. Sep 19, 2014;9(9):e108035. [FREE Full text] [CrossRef] [Medline]
- Lee JK, Lee JY, Kim DK, Yoon HI, Jeong I, Heo EY, et al. Substitution of ethambutol with linezolid during the intensive phase of treatment of pulmonary tuberculosis: a prospective, multicentre, randomised, open-label, phase 2 trial. Lancet Infect Dis. Jan 2019;19(1):46-55. [CrossRef] [Medline]
- Holmgaard FB, Guglielmetti L, Lillebaek T, Andersen ÅB, Wejse C, Dahl VN. Efficacy and tolerability of concomitant use of bedaquiline and delamanid for multidrug- and extensively drug-resistant tuberculosis: a systematic review and meta-analysis. Clin Infect Dis. Apr 03, 2023;76(7):1328-1337. [CrossRef] [Medline]
- Ncha R, Variava E, Otwombe K, Kawonga M, Martinson NA. Predictors of time to sputum culture conversion in multi-drug-resistant tuberculosis and extensively drug-resistant tuberculosis in patients at Tshepong-Klerksdorp Hospital. S Afr J Infect Dis. Aug 26, 2019;34(1):111. [FREE Full text] [CrossRef] [Medline]
- Akinsola OJ, Yusuf OB, Ige OM, Okonji PE. Models for predicting time to sputum conversion among multi-drug resistant tuberculosis patients in Lagos, South-West Nigeria. Front Public Health. Nov 27, 2018;6:347. [FREE Full text] [CrossRef] [Medline]
- Tanneau L, Karlsson MO, Svensson EM. Understanding the drug exposure-response relationship of bedaquiline to predict efficacy for novel dosing regimens in the treatment of multidrug-resistant tuberculosis. Br J Clin Pharmacol. May 2020;86(5):913-922. [FREE Full text] [CrossRef] [Medline]
- Chen Q, He W, Du J, Kang W, Zou L, Tang X, et al. Predictors of early and interim culture un-conversion in multidrug-resistant/rifampicin-resistant tuberculosis: a retrospective multi-center cohort study in China. Antimicrob Resist Infect Control. Oct 15, 2024;13(1):126. [FREE Full text] [CrossRef] [Medline]
- Salindri AD, Kipiani M, Kempker RR, Gandhi NR, Darchia L, Tukvadze N, et al. Diabetes reduces the rate of sputum culture conversion in patients with newly diagnosed multidrug-resistant tuberculosis. Open Forum Infect Dis. Jun 16, 2016;3(3):ofw126. [FREE Full text] [CrossRef] [Medline]
- Weldemhret L, Atsbaha AH, Bekuretsion H, Desta A, Legesse L, Kahsay AG, et al. Time to sputum culture conversion and its predictors among multidrug resistant tuberculosis patients in Tigray, Northern Ethiopia: retrospective cohort study. Infect Drug Resist. Jun 09, 2023;16:3671-3681. [FREE Full text] [CrossRef] [Medline]
- Meshesha MD. Predictors of sputum culture conversion time among MDR/RR TB patients on treatment in a low-income setting. PLoS One. Nov 14, 2022;17(11):e0277642. [FREE Full text] [CrossRef] [Medline]
- Yihunie Akalu T, Muchie KF, Alemu Gelaye K. Time to sputum culture conversion and its determinants among multi-drug resistant tuberculosis patients at public hospitals of the Amhara regional state: a multicenter retrospective follow up study. PLoS One. Jun 21, 2018;13(6):e0199320. [FREE Full text] [CrossRef] [Medline]
- Liu Q, Lu P, Martinez L, Yang H, Lu W, Ding X, et al. Factors affecting time to sputum culture conversion and treatment outcome of patients with multidrug-resistant tuberculosis in China. BMC Infect Dis. Mar 06, 2018;18(1):114. [FREE Full text] [CrossRef] [Medline]
- Chedid C, Kokhreidze E, Tukvadze N, Banu S, Uddin MK, Biswas S, et al. Relevance of QuantiFERON-TB Gold Plus and Heparin-Binding Hemagglutinin Interferon-γ release assays for monitoring of pulmonary tuberculosis clearance: a multicentered study. Front Immunol. Feb 2, 2021;11:616450. [FREE Full text] [CrossRef] [Medline]
- Sotgiu G, Centis R, D'Ambrosio L, Alffenaar JW, Anger HA, Caminero JA, et al. Efficacy, safety and tolerability of linezolid containing regimens in treating MDR-TB and XDR-TB: systematic review and meta-analysis. Eur Respir J. Dec 2012;40(6):1430-1442. [FREE Full text] [CrossRef] [Medline]
- Tong E, Wu Q, Chen Y, Liu Z, Zhang M, Zhu Y, et al. The efficacy and safety of bedaquiline in the treatment of pulmonary tuberculosis patients: a systematic review and meta-analysis. Antibiotics (Basel). Aug 31, 2023;12(9):1389. [FREE Full text] [CrossRef] [Medline]
- Tutsoy O, Balikci K, Ozdil NF. Unknown uncertainties in the COVID-19 pandemic: multi-dimensional identification and mathematical modelling for the analysis and estimation of the casualties. Digit Signal Process. Jul 2021;114:103058. [FREE Full text] [CrossRef] [Medline]
- Sauer CM, Sasson D, Paik KE, McCague N, Celi LA, Sánchez Fernández I, et al. Feature selection and prediction of treatment failure in tuberculosis. PLoS One. Nov 20, 2018;13(11):e0207491. [FREE Full text] [CrossRef] [Medline]
- Alene KA, Viney K, Gray DJ, McBryde ES, Xu Z, Clements AC. Development of a risk score for prediction of poor treatment outcomes among patients with multidrug-resistant tuberculosis. PLoS One. Jan 03, 2020;15(1):e0227100. [FREE Full text] [CrossRef] [Medline]
- Peetluk LS, Rebeiro PF, Ridolfi FM, Andrade BB, Cordeiro-Santos M, Kritski A, et al. A clinical prediction model for unsuccessful pulmonary tuberculosis treatment outcomes. Clin Infect Dis. Mar 23, 2022;74(6):973-982. [FREE Full text] [CrossRef] [Medline]
- Lv X, Li Y, Cai B, He W, Wang R, Chen M, et al. Utility of machine learning and radiomics based on cavity for predicting the therapeutic response of MDR-TB. Infect Drug Resist. Oct 28, 2023;16:6893-6904. [FREE Full text] [CrossRef] [Medline]
- Tuberculosis-gif / MDR-RR-TB. GitHub. URL: https://github.com/Tuberculosis-gif/MDR-RR-TB [accessed 2025-09-16]
Abbreviations
| ANN: artificial neural network |
| AUC: area under the curve |
| BDQ: bedaquiline |
| EN: elastic net |
| GBDT: gradient boosting decision tree |
| LZD: linezolid |
| MDR/RR-TB: multidrug-resistant or rifampicin-resistant tuberculosis |
| ML: machine learning |
| OR: odds ratio |
| RF: random forest |
| SVM: support vector machine |
| TB: tuberculosis |
| VIF: variance inflation factor |
| WHO: World Health Organization |
Edited by A Coristine; submitted 12.Dec.2024; peer-reviewed by R Mpofu, S Ahmed, O Tutsoy; comments to author 05.Jun.2025; revised version received 24.Jul.2025; accepted 04.Sep.2025; published 22.Sep.2025.
Copyright©Fuzhen Zhang, Zilong Yang, Xiaonan Geng, Yu Dong, Shanshan Li, Cong Yao, Yuanyuan Shang, Weicong Ren, Ruichao Liu, Haobin Kuang, Liang Li, Yu Pang. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 22.Sep.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.

