Abstract:
OBJECTIVE To analyze the trend of Mycobacterium tuberculosis (MTB) infection using time series analysis of autoregressive moving average (ARIMA) model, and the distribution characteristics of MTB genotypes in Hami regions from 2018 to 2022.
METHODS Totally 788 strains of MTB were isolated from tuberculosis patients treated in our hospital from Jan. 2018 to Dec. 2022, and the distribution of MTB genotypes was analyzed by McSpoligotyping. The seasonal ARIMA model and the determined model parameters (p, d, q)(P, D, Q)s predicted the number of MTB infections in 2023. By comparing to the observed infection data in 2023, the prediction effect of the model was evaluated.
RESULTS From 2018 to 2022, totally 788 strains of MTB were identified by McSpoligotyping, of which 419 strains belonged to Beijing family with the highest proportion 53.17%, other family strains included EAI (n=24), H (n=64), T (n=79), LAM3 (n=1), Manu2 (n=3), X1(n=2) and other undefined family(n=195). The peak period of MTB infection was during Mar. and May, with a peak in April (13.58%). The best ARIMA model was established to be ARIMA(0, 1, 1)(0, 1, 1)12, of which AIC and BIC values were 3.258 and 3.460, respectively, there was no significant difference by white noise test(Ljung-Box Q=18.797, P=0.279).The actual values observed in 2023 were well within bounds of 95% confidence interval of our selected ARIMA prediction model; and ARIMA model performed good with RMSE, MAPE, MAE of 4.988, 22.977 and 3.646, respectively.
CONCLUSIONS The Beijing family genotype is the predominant for MTB infection in Hami region. Seasonality exists and the peak period of MTB infection is from March to May with the highest peak in April annually. ARIMA model has a good performance in predicting the short-term prevalence of MTB infection.