微信公众号

官网二维码

中国癌症防治杂志 ›› 2023, Vol. 15 ›› Issue (5): 549-555.doi: 10.3969/j.issn.1674-5671.2023.05.13

• 临床研究 • 上一篇    下一篇

上消化道癌及癌前病变高危人群预测模型研究

  

  1. 山西医科大学公共卫生学院流行病学教研室;山西省肿瘤医院/中国医学科学院肿瘤医院山西医院肿瘤防控办公室;阳城县肿瘤医院科研办公室
  • 出版日期:2023-10-25 发布日期:2023-11-03
  • 通讯作者: 张永贞 E-mail:zyzzhang2003@163.com
  • 基金资助:
    国家重点研发计划专项课题(2016YFC1302802)

Prediction model of high-risk population of upper gastrointestinal cancer and precancerous lesions

  • Online:2023-10-25 Published:2023-11-03

摘要: 目的 分析上消化道癌及癌前病变的危险因素,构建上消化道癌高危人群预测模型,识别上消化道癌高危群体。 方法 选取2020年6月至2021年12月参加山西省阳城县“农村上消化道癌早诊早治项目”中的40~69岁人群,根据纳入和排除标准筛选后按7∶3随机分为训练集(n=1 997)和验证集(n=852),分别用于模型的训练和验证。采用χ2 检验进行单因素分析,P<0.2的因素进行最优子集变量筛选,选择赤池信息(akaike information criterion,AIC)最低的变量组合构建logistic回归模型并建立评分量表。绘制受试者工作特征曲线(receiver operating characteristic curve,ROC)并根据曲线下面积(area under the curve,AUC)评估模型区分度,Hosmer⁃Lemeshow(H⁃L)检验和校准曲线评估模型校准度,临床决策曲线(decision curve analysis,DCA)评估临床适用性。结果 建立基于年龄、性别、吸烟、热烫饮食摄入、肿瘤家族史5项危险因素的上消化道癌及癌前病变logistic回归预测模型,训练集和验证集AUC分别为0.759(95%CI:0.688~0.830)和0.743(95%CI:0.606~0.880),校准曲线结合H⁃L检验证明该模型具有较好的校准度(P>0.05)。根据logistic回归模型建立评分模型,分值范围0~27分,分值越高发病风险越高。评分模型训练集和验证集的AUC分别为0.760(95%CI:0.690~0.829)和0.748(95%CI:0.612~0.884),校准曲线结合H⁃L检验证明该模型具有较好的校准度(P>0.05)。DCA表明该模型具有良好的临床适用性。 结论 基于年龄、性别、吸烟、热烫饮食摄入、肿瘤家族史等5个危险因素构成的上消化道癌及癌前病变高危人群预测模型和评分模型具有较好的预测价值,有助于上消化道癌人群筛查。

关键词: 上消化道癌, 预测模型, 筛查, 早诊早治

Abstract: Objective To analyze the risk factors of upper gastrointestinal cancer and precancerous lesions, to construct a prediction model for high⁃risk population of upper gastrointestinal cancer, and to identify high⁃risk groups of upper gastrointestinal cancer. Methods The population group at age 40-69 who participated in the "Rural Upper Gastrointestinal Cancer Early Diagnosis and Treatment Project " in Yangcheng County, Shanxi Province from June 2020 to December 2021 were selected, and randomly divided into a training set (n=1,997) and a validation set (n=852), according to the inclusion and exclusion criteria with a ratio of 7 to 3, used for training and validation of the model, respectively. The chi⁃square test was used for univariate analysis, the factors with P < 0.2 were screened for the optimal subset of variables, and the combination of variables with the lowest akaike information criterion (AIC) was selected to construct a logistic regression model and establish a scoring scale. The receiver operating characteristic curve (ROC) was drawn and the discrimination of the model was evaluated according to the area under the curve (AUC). The calibration of the model was evaluated by the Hosmer⁃Lemeshow (H⁃L) test and the calibration curve. The clinical decision curve analysis (DCA) was used to evaluate the clinical applicability. Results A logistic regression prediction model for the high⁃risk population of upper gastrointestinal cancer and precancerous lesions was established based on five risk factors, including age, sex, smoking, hot food intake and family history of cancer. The AUCs of the training set and the validation set of the model were 0.759 (95%CI: 0.689-0.830) and 0.743 (95%CI: 0.606-0.880), respectively. The calibration curve combined with H⁃L test proved that the model had good calibration (P>0.05). According to the logistic regression model, a scoring scale model was established with a score ranging from 0 to 27 points. The higher the score, the higher the risk of disease. The AUCs of the training set and the validation set were 0.760 (95%CI: 0.690-0.829) and 0.748 (95%CI: 0.612-0.884), respectively. The calibration curve combined with the H⁃L test proved that the model had good calibration (P>0.05).The DCA indicated that the model had good clinical applicability. Conclusions The prediction model and scoring scale model of the high⁃risk population of upper gastrointestinal cancer and precancerous lesions, based on the five risk factors including age, sex, smoking, hot food intake and family history of cancer, have good predictive value, which are helpful for the screening of the upper gastrointestinal cancer population.

Key words:  , Upper gastrointestinal cancer, Prediction model, Screening, Early diagnosis and treatment

中图分类号: 

  •