肺结节/早期肺癌预测模型的知识图谱与可视化分析_《中国胸心血管外科临床杂志》

作者：

任益锋 ¹ , 马琼 ¹ , 蒋华 ¹ , 付西 ¹ , 李雪珂 ^1,2 , 石薇 ³ ,  由凤鸣 ^1,2

1. 成都中医药大学附属医院代谢性疾病中医药调控四川省重点实验室（成都 610075）;
2. 成都中医药大学肿瘤研究所（成都 610075）;
3. 四川大学华西医院麻醉科（成都 610041）;

关键词：

肺结节肺癌预测模型知识图谱

DOI：

10.7507/1007-4848.202304026

视频：

导出 下载 收藏 扫码 引用

摘要 全文 图表 视频 参考文献 施引文献 补充材料

目的本研究旨在揭示当前肺结节/早期肺癌预测模型的科学成果和未来研究趋势。方法检索中国知网、万方、维普和Web of Science核心库数据库中2002年1月1日—2023年6月3日收录的肺结节/早期肺癌预测模型相关文献，使用CiteSpace 6.1.R3和VOSviewer 1.6.18分析当前的热点和主题趋势，并将分析结果可视化。结果来自64个国家/地区的2711个机构的12581名作者在566种英文学术期刊上发表了2139篇英文文章，国内1256名作者在176种期刊上共发表了282篇中文文章。发表肺结节/早期肺癌预测模型相关文章最多的中英文期刊分别是《临床放射学杂志》和Frontiers in Oncology。Chest是被引频次最高的期刊。中国和美国是肺结节/早期肺癌预测模型领域的领先者。以复旦大学为代表的学术机构在该领域具有重要的学术影响力。关键词分析表明，多组学、诺莫图、机器学习和人工智能是当前肺结节/早期肺癌预测模型研究的重点。结论在过去的20年里，肺结节/早期肺癌风险预测模型的相关研究引起越来越多的关注，预测、机器学习、人工智能、诺莫图和多组学技术是当前该领域的研究热点和发展趋势。未来的研究亟需多组学技术联合对肺结节/早期肺癌进行深入探索，开展多中心前瞻性临床研究实现肺结节/早期肺癌预测模型的迭代与更新，以期减轻全球肺癌负担。

肺癌是全球癌症相关死亡的最常见原因，其早期诊断依然面临挑战^[1-4]。尽管在肺癌治疗方面已取得了长足进步，但晚期肺癌患者预后不佳的现状仍然无法得到改善^[5-7]。临床生存结果与疾病分期密切相关已成为共识，研究^[8]表明，早期诊断可使5年相对生存率从晚期肺癌的6%增加至中期肺癌33%和早期肺癌60%。现行的肺结节诊疗管理以随访监测为主，对于未达到手术指征的肺结节仍缺乏有效的干预措施，这不仅加重了患者的身心负担，在一定程度上也增加了医疗资源的浪费。因此，早诊早治是降低肺癌相关死亡率和经济负担的有效策略。

随着癌症筛查技术的进步，特别是低剂量计算机断层扫描（low dose computed tomography，LDCT）分辨率的提高，每年有数十万患者被诊断为肺结节^[9-10]。研究^[11]表明，由于假阳性率和过度诊断风险的增加，许多肺结节患者接受了非必要性手术操作。而在侵入性手术前精准评估肺结节得良恶性不仅能减少不必要的手术、降低患者的身心负担，还可延缓恶性结节的疾病进展、减少医疗资源浪费。值得一提的是，肺结节/早期肺癌风险预测模型已被证实可以显著降低肺癌筛查中的假阳性率，目前已有指南建议使用预测模型进行肺癌筛查，例如美国国立综合癌症网络（National Comprehensive Cancer Network，NCCN）发布的肺癌筛查指南^[12]强调了采用风险预测模型识别肺结节高危人群的重要地位。

最初，肺结节/早期肺癌的预测模型变量主要基于患者的CT影像特征和临床信息。但由于缺乏了其他重要生物标志物等特征性变量，较高的假阳性、过度诊疗的发生率一直无法避免^[13-14]。基于多组学技术（包括影像组学、基因组学、蛋白质组学和代谢组学）寻找新的生物诊断标志物，为提高肺癌预测模型的准确性和敏感性提供了新的切入点。肺结节/早期肺癌预测模型的数量正在快速增加，然而目前这些研究尚未得到系统性定量研究。对现有文献进行系统可视化分析有助于研究者更加直观地了解肺结节/早期肺癌预测模型的研究现状与趋势，从而掌握该领域未来的研究方向。虽然已经有一些肺结节/早期肺癌风险预测模型的相关综述^[15]，但仍然缺乏对这些模型的演变和趋势的定量评估。文献计量学可以表征某一学科的研究动态，通过知识图谱的形式将大量文献数据信息生动直观地呈现出来，为今后的研究提供参考。

综上，本研究对肺结节/早期肺癌风险预测模型的研究现状进行了表征，并通过文献计量学和可视化分析探讨了该领域的研究趋势和最新动态，为肺结节/早期肺癌预测模型领域提供整体研究的宏观概括和热点概览，以期为未来可能的研究方向提供综述性观点。

1 资料与方法

1.1 数据来源与检索策略

本研究检索了中国知网、万方、维普和Web of Science 4个数据库，检索时间为2002年1月1日—2023年6月3日。中国知网的检索式为：SU=（肺结节+肺部结节+肺癌+肺腺癌+肺鳞癌+非小细胞肺癌）AND SU=（预测模型+预后模型+列线图），万方和维普根据各自不同的检索特点稍作调整。文献类型为论著和综述。

1.2 文献筛选与数据清洗

纳入标准为已发表的有关肺结节/早期肺癌预测模型的论著或综述。排除标准为：会议摘要、未发表文章、重复出版物、勘误类文章、学位论文、信件和与研究主题不相关文章。将检索到的中文文献以Refworks格式导入NoteExpress中进行查重，并且由两名研究者独自对文献进行人工筛选。英文文献从Web of Science中将完整记录和引用的参考文献的检索结果导出为“纯文本文件”，并以“download.txt”格式存储。两位作者独立根据纳入排除标准筛选文献。如有分歧，通过第三位作者讨论解决以达成共识。

筛选标题、作者、机构、关键词等关键信息完整的文章并完成数据清洗。合并重复的机构，对于同一机构的不同名称，采用现今被广泛接受的规范名称，同一学校不同学院均采归为学校，同一医院不同科室均归为医院；合并重复的关键词，将相同含义的关键词进行合并。

1.3 文献计量学与可视化分析

将完成数据清洗后的数据使用VOSviewer 1.6.18^[16]、CiteSpace 6.1.R^[17]和在线分析平台（http://bibliometric.com/）^[18,20]分析过去20年来肺结节/早期肺癌预测模型的研究趋势和新的热点方向^[19]。这些期刊的影响因子（Impact factor，IF）由《2021期刊引文报告》（JCR）确定。采用Microsoft Excel 2020 对年发文量进行统计并绘制折线图，分析文献和被引量的年度增长趋势以及该领域的研究热度。

中文文献以RefWorks格式导出至VOSviewer 1.6.18中绘制作者和机构的合作网络，以及关键词聚类和时间演化分析，由于中文数据库无法导出参考文献相关信息，故未进行参考文献共被引分析。英文文献清洗和筛选完数据后，使用VOSviewer 1.6.18绘制了期刊的被引频次图谱，作者、国家、机构的合作网络、关键词聚类分析和时间演化分析图谱^[20-21]。使用CiteSpace对高频共被引参考文献进行聚类分析。

2 结果

2.1 检索结果

共检索到5687篇关于肺结节/早期肺癌预测模型的文献。在进一步筛选了论文的标题、摘要和全文后，本研究共纳入了2421篇文献，其中英文文献2139篇，中文文献282篇（图1）。

图1 文献筛选流程

图选项

序号	英文期刊名称	发文量（篇）	序号	中文期刊名称	发文量（篇）
1	Frontiers in Oncology	120	1	临床放射学杂志	13
2	Lung Cancer	47	2	中国胸心血管外科临床杂志	9
3	Cancers	42	3	中国肺癌杂志	7
4	BMC CANCER	39	4	中华医学杂志	5
5	Frontiers in Genetics	39	5	中华放射学杂志	5
6	Journal of Thoracic Oncology	37	6	国际医学放射学杂志	5
7	Journal of Thoracic Disease	37	7	放射学实践	5
8	Translational Lung Cancer Research	37	8	现代肿瘤医学	5
9	Scientific Reports	34	9	中华核医学与分子影像杂志	4
10	Plos One	31	10	中华肺部疾病杂志	4

序号	国家	发文量（篇）	TLS
1	中国	1125	234
2	美国	556	478
3	英国	127	232
4	加拿大	105	190
5	荷兰	101	203
6	意大利	94	195
7	德国	87	187
8	日本	82	42
9	法国	70	180
10	西班牙	59	138
TLS：总连接强度，表示一个国家与另一个国家之间的连接总强度

序号	英文发文量前10的机构	发文量（篇）	序号	中文发文量前10的机构	发文量（篇）
1	复旦大学	60	1	北京协和医学院	8
2	北京协和医学院	56	2	上海交通大学附属胸科医院	6
3	浙江大学	54	3	中国医学科学院	4
4	南京医科大学	53	4	北京大学人民医院	4
5	德克萨斯大学MD安德森癌症中心	48	5	青岛大学附属医院	4
6	山东大学	46	6	上海理工大学	3
7	上海交通大学	45	7	中国科学院	3
8	同济大学	41	8	四川大学华西医院	3
9	中山大学	41	9	安徽医科大学	3
10	纪念斯隆·凯特琳癌症中心	40	10	广东省人民医院	3

序号	英文发文量前10的作者	发文量（篇）	序号	中文发文量前10的作者	发文量（篇）
1	John K Field	17	1	韩冬	5
2	Jie He	17	2	于楠	4
3	Dirk De Ruysscher	16	3	张永奎	4
4	Martin C Tammemagi	16	4	喻微	3
5	Yi Zhang	16	5	姜冠潮	3
6	Philippe Lambin	15	6	曹捍波	3
7	Stephen W Duffy	14	7	李强	3
8	Pierre P Massion	14	8	李运	3
9	David R Baldwin	13	9	王俊	3
10	Wei Li	13	10	王兆宇	3

序号	被引次数（次）	连接强度	文献名称	年份	第一作者	期刊名称	国家	影响因子
1	291	2461	Reduced lung-cancer mortality with low-dose computed tomographic screening	2011	National Lung Screening Trial Research Team	The New England journal of Medicine	美国	176.079
2	226	890	Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries	2018	Freddie Bray	CA: a cancer journal for clinicians	美国	286.130
3	132	1531	Selection criteria for lung-cancer screening	2013	Martin C Tammemägi	The New England journal of Medicine	美国	176.079
4	116	1221	Probability of cancer in pulmonary nodules detected on first screening CT	2013	Annette McWilliams	The New England journal of Medicine	美国	176.079
5	110	1346	The LLP risk model: an individual risk prediction model for lung cancer	2008	A Cassidy	British Journal of Cancer	英国	9.082
6	106	1234	Variations in lung cancer risk among smokers	2003	Peter B Bach	Journal of the National Cancer Institute	美国	11.816
7	100	863	The probability of malignancy in solitary pulmonary nodules. Application to small radiologically indeterminate nodules	1997	S. J. Swensen	Archives of internal medicine	美国	1.2
8	96	1118	A risk model for prediction of lung cancer	2007	Margaret R Spitz	Journal of the National Cancer Institute	美国	11.816
9	85	846	Screening for lung cancer: U.S. Preventive Services Task Force recommendation statement	2014	Virginia A Moyer	Annals of Internal Medicine	美国	51.598
10	80	330	Tutorial in biostatistics multivariable prognostic models: Issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.	1996	FRANK E. HARRELL Jr	Statistics in Medicine	英国	2.497

1.	Sung H, Ferlay J, Siegel RL, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin, 2021, 71(3): 209-249.
2.	Siegel RL, Miller KD, Fuchs HE, et al. Cancer statistics, 2022. CA Cancer J Clin, 2022, 72(1): 7-33.
3.	Jonas DE, Reuland DS, Reddy SM, et al. Screening for lung cancer with low-dose computed tomography: Updated evidence report and systematic review for the US Preventive Services Task Force. JAMA, 2021, 325(10): 971-987.
4.	Lovly CM. Expanding horizons for treatment of early-stage lung cancer. N Engl J Med, 2022, 386(21): 2050-2051.
5.	Yee J, Sadar MD, Sin DD, et al. Connective tissue-activating peptideⅢ: A novel blood biomarker for early lung cancer detection. J Clin Oncol, 2009, 27(17): 2787-2792.
6.	Pan J, Fang S, Tian H, et al. lncRNA JPX/miR-33a-5p/Twist1 axis regulates tumorigenesis and metastasis of lung cancer by activating Wnt/β-catenin signaling. Mol Cancer, 2020, 19(1): 9.
7.	Li N, Wang L, Hu Y, et al. Global evolution of research on pulmonary nodules: A bibliometric analysis. Future Oncol, 2021, 17(20): 2631-2645.
8.	Li N, Tan F, Chen W, et al. One-off low-dose CT for lung cancer screening in China: A multicentre, population-based, prospective cohort study. Lancet Respir Med, 2022, 10(4): 378-391.
9.	Chan MH, Huang WT, Wang J, et al. Next-generation cancer-specific hybrid theranostic nanomaterials: MAGE-A3 NIR persistent luminescence nanoparticles conjugated to afatinib for in situ suppression of lung adenocarcinoma growth and metastasis. Adv Sci (Weinh), 2020, 7(9): 1903741.
10.	Oudkerk M, Liu S, Heuvelmans MA, et al. Lung cancer LDCT screening and mortality reduction : Evidence, pitfalls and future perspectives. Nat Rev Clin Oncol, 2021, 18(3): 135-151.
11.	Fehlmann T, Kahraman M, Ludwig N, et al. Evaluating the use of circulating microRNA profiles for lung cancer detection in symptomatic patients. JAMA Oncol, 2020, 6(5): 714-723.
12.	Ten Haaf K, Tammemägi MC, Bondy SJ, et al. Performance and cost-effectiveness of computed tomography lung cancer screening scenarios in a population-based setting: A microsimulation modeling analysis in Ontario, Canada. PLoS Med, 2017, 14(2): e1002225.
13.	Lu MT, Raghu VK, Mayrhofer T, et al. Deep learning using chest radiographs to identify high-risk smokers for lung cancer screening computed tomography: Development and validation of a prediction model. Ann Intern Med, 2020, 173(9): 704-713.
14.	Muller DC, Johansson M, Brennan P. Lung cancer risk prediction model incorporating lung function: Development and validation in the UK Biobank prospective cohort study. J Clin Oncol, 2017, 35(8): 861-869.
15.	Cassidy A, Duffy SW, Myles JP, et al. Lung cancer risk prediction: A tool for early detection. Int J Cancer, 2007, 120(1): 1-6.
16.	van Eck NJ, Waltman L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics, 2010, 84(2): 523-538.
17.	Chen C, Song M. Visualizing a field of research: A methodology of systematic scientometric reviews. PLoS One, 2019, 14(10): e0223994.
18.	Chen C. Searching for intellectual turning points: Progressive knowledge domain visualization. Proc Natl Acad Sci U S A, 2004, 101 Suppl 1(Suppl 1): 5303-5310.
19.	Jung JH, Chiang B, Grossniklaus HE, et al. Ocular drug delivery targeted by iontophoresis in the suprachoroidal space using a microneedle. J Control Release, 2018, 277: 14-22.
20.	Xie L, Chen Z, Wang H, et al. Bibliometric and visualized analysis of scientific publications on atlantoaxial spine surgery based on Web of Science and VOSviewer. World Neurosurg, 2020, 137: 435-442.
21.	Du Y, Duan C, Yang Y, et al. Heart transplantation: A bibliometric review from 1990-2021. Curr Probl Cardiol, 2022, 47(8): 101176.
22.	Toumazis I, Bastani M, Han SS, et al. Risk-Based lung cancer screening: A systematic review. Lung Cancer, 2020, 147: 154-186.
23.	Gray EP, Teare MD, Stevens J, et al. Risk prediction models for lung cancer: A systematic review. Clin Lung Cancer, 2016, 17(2): 95-106.
24.	Zhang M, Zhou Y, Lu Y, et al. The 100 most-cited articles on prenatal diagnosis: A bibliometric analysis. Medicine (Baltimore), 2019, 98(38): e17236.
25.	MacMahon H, Li F, Jiang Y, et al. Accuracy of the vancouver lung cancer risk prediction model compared with that of radiologists. Chest, 2019, 156(1): 112-119.
26.	Qiu YL, Zheng H, Devos A, et al. A meta-learning approach for genomic survival analysis. Nat Commun, 2020, 11(1): 6350.
27.	Shi L, Magee P, Fassan M, et al. A KRAS-responsive long non-coding RNA controls microRNA processing. Nat Commun, 2021, 12(1): 2038.
28.	Swensen S J, Silverstein M D, Ilstrup D M, et al. The probability of malignancy in solitary pulmonary nodules. Application to small radiologically indeterminate nodules. Arch Intern Med, 1997, 157(8): 849-855.
29.	Spitz MR, Hong WK, Amos CI, et al. A risk model for prediction of lung cancer. J Natl Cancer Inst, 2007, 99(9): 715-726.
30.	Zhang Y, Yang M, Ng DM, et al. Multi-omics data analyses construct TME and identify the immune-related prognosis signatures in human LUAD. Mol Ther Nucleic Acids, 2020, 21: 860-873.
31.	Takahashi S, Asada K, Takasawa K, et al. Predicting deep learning based multi-omics parallel integration survival subtypes in lung cancer using reverse phase protein array data. Biomolecules, 2020, 10(10): 1460.
32.	Li W, Liu B, Wang W, et al. Lung cancer stage prediction using multi-omics data. Comput Math Methods Med, 2022, 2022: 2279044.
33.	Wang T, Shao W, Huang Z, et al. MOGONET integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification. Nat Commun, 2021, 12(1): 3445.
34.	Xing W, Sun H, Yan C, et al. A prediction model based on DNA methylation biomarkers and radiological characteristics for identifying malignant from benign pulmonary nodules. BMC Cancer, 2021, 21(1): 263.
35.	Hu F, Huang H, Jiang Y, et al. Discriminating invasive adenocarcinoma among lung pure ground-glass nodules: A multi-parameter prediction model. J Thorac Dis, 2021, 13(9): 5383-5394.
36.	Hosny A, Parmar C, Coroller TP, et al. Deep learning for lung cancer prognostication: A retrospective multi-cohort radiomics study. PLoS Med, 2018, 15(11): e1002711.
37.	Chen K, Sun J, Zhao H, et al. Non-invasive lung cancer diagnosis and prognosis based on multi-analyte liquid biopsy. Mol Cancer, 2021, 20(1): 23.
38.	Aberle DR, Adams AM, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. N Engl J Med, 2011, 365(5): 395-409.
39.	Moyer VA, . Screening for lung cancer: U. S. Preventive Services Task Force recommendation statement. Ann Intern Med, 2014, 160(5): 330-338.
40.	Carrillo-Perez F, Morales JC, Castillo-Secilla D, et al. Machine-learning-based late fusion on multi-omics and multi-scale data for non-small-cell lung cancer diagnosis. J Pers Med, 2022, 12(4): 601.
41.	Li W, Liu B, Wang W, et al. Lung cancer stage prediction using multi-omics data. Comput Math Methods Med, 2022, 2022: 2279044.

《中国胸心血管外科临床杂志》

优先发表肺结节/早期肺癌预测模型的知识图谱与可视化分析

摘要 全文 图表 视频 参考文献 施引文献 补充材料

1 资料与方法

1.1 数据来源与检索策略

1.2 文献筛选与数据清洗

1.3 文献计量学与可视化分析

2 结果

2.1 检索结果

2.2 年发文量

2.3 期刊分析

2.4 国家/地区分析

2.5 机构合作分析

2.6 作者分析

2.7 关键词聚类、时间演化及突现分析

2.8 文献共被引

3 讨论

3.1 肺结节/早期肺癌预测模型研究的全球趋势

3.2 研究热点与前沿

3.3 挑战和前景

1 资料与方法

1.1 数据来源与检索策略

1.2 文献筛选与数据清洗

1.3 文献计量学与可视化分析

2 结果

2.1 检索结果

2.2 年发文量

2.3 期刊分析

2.4 国家/地区分析

2.5 机构合作分析

2.6 作者分析

2.7 关键词聚类、时间演化及突现分析

2.8 文献共被引

3 讨论

3.1 肺结节/早期肺癌预测模型研究的全球趋势

3.2 研究热点与前沿

3.3 挑战和前景

Format

Content

摘要全文图表视频参考文献施引文献补充材料