基于MRI多病灶生境影像组学预测肝富血供转移瘤的原发灶来源

分享到微信朋友圈

English 下载PDF 235 6

• 临床研究 •

王荆贾平帆王效春

Cite this article as: WANG J, JIA P F, WANG X C. Prediction of the primary lesion origin of hepatic hypervascular metastases based on MRI multi-lesion habitat radiomics[J]. Chin J Magn Reson Imaging, 2026, 17(4): 70-78.本文引用格式：王荆, 贾平帆, 王效春. 基于MRI多病灶生境影像组学预测肝富血供转移瘤的原发灶来源[J]. 磁共振成像, 2026, 17(4): 70-78. DOI:10.12015/issn.1674-8034.2026.04.010.

摘要

[摘要] 目的基于MRI动脉晚期图像，构建并验证一种多病灶生境影像组学（multi-lesion habitat radiomics, ML-HR）模型，评估其无创预测肝富血供转移瘤（hypervascular liver metastases, HLM）胃肠道（gastrointestinal, GI）与非GI来源的价值。材料与方法 回顾性纳入双中心111例HLM患者的临床及对比增强MRI资料，按7∶3比例随机分为训练集与验证集。在动脉晚期图像上勾画所有病灶的感兴趣体积（volume of interest, VOI），提取局部影像组学特征并划分亚区域，采用14种机器学习算法分别构建传统单病灶影像组学（single-lesion radiomics, SLR）模型、多病灶影像组学（multi-lesion radiomics, MLR）模型及ML-HR模型，以鉴别HLM是否来源于GI。通过受试者工作特征曲线筛选最优算法并确定最佳模型。结果共纳入111名患者（241个病灶），其中训练集（n=77），验证集（n=34）。决策树（decision tree, DT）、径向基核函数支持向量机（radial basis function support vector machine, rbf_SVM）和极限梯度提升（eXtreme Gradient Boosting, XGBoost）分别为SLR、MLR以及ML-HR模型的最佳算法。ML-HR模型性能最优，训练集AUC为0.952（95%置信区间：0.904~0.988），验证集AUC为0.901（95%置信区间：0.765~0.997），显著优于传统模型（P<0.05）。结论 ML-HR模型可有效无创预测HLM的GI与非GI来源，为临床个体化诊疗提供可靠影像学依据。

[Abstract] Objective To develop and validate a multi-lesion habitat radiomics (ML-HR) model based on late arterial phase MRI and evaluate its value in non-invasively predicting the gastrointestinal (GI) versus non-GI origin of hypervascular liver metastases (HLM).Materials and Methods The clinical and contrast-enhanced MRI Data of 111 HLM patients from two centers were retrospectively included and randomly divided into the training set and the validation set in a 7∶3 ratio. The volume of interest (VOI) of all lesions was delineated on the late-stage arterial images. Local radiomics features were extracted and subregions were divided. Fourteen machine learning algorithms were adopted to respectively construct the traditional single-lesion radiomics (SLR) model, the traditional multi-lesion radiomics (MLR) model and the multi-lesion habitat radiomics (ML-HR) model. To identify whether HLM originates from the GI. The optimal algorithm is screened and the best model is determined through the receiver operating characteristic curve.Results A total of 111 patients (241 lesions) were included, among which the training set (n = 77) and the validation set (n = 34) were included. Decision tree (DT), radial basis function support vector machine (rbf_SVM), and eXtreme Gradient Boosting (XGBoost) were identified as the optimal algorithms for SLR, MLR, and ML-HR models, respectively. The ML-HR model has the best performance. The AUC of the training set is 0.952 (95% confidence interval: 0.904 to 0.988), and that of the validation set is 0.901 (95% confidence interval: 0.765 to 0.997), which is significantly better than the traditional model (P < 0.05).Conclusions The ML-HR model can effectively and non-invasively predict the GI versus non-GI origin of HLM, providing a reliable imaging basis for clinical personalized medicine.

[关键词] 富血供肝转移瘤；胃肠道；生境影像组学；影像组学；磁共振成像；个体化诊疗

[Keywords] hypervascular liver metastases；gastroIntestinal；habitat radiomics；radiomics；magnetic resonance imaging；personalized medicine

作者信息

王荆 ^{1,
2} 贾平帆 ³ 王效春 ^1*

¹ 山西医科大学第一医院磁共振影像科，太原　030001

² 长治医学院附属和济医院影像科，长治　046000

³ 长治医学院附属和平医院影像科，长治　046000

通信作者：王效春，E-mail: 2010xiaochun@163.com

作者贡献声明：：王效春设计了本研究的具体方案，对稿件重要的内容进行了修改；王荆起草和撰写稿件，获取、分析和解释本研究的数据；贾平帆获取、分析或解释本研究的数据，对稿件重要内容进行了修改；全体作者都同意发表最后的修改稿，同意对本研究的所有方面负责，确保本研究的准确性和诚信。

出版信息

收稿日期：2025-12-18

接受日期：2026-03-19

中图分类号：R445.2 R735.7

文献标识码：A

DOI: 10.12015/issn.1674-8034.2026.04.010

本文引用格式：王荆, 贾平帆, 王效春. 基于MRI多病灶生境影像组学预测肝富血供转移瘤的原发灶来源[J]. 磁共振成像, 2026, 17(4): 70-78. DOI:10.12015/issn.1674-8034.2026.04.010.

正文

0 引言

肝脏是最常见的血行转移器官，约占所有癌症转移的25%^[¹^]。肝转移瘤（liver metastasis, LM）大多为上皮来源的癌（约70%）^[²^]。富血供LM（hypervascular LM, HLM）在临床中相对少见，原发灶可来源于胃肠道（gastrointestinal, GI）肿瘤（如GI间质瘤、部分胃癌及结直肠癌）及非GI肿瘤（如肾细胞癌、部分神经内分泌肿瘤），影像学表现存在显著重叠，增加了鉴别诊断难度^[³^]。GI来源的局限性肝转移可考虑根治性切除^[⁴^]；而非GI来源的肝转移则对免疫治疗或抗血管生成治疗更为敏感。因此，准确判断原发灶来源对提升患者生存率具有重要临床意义。

影像学是评估肝转移的主要手段，多参数MRI在敏感性和特异性方面均优于其他影像方式^[⁵^,⁶^,⁷^]。HLM的强化方式与常见LM不同，常表现为动脉期环状、结节状或弥漫均匀明显强化，门脉期及延迟期强化程度逐渐下降^[⁸^]。动脉晚期能最清晰显示病灶轮廓并揭示瘤内强化不均，如活性区域明显强化与坏死/纤维化区域弱强化。这种瘤内异质性正是生境影像组学（habitat radiomics, HR）划分亚区的物理基础，对鉴别肿瘤来源至关重要。同时，动脉晚期时相较稳定，可重复性高，尤其利于检出小病灶，避免动脉早期填充不全所致漏诊。因此，选取该期相可为机器学习模型提供信息最丰富、鉴别力最强的影像特征。影像组学已展现出在无创诊断病变性质方面的显著潜力，其性能在多类研究中优于传统的影像判读方法^[⁹^,¹⁰^,¹¹^]，已显示出媲美深度学习的诊断效能^[¹²^]。然而，现有研究大多基于对单病灶的整体勾画，隐含了“瘤内异质性均匀分布”的前提假设^[¹³^,¹⁴^,¹⁵^,¹⁶^]，忽略了肝HLM常见的坏死、出血等瘤内异质性因素^[¹⁷^]，事实上，HLM常以多发病灶形式出现，不同病灶在病理生长方式和微观属性上存在差异^[¹⁸^]，而现有单病灶模型无法捕捉这种瘤间异质性。近年来，多病灶信息整合策略成为重要发展方向^[¹⁹^,²⁰^,²¹^,²²^]，但尚缺乏公认的最佳整合方法^[¹⁸^,²²^]。HR作为新一代技术，通过划分肿瘤内部具有不同生物学特性的亚区域，为深度揭示异质性提供了新途径^[²³^,²⁴^,²⁵^,²⁶^]，该技术已在直肠癌肝转移异质性分析^[²⁷^,²⁸^]、结直肠癌肝转移病理生长模式评估^[²⁹^]及肝细胞癌术后早期复发预测^[³⁰^]等研究中展现出卓越性能。因此，融合多病灶整合策略与HR，有望同时量化瘤内与瘤间异质性，进一步提升HLM原发灶来源的预测效能。

本研究基于MRI动脉晚期图像构建多病灶HR（multi-lesion HR, ML-HR）模型无创预测HLM的GI与非GI来源。首次基于双中心数据将HR应用于HLM溯源，提出多病灶特征整合策略，增强了模型的泛化能力与临床适用性。

1 材料与方法

1.1 研究对象

本双中心回顾性研究遵守《赫尔辛基宣言》，经山西医科大学第一医院伦理委员会、长治医学院附属和济医院伦理委员会批准，免除受试者知情同意，批准文号分别为KYLL-2025-485、K25-105。连续收集了山西医科大学第一医院（中心1）2021年3月至2025年9月的187例及长治医学院附属和济医院（中心2）2021年1月至2025年1月的32例HLM患者的临床及影像资料。纳入标准：（1）所有病例的原发肿瘤均来自腹部消化道并经组织病理学证实；（2）LM需经病理学和/或临床影像学综合诊断，其中部分病例同时具备原发灶与转移灶的病理学证实。排除标准：（1）在对比增强MRI（contrast-enhanced MRI, CE-MRI）检查之前，接受过放疗、化疗或靶向治疗等HLM治疗（n=47）；（2）非腹部消化道来源、两个及以上原发性肿瘤来源；（3）HLM病灶最大径层面直径小于5 mm（n=12）；（4）MRI图像运动伪影过重导致病灶无法准确判断、测量、勾画、提取（n=18）；（5）临床资料不完整（n=31）。

基于MRI检查结果，将患者分为两类：单病灶患者（病理学和/或临床影像学综合诊断显示肝脏内仅有1个转移灶的患者）；多病灶患者（病理学和/或临床影像学综合诊断显示肝脏内存在2个及以上转移灶的患者）。

依据原发肿瘤是否来源于GI，将肝转移患者分为两类：GI来源（GI间质瘤、神经内分泌肿瘤、胃癌及结直肠癌）与非GI来源（肝、胰腺神经内分泌肿瘤、原发性肝癌、胆道癌、胰腺癌），并按照7∶3的比例随机分为训练集和验证集。

1.2 图像采集

双中心4台3.0 T扫描仪（1台skyra，Siemens Healthineers，德国；1台Signa HDxt、2台GE 750w，GE Healthcar，美国）采集LM患者的术前上腹部增强MRI图像，采用16通道矩阵线圈采集图像，扫描范围从膈顶到肝脏下缘。成像序列包括T1WI、T2WI、T2WI脂肪抑制、DWI及多期动态增强扫描等。在采集DWI图像后向肘静脉内以3.0 mL/s的速率注入对比剂（钆喷酸葡胺，0.1 mL/kg），随后以相同流率注入20 mL生理盐水冲洗，分别于15 s，45 s，2.5 min，4.5~6.0 min时采集动脉早晚期、静脉期、延迟期图像。取动态增强动脉晚期序列作为目标图像来源。各扫描仪成像协议详见表1。

点击查看表格

表1 各扫描仪参数值
Tab. 1 Parameter values of each scanner

1.3 病灶勾画和生境亚区域划分

由1名放射科住院医师使用3D Slicer软件（版本5.2.2）勾画所有病灶的感兴趣体积（volume of interest, VOI），另1名放射科主治医师随机抽取20例患者进行二次勾画评估可重复性。对于多病灶患者，勾画肝脏内所有可清晰辨识的转移灶（直径≥5 mm）。

然后将瘤灶内每个像素及其邻近像素组成一个5×5×5的滑动窗口，并使用pyradiomics软件包提取每个像素的局部影像组学特征，包括对比度（Contrast）、差熵（DifferenceEntropy）、连接能量（JointEnergy）、连接熵（JointEntropy）和相关性（Correlation），以形成每个像素的5维特征向量。为保证特征间量级一致，将所有特征值缩放至0~1的范围。采用高斯混合模型（GMM）聚类所有像素识别肿瘤内的相似亚区域，将聚类簇K的范围设为2~9，使用Silhouette系数计算最佳K值并作为亚区域的个数（图1），Silhouette仅评估聚类结构质量，并不直接代表生物学机制，但可为生境划分的合理性提供量化支撑。以上生境亚区域生成采用PixelMed AI平台完成。

点击查看大图

图1 Silhouette系数图。Silhouette系数用于衡量聚类结果的类内紧密性与类间分离度，取值越高表示聚类结构越清晰，当K=3时Silhouette系数明显高于相邻（K）值，说明将病灶划分为3个亚区域可在“区分度”与“稳定性/可解释性”之间取得较佳平衡，因此本研究选取K=3。
Fig. 1 The Silhouette coefficient diagram. The Silhouette coefficient is used to measure the intra-class tightness and inter-class separation degree of clustering results. The higher the value, the clearer the clustering structure. When K = 3, the Silhouette coefficient is significantly higher than the adjacent (K) value, indicating that dividing the lesion into three subregions can achieve a better balance between "discrimination" and "stability/interpretability". Therefore, K = 3 was selected in this study.

1.4 影像组学特征提取

图像重采样至3 mm×3 mm×3 mm（图像用三线性插值，掩膜用最近邻插值）。灰度值离散化为25个等宽箱（箱宽=25）。该箱宽选择基于以下考虑：（1）既往影像组学研究显示，25~32个箱宽能在保持纹理特征稳定性的同时，避免因箱宽过小引入噪声或箱宽过大丢失细节；（2）经测试，25个箱宽在本研究数据集中能最大化特征的可重复性，即组内相关系数（intra-class correlation coefficient, ICC）>0.75的特征比例最高。

分别对图像中的肿瘤区域以及亚区域进行影像组学特征提取。对所有瘤灶的VOI以及每个亚区域的原始图像和8种滤波器：小波（Wavelet）、高斯拉普拉斯（LoG）、梯度（Gradient）、球面谐波的局部二元模式3D（LocalBinaryPattern3D）、指数（Exponential）、平方（Square）、平方根（SquareRoot）和对数（Logarithm），提取未滤波（原始图像）和滤波特征进行分析。分别提取4683个特征。其中：一阶（firstorder）特征（918个）、形状（shape）特征（42个）、灰度共生矩阵（GLCM）特征（1122个）、灰度大小区域矩阵（GLSZM）特征（816个）、灰度游程长度矩阵（GLRLM）特征（816个）、相邻灰度-色调差分矩阵（NGTDM）特征（255），灰度依赖矩阵（GLDM）特征（714）。上述特征均符合成像生物标志物标准化倡议（IBSI）。对训练集、验证集中提取的特征进行Z-score标准化（均值为0，方差为1）处理。所有图像预处理及特征提取均通过Pyradiomics（版本3.0.1）实现。

1.5 特征选择

采用多步筛选：（1）保留ICC≥0.75的特征；（2）t检验保留P<0.05的特征；（3）Pearson相关系数|r|>0.9的特征对中仅保留一项；（4）最小绝对收缩和选择算子（least absolute shrinkage and selection operator, LASSO）回归十折交叉验证确定最优λ，筛选非零系数特征。

1.6 模型构建

为规避同一患者多病灶间的空间自相关性对模型独立性的影响，本研究采用患者水平的数据处理与评估策略：以患者为单位按7∶3随机划分训练集与验证集，确保同一患者的全部病灶仅出现在同一集合中，从源头上避免数据泄露。

对于多病灶患者，模型先对每个病灶独立输出预测概率（病灶影像组学指数RIL；HR指数hRIL），再通过预设聚合公式（公式1~2）整合为单一患者水平评分（RIS/hRIS）。该评分作为最终分类（GI vs. 非GI）与模型评估的依据，确保模型输入、输出及评价均基于独立患者个体，符合临床决策实际。聚合后的RIS/hRIS均为[0，1]连续型指数，其临床分类阈值独立于病灶划分阈值T，通过验证集最大Youden指数确定。ML-HR模型最佳截断值为0.488，即hRIS≥0.488判读为非GI来源，hRIS<0.488判读为GI来源。本聚合策略的思想框架借鉴自SEJBEN等^[³¹^]的多病灶影像组学（multi-lesion radiomics, MLR）研究。

使用scikit-learn机器学习库、采用14种机器学习算法分别构建三种分类模型。机器学习算法包括逻辑回归（logistic regression, LR）、朴素贝叶斯（naive Bayes, NB）、支持向量机（support vector machine, SVM）、决策树（decision tree, DT）、随机森林（random forest, RF）、极端随机树（extra trees, ExtraTrees）、极限梯度提升（eXtreme Gradient Boosting, XGBoost）、自适应提升（adaptive boosting, AdaBoost）、多层感知机（multilayer perceptron, MLP）、梯度提升机（gradient boosting machine, GBM）、轻量梯度提升机（light gradient boosting machine, LightGBM）。分类模型包括传统单病灶影像组学（single-lesion radiomics, SLR）、MLR、ML-HR，模型构建方法对比见表2。使用五重交叉验证确定每个模型的最佳参数。绘制受试者工作特征（receiver operating characteristic, ROC）曲线，根据曲线下面积（area under the curve, AUC）和DeLong检验明确上述模型的最佳算法。具体流程见图2。

公式中，RIS为患者影像组学指数，hRIS为患者生境影像组学指数，RIL为病灶影像组学指数，hRIL为病灶生境影像组学指数，T为阈值，RIL⁺为阳性队列，RIL⁻为阴性队列，n、p分别为RIL⁻、RIL⁺队列中所有病灶的数量，nh、ph分别为hRIL⁻、hRIL⁺队列中所有病灶的数量，RILi⁺、RILj⁻分别为阳性、阴性病灶影像组学得分，hRILi⁺、hRILj⁻分别为阳性、阴性病灶生境影像组学得分。

点击查看大图

图2 模型流程示意图。步骤1展示了病灶最大层面ROI勾画与生境区域划分。步骤2为VOI及特征提取示意图。步骤3展示了特征选择中LASSO系数路径图+正则化参数调优。步骤4展示了模型评估中的验证集的ROC、校准曲线图和DCA。ROI：感兴趣区；VOI：感兴趣体积；LASSO：最小绝对收缩和选择算子；ROC：受试者工作特征；DCA：决策曲线分析。
Fig. 2 Flowchart of model process. Step 1 presents the delineation of the ROI at the maximum level of the lesion and the division of the habitat area. Step 2 is the schematic diagram of VOI and feature extraction. Step 3 demonstrates the LASSO coefficient path map and regularization parameter tuning in feature selection. Step 4 presents the ROC, calibration curve graph and DCA of the validation set in the model evaluation. ROI: region of interest; VOI: volume of interest; LASSO: least absolute shrinkage and selection operator; ROC: receiver operating characteristic curve; DCA: decision curve analysis.

点击查看表格

表2 三种影像组学模型构建方法对比
Tab. 2 Comparison of Three radiomics Model Construction Methods

1.7 统计学分析

统计分析使用R（版本4.0.2，https：//www.r-project.org）和Python（版本3.8，https：//www.python.org）软件进行。符合正态分布的连续变量表示为平均值±标准差，使用t检验比较组间差异。非正态分布的连续变量用中位数和四分位间距表示，使用Mann-Whitney U检验进行组间比较。分类变量表示为频数和百分比，使用卡方检验进行分类数据的比较。DeLong检验用于比较不同模型之间AUC性能。决策曲线分析（decision curve analysis, DCA）评估各模型的临床实用性。Hosmer-Lemeshow检验用于检验模型的拟合优度。双侧P<0.05为差异具有统计学意义。

2 结果

2.1 患者资料

共纳入111个（241个病灶）病例，男71例，女40例，年龄（63.1±11.7）岁（范围34~87岁）。其中：GI来源52例（胃癌26例、结直肠癌20例、小肠癌6例；共117个病灶），非GI来源59例（肝癌24例、胆囊或胆管癌14例、胰腺癌21例；124个病灶）。

训练集与验证集在年龄、病灶数量、影像学特征及结局指标等方面差异均无统计学意义（均P>0.05），但性别分布差异具有统计学意义（P=0.024，表3），提示性别可能为潜在混杂因素。为进一步降低基线差异影响，本研究补充进行了基于性别的倾向性评分匹配分析：以组别为因变量、性别为协变量建立logistic回归模型计算倾向性评分，采用最近邻1∶1匹配获得匹配后样本。匹配后两组性别分布达到平衡（P>0.05），经匹配后验证，模型性能保持稳定，表明性别因素已排除，研究结论稳健可靠。

点击查看表格

表3 患者临床基线资料
Tab. 3 Patients' clinical baseline data

2.2 肿瘤VOI标注和亚区域划分

一致性分析结果显示，所有最终入选特征的ICC均大于0.75（表4），表明病灶勾画具有良好的观察者间可靠性。对所有肿瘤病灶进行体积勾画后，采用高斯混合模型将每个VOI自动划分为多个生境亚区域。选择5种局部纹理特征作为聚类输入，主要基于以下考虑：（1）在聚类阶段纳入过多特征（如形态或动态增强参数）会显著增加计算负担，影响流程效率与可重复性；（2）所选纹理特征能有效表征肿瘤内部异质性，且已被既往HR研究验证可行^[³⁰^]，故本研究沿用该成熟方案。肿瘤被划分为三个异质性亚区域，验证了该方法在量化瘤内空间异质性方面的有效性（图3）。体素分布散点图进一步显示了肿瘤内部明显的异质结构（图4）。

点击查看大图

图3 肝转移瘤患者的肿瘤全体积标注及亚区域划分示意图。3A：单病灶影像组学（SLR）模型示例（胃肠道来源）；3B：多病灶影像组学（MLR）模型及多病灶生境影像组学（ML-HR）模型示例（非胃肠道来源）；3C：MLR及ML-HR模型示例（胃肠道来源）。基于高斯混合模型的生境划分依据影像组学特征所定义的瘤内异质性进行无监督聚类，将肿瘤划分为三个生境亚区（亚区域1~3）。此过程独立于肝转移瘤的原发灶来源，体现了该方法发现固有生物学特征的普适性。绿色、红色、黄色分别代表亚区域1、2、3。
Fig. 3 Schematic diagram of tumor total volume annotation and subregion division in patients with liver metastases. 3A: An example of a traditional single-lesion radiomics (SLR) model (gastrointestinal source); 3B: Examples of traditional multi-lesion radiomics (MLR) model and multi-lesion habitat radiomics (ML-HR) model (non-gastrointestinal source); 3C: Examples of MLR and ML-HR models (gastrointestinal source). The habitat division based on the Gaussian mixture model conducts unsupervised clustering according to the intratumoral heterogeneity defined by radiomics features, and divides the tumor into three habitat subregions (subregions 1 to 3). This process is independent of the primary source of liver metastases, demonstrating the universality of this method in discovering inherent biological characteristics. Green, red and yellow represent sub-regions 1, 2 and 3 respectively.

点击查看大图

图4 基于五种局部纹理特征构建的三维散点图。不同颜色代表GMM聚类得到的三个生境亚区。图中散点的聚集与分离情况直观反映了各类生境在纹理特征空间内的类内一致性及类间差异性，为亚区域划分的合理性提供可视化佐证。GMM：高斯混合模型。
Fig. 4 A three-dimensional scatter plot constructed based on five local texture features, with different colors representing the three Habitat subregions obtained by GMM clustering. The aggregation and separation of the scattered points in the figure directly reflect the intra-class consistency and inter-class differences of various habitats within the texture feature space, providing visual evidence for the rationality of sub-region division. GMM: Gaussian mixture model.

点击查看表格

表4 筛选后特征的观察者间可重复性分析
Tab. 4 Inter-observer repeatability analysis of the screened features

2.3 影像组学特征提取与筛选

SLR、MLR、ML-HR共提取到4683个特征。经过ICC、t检验、Pearson相关系数以及LASSO特征筛选后分别保留5个单病灶VOI特征、6个多发病灶VOI特征以及7个多发病灶亚区域特征（图5）。

点击查看大图

图5 特征筛选图。5A：单病灶影像组学（SLR）特征；5B：多病灶影像组学（MLR）特征；5C：多病灶生境影像组学（ML-HR）特征。
Fig. 5 Feature screening map. 5A: Single-lesion radiomics (SLR) features; 5B: Multi-lesion radiomics (MLR) features; 5C: Multi-lesion habitat radiomics (ML-HR) features.

2.4 模型建立及评估

通过对每种模型的14种机器学习算法进行全面的指标评估，三种模型最佳算法预测效能指标见表5，SLR模型中的最佳算法是DT，训练集和验证集AUC分别为0.656（95% CI：0.580~0.759）和0.615（95% CI：0.458~0.797）；MLR模型中的最佳算法是径向基核函数支持向量机（radial basis function support vector machine, rbf_SVM），训练集和验证集AUC分别为0.808（95% CI：0.695~0.898）和0.694（95% CI：0.482~0.839）；ML-HR的最佳模型为XGBoost，训练集和验证集AUC分别为0.952（95% CI：0.904~0.988）和0.901（95% CI：0.765~0.997）。三种模型的最佳算法使用DeLong检验比较后，发现ML-HR模型的AUC最高，相较另外两种影像组学模型差异有统计学意义（P<0.05），详见表6。三种模型的ROC曲线、DCA、校正曲线见图6。ROC曲线分析显示，ML-HR模型在训练集和验证集中均表现出最优的区分能力。DCA分析显示，ML-HR模型在全阈值范围内临床净获益最高。训练集中，ML-HR在0.23~0.91阈值区间净获益均优于MLR、SLR及treat all；验证集中优势区间为0.35~0.88。以代表性阈值为例：Pt=0.5，ML-HR在训练集与验证集的净获益分别为0.403和0.353，显著高于MLR（0.236、0.118）、SLR（0.196、0.147）及treat all（0.067、0.059）；Pt=0.6，ML-HR净获益为0.344和0.351，treat all已为负净获益（-0.181、-0.192），较treat none每100名患者净减少34例和35例不必要干预。校准曲线显示，三种模型在训练集和验证集中均表现出良好的拟合优度，其中ML-HR模型的校准斜率及截距最接近理想参考线，表7显示Hosmer-Lemeshow test均>0.05，提示其预测概率与实际发生概率具有良好的一致性。三折交叉验证各折的AUC表现稳定、方差较小，提示各模型拟合度良好。

点击查看大图

图6 模型效能评估。三种模型在训练集（6A）、验证集（6B）的受试者工作特征曲线，训练集和验证集中ML-HR模型都展现出最优的区分能力，其曲线下面积显著高于MLR模型与SLR模型。三种模型在训练集（6C）、验证集（6D）的决策曲线，显示ML-HR模型具有最大净收益。三种模型在训练集（6E）、验证集（6F）的校准曲线，校准曲线接近对角线，表明预测概率与实际风险吻合良好。ML-HR模型在训练集、验证集的概率校准效果优于SLR及MLR模型。SLR：单病灶影像组学；MLR：多病灶影像组学；ML-HR：多病灶生境影像组学。
Fig. 6 Performance evaluation of models. The receiver operating characteristic curves of the three models in the training set (6A) and the validation set (6B) showed that the ML-HR model demonstrated the best discriminatory ability in both the training set and the validation set, and its area under the curve was significantly higher than that of the MLR model and the SLR model. The decision curves of the three models in the training set (6C) and validation set (6D) show that the ML-HR model has the maximum net benefit. The calibration curves of the three models in the training set (6E) and validation set (6F) are close to the diagonal, indicating that the predicted probabilities are in good agreement with the actual risks. The probability calibration effect of the ML-HR model in the training set and validation set is superior to that of the single-lesion (rad) and the traditional multi-lesion radiomics model (rad_multi). SLR: single-lesion radiomics; MLR: multi-lesion radiomics; ML-HR: multi-lesion habitat radiomics.

点击查看表格

表5 三种模型最佳算法预测效能指标
Tab. 5 Prediction Performance indicators of the best algorithms for the three models

点击查看表格

表6 模型间AUC比较的DeLong检验P值矩阵
Tab. 6 P-value matrix of DeLong test for AUC comparison between models

点击查看表格

表7 Hosmer-Lemeshow检验表
Tab. 7 Hosmer-Lemeshow test table

2.5 SHAP可视化分析

我们通过SHAP方法分析了ML-HR模型的决策机制。整体SHAP条形图（图7A）显示了各影像组学特征的重要性排序。特征SHAP依赖图（图7B）中，点的颜色代表样本的类别（红色：非GI来源，蓝色：GI来源），其水平位置（SHAP值）表示特征对预测的贡献。分析发现，sub1_logarithm_firstorder特征值的增加会提升模型预测为非GI来源的概率（SHAP值趋向正方向）；sub1_wavelet_LLH_firstorder特征值的增加则提升模型预测为GI来源的概率（SHAP值趋向负方向）；而sub1_wavelet_LLH_glrlm的影响则更为复杂，呈现非线性关系。

点击查看大图

图7 ML-HR模型SHAP可解释性分析。7A：基于SHAP值的影像组学特征重要性排名。特征按其对模型输出的平均影响力（|SHAP值|）排序。结果显示，sub1_logarithm_firstorder_Energy是对模型区分Class 0（蓝色；胃肠道来源）与Class 1（红色；非胃肠道来源）贡献最大的特征。7B：可视化证实了关键的影像组学特征与模型输出之间存在的关系，蓝色高能量值倾向于将预测推向Class 0（蓝色；胃肠道来源），而红色高能量值则将预测推向Class 1（红色；非胃肠道来源）。
Fig. 7 SHAP interpretability analysis of the ML-HR model. 7A: The ranking of the importance of radiomics features based on SHAP values. Features are sorted by their average influence on the model output (SHAP value). The results show that sub1_logarithm_firstorder_Energy is the feature that contributes the most to the model's distinction between Class 0 (blue; gastrointestinal origin) and Class 1 (red; non-gastrointestinal origin). 7B: Visualization confirmed the relationship between key radiomics features and model outputs: blue high-energy values tend to push predictions towards Class 0 (blue; gastrointestinal origin), while red high-energy values push predictions towards Class 1 (red; non-gastrointestinal origin).

3 讨论

本研究基于双中心MRI动脉晚期图像，构建并验证了一种ML-HR模型，用于无创预测肝HLM的GI与非GI来源。结果显示，ML-HR模型在训练集和验证集中的AUC分别达0.952和0.901，显著优于SLR及MLR模型（P<0.05），表明该模型具有良好的鉴别效能与泛化能力。本研究的主要创新性体现在：首次将HR应用于HLM原发灶来源鉴别，通过量化瘤内异质性提升模型对肿瘤生物学行为的表征能力；首次提出多病灶特征整合策略，采用证据加权投票法将多病灶指数聚合为患者水平评分，有效规避单一病灶取样偏倚；基于双中心数据开展研究，增强了模型的泛化能力与临床适用性。从临床价值而言，该模型既可为原发灶不明患者提供无创溯源辅助，减少不必要筛查（如PET-CT、内镜检查），降低医疗成本并缩短诊断时间；亦可为病史明确患者提供前置、定向的个体化治疗决策支持。例如预测为GI来源时，可优先考虑结直肠癌或胃肠间质瘤的化疗/靶向方案；预测为非GI来源时，则可倾向于神经内分泌肿瘤或肾细胞癌的相应治疗策略。

3.1 ML-HR在鉴别LM原发灶中的优势

ML-HR模型的性能优势源于其对肿瘤异质性的双维度量化：生境划分捕获了瘤内异质性（如坏死、出血区域），多病灶整合策略融合了瘤间异质性信息。传统SLR模型仅反映单一病灶宏观特征，MLR模型虽纳入多病灶但忽略瘤内异质性，均未能全面刻画肿瘤真实生物学行为。与HOU等^[¹⁶^,³²^]基于CT的模型（AUC=0.738~0.926）相比，本研究ML-HR模型性能相当或略优，且基于单期相MRI避免了多期相配准误差，通过SHAP分析提供了更强的可解释性。

3.2 关键特征的生物学意义与可解释性

SHAP分析显著增强了机器学习模型的可解释性^[³³^,³⁴^]。sub1_logarithm_firstorder_Energy是贡献度最高的预测特征，其高值倾向于预测非GI来源，可能反映亚区域1血供丰富、强化均匀的特性，与非GI来源HLM（如神经内分泌肿瘤）的弥漫均匀强化模式相符。sub1_wavelet_LLH_firstorder及sub1_wavelet_LLH_glrlm特征的高值与GI来源预测正相关，提示肿瘤内部强度变异大、纹理粗糙，与GI来源HLM（如结直肠癌转移）常见的中心坏死、不均匀强化模式一致。此外，sub3_wavelet_HHH_glcm等特征表明亚区域3的局部细微均匀性亦具重要贡献。动脉晚期图像能敏感揭示微血管分布差异所致的强化不均，这种模式差异正是区分GI与非GI来源HLM的病理生理基础。本研究通过生境划分捕获这些肉眼难辨的细微差异，结合SHAP分析实现模型决策的可解释化，突破了既往研究的“黑箱”局限^[³⁵^]。

3.3 本研究的局限性与未来展望

第一，这是一项具有内部验证的双中心回顾性研究，样本量较小，研究结果仍需要未来的多中心或外部验证。双中心设备间的固有物理差异可能仍有残留影响，未来研究尽可能采用前瞻性标准化协议。第二，部分肝转移灶未经病理证实，依据影像学及临床随访诊断，可能存在选择偏倚。第三，仅选取动脉晚期单期相，未能全面反映LM血供特征。第四，本研究未整合临床指标（如年龄、肿瘤标志物）；第五，受限于样本量，未采用深度学习方法。未来将开展多中心前瞻性研究，融合多期相特征及临床变量，探索深度学习与HR的结合，进一步验证并优化模型性能。

4 结论

本研究构建的ML-HR模型通过量化瘤内及瘤间异质性，可有效无创预测HLM的GI与非GI来源，性能显著优于传统模型，为临床个体化诊疗提供了可靠的影像学工具。

参考文献

[1]

QIN H, WU Y Q, LIN P, et al. Ultrasound image-based radiomics: an innovative method to identify primary tumorous sources of liver metastases[J]. J Ultrasound Med, 2021, 40(6): 1229-1244. DOI: 10.1002/jum.15506.

[2]

SIEGEL R L, GIAQUINTO A N, JEMAL A. Cancer statistics, 2024[J]. CA Cancer J Clin, 2024, 74(1): 12-49. DOI: 10.3322/caac.21820.

[3]

BHAYANA R, BALIYAN V, KORDBACHEH H, et al. Hepatobiliary phase enhancement of liver metastases on gadoxetic acid MRI: assessment of frequency and patterns[J]. Eur Radiol, 2021, 31(3): 1359-1366. DOI: 10.1007/s00330-020-07228-3.

[4]

WANG F, CHEN G, ZHANG Z, et al. The Chinese Society of Clinical Oncology (CSCO): Clinical guidelines for the diagnosis and treatment of colorectal cancer, 2024 update[J]. Cancer Commun (Lond), 2025, 45(3): 332-379. DOI: 10.1002/cac2.12639.

[5]

MAINO C, VERNUCCIO F, CANNELLA R, et al. Liver metastases: The role of magnetic resonance imaging[J]. World J Gastroenterol, 2023, 29(36): 5180-5197. DOI: 10.3748/wjg.v29.i36.5180.

[6]

NIEKEL M C, BIPAT S, STOKER J. Diagnostic imaging of colorectal liver metastases with ct, mr imaging, fdg pet, and/or fdg pet/ct: a meta-analysis of prospective studies including patients who have not previously undergone treatment[J]. Radiology, 2010, 257(3): 674-684. DOI: 10.1148/radiol.10100729.

[7]

FLORIANI I, TORRI V, RULLI E, et al. Performance of imaging modalities in diagnosis of liver metastases from colorectal cancer: a systematic review and meta-analysis[J]. J Magn Reson Imaging, 2010, 31(1): 19-31. DOI: 10.1002/jmri.22010.

[8]

MATOS A P, ALTUN E, RAMALHO M, et al. An overview of imaging techniques for liver metastases management[J]. Expert Rev Gastroenterol Hepatol, 2015, 9(12): 1561-1576. DOI: 10.1586/17474124.2015.1092873.

[9]

LAMBIN P, RIOS-VELAZQUEZ E, LEIJENAAR R, et al. Radiomics: extracting more information from medical images using advanced feature analysis[J]. Eur J Cancer, 2012, 48(4): 441-446. DOI: 10.1016/j.ejca.2011.11.036.

[10]

KNIEP H C, MADESTA F, SCHNEIDER T, et al. Radiomics of brain MRI: utility in prediction of metastatic tumor type[J]. Radiology, 2019, 290(2): 479-487. DOI: 10.1148/radiol.2018180946.

[11]

LAMBIN P, LEIJENAAR R T H, DEIST T M, et al. Radiomics: the bridge between medical imaging and personalized medicine[J]. Nat Rev Clin Oncol, 2017, 14(12): 749-762. DOI: 10.1038/nrclinonc.2017.141.

[12]

THARMASEELAN H, VELLALA A K, HERTEL A, et al. Tumor classification of gastrointestinal liver metastases using CT-based radiomics and deep learning[J/OL]. Cancer Imaging, 2023, 23(1): 95 [2025-08-04]. https://pubmed.ncbi.nlm.nih.gov/37798797/. DOI: 10.1186/s40644-023-00612-4.

[13]

DU K P, HUANG W P, LIU S Y, et al. Application of computed tomography-based radiomics in differential diagnosis of adenocarcinoma and squamous cell carcinoma at the esophagogastric junction[J]. World J Gastroenterol, 2022, 28(31): 4363-4375. DOI: 10.3748/wjg.v28.i31.4363.

[14]

SUN R, LEROUSSEAU M, BRIEND-DIOP J, et al. Radiomics to evaluate interlesion heterogeneity and to predict lesion response and patient outcomes using a validated signature of CD8 cells in advanced melanoma patients treated with anti-PD1 immunotherapy[J/OL]. J Immunother Cancer, 2022, 10(10): e004867 [2025-08-04]. https://pubmed.ncbi.nlm.nih.gov/36307149/. DOI: 10.1136/jitc-2022-004867.

[15]

BEN-COHEN A, KLANG E, DIAMANT I, et al. CT image-based decision support system for categorization of liver metastases into primary cancer sites: initial results[J]. Acad Radiol, 2017, 24(12): 1501-1509. DOI: 10.1016/j.acra.2017.06.008.

[16]

HOU C S, WANG F, PRINCE M, et al. CT-based liver peritumoural radiomics features predict hepatic metastases sources as gastrointestinal or non-gastrointestinal[J]. Br J Radiol, 2025, 98(1167): 458-468. DOI: 10.1093/bjr/tqae248.

[17]

LATACZ E, HÖPPENER D, BOHLOK A, et al. Histopathological growth patterns of liver metastasis: updated consensus guidelines for pattern scoring, perspectives and recent mechanistic insights[J]. Br J Cancer, 2022, 127(6): 988-1013. DOI: 10.1038/s41416-022-01859-7.

[18]

CHANG E, JOEL M Z, CHANG H Y, et al. Comparison of radiomic feature aggregation methods for patients with multiple tumors[J/OL]. Sci Rep, 2021, 11(1): 9758 [2025-08-12]. https://pubmed.ncbi.nlm.nih.gov/33963236/. DOI: 10.1038/s41598-021-89114-6.

[19]

GEADY C, PATEL H, PEOPLES J, et al. Radiomic-based approaches in the multi-metastatic setting: a quantitative review[J/OL]. BMC Cancer, 2025, 25(1): 538 [2025-08-12]. https://pubmed.ncbi.nlm.nih.gov/40133834/. DOI: 10.1186/s12885-025-13850-5.

[20]

HU Y L, ZHONG L S, LIU H Y, et al. Lung CT-based multi-lesion radiomic model to differentiate between nontuberculous mycobacteria and Mycobacterium tuberculosis[J]. Med Phys, 2025, 52(2): 1086-1095. DOI: 10.1002/mp.17537.

[21]

LU W X, WU G Y, MIAO X Y, et al. The radiomics nomogram predicts the prognosis of pancreatic cancer patients with hepatic metastasis after chemoimmunotherapy[J/OL]. Cancer Immunol Immunother, 2024, 73(5): 87 [2025-08-12]. https://pubmed.ncbi.nlm.nih.gov/38554161/. DOI: 10.1007/s00262-024-03644-2.

[22]

付佳, 唐磊, 李子禹, 等. CT区分SiewertⅡ型食管胃结合部腺癌cT3与cT4a: 国际抗癌联盟/美国癌症联合委员会(UICC/AJCC)第8版分期与国际胃癌协会(IGCA)第4版分期对照研究[J]. 中华胃肠外科杂志, 2018, 21(9): 1013-1018. DOI: 10.3760/cma.j.issn.1671-0274.2018.09.008.

FU J, TANG L, LI Z Y, et al. CT in differentiation of cT3 and cT4a Siewert type esophagogastric junction adenocarcinoma: a comparison study based on UICC/AJCC 8th edition and IGCA 4th edition[J]. Chin J Gastrointest Surg, 2018, 21(9): 1013-1018. DOI: 10.3760/cma.j.issn.1671-0274.2018.09.008.

[23]

CHANG X, GUO X, LI X L, et al. Potential value of radiomics in the identification of stage T3 and T4a esophagogastric junction adenocarcinoma based on contrast-enhanced CT images[J/OL]. Front Oncol, 2021, 11: 627947 [2025-08-12]. https://pubmed.ncbi.nlm.nih.gov/33747947/. DOI: 10.3389/fonc.2021.627947.

[24]

LUO X, PIAO S R, LI H Q, et al. Multi-lesion radiomics model for discrimination of relapsing-remitting multiple sclerosis and neuropsychiatric systemic lupus erythematosus[J]. Eur Radiol, 2022, 32(8): 5700-5710. DOI: 10.1007/s00330-022-08653-2.

[25]

WANG S X, LIU X W, WU Y, et al. Habitat-based radiomics enhances the ability to predict lymphovascular space invasion in cervical cancer: A multi-center study[J/OL]. Front Oncol, 2023, 13: 1252074 [2025-10-04].https://pubmed.ncbi.nlm.nih.gov/37954078/. DOI: 10.3389/fonc.2023.1252074.

[26]

XU S J, YING Y S, HU Q L, et al. Fusion model integrating multi-sequence MRI radiomics and habitat imaging for predicting pathological complete response in breast cancer treated with neoadjuvant therapy[J/OL]. Cancer Imaging, 2025, 25: 108 [2025-10-04]. https://pubmed.ncbi.nlm.nih.gov/40883826/. DOI: 10.1186/s40644-025-00929-2.

[27]

SHI S M, JIANG T, LIU H, et al. Habitat radiomics based on MRI for predicting metachronous liver metastasis in locally advanced rectal cancer: a two-center study[J]. Acad Radiol, 2025, 32(6): 3370-3383. DOI: 10.1016/j.acra.2025.02.046.

[28]

ZHAN G L, ZHAO E D, LIU X H, et al. Interpretable habitat radiomics model based on multi-sequence MRI for risk prediction of metachronous liver metastasis in rectal cancer: a multicenter study[J]. Jpn J Radiol, 2026, 44(2): 346-359. DOI: 10.1007/s11604-025-01889-0.

[29]

HAN Y Q, CHAI F, WEI J W, et al. Identification of predominant histopathological growth patterns of colorectal liver metastasis by multi-habitat and multi-sequence based radiomics analysis[J/OL]. Front Oncol, 2020, 10: 1363 [2025-10-04]. https://pubmed.ncbi.nlm.nih.gov/32923388/. DOI: 10.3389/fonc.2020.01363.

[30]

ZHANG Y B, MA H Y, LEI P, et al. Prediction of early postoperative recurrence of hepatocellular carcinoma by habitat analysis based on different sequence of contrast-enhanced CT[J/OL]. Front Oncol, 2024, 14: 1522501 [2025-10-04]. https://pubmed.ncbi.nlm.nih.gov/39830646/. DOI: 10.3389/fonc.2024.1522501.

[31]

SEJBEN A, ABBASI P, PÓSFAI B, et al. Examination of histopathological growth patterns of liver metastases in a retrospective, consecutive, single-center, cohort study[J/OL]. Pathol Oncol Res, 2025, 31: 1612161 [2025-10-04]. https://pubmed.ncbi.nlm.nih.gov/41245580/. DOI: 10.3389/pore.2025.1612161.

[32]

侯承师, 胡景卉, 黄京城, 等. 增强CT影像组学在鉴别肝转移性腺癌来源的运用[J]. 临床放射学杂志, 2024, 43(1): 73-78. DOI: 10.13437/j.cnki.jcr.2024.01.022.

HOU C S, HU J H, HUANG J C, et al. The application of contrast-enhanced CT radiomics in the differential diagnosis of hepatic metastatic adenocarcinomas source[J]. J Clin Radiol, 2024, 43(1): 73-78. DOI: 10.13437/j.cnki.jcr.2024.01.022.

[33]

GOU W L, LING C W, HE Y, et al. Interpretable machine learning framework reveals robust gut microbiome features associated with type 2 diabetes[J]. Diabetes Care, 2021, 44(2): 358-366. DOI: 10.2337/dc20-1536.

[34]

LIU C L, TAIN Y L, LIN Y C, et al. Prediction and clinically important factors of acute kidney injury non-recovery[J/OL]. Front Med, 2021, 8: 789874 [2025-12-04]. https://pubmed.ncbi.nlm.nih.gov/35111778/. DOI: 10.3389/fmed.2021.789874.

[35]

王云. 影像组学在结直肠癌肝转移鉴别诊断及与微卫星不稳定性相关性研究[D]. 保定: 河北大学, 2021. DOI: 10.27103/d.cnki.ghebu.2021.001540.

WANG Y. Radiomics study on differential diagnosis of colorectal liver metastasis and its correlation with microsatellite instability[D]. Baoding: Hebei University, 2021. DOI: 10.27103/d.cnki.ghebu.2021.001540.

上一篇 基于心脏磁共振探析肥胖患者心外膜脂肪对左心室纤维化的预测价值

下一篇 基于磁共振成像影像组学的肿瘤内异质性评分模型在卵巢肿瘤恶性风险评估中的应用价值