深入解析决策曲线指标（DCA）源码实现与应用

2025-03-23 16:05:02 股市动态 facai888

9|0条评论

在医疗决策分析中，决策曲线指标（Decision Curve Analysis, DCA）是一种评估预测模型性能的重要工具，DCA通过比较不同阈值下的风险和收益，帮助决策者理解模型在实际应用中的价值，本文将详细介绍DCA的概念、源码实现以及如何应用这一指标来评估预测模型。

决策曲线指标（DCA）简介

决策曲线指标是一种图形化工具，用于评估预测模型的临床有效性，与传统的性能评估指标（如AUC、敏感性、特异性）不同，DCA关注的是模型在不同决策阈值下对患者管理的影响，通过比较模型预测结果与实际结果，DCA可以展示在不同阈值下采取行动（例如治疗或不治疗）的净效益。

DCA源码实现

为了更好地理解DCA，我们可以通过编写源码来实现这一指标，以下是一个基于Python的简单DCA实现示例：

Python

import numpy as np
import matplotlib.pyplot as plt
def decision_curve(y_true, y_pred, thresholds):
    # y_true: 真实的二分类结果
    # y_pred: 模型预测的概率
    # thresholds: 决策阈值数组
    tprs = []
    fps = []
    tns = []
    fns = []
    for threshold in thresholds:
        tp = np.sum((y_true == 1) & (y_pred >= threshold))
        fp = np.sum((y_true == 0) & (y_pred >= threshold))
        tn = np.sum((y_true == 0) & (y_pred < threshold))
        fn = np.sum((y_true == 1) & (y_pred < threshold))
        tprs.append(tp / (tp + fn))
        fps.append(fp / (fp + tn))
        tns.append(tn)
        fns.append(fn)
    
    net_benefit = [tprs[i] - fps[i] for i in range(len(tprs))]
    return net_benefit, thresholds
示例数据
y_true = np.array([1, 1, 0, 1, 0, 1, 0, 0, 1, 0])
y_pred = np.array([0.9, 0.8, 0.7, 0.6, 0.5, 0.4, 0.3, 0.2, 0.1, 0.05])
计算决策曲线
thresholds = np.linspace(0, 1, 100)
net_benefit, thresholds = decision_curve(y_true, y_pred, thresholds)
绘制决策曲线
plt.plot(thresholds, net_benefit)
plt.xlabel('Threshold')
plt.ylabel('Net Benefit')
plt.title('Decision Curve')
plt.show()