指标

可用于模型训练的指标定义

核心指标

本节定义了将 scikit-learn 指标转换为 fastai 指标的函数。除非您想了解 fastai 的所有内部细节，否则可以跳过本节。

AccumMetric

 AccumMetric (func, dim_argmax=None, activation='no', thresh=None,
              to_np=False, invert_arg=False, flatten=True, name=None,
              **kwargs)

在 CPU 上累积存储预测和目标，以便使用 func 执行最终计算。

仅在请求 value 属性时（例如在验证/训练阶段结束时，与 Learner 及其 Recorder 配合使用）才将 func 应用于累积的预测/目标。func 的签名应为 inp,targ（其中 inp 是模型的预测，targ 是相应的标签）。

对于单标签分类问题，预测需要先经过 softmax 然后 argmax 转换，再与目标进行比较。由于 softmax 不改变数字的顺序，我们可以只应用 argmax。传递 dim_argmax 参数即可让 AccumMetric 执行此操作（通常 -1 效果很好）。如果您需要将概率而不是预测传递给指标，请使用 softmax=True。

对于多标签分类问题，或者如果您的目标是 one-hot 编码，预测可能需要先经过 sigmoid（如果模型中未包含）然后与给定阈值进行比较（用于决定 0 和 1），如果传递 sigmoid=True 和/或 thresh 的值，AccumMetric 会完成此操作。

如果您想使用 scikit-learn.metrics 的指标函数，您需要使用 to_np=True 将预测和标签转换为 numpy 数组。此外，scikit-learn 指标采用 y_true，y_preds 的约定，这与我们相反，因此您需要传递 invert_arg=True 来让 AccumMetric 为您进行反转。

#For testing: a fake learner and a metric that isn't an average
@delegates()
class TstLearner(Learner):
    def __init__(self,dls=None,model=None,**kwargs): self.pred,self.xb,self.yb = None,None,None

def _l2_mean(x,y): return torch.sqrt((x.float()-y.float()).pow(2).mean())

#Go through a fake cycle with various batch sizes and computes the value of met
def compute_val(met, x1, x2):
    met.reset()
    vals = [0,6,15,20]
    learn = TstLearner()
    for i in range(3):
        learn.pred,learn.yb = x1[vals[i]:vals[i+1]],(x2[vals[i]:vals[i+1]],)
        met.accumulate(learn)
    return met.value

x1,x2 = torch.randn(20,5),torch.randn(20,5)

tst = AccumMetric(_l2_mean)
test_close(compute_val(tst, x1, x2), _l2_mean(x1, x2))
test_eq(torch.cat(tst.preds), x1.view(-1))
test_eq(torch.cat(tst.targs), x2.view(-1))

#test argmax
x1,x2 = torch.randn(20,5),torch.randint(0, 5, (20,))
tst = AccumMetric(_l2_mean, dim_argmax=-1)
test_close(compute_val(tst, x1, x2), _l2_mean(x1.argmax(dim=-1), x2))

#test thresh
x1,x2 = torch.randn(20,5),torch.randint(0, 2, (20,5)).bool()
tst = AccumMetric(_l2_mean, thresh=0.5)
test_close(compute_val(tst, x1, x2), _l2_mean((x1 >= 0.5), x2))

#test sigmoid
x1,x2 = torch.randn(20,5),torch.randn(20,5)
tst = AccumMetric(_l2_mean, activation=ActivationType.Sigmoid)
test_close(compute_val(tst, x1, x2), _l2_mean(torch.sigmoid(x1), x2))

#test to_np
x1,x2 = torch.randn(20,5),torch.randn(20,5)
tst = AccumMetric(lambda x,y: isinstance(x, np.ndarray) and isinstance(y, np.ndarray), to_np=True)
assert compute_val(tst, x1, x2)

#test invert_arg
x1,x2 = torch.randn(20,5),torch.randn(20,5)
tst = AccumMetric(lambda x,y: torch.sqrt(x.pow(2).mean()))
test_close(compute_val(tst, x1, x2), torch.sqrt(x1.pow(2).mean()))
tst = AccumMetric(lambda x,y: torch.sqrt(x.pow(2).mean()), invert_arg=True)
test_close(compute_val(tst, x1, x2), torch.sqrt(x2.pow(2).mean()))

轮次	训练损失	验证损失	l1	l2	时间
0	15.296746	12.515826	3.019884	9.495943	00:00
1	13.290909	8.719325	2.454751	6.264574	00:00

核心指标

AccumMetric

skm_to_fastai

optim_metric

单标签分类

准确率

错误率

Top-k 准确率

二分类平均精度得分

平衡准确率

Brier 得分

Cohen Kappa 系数

F1 得分

FBeta 得分

汉明损失

Jaccard 得分

精确率

召回率

ROC AUC

二分类 ROC AUC

Matthews 相关系数

多标签分类

多标签准确率

多标签平均精度得分

多标签 Brier 得分

多标签 F1 得分

多标签 FBeta 得分

多标签汉明损失

多标签 Jaccard 得分

多标签 Matthews 相关系数

多标签精确率

多标签召回率

多标签 ROC AUC

回归

均方误差

均方根误差

平均绝对误差

均方对数误差

指数均方根百分比误差

解释方差

R平方得分

Pearson 相关系数

Spearman 相关系数

分割

前景准确率

Dice 系数

多类别 Dice 系数

Jaccard 系数

多类别 Jaccard 系数

NLP

语料库 BLEU 指标

困惑度

损失指标

损失指标集