【bug】KeyError: “The metric_for_best_model training argument is set to ‘eval_loss’, which is not foun

LittleSeedling

已于 2025-03-26 10:46:41 修改

阅读量1.5k

点赞数 28

分类专栏： BUG 文章标签： bug

于 2024-11-20 09:43:51 首次发布

本文链接：https://blog.csdn.net/LittleSeedling/article/details/143902509

版权

BUG 专栏收录该内容

8 篇文章

订阅专栏

文章目录

问题
解决
分析

问题

KeyError: “The metric_for_best_model training argument is set to ‘eval_loss’, which is not found in the evaluation metrics. The available evaluation metrics are: [‘eval_runtime’, ‘eval_samples_per_second’, ‘eval_steps_per_second’, ‘epoch’]. Consider changing the metric_for_best_model via the TrainingArguments.”

在Transformer的Trainer中添加的compute_metrics没起作用。

参考：
***eval_loss not found when training a peft model using trainer.py #33420
It just doesn’t calculate the metrics #782

当前transformers版本为4.46.0。

def compute_metrics(eval_preds, **kwargs):
    preds, labels, *inputs_losses = eval_preds
    
    # preprocess labels
    # take label which not equal to -100
    preds = preds[labels != -100]
    labels = labels[labels != -100]
    labels = np.where(labels > 0.5, 1, 0)

    accuracy = accuracy_score(labels, preds)
    f1 = f1_score(labels, preds)
    auc = roc_auc_score(labels, preds)
    precision = precision_score(labels, preds)
    recall = recall_score(labels, preds)
    
    return dict(
        accuracy=accuracy,
        f1=f1,
        auc=auc,
        precision=precision,
        recall=recall
    )

trainer = Trainer(
    model=model,
    processing_class=tokenizer,
    args=training_args,
    train_dataset=train_dataset,
    eval_dataset=valid_dataset,
    data_collator=DataCollatorWithPadding(tokenizer),
    preprocess_logits_for_metrics=preprocess_logits_for_metrics,
    compute_metrics=compute_metrics,
    compute_loss_func=compute_loss_func,
)

尝试过在training_args中添加include_for_metrics = ['loss']，但是不起作用。compute_metrics中的内容没有返回。

解决

本质上由于使用的是PeftModel导致self.label_names为[]。
所以，在trainer.py的732行附近，修改为如下代码。
原代码：

default_label_names = find_labels(self.model.__class__)
self.label_names = default_label_names if self.args.label_names is None else self.args.label_names
self.can_return_loss = can_return_loss(self.model.__class__)

修改为如下：

if _is_peft_model(self.model):
    if hasattr(self.model, "get_base_model"):
        model_to_inspect = self.model.get_base_model()
        default_label_names = find_labels(model_to_inspect.__class__)
        self.can_return_loss = can_return_loss(model_to_inspect.__class__)
else:
    default_label_names = find_labels(self.model.__class__)
    self.can_return_loss = can_return_loss(self.model.__class__)
self.label_names = default_label_names if self.args.label_names is None else self.args.label_names