BIG-bench 开源项目使用教程

最新推荐文章于 2024-08-21 09:10:06 发布

秋或依

最新推荐文章于 2024-08-21 09:10:06 发布

阅读量538

点赞数 4

本文链接：https://blog.csdn.net/gitblog_00035/article/details/141049824

版权

BIG-bench 开源项目使用教程

BIG-bench项目地址:https://gitcode.com/gh_mirrors/bi/BIG-bench

项目介绍

BIG-bench（Beyond the Imitation Game Benchmark）是一个协作基准测试，旨在探测大型语言模型并推断其未来的能力。该项目包含超过200个任务，这些任务可以通过关键字和任务名称进行总结。BIG-bench 的论文正在审查中，并且可以作为预印本获取。项目组织者可以通过电子邮件 bigbench@googlegroups.com 联系。

项目快速启动

安装

首先，克隆仓库并设置环境：

git clone https://github.com/google/BIG-bench.git
cd BIG-bench
python setup.py sdist
pip install -e .

使用 SeqIO 加载和评估 BIG-bench JSON 任务

以下是一个简单的示例，展示如何使用 SeqIO 加载和评估 BIG-bench JSON 任务：

import seqio
from bigbench.bbseqio import tasks

# 加载所有 JSON 任务
bb_mix = seqio.get_mixture_or_task("bigbench:all_json")
all_subtasks = [t.name for t in bb_mix.tasks]
print("\n".join(all_subtasks[:7]) + "\n")

# 检查一个示例任务
task = seqio.get_mixture_or_task("bigbench:simple_arithmetic_json")
ds = task.get_dataset(split="all", sequence_length={"inputs": 32, "targets": 32})
print(next(iter(ds)))