FastEval 开源项目教程

史锋燃Gardner

于 2024-08-31 07:21:57 发布

阅读量727

点赞数 24

本文链接：https://blog.csdn.net/gitblog_00229/article/details/141733374

版权

FastEval 开源项目教程

FastEvalFast & more realistic evaluation of chat language models. Includes leaderboard.项目地址:https://gitcode.com/gh_mirrors/fas/FastEval

1、项目的目录结构及介绍

FastEval 项目的目录结构如下：

FastEval/
├── README.md
├── requirements.txt
├── setup.py
├── fasteval/
│   ├── __init__.py
│   ├── main.py
│   ├── config.py
│   ├── benchmarks/
│   │   ├── __init__.py
│   │   ├── mt_bench.py
│   │   ├── human_eval_plus.py
│   │   ├── ds1000.py
│   │   ├── cot/
│   │   │   ├── __init__.py
│   │   │   ├── gsm8k.py
│   │   │   ├── math.py
│   │   │   ├── bbh.py
│   │   │   ├── mmlu.py
│   │   ├── custom_test_data/
│   │   │   ├── __init__.py
│   │   │   ├── custom_data.py
│   ├── utils/
│   │   ├── __init__.py
│   │   ├── prompt_templates.py
│   │   ├── model_utils.py
├── tests/
│   ├── __init__.py
│   ├── test_main.py
│   ├── test_config.py
│   ├── test_benchmarks.py
│   ├── test_utils.py

目录结构介绍

README.md: 项目说明文档。
requirements.txt: 项目依赖文件。
setup.py: 项目安装脚本。
fasteval/: 项目主目录。
- __init__.py: 初始化文件。
- main.py: 项目启动文件。
- config.py: 项目配置文件。
- benchmarks/: 基准测试目录。
  - __init__.py: 初始化文件。
  - mt_bench.py: MT-Bench 基准测试。
  - human_eval_plus.py: HumanEval+ 基准测试。
  - ds1000.py: DS-1000 基准测试。
  - cot/: Chain of Thought 基准测试目录。
    - __init__.py: 初始化文件。
    - gsm8k.py: GSM8K 基准测试。
    - math.py: Math 基准测试。
    - bbh.py: BBH 基准测试。
    - mmlu.py: MMLU 基准测试。
  - custom_test_data/: 自定义测试数据目录。
    - __init__.py: 初始化文件。
    - custom_data.py: 自定义数据测试。
- utils/: 工具目录。
  - __init__.py: 初始化文件。
  - prompt_templates.py: 提示模板工具。
  - model_utils.py: 模型工具。
tests/: 测试目录。
- __init__.py: 初始化文件。
- test_main.py: 主程序测试。
- test_config.py: 配置文件测试。
- test_benchmarks.py: 基准测试测试。
- test_utils.py: 工具测试。