RLHF-Label-Tool-simple是RLHF-Label-Tool的一个简化版。RLHF-Label-Tool是一个偏好数据集标注工具。
安装依赖
- Python 3.x
- 安装依赖包:
pip install -r requirements.txt
快速开始
- 待标注的数据集文件是input_file.jsonl。里面的数据如下:
{"question": "How are you doing?", "response": ["I am good", "I am bad","Terrible","Mind your own business"],"reference":"Normally the answer should be nice."}
{"question": "who are you?", "response": ["LLM", "Apple","Banana",&#