基于brat对数据集.txt文件进行标注构造.ann文件

你与民谣我与欢喜

于 2024-05-30 20:37:43 发布

阅读量441

点赞数 4

文章标签：自然语言处理

本文链接：https://blog.csdn.net/2201_75499442/article/details/139329356

版权

一、brat的安装

参考同伴的csdn创新实训-BRAT的安装-CSDN博客

在官网下载好了brat的安装包，在ubuntu中解压后利用解压包中的install脚本即可实现安装，在设定好自己的用户名密码及邮箱后，导入自己的数据，并导入与小组讨论得出的配置文件，便可实现brat的标注功能。现将配置文件放在此处。

注意：配置文件要和自己的数据在同一层目录下，如下图所示：

命名为：annotation.conf

该标注实体中的具体内涵参考创新实训-BRAT使用-CSDN博客，实体内容由小组讨论得出，并且更加适用于RCT随机对照试验。

[entities]

total-partcipants
intervention-participants
control-participants
age
eligibility
condition  
location
ethinicity 
intervention
control
outcome
outcome-Measure
iv-bin-abs
cv-bin-abs
iv-bin-percent
cv-bin-percent
iv-cont-mean
cv-cont-mean
iv-cont-median
cv-cont-median
iv-cont-sd
cv-cont-sd

此外，我们还需要在该目录的上一层中利用指令：find 文件夹名 -name '*.txt'|sed -e 's|\.txt|.ann|g'|xargs touch实现每个.ann文件的创建，在一切上述工作都完成后，即可进行brat的标注工作。