python爬取电子病历_Med7:临床电子病历可迁移自然语言处理模型

Med7是一个可移植的临床自然语言处理模型,用于电子健康记录的命名实体识别任务,训练自MIMIC-III数据集,识别7类信息。模型包括tagger、parser和临床NER组件。通过预训练,模型在多种配置下表现良好,最高达到0.957的F1分数。安装推荐使用虚拟环境和spaCy 2.2.3。模型可用于识别药物、剂量、频率等实体,并能直观展示识别结果。
摘要由CSDN通过智能技术生成

Med7

This repository dedicated to the first release of Med7: a transferable clinical natural language processing model for electronic health records, compatible with spaCy, for clinical named-entity recognition (NER) tasks. The en_core_med7_lg model is trained on MIMIC-III free-text electronic health records and is able to recognise 7 categories:

Screenshot%202020-02-26%20at%2018.18.54.png

The trained model comprises three components in its pipeline:

tagger

parser

clinical NER with seven categories.

Self-supervised pre-training has shown its efficiency in achieving good results even with a small number of gold-annotated training data. We have experimented with the spacy pretrain approach and trained a number of weights for model initialisation for various parameters of the width a

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值