NLP 自然语言处理数据集 粗略

收集匆忙,并不保证准确

dataset

indexdatasetAbbreviationtasknote
1LiBriSpeechAutomatic speech recogniton
2WSJAutomatic speech recogniton
3Hub5’00 EvaluationAutomatic speech recogniton
4Rich TranscriptionsAutomatic speech recogniton
5FisherRT03S FSHAutomatic speech recogniton
6TED-LIUMAutomatic speech recogniton
7CHiMECHiMEAutomatic speech recognitonnoisy speech
8TIMITAutomatic speech recogniton
9CCGBankCCG supertagging
10Event2MindCommon sense
11Situations with Adversarial GenerationsSWAGCommon sense
12Winograd Schema ChallengeCommon sense
13Visual Commonsense ReasoningVCRCommon sense
14Penn TreebankConstituency parsing
15CoNLL 2012Coreference resolution
16Penn TreebankDependency parsing
17Penn TreebankUnsupervised dependency parsing
18Switchboard corpusDialogueDialogue act classification
19Switchboard Dialogue Act CorpusSwDADialogueDialogue act classification
20ICSI Meeting Recorder Dialog Act corpusMRDADialogueDialogue act classification
21Second dialogue state tracking challengeDSTC2DialogueDialogue state tracking
22Wizard-of-OzDialogueDialogue state tracking
23Ubuntu CorpusDialogueRetrieval-based Chatbot
24Multi-Domain Sentiment DatasetDomain adaptationSentiment analysis
25AIDA CoNLL-YAGO DatasetEntity Linking
26TAC KBP English Entity Linking Comprehensive and Evaluation Data 2010Entity Linking
27CoNLL-2014 Shared TaskGrammatical Error Correction
28CoNLL-2014 10 AnnotationsGrammatical Error Correction
29JFLEGGrammatical Error Correction
30BaseInformation ExtractionOpen Knowledge Graph Canonicalization
31AmbigousInformation ExtractionOpen Knowledge Graph Canonicalization
32ReVerb45KInformation ExtractionOpen Knowledge Graph Canonicalization
33Penn TreebankLanguage modelingWord Level Models
34WikiText-2Language modelingWord Level Models
35WikiText-103Language modelingWord Level Models
361B Words / Google Billion Word benchmarkLanguage modelingWord Level Models
37Hutter PrizeLanguage modelingCharacter Level Models
38Text8Language modelingCharacter Level Models
39Penn TreebankLanguage modelingCharacter Level Models
40LexNormLexical Normalization
41LexNorm2015Lexical Normalization
42WMT 2014 EN-DEMachine translation
43WMT 2014 EN-FRMachine translation
44DecalNLPMulti-task learning
45GLUEMulti-task learning
46IEMOCAPMultimodalMultimodal Emotion Recognition
47MultimodalMultimodal Metaphor Recognition
48MOSIMultimodalMultimodal Sentiment Analysis
49CoNLL 2003(English)Named entity recognition
50Long-tail emerging entitiesNamed entity recognition
51Ontonotes v5Named entity recognition
52Stanford Natural Language Inference CorpusSNLINatural language inference
53Multi-Genre Natural Langeuage Inference corpusMultiNLINatural language inference
54SciTailNatural language inference
55Penn TreebankPart-of-speech tagging
56Social mediaPart-of-speech tagging
57Universal DependenciesPart-of-speech tagging
58AI2 Reasoning ChallengeARCQuestion answering
59ShARCShARCQuestion answering
60CLiCRCLiCRQuestion answeringReading comprehension
61CNN/Daily MailQuestion answeringReading comprehension
62CoQAQuestion answeringReading comprehension
63HotpotQAQuestion answeringReading comprehension
64MS MARCOQuestion answeringReading comprehension
65MultiRCQuestion answeringReading comprehension
66NewsQAQuestion answeringReading comprehension
67QAngarooQuestion answeringReading comprehension
68QuACQuestion answeringReading comprehension
69RACEQuestion answeringReading comprehension
70Stanford Question Answering DatasetSQuADQuestion answeringReading comprehension
71Story Cloze TestQuestion answeringReading comprehension
72RecipeQAQuestion answeringReading comprehension
73NarrativeQAQuestion answeringReading comprehension
74DuoRCQuestion answeringReading comprehension
75DuReaderQuestion answeringOpen-domain Question Answering
76QuasarQuestion answeringOpen-domain Question Answering
77SearchQAQuestion answeringOpen-domain Question Answering
78Freebase-15K-238FB15K-237Relation Prediction
79WordNet-18-RRWN18RRRelation Prediction
80New York Times CorpusRelationship Extraction
81SemEval-2010 Task 8Relationship Extraction
82TACREDTACREDRelationship Extraction
83Few-Shot Relation Classification DatasetFewRelRelationship Extraction
84SentEvalSemantic textual similarity
85Quora Question PairsSemantic textual similarityParaphrase identification
86LDC2014T12Semantic parsingAMR parsing
87LDC2015E86Semantic parsingAMR parsing
88LDC2016E25Semantic parsingAMR parsing
89ATISSemantic parsingSQL parsing
90AdvisingSemantic parsingSQL parsing
91GeoQuerySemantic parsingSQL parsing
92ScholarSemantic parsingSQL parsing
93SpiderSemantic parsingSQL parsing
94WikiSQLSemantic parsingSQL parsing
95Smaller DatasetsSemantic parsingSQL parsing
96OntoNotesSemantic role labeling
97IMDbSentiment analysis
98Stanford Sentiment TreebankSSTSentiment analysis
99Yelp Review datasetYelpSentiment analysis
100SemEvalSentiment analysis
101SentihoodSentiment analysisAspect-based sentiment analysis
102SemEval-2014 Task 4Sentiment analysisAspect-based sentiment analysis
103Subjectivity datasetSUBJSentiment analysisSubjectivity analysis
104Penn TreebankShallow syntaxChunking
105Main-Simple English WikipediaSimplificationSentence Simplification
106PWKP/WikiSmallSimplificationSentence Simplification
107Coster and KauchackSimplificationSentence Simplification
108Turk CorpusSimplificationSentence Simplification
108NewselaSimplificationSentence Simplification
109RumourEvalStance detection
110CNN/Daily MailSummarization
110GigawordSummarization
111DUC 2004 Task 1Summarization
112Webis-TLDR-17 CorpusSummarization
113Google DatasetSummarizationSentence Compression
114SemEval 2018Taxonomy LearningHypernym Discory
115APWTemporal ProcessingDocument Dating(Time-stamping)
116NYTTemporal ProcessingDocument Dating(Time-stamping)
117TimeBankTemporal ProcessingTemporal Information Extraction
118TempEval-3Temporal ProcessingTemporal Information Extraction
119TimeBankTemporal ProcessingTimex normalisation
120PNTTemporal ProcessingTimex normalisation
121AG News corpusText classification
122DBpediaText classification
123TRECText classification
124Fine-grained WSDWord Sense Disambiguation
125AIDA CoNLL-YAGO DatasetEntity linking
126Chinese Treebank 6Chinese Word Segmentation
127Chinese Treebank 7Chinese Word Segmentation
128ASChinese Word Segmentation
129CityUChinese Word Segmentation
130PKUChinese Word Segmentation
131MSRChinese Word Segmentation
  • 4
    点赞
  • 7
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值