《一周新论文》系列之2020年第13周:自然语言处理相关
本周重点关注:
- Google: [38], [40]
- Microsoft: [13]
- Facebook: [2]
- 其他: [1]
2020年3月27日
[1]. TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation
链接 | https://arxiv.org/abs/2003.11963
作者 | Shaojie Jiang, Thomas Wolf, Christof Monz, Maarten de Rijke
单位 | University of Amsterdam; Hugging Face
[2]. Rat big, cat eaten! Ideas for a useful deep-agent protolanguage
链接 | https://arxiv.org/abs/2003.11922
作者 | Marco Baroni
单位 | Facebook AI Research
[3]. Common-Knowledge Concept Recognition for SEVA
链接 | https://arxiv.org/abs/2003.11687
作者 | Jitin Krishnan, Patrick Coronado, Hemant Purohit, Huzefa Rangwala
[4]. Word2Vec: Optimal Hyper-Parameters and Their Impact on NLP Downstream Tasks
链接 | https://arxiv.org/abs/2003.11645
作者 | Tosin P. Adewumi, Foteini Liwicki, Marcus Liwicki
[5]. Multi-Label Text Classification using Attention-based Graph Neural Network
链接 | https://arxiv.org/abs/2003.11644
作者 | Ankit Pal, Muru Selvakumar, Malaikannan Sankarasubbu
[6]. Sentiment Analysis in Drug Reviews using Supervised Machine Learning Algorithms
链接 | https://arxiv.org/abs/2003.11643
作者 | Sairamvinay Vijayaraghavan, Debraj Basu
[7]. Author2Vec: A Framework for Generating User Embedding
链接 | https://arxiv.org/abs/2003.11627
作者 | Xiaodong Wu, Weizhe Lin, Zhilin Wang, Elena Rastorgueva
单位 | University of Cambridge
[8]. Predicting Unplanned Readmissions with Highly Unstructured Data
链接 | https://arxiv.org/abs/2003.11622
作者 | Constanza Fierro, Jorge Pérez, Javier Mora
[9]. Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data
链接 | https://arxiv.org/abs/2003.11563
作者 | Harish Tayyar Madabushi, Elena Kochkina, Michael Castelle
单位 | University of Birmingham; Alan Turing Institute
备注 | NLP4IF 2019
[10]. Finnish Language Modeling with Deep Transformer Models
链接 | https://arxiv.org/abs/2003.11562
作者 | Abhilash Jain
[11]. Predicting Legal Proceedings Status: an Approach Based on Sequential Text Data
链接 | https://arxiv.org/abs/2003.11561
作者 | Felipe Maia Polo, Itamar Ciochetti, Emerson Bertolo
[12]. Forensic Authorship Analysis of Microblogging Texts Using N-Grams and Stylometric Features
链接 | https://arxiv.org/abs/2003.11545
作者 | Nicole Mariah Sharon Belvisi, Naveed Muhammad, Fernando Alonso-Fernandez
备注 | Accepted for publication at 8th International Workshop on Biometrics and Forensics, IWBF 2020
[13]. VIOLIN: A Large-Scale Dataset for Video-and-Language Inference
链接 | https://arxiv.org/abs/2003.11618
作者 | Jingzhou Liu, Wenhu Chen, Yu Cheng, Zhe Gan, Licheng Yu, Yiming Yang, Jingjing Liu
单位 | Carnegie Mellon University; University of California, Santa Barbara; Microsoft
备注 | Accepted to CVPR2020
[14]. Heavy-tailed Representations, Text Polarity Classification & Data Augmentation
链接 | https://arxiv.org/abs/2003.11593
作者 | Hamid Jalalzai, Pierre Colombo, Chloé Clavel, Eric Gaussier, Giovanna Varni, Emmanuel Vignon, Anne Sabourin
2020年3月26日
[15]. The Medical Scribe: Corpus Development and Model Performance Analyses
链接 |