2021
- Quality Estimation without Human-labeled Data. EACL 2021: 619-625
2020
- Unsupervised Quality Estimation for Neural Machine Translation. Trans. Assoc. Comput. Linguistics 8: 539-555 (2020)
- Multimodal Quality Estimation for Machine Translation. ACL 2020: 1233-1240
- TransQuest: Translation Quality Estimation with Cross-lingual Transformers. COLING 2020: 5070-5081
- Intermediate Self-supervised Learning for Machine Translation Quality Estimation. COLING 2020: 4355-4360
- Revisiting Round-trip Translation for Quality Estimation. EAMT 2020: 91-104
- QE Viewer: an Open-Source Tool for Visualization of Machine Translation Quality Estimation Results. EAMT 2020: 453-454
- Efficient Transfer Learning for Quality Estimation with Bottleneck Adapter Layer. EAMT 2020: 29-34
- Computer Assisted Translation with Neural Quality Estimation and Auotmatic Post-Editing. EMNLP (Findings) 2020: 2175-2186
- Translation Quality Estimation by Jointly Learning to Score and Rank. EMNLP (1) 2020: 2592-2598
- An Exploratory Study on Multilingual Quality Estimation. AACL/IJCNLP 2020: 366-377
- Sentence Level Human Translation Quality Estimation with Attention-based Neural Networks. LREC 2020: 1858-1865
- Ensemble Distilling Pretrained Language Models for Machine Translation Quality Estimation. NLPCC (2) 2020: 231-243
Sentence Level Human Translation Quality Estimation with Attention-based Neural Networks. arxiv. 2020
On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation. arxiv. 2020
Quality Estimation without Human-labeled Data. arxiv. 2021
On the Language Neutrality of Pre-trained Multilingual Representations. arxiv. 2020
Evaluating Multilingual Text Encoders for Unsupervised Cross-Lingual Retrieval. arxiv. 2021
CLIPScore: A Reference-free Evaluation Metric for Image Captioning. arxiv. 2021
Assessing Reference-Free Peer Evaluation for Machine Translation. arxiv. 2021
KoBE: Knowledge-Based Machine Translation Evaluation. arxiv. 2020
Experts, Errors, and Context: A Large-Scale Study of Human Evaluation for Machine Translation. arxiv. 2021
Incorporate Semantic Structures into Machine Translation Evaluation via UCCA. arxiv. 2020
BLEU Neighbors: A Reference-less Approach to Automatic Evaluation. arxiv. 2020
Assessing Reference-Free Peer Evaluation for Machine Translation. arxiv. 2021
Automatic Machine Translation Evaluation in Many Languages via Zero-Shot Paraphrasing. arxiv. 2020
BLEU might be Guilty but References are not Innocent. arxiv. 2020
Human-Paraphrased References Improve Neural Machine Translation. arxiv. 2020
Unsupervised Reference-Free Summary Quality Evaluation via Contrastive Learning. arxiv. 2020
SumQE: a BERT-based Summary Quality Estimation Model
ParBLEU: Augmenting Metrics with Automatic Paraphrases for the WMT’20 Metrics Shared Task
Fact-based Content Weighting for Evaluating Abstractive Summarisation
Intrinsic Evaluation of Summarization Datasets
Identifying Elements Essential for BERT’s Multilinguality
COMET: A Neural Framework for MT Evaluation
Verdi: Quality Estimation and Error Detection for Bilingual Corpora