An Error-Driven Word-Character Hybrid Model for Joint Chinese Word Segmentation and POS Tagging
作者:神户大学,Canasai Kruengkrai, and Kiyotaka Uchimoto, and Jun’ichi Kazama, Yiou Wang, and Kentaro Torisawa, and Hitoshi Isahara
出处:Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP, pages 513–521,Suntec, Singapore, 2-7 August2009.
word-character based标注结合MIRA算法,是Tetsuji Nakagawa继2004-2007年后的又一次改进
引言部分
分词词性标注一体化,从2004-2009得到非常广泛的关注(Ngand Low, 2004; Nakagawa and Uchimoto, 2007;Zhang and Clark, 2008; Jiang et al., 2008a; Jianget al., 2008b)
字词混合标注模型2004年提出使用,词Markov