Hidden Markov Models
Rabiner, L. A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. (Proceedings of the IEEE 1989)
Freitag and McCallum, 2000, Information Extraction with HMM Structures Learned by Stochastic Optimization, (AAAI'00)
Maximum Entropy
Adwait R. A Maximum Entropy Model for POS tagging, (1994)
A. Berger, S. Della Pietra, and V. Della Pietra. A maximum entropy approach to natural language processing. (CL'1996)
A. Ratnaparkhi. Maximum Entropy Models for Natural Language Ambiguity Resolution. PhD thesis, University of Pennsylvania, 1998.
Hai Leong Chieu, 2002. A Maximum Entropy Approach to Information Extraction from Semi-Structured and Free Text, (AAAI'02)
MEMM
McCallum et al., 2000, Maximum Entropy Markov Models for Information Extraction and Segmentation, (ICML'00)
Punyakanok & Roth, 2001, The Use of Classifiers in Sequential Inference. (NIPS'01)
Perceptron
McCallum, 2002 Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms (EMNLP'02)
Y. Li, K. Bontcheva, and H. Cunningham. Using Uneven-Margins SVM and Perceptron for Information Extraction. (CoNLL'05)
SVM
Z. Zhang. Weakly-Supervised Relation Classification for Information E