Pipeline Model of Text Interpretation
The steps of text preprocessing
1.Language identification
2.Tokenization
3.Morphological analysis (simplest form: stemming)
4.Sentence splitting
5.Part of speech (POS) tagging
6.Parsing
The tool of language identification
The tools of tokenizers
The two methods of this Morphological Analysis
The tools of stemming
Sentences Splitting
The tools of Sentences Splitting
Part-of-Speech (POS) Tagging
The tools of POS tagging
Parsing