Table of Contents目录
Preface
1. Language Processing and Python
1.1 Computing with Language: Texts and Words
1.2 ACloser Look at Python: Texts as Lists of Words
1.3 Computing with Language: Simple Statistics
1.4 Back to Python: Making Decisions and Taking Control
1.5 Automatic Natural Language Understanding
1.6 Summary
1.7 Further Reading
1.8 Exercises
2. Accessing Text Corpora and Lexical Resources
2.1 Accessing Text Corpora
2.2 Conditional Frequency Distributions
2.3 More Python: Reusing Code
2.4 Lexical Resources
2.5 WordNet
2.6 Summary
2.7 Further Reading
2.8 Exercises
3. Processing Raw Text
处理原始文本
3.1 Accessing Text from the Web and from Disk
3.2 Strings: Text Processing at the Lowest Level
3.3 Text Processing with Unicode
3.4 Regular Expressions for Detecting Word Patterns
3.5 Useful Applications of Regular Expressions
3.6 Normalizing Text
3.7 Regular Expressions for Tokenizing Text
3.8 Segmentation
3.9 Formatting: From Lists to Strings
3.10 Summary
3.11 Further Reading
3.12 Exercises
4. Writing Structured Programs
编写结构化程序
4.1 Back to the Basics
4.2 Sequences
4.3 Questions of Style
4.4 Functions: The Foundation of Structured Programming
4.5 Doing More with Functions
4.6 Program Development
4.7 Algorithm Design
4.8 ASample of Python Libraries
Python库的样本
4.9 Summary
4.10 Further Reading
4.11 Exercises
5. Categorizing and Tagging Words
分类和标注单词
5.1 Using a Tagger
使用标注器
5.2 Tagged Corpora
5.3 Mapping Words to Properties Using Python Dictionaries
使用Python字典把单词映射到属性
5.4 Automatic Tagging
5.5 N-Gram Tagging
N-Gram标注
5.6 Transformation-Based Tagging
5.7 How to Determine the Category of a Word
5.8 Summary
5.9 Further Reading
5.10 Exercises
6. Learning to Classify Text
学习本文分类
6.1 Supervised Classification
6.2 Further Examples of Supervised Classification
6.3 Evaluation
6.4 Decision Trees
决策树
6.5 Naive Bayes Classifiers
6.6 Maximum Entropy Classifiers
最大熵分类器
6.7 Modeling Linguistic Patterns
6.8 Summary
小结
6.9 Further Reading
6.10 Exercises
7. Extracting Information from Text
从文本提取信息
7.1 Information Extraction
7.2 Chunking
7.3 Developing and Evaluating Chunkers
7.4 Recursion in Linguistic Structure
7.5 Named Entity Recognition
7.6 Relation Extraction
7.7 Summary
7.8 Further Reading
7.9 Exercises
8. Analyzing Sentence Structure
句子结构分析
8.1 Some Grammatical Dilemmas
8.2 What’s the Use of Syntax?
语法有什么用处?
8.3 Context-Free Grammar
上下文无关语法
8.4 Parsing with Context-Free Grammar
使用上下文无关语法进行解析
8.5 Dependencies and Dependency Grammar
相关性和相关性语法
8.6 Grammar Development
语法的发展
8.7 Summary
小结
8.8 Further Reading
深入阅读
8.9 Exercises
练习
9. Building Feature-Based Grammars
构建基于特征的语法
9.1 Grammatical Features
语法特征
9.2 Processing Feature Structures
处理特征结构
9.3 Extending a Feature-Based Grammar
扩展基于特征的语法
9.4 Summary
小结
9.5 Further Reading
深入扩展
9.6 Exercises
练习
10. Analyzing the Meaning of Sentences
分析句子的意义
10.1 Natural Language Understanding
自然语言的理解
10.2 Propositional Logic
命题逻辑
10.3 First-Order Logic
一阶逻辑
10.4 The Semantics of English Sentences
英文句子的语义
10.5 Discourse Semantics
语段语义
10.6 Summary
小结
10.7 Further Reading
深入阅读
10.8 Exercises
练习
11. Managing Linguistic Data
语料管理
11.1 Corpus Structure: A Case Study
语料库结构:案例研究
11.2 The Life Cycle of a Corpus
语料库的生命周期
11.3 Acquiring Data
获取数据
11.4 Working with XML
处理XML
11.5 Working with Toolbox Data
处理Toolbox Data
11.6 Describing Language Resources Using OLAC Metadata
使用OLAC元数据描述语言资源
11.7 Summary
小结
11.8 Further Reading
深入阅读
11.9 Exercises
练习
Afterword: The Language Challenge
后记:语言的挑战
Bibliography
参考文献
NLTK Index
NLTK索引
General Index
一般索引
知识共享署名、非商业性使用、禁止演绎创作许可证3.0
以上章节内容均来自Natural Language Processing with Python,共同的辛勤劳动,Copyright © 2009,本内容并随NLTK共同发布,网址:http://www.nltk.org/ 。文章和相关资料遵循创作许可证。