先上官方文档:
http://scikit-learn.org/stable/user_guide.html
API:
http://scikit-learn.org/stable/modules/classes.html
加载文本语料的方法doc文档为
http://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_files.html#sklearn.datasets.load_files
语料库目录结构:
container_folder/
category_1_folder/
file_1.txt file_2.txt ... file_42.txt
category_2_folder/
file_43.txt file_44.txt ...
源码分析: