最近老是遇到第三方库安装不上。比如jieba库。
1. jieba库能干嘛
jieba“结巴”中文分词:做最好的 Python 中文分词组件
“Jieba” (Chinese for “to stutter”) Chinese text segmentation: built to be the best Python Chinese word segmentation module.
完整文档见 README.md
GitHub: https://github.com/fxsjy/jieba
2. 特点
- 支持三种分词模式:
- 精确模式,试图将句子最精确地切开,适合文本分析;
- 全模式,把句子中所有的可以成词的词语都扫描出来, 速度非常快,但是不能解决歧义;
- 搜索引擎模式,在精确模式的基础上,对长词再次切分,提高召回率,适合用于搜索引擎分词。
- 支持繁体分词
- 支持自定义词典
- MIT 授权协议
在线演示: http://jiebademo.ap01.aws.af.cm/
3. 安装说明
代码对 Python 2/3 均兼容。
- 全自动安装: easy_install jieba 或者 pip install jieba / pip3 install jieba
- 半自动安装:先下载 https://pypi.python.org/pypi/jieba/ ,解压后运行 python setup.py install
- 手动安装:将 jieba 目录放置于当前目录或者 site-packages 目录,通过 import jieba 来引用
4. 动手环节
使用命令:
pip3 install jieba
问题来了,我信你个鬼,就是匹配不到 。
呵呵,我采用 半自动安装的方式。
4.1 下载jieba离线包
从官网按照下图下图即可。
4.2 解压并安装
解压文件jieba-0.42.1.tar.gz之后,并进入解压目录。
D:\Python38-32\jieba-0.42.1\jieba-0.42.1>python setup.py install
running install
running build
running build_py
creating build
creating build\lib
creating build\lib\jieba
copying jieba_compat.py -> build\lib\jieba
copying jieba_init_.py -> build\lib\jieba
copying jieba_main_.py -> build\lib\jieba
copying jieba\dict.txt -> build\lib\jieba
creating build\lib\jieba\finalseg
copying jieba\finalseg\prob_emit.p -> build\lib\jieba\finalseg
copying jieba\finalseg\prob_emit.py -> build\lib\jieba\finalseg
copying jieba\finalseg\prob_start.p -> build\lib\jieba\finalseg
copying jieba\finalseg\prob_start.py -> build\lib\jieba\finalseg
copying jieba\finalseg\prob_trans.p -> build\lib\jieba\finalseg
copying jieba\finalseg\prob_trans.py -> build\lib\jieba\finalseg
copying jieba\finalseg_init_.py -> build\lib\jieba\finalseg
creating build\lib\jieba\analyse
copying jieba\analyse\analyzer.py -> build\lib\jieba\analyse
copying jieba\analyse\idf.txt -> build\lib\jieba\analyse
copying jieba\analyse\textrank.py -> build\lib\jieba\analyse
copying jieba\analyse\tfidf.py -> build\lib\jieba\analyse
copying jieba\analyse_init_.py -> build\lib\jieba\analyse
creating build\lib\jieba\posseg
copying jieba\posseg\char_state_tab.p -> build\lib\jieba\posseg
copying jieba\posseg\char_state_tab.py -> build\lib\jieba\posseg
copying jieba\posseg\prob_emit.p -> build\lib\jieba\posseg
copying jieba\posseg\prob_emit.py -> build\lib\jieba\posseg
copying jieba\posseg\prob_start.p -> build\lib\jieba\posseg
copying jieba\posseg\prob_start.py -> build\lib\jieba\posseg
copying jieba\posseg\prob_trans.p -> build\lib\jieba\posseg
copying jieba\posseg\prob_trans.py -> build\lib\jieba\posseg
copying jieba\posseg\viterbi.py -> build\lib\jieba\posseg
copying jieba\posseg_init_.py -> build\lib\jieba\posseg
creating build\lib\jieba\lac_small
copying jieba\lac_small\creator.py -> build\lib\jieba\lac_small
copying jieba\lac_small\nets.py -> build\lib\jieba\lac_small
copying jieba\lac_small\predict.py -> build\lib\jieba\lac_small
copying jieba\lac_small\reader_small.py -> build\lib\jieba\lac_small
copying jieba\lac_small\utils.py -> build\lib\jieba\lac_small
copying jieba\lac_small_init_.py -> build\lib\jieba\lac_small
copying jieba\lac_small\tag.dic -> build\lib\jieba\lac_small
copying jieba\lac_small\word.dic -> build\lib\jieba\lac_small
creating build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\crfw -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\fc_0.b_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\fc_0.w_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\fc_1.b_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\fc_1.w_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\fc_2.b_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\fc_2.w_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\fc_3.b_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\fc_3.w_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\fc_4.b_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\fc_4.w_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\gru_0.b_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\gru_0.w_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\gru_1.b_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\gru_1.w_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\gru_2.b_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\gru_2.w_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\gru_3.b_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\gru_3.w_0 -> build\lib\jieba\lac_small\model_baseline
copying jieba\lac_small\model_baseline\word_emb -> build\lib\jieba\lac_small\model_baseline
running install_lib
creating D:\Python38-32\Lib\site-packages\jieba
creating D:\Python38-32\Lib\site-packages\jieba\analyse
copying build\lib\jieba\analyse\analyzer.py -> D:\Python38-32\Lib\site-packages\jieba\analyse
copying build\lib\jieba\analyse\idf.txt -> D:\Python38-32\Lib\site-packages\jieba\analyse
copying build\lib\jieba\analyse\textrank.py -> D:\Python38-32\Lib\site-packages\jieba\analyse
copying build\lib\jieba\analyse\tfidf.py -> D:\Python38-32\Lib\site-packages\jieba\analyse
copying build\lib\jieba\analyse_init_.py -> D:\Python38-32\Lib\site-packages\jieba\analyse
copying build\lib\jieba\dict.txt -> D:\Python38-32\Lib\site-packages\jieba
creating D:\Python38-32\Lib\site-packages\jieba\finalseg
copying build\lib\jieba\finalseg\prob_emit.p -> D:\Python38-32\Lib\site-packages\jieba\finalseg
copying build\lib\jieba\finalseg\prob_emit.py -> D:\Python38-32\Lib\site-packages\jieba\finalseg
copying build\lib\jieba\finalseg\prob_start.p -> D:\Python38-32\Lib\site-packages\jieba\finalseg
copying build\lib\jieba\finalseg\prob_start.py -> D:\Python38-32\Lib\site-packages\jieba\finalseg
copying build\lib\jieba\finalseg\prob_trans.p -> D:\Python38-32\Lib\site-packages\jieba\finalseg
copying build\lib\jieba\finalseg\prob_trans.py -> D:\Python38-32\Lib\site-packages\jieba\finalseg
copying build\lib\jieba\finalseg_init_.py -> D:\Python38-32\Lib\site-packages\jieba\finalseg
creating D:\Python38-32\Lib\site-packages\jieba\lac_small
copying build\lib\jieba\lac_small\creator.py -> D:\Python38-32\Lib\site-packages\jieba\lac_small
creating D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\crfw -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\fc_0.b_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\fc_0.w_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\fc_1.b_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\fc_1.w_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\fc_2.b_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\fc_2.w_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\fc_3.b_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\fc_3.w_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\fc_4.b_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\fc_4.w_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\gru_0.b_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\gru_0.w_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\gru_1.b_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\gru_1.w_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\gru_2.b_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\gru_2.w_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\gru_3.b_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\gru_3.w_0 -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\model_baseline\word_emb -> D:\Python38-32\Lib\site-packages\jieba\lac_small\model_baseline
copying build\lib\jieba\lac_small\nets.py -> D:\Python38-32\Lib\site-packages\jieba\lac_small
copying build\lib\jieba\lac_small\predict.py -> D:\Python38-32\Lib\site-packages\jieba\lac_small
copying build\lib\jieba\lac_small\reader_small.py -> D:\Python38-32\Lib\site-packages\jieba\lac_small
copying build\lib\jieba\lac_small\tag.dic -> D:\Python38-32\Lib\site-packages\jieba\lac_small
copying build\lib\jieba\lac_small\utils.py -> D:\Python38-32\Lib\site-packages\jieba\lac_small
copying build\lib\jieba\lac_small\word.dic -> D:\Python38-32\Lib\site-packages\jieba\lac_small
copying build\lib\jieba\lac_small_init_.py -> D:\Python38-32\Lib\site-packages\jieba\lac_small
creating D:\Python38-32\Lib\site-packages\jieba\posseg
copying build\lib\jieba\posseg\char_state_tab.p -> D:\Python38-32\Lib\site-packages\jieba\posseg
copying build\lib\jieba\posseg\char_state_tab.py -> D:\Python38-32\Lib\site-packages\jieba\posseg
copying build\lib\jieba\posseg\prob_emit.p -> D:\Python38-32\Lib\site-packages\jieba\posseg
copying build\lib\jieba\posseg\prob_emit.py -> D:\Python38-32\Lib\site-packages\jieba\posseg
copying build\lib\jieba\posseg\prob_start.p -> D:\Python38-32\Lib\site-packages\jieba\posseg
copying build\lib\jieba\posseg\prob_start.py -> D:\Python38-32\Lib\site-packages\jieba\posseg
copying build\lib\jieba\posseg\prob_trans.p -> D:\Python38-32\Lib\site-packages\jieba\posseg
copying build\lib\jieba\posseg\prob_trans.py -> D:\Python38-32\Lib\site-packages\jieba\posseg
copying build\lib\jieba\posseg\viterbi.py -> D:\Python38-32\Lib\site-packages\jieba\posseg
copying build\lib\jieba\posseg_init_.py -> D:\Python38-32\Lib\site-packages\jieba\posseg
copying build\lib\jieba_compat.py -> D:\Python38-32\Lib\site-packages\jieba
copying build\lib\jieba_init_.py -> D:\Python38-32\Lib\site-packages\jieba
copying build\lib\jieba_main_.py -> D:\Python38-32\Lib\site-packages\jieba
byte-compiling D:\Python38-32\Lib\site-packages\jieba\analyse\analyzer.py to analyzer.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\analyse\textrank.py to textrank.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\analyse\tfidf.py to tfidf.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\analyse_init_.py to init.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\finalseg\prob_emit.py to prob_emit.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\finalseg\prob_start.py to prob_start.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\finalseg\prob_trans.py to prob_trans.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\finalseg_init_.py to init.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\lac_small\creator.py to creator.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\lac_small\nets.py to nets.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\lac_small\predict.py to predict.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\lac_small\reader_small.py to reader_small.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\lac_small\utils.py to utils.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\lac_small_init_.py to init.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\posseg\char_state_tab.py to char_state_tab.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\posseg\prob_emit.py to prob_emit.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\posseg\prob_start.py to prob_start.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\posseg\prob_trans.py to prob_trans.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\posseg\viterbi.py to viterbi.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba\posseg_init_.py to init.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba_compat.py to compat.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba_init.py to init.cpython-38.pyc
byte-compiling D:\Python38-32\Lib\site-packages\jieba_main_.py to main.cpython-38.pyc
running install_egg_info
Writing D:\Python38-32\Lib\site-packages\jieba-0.42.1-py3.8.egg-info
4.3 查看jiaba版本,检验是否安装成功
执行命令
pip show jieba
效果如下:
D:\Python38-32\jieba-0.42.1\jieba-0.42.1>pip show jieba
Name: jieba
Version: 0.42.1
Summary: Chinese Words Segmentation Utilities
Home-page: https://github.com/fxsjy/jieba
Author: Sun, Junyi
Author-email: ccnusjy@gmail.com
License: MIT
Location: d:\python38-32\lib\site-packages
Requires:
Required-by:
也可以进入python交互命令使用import jieba校验是否安装成功,我咋就喜欢用pip show jieba呢,嘻嘻。。
5.结语
没啥说的了,动手实操都在上边了,反正最近我是和pip install 库名 干上了,直接安装都不行。
如果要看大佬的爬虫案例 ,去搜msdn一个叫“不正经的kimol君”的大佬吧;
或者微信公众号“Python干货铺子”的大佬吧。