python安装tesserocr时候出现如下错误:
一、ERROR: Command errored out with exit status 1: python setup.py egg_info Check the logs for full command output.
解决方法:更新setuptools,输入命令如下:
python -m pip install --upgrade setuptools
二、错误为:
ERROR: Command errored out with exit status 1: ‘c:\python38\python.exe’ -u -c ‘import sys, setuptools, tokenize; sys.argv[0] = ‘"’"‘C:\Users\sumengtin\AppData\Local\Temp\pip-install-fse30vja\tesserocr\setup.py’"’"’; file=’"’"‘C:\Users\sumengtin\AppData\Local\Temp\pip-install-fse30vja\tesserocr\setup.py’"’"’;f=getattr(tokenize, ‘"’"‘open’"’"’, open)(file);code=f.read().replace(’"’"’\r\n’"’"’, ‘"’"’\n’"’"’);f.close();exec(compile(code, file, ‘"’"‘exec’"’"’))’ install --record ‘C:\Users\sumengtin\AppData\Local\Temp\pip-record-az0i4_x7\install-record.txt’ --single-version-externally-managed --compile --install-headers ‘c:\python38\Include\tesserocr’ Check the logs for full command output.
解决方法:
1)打开cmd,输入python,查看下载的python是什么版本
2)在https://www.lfd.uci.edu/~gohlke/pythonlibs/这个网址中找到wordcloud对应的版本,cp38是python3.8版本,amd64是windows64位
3)输入命令pip install wordcloud-1.7.0-cp38-cp38-win_amd64.whl下载
三、错误提示:
UnicodeDecodeError: ‘utf-8’ codec can’t decode byte 0xc3 in position 0: invalid continuation byte
原因:保存Tesseract-OCR的路径名称用了中文
解决方法:将路径的文件名改为英文
四、错误提示:
pytesseract.pytesseract.TesseractError: (1, 'Error opening data file \selenium_code\Tesseract-OCR\eng.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to your “tessdata” directory. Failed loading language ‘eng’ Tesseract c
解决方法:
1.键盘点击ctrl+r打开cmd窗口
2.在cmd窗口输入SET TESSDATA_PREFIX=E:\selenium_code\Tesseract-OCR,再按回车键就可以了
五、错误提示:tesserocr is not a supported wheel on this platform
原因:同一级目录里有2个不同的tesserocr文件
解决方法:去掉一个就可以下载了