1. 机器学习
- scikit-learn: 机器学习
https://scikit-learn.org/stable/ - imbalanced-learn: 应对机器学习中的不平衡数据集
https://imbalanced-learn.org/stable/ - scikit-feature: 机器学习特征选择
https://jundongl.github.io/scikit-feature/ - tsfresh: 时间序列数据特征提取
https://tsfresh.readthedocs.io/en/latest/ - XGBoost
2. 深度学习
- pytorch
- Tensorflow
- Transformers: State-of-the-art Machine Learning for PyTorch, TensorFlow and JAX, 可用于下载和训练预训练模型
https://huggingface.co/docs/transformers/index - Keras
Keras中文文档
注:TensorFlow与Pytorch相比,个人推荐Pytorch.
- SpeechRecognition: 语音识别
https://pypi.org/project/SpeechRecognition/
3. 数据分析及处理
-
Numpy中文
https://www.numpy.org.cn/ -
Pandas
https://www.pypandas.cn/ (中文网)
https://pandas.pydata.org/ -
jieba: 用于中文分词
https://pypi.org/project/jieba/ -
pypinyin: 汉语转拼音
https://pypi.org/project/pypinyin/ -
Moviepy: 视频编辑库
https://zulko.github.io/moviepy/ -
networkx源码: 用于图/网络的分析
https://github.com/networkx -
matplotlib: 数据可视化
https://matplotlib.org/
https://www.matplotlib.org.cn (中文网) -
pyecharts: 数据可视化
https://pyecharts.org/#/
e.g.