1.计算均值、最大值、最小值、上下四分位数
ndarray.min() / np.min(ndarray)
ndarray.max() / np.max(ndarray)
ndarray.mean() / np.mean(ndarray)
numpy.percentile(a, q, axis=None, out=None, overwrite_input=False, interpolation='linear', keepdims=False)
2.Matplotlib绘制密度直方图
统计文本长度分布情况
new_lengths = np.array(new_lengths)
s = pd.Series(new_lengths)
plt.hist(s, bins=10, rwidth=0.9)
plt.savefig(pic_path)
注意:当linux服务器没有GUI时,需要添加如下代码:
import matplotlib as mpl
mpl.use('Agg')
import matplotlib.pyplot as plt