例题
对于这一组电影数据,如果我们想知道rating,runtime的分布情况,应该如何呈现数据?
解
df.info()
==============================================================
==============================================================
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1000 entries, 0 to 999
Data columns (total 12 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Rank 1000 non-null int64
1 Title 1000 non-null object
2 Genre 1000 non-null object
3 Description 1000 non-null object
4 Director 1000 non-null object
5 Actors 1000 non-null object
6 Year 1000 non-null int64
7 Runtime (Minutes) 1000 non-null int64
8 Rating 1000 non-null float64
9 Votes 1000 non-null int64
10 Revenue (Millions) 872 non-null float64
11 Metascore 936 non-null float64
dtypes: float64(3), int64(4), object(5)
memory usage: 93.9+ KB
统计runtime的分布情况(直方图)
代码
# -*- coding: utf-8 -*-
'''
@Time : 2020/12/15 19:25
@Author : yuhui
@Email : 3476237164@qq.com
@FileName: pandas_1.py
@Software: PyCharm
'''
"""30【统计方法和字符串离散化】02电影数据直方图"""
"""对于这一组电影数据,如果我们想知道rating,runtime的分布情况,应该如何呈现数据?"""
import pandas as pd
import matplotlib.pyplot as plt
# 解决中文乱码
plt.rcParams["font.sans-serif"]=["KaiTi"]
plt.rcParams["font.family"]="sans-serif"
# 解决符号无法显示的问题
plt.rcParams['axes.unicode_minus']