python pandas
weixin_45414611
这个作者很懒,什么都没留下…
展开
-
python pandas行操作 10
import pandas as pd page_001 = pd.read_excel('Students.xlsx',sheet_name = 'Page_001') page_002 = pd.read_excel('Students.xlsx',sheet_name = 'Page_002') students = page_001.append(page_002)reset_index(drop = True) # 不管原来index stu = pd.Series({'ID':41,'Name原创 2021-05-13 23:21:27 · 102 阅读 · 0 评论 -
python pandas透视表pivot table 09
方法一: import pandas as pd pd.options.display.max_columns = 999 orders = pd.read_excel('Orders.xlsx') orders['Year'] = pd.DatetimeIndex(orders['Date']).year pt1 = orders.pivot_table(index = 'Category',columns='Year',values= 'Total',aggfunc = np.sum) 方法二:原创 2021-05-13 22:43:17 · 131 阅读 · 0 评论 -
python pandas 一列拆分08
把一列拆分成两列 例如name拆分成first name+last name import pandas as pd employees = pd.read_excel('employee.xlsx',index_col = 'ID') df = employees['Full Name'].str.split(n=3,expand = True) # 默认空格切割 n:最多保留个数 employees['Fiest Name'] = df[0].str.upper() # 全大写 employees原创 2021-05-13 22:21:25 · 129 阅读 · 0 评论 -
python pandas 数据校验07
import pandas as pd def score_validation(row): try: assert 0<=row.Score<=100 except: print(f'#{row.ID}student{row.Name}has an invalid score {row.Score}.') students = pd.read_excel('bookds.xlsx') students.apply(score_validation,axis=1) import原创 2021-05-13 18:20:43 · 222 阅读 · 0 评论 -
python pandas多表联合VLOOKUP 06
excel =if NA(vlookup(),0) ----当NA的时候为0 方法一 import pandas as pd students = pd.read_excel('bookds.xlsx', sheet_name='Students',index_col ='ID') scores = pd.read_excel('bookds.xlsx', sheet_name='Scores',index_col ='ID') # 一旦设为index就从普通列中消失了 table = student原创 2021-05-13 18:09:29 · 481 阅读 · 0 评论 -
python pandas 可视化图表 05
import pandas as pd import matplotlib.pyplot as plt books = pd.read_excel('bookds.xlsx', index_col='ID') books['Price'] = books['ListPrice'] * books['Discount'] 课时09原创 2021-05-13 17:32:12 · 49 阅读 · 0 评论 -
python pandas 筛选04
方法优化 import pandas as pd def age_18_to_30(a): return 18<=a<30 def level_a(s): return 85<=s<=100 students = pd.read_excel('students.xlsx', index_col='ID') students = students.loc[students['Age'].apply[age_18_to_30]].loc[students['Score'].appl原创 2021-05-13 17:29:47 · 53 阅读 · 0 评论 -
python pandas 函数运用 筛选 03
公式计算* 方法一 import pandas as pd books = pd.read_excel('bookds.xlsx', index_col='ID') books['Price'] = books['ListPrice'] * books['Discount'] # 操作符的重载 方法二:用for loop import pandas as pd books = pd.read_excel('bookds.xlsx', index_col='ID') for i in books.inde原创 2021-05-13 17:19:30 · 79 阅读 · 0 评论 -
python pandas series 02
序列series 理解 import pandas as pd # dict 很像 key value pair d =('x':100) d =('x':100) s1 = pd.Series(d) # index是 x value 是100 import pandas as pd L1 = [100,200,300] L2 = ['x','y','z'] s1 = pd.Series(L1, index = L2) # 或者直接值放进去 s1 = pd.Series([100,200,300], in原创 2021-05-13 16:34:04 · 53 阅读 · 0 评论 -
python pandas 文件读取 01
创建文件 df = pd.DataFrame() # 一个 worksheet df = pd.DataFrame({'ID':[1,2,3]}) # 加数据 此时会多出一列012···作为索引 # 若想让ID作为索引 df = df.set_index('ID') 读文件 import pandas as pd all_player = pd.read_excel('allplayer.xls') games = pd.read_excel('allplayer.xlsx') res = pd.merg原创 2021-05-13 14:55:21 · 277 阅读 · 0 评论