python字典dict和DataFrame、Series之间的相互转换

赵孝正

已于 2025-04-17 08:59:02 修改

阅读量3.1k

点赞数 1

分类专栏： Python初级利用Python进行数据分析文章标签： python numpy 机器学习

于 2022-06-28 16:01:06 首次发布

本文链接：https://blog.csdn.net/weixin_46713695/article/details/125503821

版权

Python初级同时被 2 个专栏收录

58 篇文章

订阅专栏

利用Python进行数据分析

18 篇文章

订阅专栏

本文介绍了如何将包含字典的嵌套字典、等长度列表或NumPy数组的字典转换为Pandas DataFrame，包括直接转换嵌套字典、通过items操作转dict至DataFrame，以及利用等长数据构建DataFrame的方法。还演示了pd.Series转dict和DataFrame列名重命名的操作技巧。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

1.包含字典的嵌套字典，转DataFrame

import pandas as pd

pop = {'Nevada': {2001: 2.4, 2002: 2.9},
       'Ohio': {2000: 1.5, 2001: 1.7, 2002: 3.6}}
frame = pd.DataFrame(pop)

frame
Out[3]: 
      Nevada  Ohio
2001     2.4   1.7
2002     2.9   3.6
2000     NaN   1.5

在这里插入图片描述
类似于numpy的转置：

frame.T
Out[4]: 
        2001  2002  2000
Nevada   2.4   2.9   NaN
Ohio     1.7   3.6   1.5

1.2 构建dict，并转DataFrame

ans_weight = {'class1': 10.23, 'class3': 18.38}
difference = ans_weight['class3'] - ans_weight['class1']
ans_weight['total'] = difference
print(ans_weight)

{'class1': 10.23, 'class3': 18.38, 'total': 8.15}

ans_weight.items()
Out[15]: dict_items([('class1', 10.23), ('class3', 18.38), ('total', 8.149999999999999)])
list(ans_weight.items())
Out[16]: [('class1', 10.23), ('class3', 18.38), ('total', 8.149999999999999)]

转为DataFrame

# Convert the ans_weight dictionary to a pandas DataFrame
df_ans_weight = pd.DataFrame(list(ans_weight.items()), columns=['Class', 'Value'])

Out[17]: 
    Class  Value
0  class1  10.23
1  class3  18.38
2   total   8.15

2. 利用包含等长度列表或NumPy数组的字典dict 来形成DataFrame

注意：等长度 的列表或数组

data = {'state': ['Ohio', 'Ohio', 'Ohio', 'Nevada', 'Nevada', 'Nevada'],
        'year': [2000, 2001, 2002, 2001, 2002, 2003],
        'pop': [1.5, 1.7, 3.6, 2.4, 2.9, 3.2]}
frame = pd.DataFrame(data)

frame
Out[8]: 
    state  year  pop
0    Ohio  2000  1.5
1    Ohio  2001  1.7
2    Ohio  2002  3.6
3  Nevada  2001  2.4
4  Nevada  2002  2.9
5  Nevada  2003  3.2

在这里插入图片描述
指定列的顺序

pd.DataFrame(data, columns=['year', 'state', 'pop'])
Out[9]: 
   year   state  pop
0  2000    Ohio  1.5
1  2001    Ohio  1.7
2  2002    Ohio  3.6
3  2001  Nevada  2.4
4  2002  Nevada  2.9
5  2003  Nevada  3.2

3. pd.Series 转为 dict

C = [6367, 18]
pd.Series(C).value_counts().to_dict()  # C 为list

输出结果：

{0: 6367, -1: 1103, 1: 18}

4.快速给dataframe重命名 columns = dict(zip(list1, list2))

df_10min = df_10min.rename(columns=dict(zip(df_10min.columns, wind_profile_clustering.height)))

5. List生成DataFrame

将两个列表（height_ws和height_wd）生成一个DataFrame

height_ws = sorted(
    list(set([int(re.split('[_-]', hh)[1]) for hh in new_columns if hh != 'time' and 'ws' in hh])))
height_wd = sorted(
    list(set([int(re.split('[_-]', hh)[1]) for hh in new_columns if hh != 'time' and 'wd' in hh])))
from itertools import zip_longest
zipped = zip_longest(height_ws, height_wd, fillvalue=None)
df2 = pd.DataFrame(zipped, columns=['ws', 'wd'])