Pandas学习笔记1 pandas数据结构

最新推荐文章于 2024-07-25 10:26:51 发布

Chovy_Yao

最新推荐文章于 2024-07-25 10:26:51 发布

阅读量116

点赞数

文章标签：数据结构 python

本文链接：https://blog.csdn.net/qq_46512555/article/details/119937778

版权

前言

随着人工智能的不断发展，机器学习这门技术也越来越重要，很多人都开启了学习机器学习，本文就介绍了机器学习的基础内容。

示例：pandas 是基于NumPy 的一种工具，该工具是为了解决数据分析任务而创建的。

代码如下（示例）：

import pandas as pd
import numpy as np

用我们目前所知道的numpy来生成数组

arr1 = np.arange(10)
print(arr1)

[0 1 2 3 4 5 6 7 8 9]

ser1 = pd.Series(arr1)
print(ser1)

0 0
1 1
2 2
3 3
4 4
5 5
6 6
7 7
8 8
9 9
dtype: int32

我们再尝试下字符串可不可以作为列表呢

ser2 = pd.Series(arr1,list('ircolasefm'))
print(ser2)

i 0
r 1
c 2
o 3
l 4
a 5
s 6
e 7
f 8
m 9
dtype: int32

观察一下行索引

ser2.values

array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

自定义索引

ser2.index

Index([‘i’, ‘r’, ‘c’, ‘o’, ‘l’, ‘a’, ‘s’, ‘e’, ‘f’, ‘m’], dtype=‘object’)

ser1.index

RangeIndex(start=0, stop=10, step=1)

pd.Series([1,2,3])

0 1
1 2
2 3
dtype: int64

我们再新建一个索引

ser4 = pd.Series({'name':'zhangsan','age':'18'})

查看索引的对象

ser4.values

array([‘zhangsan’, ‘18’], dtype=object)

在查看一下行索引

ser4.index

Index([‘name’, ‘age’], dtype=‘object’)

创建一个3行4列0-50的数组

arr2 = np.random.randint(0,50,(3,4))
print(arr2)

生成索引

df1 = pd.DataFrame(arr2)
print(df1)

在这里插入图片描述

我们把索引修改一下

df2 = pd.DataFrame(arr2,index=list('ABC'),columns=list('abcd'))
df2

在这里插入图片描述

我们尝试下生成一个字典

new_dict = {
    'A':1,
    'B':[1,2,3,4],
    'C':'asd123',
    'D':np.arange(4),
    'E':pd.Series(1,index=range(4))
}

pd.DataFrame(new_dict)

在这里插入图片描述
如果是单个数据，直接补充到最多个数，如果是多个数据，注意数据个数必须保持相同

关注