不同类别的图像数量可视化

最新推荐文章于 2022-12-01 08:30:00 发布

范德彪陕西分彪

最新推荐文章于 2022-12-01 08:30:00 发布

阅读量295

点赞数

分类专栏： pytorch学习文章标签： pytorch 深度学习 python

本文链接：https://blog.csdn.net/weixin_46815330/article/details/122483077

版权

pytorch学习专栏收录该内容

138 篇文章 12 订阅

订阅专栏

文章目录

鲸鱼数据
可视化chest-xray

鲸鱼数据

https://www.kaggle.com/mmm4a1/seresnext101-pytorch-starter/edit

利用train.csv对其不同类别的图像数量进行可视化。
首先读取一下

train_df = pd.read_csv('../input/train.csv')

看一下数据的样子，我们要用到Id这一列。

train_df.head()

在这里插入图片描述
看一下一共有多少种鲸鱼图像？

NCLASSES = len(train_df['Id'].unique())
print(NCLASSES)

然后使用函数统计一下,这个返回值是series类型也可以和列表一样切片。

train_df['Id'].value_counts()[:10]

new_whale    9664
w_23a388d      73
w_9b5109b      65
w_9c506f6      62
w_0369a5c      61
w_700ebb4      57
w_3de579a      54
w_564a34b      51
w_fd3e556      50
w_88e4537      49
Name: Id, dtype: int64

直接plot即可

train_df['Id'].value_counts()[1:15].plot(kind = 'bar')

在这里插入图片描述

可视化chest-xray

如果没有csv文件怎么办？
首先看一下目录结构
在这里插入图片描述
normal,pneumonia中各存了照片。我们可视化一下两个文件夹中的图像数量的柱状图。

path = '../input/chest-xray-pneumonia/chest_xray/chest_xray'

train_samplesize = pd.DataFrame.from_dict(
    {'Normal': [len([os.path.join(path+'/train/NORMAL', filename) 
                     for filename in os.listdir(path+'/train/NORMAL')])], 
     'Pneumonia': [len([os.path.join(path+'/train/PNEUMONIA', filename) 
                        for filename in os.listdir(path+'/train/PNEUMONIA')])]})

import seaborn as sns

sns.barplot(data=train_samplesize).set_title('Training Set Data Inbalance', fontsize=20)
plt.show()

这样写更方便

path = '../input/chest-xray-pneumonia/chest_xray/chest_xray'

train_samplesize = pd.DataFrame.from_dict(
    {'Normal': [len(os.listdir(path+'/train/NORMAL'))], 
     'Pneumonia': [len(os.listdir(path+'/train/PNEUMONIA'))]})

import seaborn as sns

sns.barplot(data=train_samplesize).set_title('Training Set Data Inbalance', fontsize=20)
plt.show()

在这里插入图片描述

结果是一样滴
在这里插入图片描述

范德彪陕西分彪

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
不同类别的图像数量可视化

文章目录鲸鱼数据可视化chest-xray鲸鱼数据https://www.kaggle.com/mmm4a1/seresnext101-pytorch-starter/edit利用train.csv对其不同类别的图像数量进行可视化。首先读取一下train_df = pd.read_csv('../input/train.csv')看一下数据的样子，我们要用到Id这一列。train_df.head()看一下一共有多少种鲸鱼图像？NCLASSES = len(train_df['Id'
复制链接

扫一扫