Understanding Clouds from Satellite Images比赛的discussion调研与colab数据集下载配置

 

colab数据集下载配置代码:

%%time
!pip install -U -q kaggle
!mkdir -p ~/.kaggle
 
!echo '{"username":"pupil1","key":"ae776d041bf94ae1bfa9a3843797ad6d"}' > ~/.kaggle/kaggle.json
 
!chmod 600 ~/.kaggle/kaggle.json
!mkdir -p understanding_cloud_organization
!kaggle competitions download -c understanding_cloud_organization
!mv *.zip understanding_cloud_organization/
!mv *.csv understanding_cloud_organization/
!cd /content/understanding_cloud_organization/;unzip train_images.zip
!cd /content/understanding_cloud_organization;mkdir train_images;mv *.jpg train_images/
!cd /content/understanding_cloud_organization/;unzip train.csv.zip
!cd /content/understanding_cloud_organization/;unzip test_images.zip
!cd /content/understanding_cloud_organization;mkdir test_images;mv *.jpg test_images

 

 

 

 

根据[2]的描述

The remaining area, which has not been covered by two succeeding orbits, is marked black.0

所以图片中如果出现黑色区域,就是两颗卫星都没有扫描到的地方。如下:

 

 

使用pupil1账号视角,凡是变色的都是看过的,实在极其没有意义的不予收录.

 

链接备注
Train with crops, Predict with full images发帖子的人得分不高
How effective is pseudo-labeling?(看完了)半监督

[LB 0.628] simple segmentation approach

threshold is high?

threshold的用法

Overlapping Labels in Train Data?

 

Can a pixel be considered as multiple classes?

(看完了)

根据第一个链接,一个像素可以属于多个类别.

Each image was labeled by several people (2-4), so the labels can overlap. In addition, there was no restriction that the labels from a single labeler cannot overlap. To create the masks for this competition, we simply used the union of all labels for each class. So naturally there will be some overlap.

AdamAccumulate(看完了)提到了AdamAccumulate的版本兼容性问题
Hints for late joiners?(看完了)提到使用steel比赛的方案
Bounding Boxes instead of Segmentation

(看完了)评论中提到:

举办方不鼓励对象检测的方式,但是帖子的作者认为线性的模型比非线性的模型跟容易泛化,所以坚持使用Bounding Boxes(对象检测)的方式

use linknetunet> linknet > fpn
Correct Dice Metric(看完了)讨论误差函数机制
Instance Segmentation->Request for list of past competition参考资料

Information: Bad image list

 

Corrupt and Mislabeled Images

 

Information: Bad image list

一些损坏的数据
Question about the black area in the image有很好的可视化
ResNet34 implementation of Unet works but ResNet 50 and 101 fails?(看完了)改变模型如果爆内存就减少batch_size
Flowers are easy to pick ?介绍了一些树算法
Single model performance最佳单模
A best description of Generating mask from encoded pixel涉及encoded pixel

Adding TTA to the model before optimisation could help

 

Augmentations Strategies for this Competition. TTA?

使用时间强化

Augmentations thred

 

Augmentations released version 0.4.0

图像增强的讨论
Questions about the origin of the data讨论快照功能

More Tricks to Train w/ Bigger Batches (pytorch)

 

Some tricks to train faster (pytorch)

 

A trick to use bigger batches for training: gradient accumulation

讨论训练技巧
Simple Descriptions of Cloud Types / Labeling Process讨论肉眼区分类别
Fast data loading [Experiments] 快速读取数据
Deeper, Stronger, Better?

发现resnet18有效

resnext50_32x4defficientnet-b5无效 

Beware of Pandas value_counts method for validation split指出几个代码的pandas使用有误
Efficient Net B4-B7评论区提到修补小batch_size的办法是使用 gradient accumulation

Improving code quality with utility scripts

 

Utility scripts for Keras users

 

Using High-level frameworks is not learner friendly

代码推销
Object Detection vs Instance Segmentation很多概念
Hybrid convolutional and bidirectional LSTM or RNN使用RNN网络
EfficientNets are now available in pytorch segmentation model repo.没看懂这个是干嘛的,房之后再看
New method to tackle severe label noise处理label噪音的一篇论文
FPN or Unet: Which one is better?提到了FPN以及Unet
Some thoughts on this competitionkernel grandmaster的一些想法
what is the label to be taken for overlapping masks? for example, in the image 0011165.jpg, Fish and Flower masks overlap each other for some region.mask重合
Must read material一些资料

Ideas for merging ensemble's predictions

 

How to effectively ensemble models with Keras

讨论模型融合
Instance Segmentation->How to predict classes讨论UNET的输出怎么改成多分类
What does it mean to use a pretrained resnet encoder with UNET?讨论UNET使用预训练的resnet编码器是什么意思?
Regular image segmentation approach提到进行语义分割任务的都有两个数据集
Discussing post processing讨论后处理
Weakly supervised segmentation弱监督分割
Must-see Kernels and topics - Understanding Clouds from Satellite Images对于资料的自行总结
RLE Decode in C++提到了RLE技术
Hints from a late joiner's persepctive提到了后处理
Impact of using classier for removing the masks考虑去掉mask编码
A Late Joiner's Understanding and Notes需要细看
LPT: See what's going on with that commit ?介绍了一个有用的训练的可视化工具
Knock Knock can send you email notification (or slack notification)一个工具用来提醒你训练结束的时候发信息到你邮件通知你

 

一些统计数据来自[1]:

Useful Stats::

no. of empty mask = 7055
no. of non-empty mask = 7737
no. of non-empty mask for Fish = 1864
no. of non-empty mask for Flower = 1509
no. of non-empty mask for Gravel = 1982
no. of non-empty mask for Sugar = 2382

 

Reference:

[1]Public TestSet Distribution via LB probing

[2]https://www.kaggle.com/c/understanding_cloud_organization/data

  • 1
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值