colab数据集下载配置代码:
%%time
!pip install -U -q kaggle
!mkdir -p ~/.kaggle
!echo '{"username":"pupil1","key":"ae776d041bf94ae1bfa9a3843797ad6d"}' > ~/.kaggle/kaggle.json
!chmod 600 ~/.kaggle/kaggle.json
!mkdir -p understanding_cloud_organization
!kaggle competitions download -c understanding_cloud_organization
!mv *.zip understanding_cloud_organization/
!mv *.csv understanding_cloud_organization/
!cd /content/understanding_cloud_organization/;unzip train_images.zip
!cd /content/understanding_cloud_organization;mkdir train_images;mv *.jpg train_images/
!cd /content/understanding_cloud_organization/;unzip train.csv.zip
!cd /content/understanding_cloud_organization/;unzip test_images.zip
!cd /content/understanding_cloud_organization;mkdir test_images;mv *.jpg test_images
根据[2]的描述
The remaining area, which has not been covered by two succeeding orbits, is marked black.0
所以图片中如果出现黑色区域,就是两颗卫星都没有扫描到的地方。如下:
使用pupil1账号视角,凡是变色的都是看过的,实在极其没有意义的不予收录.
一些统计数据来自[1]:
Useful Stats::
no. of empty mask = 7055
no. of non-empty mask = 7737
no. of non-empty mask for Fish
= 1864
no. of non-empty mask for Flower
= 1509
no. of non-empty mask for Gravel
= 1982
no. of non-empty mask for Sugar
= 2382
Reference:
[1]Public TestSet Distribution via LB probing
[2]https://www.kaggle.com/c/understanding_cloud_organization/data