呼吁开放外网
Getting a dataset with images is not easy if you want to use it for a course or a book. Yes, there are many datasets with images, but few of them are suitable for commercial or educational use.
如果您想将其用于课程或书籍,则获取带有图像的数据集并不容易。 是的,有很多带有图像的数据集,但是很少有适合商业或教育用途的数据集。
To solve this issue, I decided to collect a dataset with clothing. All the pictures will be shared under the CC0 license. This means that anyone can use this data for any purpose.
为了解决这个问题,我决定收集衣服数据集。 所有图片将在CC0许可证下共享。 这意味着任何人都可以出于任何目的使用此数据。
For example:
例如:
- Creating a tutorial or a course (free or paid) 创建教程或课程(免费或付费)
- Writing a book 写书
- Kaggle competitions (as an external dataset) Kaggle竞赛(作为外部数据集)
- Training an internal model at any company 在任何公司培训内部模型
I already collected more than 1,000 pictures, but it’s not easy to do alone. I need your help.
我已经收集了1000多张照片,但是要单独完成并不容易。 我需要你的帮助。
我该如何帮忙? (How can I help?)
There are many ways you can help.
您可以通过多种方式提供帮助。
Spread the word about it. Share it on social media, send it to your colleagues and friends.
散布关于它的话 。 在社交媒体上分享,并将其发送给您的同事和朋友。
Upload your pictures. If don’t want to go through your entire wardrobe and take a picture of every item — it’s okay. Even one image is helpful. Perhaps there’s a t-shirt nearby, jeans, or shoes? Take a picture and upload it using this form. See the next section for details on how to take pictures.
上载您的图片。 如果不想翻遍整个衣橱,为每件照片拍照-没关系。 甚至一张图像也是有帮助的。 也许附近有一件T恤,牛仔裤或鞋子? 拍照并使用此表格上传。 有关如何拍照的详细信息,请参见下一部分。
The form works on mobile too!
该表格也可以在移动设备上使用!
Upload many pictures at once. If you have more than a couple of images, using the previous form is not convenient. There are other options:
一次上传许多图片。 如果您有多个图像,则使用前一个表格不方便。 还有其他选择:
- Google Photos. The app can automatically synchronize all your images. Just move the pictures of clothes to a separate album and share the link. Google相簿。 该应用程序可以自动同步所有图像。 只需将衣服图片移到单独的相册中并共享链接即可。
- Dropbox, Google Drive, Yandex Disk, or any similar cloud storage. Upload a folder or a zip archive and share the link. Dropbox,Google云端硬盘,Yandex磁盘或任何类似的云存储。 上载文件夹或zip存档并共享链接。
WeTransfer.com. You can use it to upload files up to 2GB without registering.
WeTransfer.com 。 您可以使用它上传最大2GB的文件而无需注册。
Once you have a link, use another form to submit it:
有了链接后,请使用其他表单提交它:
图片 (Images)
There are the following categories of clothes:
有以下几类衣服:
- T-shirts T恤衫
- Long sleeves, sweaters, hoodies 长袖,毛衣,连帽衫
- Shirts 上衣
- Jeans, pants, shorts 牛仔裤,裤子,短裤
- Dresses, skirts 连衣裙,裙子
- Shoes 鞋类
- Jackets, coats 外套,大衣
- Hats 礼帽
- Clothes for kids 孩子们的衣服
To make a picture, put the item on a floor or a bed:
要拍照,请将物品放在地板或床上:
Pictures of hanging clothes are fine, but make sure the item is visible:
可以挂衣服的图片很好,但是请确保物品可见:
The item shouldn’t be crumpled or packed:
该物品不应该被弄皱或包装:
The background should be contrasting enough to see the item:
背景的对比度应足以看到该项目:
An image should contain only one item:
图像应仅包含一项:
And there should be no people:
而且应该没有人:
If you’re not sure about something, just share it, and I’ll figure it out.
如果您不确定某件事,请分享一下,我会解决的。
我怎么知道什么时候数据准备好了? (How can I know when the data is ready?)
When I collect enough pictures, I’ll annotate them and upload the result to Kaggle. If you provide your email when sharing images, I’ll inform you when it happens.
当我收集到足够的图片时,将对其进行批注并将结果上传到Kaggle。 如果您在共享图像时提供了电子邮件,则在发生这种情况时会通知您。
I will also post in other places:
我还将在其他地方发布:
Data Science Insider on Medium
My Twitter account: @Al_Grigor
我的Twitter帐户: @Al_Grigor
My LinkedIn account: agrigorev
我的LinkedIn帐户: agrigorev
The #datasets channel in ods.ai
在#datasets通道ods.ai
The /r/datasets/ subreddit
/ r / datasets / subreddit
I’d like to collect 10,000 images and I need your help!
我想收集10,000张图片,需要您的帮助!
Upload a few images right now: https://airtable.com/shr7Go5VUAGKRx2sW
立即上传一些图片: https : //airtable.com/shr7Go5VUAGKRx2sW
Batch-upload more images later: https://airtable.com/shrJHj9bxUuQQaWNR
稍后批量上传更多图像: https : //airtable.com/shrJHj9bxUuQQaWNR
翻译自: https://medium.com/data-science-insider/clothing-dataset-call-for-action-3cad023246c1
呼吁开放外网