pytorch 人脸修复
黑客数据科学工作流程(Hacking data science workflows)
I came across an interesting problem recently. A teammate and I were working on a series of Deep Learning experiments that involved an image dataset that spanned hundreds of gigabytes. Me, being the indecisive goof I am, wanted to understand whether the data was suited to a plethora of classification tasks, all spanning different configurations of the dataset.
我最近遇到了一个有趣的问题。 我和一个队友正在进行一系列深度学习实验,这些实验涉及跨越数百GB的图像数据集。 作为优柔寡断的我,我想了解数据是否适合于过多的分类任务,这些任务都跨越了数据集的不同配置。
This led us down a PyTorch DataLoader shaped rabbit hole for hours, before we nearly gave up in frustration. Thankfully though, the only thing more frustrating than writing scaffolding code is waiting for a virtual machine to finish copying files hordes across arbitrary directories.
在我们几乎放弃沮丧之前,这导致我们在一个PyTorch DataLoader形兔子Kong上钻了几个小时。 值得庆幸的是,唯一比编写脚手架代码更令人沮丧的是,等待虚拟机完成跨任意目录复制成群的文件。
Fortunately, we soon stumbled upon a solution and decided that it was time to give the DataLoader class a facelift. We took our messy scaffolding code, cleaned it up, and added the ability to not only dynamically label training data, but also specify subsets, perform cus