计算机视觉中的深度学习
Deep learning has achieved great success in computer vision since AlexNet was proposed in 2012. This success is mainly related to two factors: a well-designed deep learning model, and a large-scale annotated data set to train the model.
自从AlexNet于2012年提出以来,深度学习在计算机视觉方面就取得了巨大的成功。这一成功主要与两个因素有关:设计良好的深度学习模型和训练模型的大规模带注释数据集。
Nowadays, deep learning has become a go-to method on computer vision projects. Solving a supervised learning problem in computer vision such as classification, detection, and segmentation commonly takes two steps:
如今,深度学习已成为计算机视觉项目的首选方法。 解决计算机视觉中的监督学习问题,例如分类,检测和分段通常需要两个步骤:
- choosing and downloading a pretrained model which is suitable for the problem 选择并下载适合该问题的预训练模型
- retraining the model using customized annotated data by applying transfer learning 通过应用转移学习使用定制的带注释数据重新训练模型
Many pretrained models are available to download from the internet. The second step — retraining the model using the customized annotated dataset — is therefore the main issue.
许多预训练的模型可以从互联网上下载。 因此,第二步(使用定制的带注释的数据集重新训练模型)是主要问题。
Annotating images is a time-consuming task. People normally start from a small dataset and then apply image augmentation to increase the size of the dataset. Image augmentation has been widely used in deep learning of computer vision. It uses traditional image processing, such as blurring, adding no