课件网址cs231n.stanford.edu/slides/2022/
- Learning objectives
Formalize computer vision applications into tasks
-Formalize inputs and outputs for vision-related problems
-Understand what data and computational requirements you need to train a model
Develop and train vision models
-Learn to code, debug, and train convolutional neural networks.
-Learn how to use software frameworks like PyTorch and TensorFlow
Gain an understanding of where the field is and where it is headed
-What new research has come out in the last 0-5 years?
-What are open research challenges?
-What ethical and societal considerations should we consider before deployment?
- Why should you take this class?
Become a vision researcher (an incomplete list of conferences)
-
Get involved with
vision research at Stanford
: apply
using this form
.
-
CVPR 2022 conference
-
ICCV 2021 conference
Become a vision engineer in industry (an incomplete list of industry teams)
-
Perception team at Google AI
,
Vision at Google Cloud
-
Vision at Meta AI
-
Vision at Amazon AWS
-
Nvidia, Tesla, Apple, Salesforce, ……
General interest(用爱发电)
- 人工智能各学科之间的关系
- ImageNet历年获奖的网络及表现
- Deep Learning Explosion
- CV的应用(选了几个感觉比较重要的)
- Computer Vision Can Cause Harm
下方左图把黑人识别成了Gorillas大猩猩,这显然是会导致种族歧视的。
- Image Classification: A core task in Computer Vision
There are many visual recognition problems that are related to image classification, such as
object detection
,
image captioning
,
image segmentation
,
visual question answering
,
visual
instruction navigation, video understanding, etc
(不明觉厉)
- 推荐资源
Deep Learning
-
by Goodfellow, Bengio, and Courville
-
Here is a
free version
Mathematics of deep learning
-Chapters 5, 6 7 are useful to understand vector calculus and continuous optimization
-
Free online version
Dive into deep learning
-An interactive deep learning book with code, math, and discussions, based on the NumPy
interface.
-
Free online version