Ross B. Girshick
Research Scientist
Facebook AI Research (FAIR)
Researcher
Microsoft Research, Redmond
Postdoctoral fellow
University of California, Berkeley, EECS
r......@eecs.berkeley.edu
cv / google scholar / Ph.D. thesis
papers: arXiv / journal / conference
MSR website
Research Scientist
Facebook AI Research (FAIR)
r......@eecs.berkeley.edu
cv / google scholar / Ph.D. thesis
papers: arXiv / journal / conference
MSR website
from: http://www.cs.berkeley.edu/~rbg/
About me
I finished my Ph.D. in computer vision at
The University of Chicago under the supervision of
Pedro Felzenszwalb in April 2012. Then, I spent two unbelievably wonderful years as a postdoc at
UC Berkeleyunder
Jitendra Malik. From Berkeley, I spent just over one year as a Researcher at Microsoft Research, Redmond. Now, I'm off on a new adventure as a Research Scientist with the terrific group of researchers and engineers in Facebook AI Research (FAIR).
My main research interests are in computer vision, AI, and machine learning. I'm particularly focused on building models for object detection and recognition. These models aim to incorporate the "right" biases so that machine learning algorithms can understand image content from moderate to large-scale datasets. I always have an eye towards fast systems that work well in practice.
During my Ph.D., I spent time as a research intern at Microsoft Research Cambridge, UK working on human pose estimation from (Kinect) depth images. I also participated in several first-place entries into the PASCAL VOC object detection challenge, and was awarded a "lifetime achievement" prize for my work on deformable part models. I think this refers to the lifetime of the PASCAL challenge—and not mine!
My main research interests are in computer vision, AI, and machine learning. I'm particularly focused on building models for object detection and recognition. These models aim to incorporate the "right" biases so that machine learning algorithms can understand image content from moderate to large-scale datasets. I always have an eye towards fast systems that work well in practice.
During my Ph.D., I spent time as a research intern at Microsoft Research Cambridge, UK working on human pose estimation from (Kinect) depth images. I also participated in several first-place entries into the PASCAL VOC object detection challenge, and was awarded a "lifetime achievement" prize for my work on deformable part models. I think this refers to the lifetime of the PASCAL challenge—and not mine!
News
-
Sean Bell led our team, together with Kavita Bala and Larry Zitnick, to 3rd place in the 2015 MS COCO object detection challenge! Sean also won the prize for the best student-led entry. Check out our tech report describing the ION (inside-outside network) detector:
-
Faster R-CNN: paper / Python code / Matlab code
-
Fast R-CNN: paper / code
Project pages
- Faster R-CNN github repository (Python source code for training and using Faster R-CNN)
- Fast R-CNN github repository (Python source code for training and using Fast R-CNN)
- R-CNN github repository (Matlab source code for training and using R-CNN)
- <a href="http://www.eecs.berkeley.edu/Research/Projects/CS/vision/shape/sds/" <="" a="" style="margin: 0pt; padding: 0pt; border: 0pt none; font-family: inherit; font-style: inherit; font-weight: inherit; outline: invert none 0pt; vertical-align: baseline; color: rgb(23, 114, 208); text-decoration: none;">SDS: Simultaneous Detection and Segmentation
- R-CNN (and more!) for RGB-D data
- LSDA: Large Scale Detection through Adaptation
- Deformable Part Models (DPM) (voc-release5 — dissertation code / includes cascade and NIPS grammar model)
- Cascade object detection with deformable part models (add-on code for voc-release4.01)
arXiv tech reports
![Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks](http://www.cs.berkeley.edu/~rbg/images/ion.jpg)
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross Girshick
arXiv [cs.CV]
abstract / bibtex
Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross Girshick
arXiv [cs.CV]
abstract / bibtex
![Object Detection Networks on Convolutional Feature Maps](http://www.cs.berkeley.edu/~rbg/images/noc.png)
Object Detection Networks on Convolutional Feature Maps
Shaoqing Ren, Kaiming He, Ross Girshick, Xiangyu Zhang, Jian Sun
arXiv [cs.CV]
bibtex
Shaoqing Ren, Kaiming He, Ross Girshick, Xiangyu Zhang, Jian Sun
arXiv [cs.CV]
bibtex
Journal papers
![Region-based Convolutional Networks for Accurate Object Detection and Segmentation](http://www.cs.berkeley.edu/~rbg/images/rcnn_pami.png)
Region-based Convolutional Networks for Accurate Object Detection and Semantic Segmentation
R. Girshick, J. Donahue, T. Darrell, J. Malik
IEEE Transactions on Pattern Analysis and Machine Intelligence (accepted May 18, 2015)
abstract / code / CVPR'14 version
R. Girshick, J. Donahue, T. Darrell, J. Malik
IEEE Transactions on Pattern Analysis and Machine Intelligence (accepted May 18, 2015)
abstract / code / CVPR'14 version
![Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation](http://www.cs.berkeley.edu/~rbg/images/ijcv-depth.png)
Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation
Saurabh Gupta, Pablo Arbeláez, Ross Girshick, Jitendra Malik
International Journal of Computer Vision (IJCV), 2014
code / bibtex
Saurabh Gupta, Pablo Arbeláez, Ross Girshick, Jitendra Malik
International Journal of Computer Vision (IJCV), 2014
code / bibtex
![Kinect pose estimation, PAMI 2013](http://www.cs.berkeley.edu/~rbg/images/kinect-pami.png)
Efficient Human Pose Estimation from Single Depth Images
J. Shotton, R. Girshick, A. Fitzgibbon, T. Sharp, M. Cook, M. Finocchio, R. Moore, P. Kohli, A. Criminisi, A. Kipman, A. Blake
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35, No. 12, Dec. 2013
abstract / bibtex
J. Shotton, R. Girshick, A. Fitzgibbon, T. Sharp, M. Cook, M. Finocchio, R. Moore, P. Kohli, A. Criminisi, A. Kipman, A. Blake
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35, No. 12, Dec. 2013
abstract / bibtex
![DPM, PAMI 2010](http://www.cs.berkeley.edu/~rbg/images/dpm.png)
Object Detection with Discriminatively Trained Part Based Models
†
P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 32, No. 9, Sep. 2010
abstract / PAMI code / latest code (voc-release5) / bibtex
P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan
Communications of the ACM, no. 9 (2013): 97-105
P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 32, No. 9, Sep. 2010
abstract / PAMI code / latest code (voc-release5) / bibtex
See also, CACM Research Highlight:
Visual Object Detection with Deformable Part Models
P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan
Communications of the ACM, no. 9 (2013): 97-105
Conference papers
2015
![Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks](http://www.cs.berkeley.edu/~rbg/images/faster_rcnn.png)
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun
Neural Information Processing Systems (NIPS), 2015
Python code / Matlab code / bibtex
Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun
Neural Information Processing Systems (NIPS), 2015
Python code / Matlab code / bibtex
![Fast R-CNN](http://www.cs.berkeley.edu/~rbg/images/fast_rcnn.png)
Fast R-CNN
Ross Girshick
IEEE International Conference on Computer Vision (ICCV), 2015
oral presentation
code / slides / bibtex
Ross Girshick
IEEE International Conference on Computer Vision (ICCV), 2015
oral presentation
code / slides / bibtex
![Contextual Action Recognition with R*CNN](http://www.cs.berkeley.edu/~rbg/images/rstar.png)
Contextual Action Recognition with R*CNN
Georgia Gkioxari, Ross Girshick, Jitendra Malik
IEEE International Conference on Computer Vision (ICCV), 2015
code / bibtex
Georgia Gkioxari, Ross Girshick, Jitendra Malik
IEEE International Conference on Computer Vision (ICCV), 2015
code / bibtex
![Actions and Attributes from Wholes and Parts](http://www.cs.berkeley.edu/~rbg/images/wholesandparts.png)
Actions and Attributes from Wholes and Parts
Georgia Gkioxari, Ross Girshick, Jitendra Malik
IEEE International Conference on Computer Vision (ICCV), 2015
bibtex
Georgia Gkioxari, Ross Girshick, Jitendra Malik
IEEE International Conference on Computer Vision (ICCV), 2015
bibtex
![Hypercolumns for object segmentation and fine-grained localization](http://www.cs.berkeley.edu/~rbg/papers/cvpr15/hypercolumn.png)
Hypercolumns for object segmentation and fine-grained localization
Bharath Hariharan, Pablo Arbeláez, Ross Girshick, Jitendra Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
oral presentation
bibtex
Bharath Hariharan, Pablo Arbeláez, Ross Girshick, Jitendra Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
oral presentation
bibtex
![Aligning 3D models to RGB-D images of cluttered scenes](http://www.cs.berkeley.edu/~rbg/papers/cvpr15/align2rgbd.png)
Aligning 3D models to RGB-D images of cluttered scenes
Saurabh Gupta, Pablo Arbeláez, Ross Girshick, Jitendra Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
bibtex
Saurabh Gupta, Pablo Arbeláez, Ross Girshick, Jitendra Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
bibtex
![Deformable Part Models are Convolutional Neural Networks](https://i-blog.csdnimg.cn/blog_migrate/61e4de17d26bf540d9f8feb65693b5ba.png)
Deformable Part Models are Convolutional Neural Networks
Ross Girshick, Forrest Iandola, Trevor Darrell, Jitendra Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
bibtex
Ross Girshick, Forrest Iandola, Trevor Darrell, Jitendra Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
bibtex
2014
![LSDA: Large Scale Detection through Adaptation](http://www.cs.berkeley.edu/~rbg/images/lsda.png)
LSDA: Large Scale Detection through Adaptation
Judy Hoffman, Sergio Guadarrama, Eric Tzeng, Ronghang Hu, Jeff Donahue, Ross Girshick, Trevor Darrell, Kate Saenko
Neural Information Processing Systems (NIPS), 2014
project, code, models / bibtex
Judy Hoffman, Sergio Guadarrama, Eric Tzeng, Ronghang Hu, Jeff Donahue, Ross Girshick, Trevor Darrell, Kate Saenko
Neural Information Processing Systems (NIPS), 2014
project, code, models / bibtex
![Simultaneous Detection and Segmentation, ECCV 2014](http://www.cs.berkeley.edu/~rbg/images/sds.png)
Simultaneous Detection and Segmentation
Bharath Hariharan, Pablo Arbeláez, Ross Girshick, Jitendra Malik
European Conference on Computer Vision (ECCV), 2014
project page (with code) / bibtex
Bharath Hariharan, Pablo Arbeláez, Ross Girshick, Jitendra Malik
European Conference on Computer Vision (ECCV), 2014
project page (with code) / bibtex
![Learning Rich Features from RGB-D Images for Object Detection and Segmentation, ECCV 2014](http://www.cs.berkeley.edu/~rbg/images/toilet.png)
Learning Rich Features from RGB-D Images for Object Detection and Segmentation
Saurabh Gupta, Ross Girshick, Pablo Arbeláez, Jitendra Malik
European Conference on Computer Vision (ECCV), 2014
code, models, and data / bibtex
Saurabh Gupta, Ross Girshick, Pablo Arbeláez, Jitendra Malik
European Conference on Computer Vision (ECCV), 2014
code, models, and data / bibtex
![Analyzing the Performance of Multilayer Neural Networks for Object Recognition, ECCV 2014](https://i-blog.csdnimg.cn/blog_migrate/11db409486d6bbaba1b34bb9ad654df4.png)
Analyzing the Performance of Multilayer Neural Networks for Object Recognition
Pulkit Agrawal, Ross Girshick, Jitendra Malik
European Conference on Computer Vision (ECCV), 2014
bibtex
Pulkit Agrawal, Ross Girshick, Jitendra Malik
European Conference on Computer Vision (ECCV), 2014
bibtex
![Part-based R-CNNs for Fine-grained Category Detection, ECCV 2014](http://www.cs.berkeley.edu/~rbg/images/part-rcnn.png)
Part-based R-CNNs for Fine-grained Category Detection
Ning Zhang, Jeff Donahue, Ross Girshick, Trevor Darrell
European Conference on Computer Vision (ECCV), 2014
oral presentation
bibtex
Ning Zhang, Jeff Donahue, Ross Girshick, Trevor Darrell
European Conference on Computer Vision (ECCV), 2014
oral presentation
bibtex
![On learning to localize objects with minimal supervision, ICML 2014](https://i-blog.csdnimg.cn/blog_migrate/83d9bf5c8c17974a481f43e23039da92.png)
On Learning to Localize Objects with Minimal Supervision
Hyun Oh Song, Ross Girshick, Stefanie Jegelka, Julien Mairal, Zaid Harchaoui, Trevor Darrell
International Conference on Machine Learning (ICML), 2014
code / bibtex
Hyun Oh Song, Ross Girshick, Stefanie Jegelka, Julien Mairal, Zaid Harchaoui, Trevor Darrell
International Conference on Machine Learning (ICML), 2014
code / bibtex
![Rich feature hierarchies for accurate object detection and semantic segmentation, CVPR 2014](http://www.cs.berkeley.edu/~rbg/images/rcnn_pami.png)
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
R. Girshick, J. Donahue, T. Darrell, J. Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
oral presentation
arXiv tech report (includes ImageNet results) / supplement / code / poster / slides / bibtex
R. Girshick, J. Donahue, T. Darrell, J. Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
oral presentation
arXiv tech report (includes ImageNet results) / supplement / code / poster / slides / bibtex
![Using k-poselets for Detecting People and Localizing their Keypoints, CVPR 2014](https://i-blog.csdnimg.cn/blog_migrate/5bd4935a5b9553f5b018c8711d8f47fd.png)
Using k-poselets for Detecting People and Localizing their Keypoints
G. Gkioxari*, B. Hariharan*, R. Girshick, J. Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
* equal contribution
project page / code / github / bibtex
G. Gkioxari*, B. Hariharan*, R. Girshick, J. Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
* equal contribution
project page / code / github / bibtex
![Understanding Objects in Detail with Fine-grained Attributes, CVPR 2014](http://www.cs.berkeley.edu/~rbg/images/oid.png)
Understanding Objects in Detail with Fine-grained Attributes
A. Vedaldi, S. Mahendran, S. Tsogkas, S. Maji, R. Girshick, J. Kannala, E. Rahtu, I. Kokkinos, M. B. Blaschko, D. Weiss, B. Taskar, K. Simonyan, N. Saphra, S. Mohamed
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
dataset / bibtex
A. Vedaldi, S. Mahendran, S. Tsogkas, S. Maji, R. Girshick, J. Kannala, E. Rahtu, I. Kokkinos, M. B. Blaschko, D. Weiss, B. Taskar, K. Simonyan, N. Saphra, S. Mohamed
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
dataset / bibtex
2013
![Training deformable part models with decorrelated features, ICCV 2013](http://www.cs.berkeley.edu/~rbg/images/dpm-learning.png)
Training Deformable Part Models with Decorrelated Features
R. Girshick, J. Malik
IEEE International Conference on Computer Vision (ICCV), 2013
supplement / LM-LLDA DPM training code / bibtex
R. Girshick, J. Malik
IEEE International Conference on Computer Vision (ICCV), 2013
supplement / LM-LLDA DPM training code / bibtex
![Discriminatively activated sparselets, ICML 2013](http://www.cs.berkeley.edu/~rbg/images/das.png)
Discriminatively Activated Sparselets
R. Girshick*, H. O. Song*, T. Darrell
International Conference on Machine Learning (ICML), 2013
oral presentation
supplement / Caltech-101 demo code / bibtex
R. Girshick*, H. O. Song*, T. Darrell
International Conference on Machine Learning (ICML), 2013
oral presentation
supplement / Caltech-101 demo code / bibtex
2012 and earlier
![Sparselets, ECCV 2012](http://www.cs.berkeley.edu/~rbg/images/sparselets.png)
Sparselet Models for Efficient Multiclass Object Detection
H.O. Song, S. Zickler, T. Althoff, R. Girshick, M. Fritz, C. Geyer, P. Felzenszwalb, T. Darrell
European Conference on Computer Vision (ECCV), 2012
code / bibtex
H.O. Song, S. Zickler, T. Althoff, R. Girshick, M. Fritz, C. Geyer, P. Felzenszwalb, T. Darrell
European Conference on Computer Vision (ECCV), 2012
code / bibtex
![Object detection with grammar models, NIPS 2011](https://i-blog.csdnimg.cn/blog_migrate/4fb37c5a053f7eec5be7ea68f7cb58e3.png)
Object Detection with Grammar Models
R. Girshick, P. Felzenszwalb, D. McAllester
Neural Information Processing Systems (NIPS), 2011
spotlight video / code (voc-release5) / bibtex
R. Girshick, P. Felzenszwalb, D. McAllester
Neural Information Processing Systems (NIPS), 2011
spotlight video / code (voc-release5) / bibtex
![Efficient regression of general-activity human poses from depth images, ICCV 2011](http://www.cs.berkeley.edu/~rbg/images/offset-regression.png)
Efficient Regression of General-Activity Human Poses from Depth Images
R. Girshick, J. Shotton, P. Kohli, A. Criminisi, A. Fitzgibbon
IEEE International Conference on Computer Vision (ICCV), 2011
supplement / video / bibtex
R. Girshick, J. Shotton, P. Kohli, A. Criminisi, A. Fitzgibbon
IEEE International Conference on Computer Vision (ICCV), 2011
supplement / video / bibtex
![Cascade object detection with deformable part models, CVPR 2010](http://www.cs.berkeley.edu/~rbg/images/cascade.png)
Cascade Object Detection with Deformable Part Models
†
P. Felzenszwalb, R. Girshick, D. McAllester
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010
oral presentation
slides (pdf) / slides (keynote) / talk / code (voc-release5) / bibtex
P. Felzenszwalb, R. Girshick, D. McAllester
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010
oral presentation
slides (pdf) / slides (keynote) / talk / code (voc-release5) / bibtex
![Visibility constraints on features of 3D objects, CVPR 2009](http://www.cs.berkeley.edu/~rbg/images/visibility.png)
Visibility Constraints on Features of 3D Objects
†
R. Basri, P. Felzenszwalb, R. Girshick, D. Jacobs, C. Klivans
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009
bibtex
R. Basri, P. Felzenszwalb, R. Girshick, D. Jacobs, C. Klivans
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009
bibtex
![Simulating chinese brush painting, SIGGRAPH poster 2004](https://i-blog.csdnimg.cn/blog_migrate/564989fc7775f5237974dcc61640df8f.png)
Simulating Chinese Brush Painting: the Parametric Hairy Brush
R. Girshick
ACM SIGGRAPH Posters, 2004
Session: Nonphotorealistic Animation and Rendering
bibtex
R. Girshick
ACM SIGGRAPH Posters, 2004
Session: Nonphotorealistic Animation and Rendering
bibtex
†Authors listed alphabetically
Ph.D. dissertation
![My dissertation](http://www.cs.berkeley.edu/~rbg/images/dissertation.png)
From Rigid Templates to Grammars: Object Detection with Structured Models
R. Girshick
Ph.D. dissertation, The University of Chicago, Apr. 2012
slides / bibtex
R. Girshick
Ph.D. dissertation, The University of Chicago, Apr. 2012
slides / bibtex
![MS](http://www.cs.berkeley.edu/~rbg/images/ms.png)
Object Detection with Heuristic Coarse-to-Fine Search
R. Girshick
M.S. thesis, The University of Chicago, Dec. 2009
bibtex
R. Girshick
M.S. thesis, The University of Chicago, Dec. 2009
bibtex