ICCV Tutorial on Sparse Coding and Dictionary Learning for Image Analysis



ICCV Tutorial on Sparse Coding and Dictionary Learning for Image Analysis

General Information

Organizers

Time

September 28th, Morning

Duration

3 hours

Short Description

Sparse modeling calls for constructing efficient representations of data as a (often linear) combination of a few typical patterns (atoms) learned from the data itself. Significant contributions to the theory and practice of learning such collections of atoms (usually called dictionaries or codebooks), and of representing the actual data in terms of them, leading to state-of-the-art results in many signal and image processing and analysis tasks. The first critical component of this topic is how to sparsely encode a signal given the dictionary. After introducing the topic, the tutorial will describe the state-of-the-art approaches in this area, ranging from greedy algorithms to l1-optimization all the way to simultaneous sparse coding of collections of signals.

The actual dictionary plays a critical role, and it has been shown once and again that learned dictionaries significantly outperforms off-the-shelf ones such as wavelets. The second part of this tutorial will present efficient optimization methods for learning dictionaries adapted for a reconstruction task, and image processing applications where it leads to state-of-the-art results such as image denoising, inpainting or demosaicking.

The third part of the tutorial will discuss numerous applications where the dictionary is not only adapted to reconstruct the data, but also learned for a specific task, such as classification, edge detection and compressed sensing. The last part presents recent new sparse models that go beyond classical sparse regularization. The tutorial concludes with the discussion of other frameworks closely related to sparse signal modeling and dictionary learning, as well as with a description of important open problems.

Preliminary Syllabus

  • Introduction: What sparse coding and dictionary learning are about and why one should care?
  • Part I:Optimization techniques for sparse coding.
    • Greedy algorithms
    • Lasso and LARS
    • Soft-thresholding based optimization
  • Part II: Dictionary learning for reconstruction.
    • Efficient optimization
    • Image denoising
    • Image inpainting and demosaicking
  • Part III: Learning for the task
    • Learning discriminative dictionaries
    • Patch classification, edge detection/classification
    • Learning for the sensing pipeline
  • Part IV: New sparse models
    • Grouped sparsity
    • Structured sparsity
  • Open questions and discussion

Course Material and Software

A Matlab toolbox for sparse decomposition and dictionary learning is available here.

The slides are now available:

Relevant References

Among the vast literature on sparse coding, here are a few selected publications. The list is preliminary and subject to modifications:

  • B. Efron, T. Hastie, I. Johnstone, and R. Tibshirani. Least angle regression. Annals of statistics, 32(2):407--499, 2004.
  • J. Friedman, T. Hastie, H. Hölfling, and R. Tibshirani. Pathwise coordinate optimization. Annals of statistics, 1(2):302--332, 2007.
  • B. A. Olshausen and D. J. Field. Sparse coding with an overcomplete basis set: A strategy employed by V1? Vision Research, 37:3311--3325, 1997.
  • M. Elad and M. Aharon. Image denoising via sparse and redundant representations over learned dictionaries.IEEE Transactions on Image Processing, 54(12):3736--3745, December 2006.
  • J. Mairal, F. Bach, J. Ponce, and G. Sapiro. Online dictionary learning for sparse coding. In Proceedings of the International Conference on Machine Learning (ICML), 2009.
  • J. Mairal, M. Elad and G. Sapiro. Sparse representation for color image restoration. IEEE Transactions on Image Processing 17(1):53--69. 2008.
  • J. Mairal, F. Bach, J. Ponce, G. Sapiro and A. Zisserman. Discriminative Learned Dictionaries for Local Image Analysis. IEEE Conference on Computer Vision and Pattern Recognition, 2008.
  • J. M. Duarte-Carvajalino and G. Sapiro. Learning to sense sparse signals: Simultaneous sensing matrix and sparsifying dictionary optimization. IEEE Trans. Image Processing, 2009, to appear.
  • B. A. Turlach, W. N. Venables, and S. J. Wright. Simultaneous variable selection. Technometrics, 47(3):349--363, 2005.
  • J. A. Tropp, A. C. Gilbert, and M. J. Strauss. Algorithms for simultaneous sparse approximation. Part I: Greedy pursuit Signal Processing, special issue "Sparse approximations in signal and image processing," 86:572--588, 2006.

Biographies

Francis Bach is a researcher in the Willow INRIA project-team, in the Computer Science Department of the Ecole Normale Supérieure, Paris, France. He graduated from the Ecole Polytechnique, Palaiseau, France, in 1997, and earned his PhD in 2005 from the Computer Science division at the University of California, Berkeley. His research interests include machine learning, statistics, optimization, graphical models, kernel methods, sparse methods and statistical signal processing.

Julien Mairal received the graduate degree from Ecole Polytechnique and Ecole Nationale Supérieure des Téléecommunications, Paris, and the MS degree from the Ecole Normale Supérieure, Cachan. He is currently pursuing the Ph.D. degree under the supervision of Jean Ponce and Francis Bach at Ecole Normale Supérieure, Paris. His research interests include machine learning, computer vision and image processing.

Jean Ponce is a computer science professor at Ecole Normale Supérieure (ENS) in Paris, France, where he heads the joint ENS/INRIA/CNRS oject-team WILLOW. Before joining ENS, he spent most of his career in the US, with positions at MIT, Stanford, and the University of Illinois at Urbana-Champaign, where he was a full professor until 2005. Jean Ponce is the author of over 120 technical publications in computer vision and robotics, including the textbook ``Computer Vision: A Modern Approach'', which has been translated in Chinese, Japanese, and Russian. He is an IEEE Fellow, served as editor-in-chief for the International Journal of Computer Vision from 2003 to 2008, and chaired the IEEE Conference on Computer Vision and Pattern Recognition in 1997 and 2000, and the European Conference on Computer Vision in 2008.

Guillermo Sapiro was born in Montevideo, Uruguay, on April 3, 1966. He received his B.Sc. (summa cum laude), M.Sc., and Ph.D. from the Department of Electrical Engineering at the Technion, Israel Institute of Technology, in 1989, 1991, and 1993 respectively. After post-doctoral research at MIT, Dr. Sapiro became Member of Technical Staff at the research facilities of HP Labs in Palo Alto, California. He is currently with the Department of Electrical and Computer Engineering at the University of Minnesota, where he holds the position of Distinguished McKnight University Professor and Vincentine Hermes-Luh Chair in Electrical and Computer Engineering. He works on differential geometry and geometric partial differential equations, both in theory and applications in computer vision, computer graphics, medical imaging, and image analysis. He recently co-edited a special issue of IEEE Image Processing in this topic and a second one in the Journal of Visual Communication and Image Representation. He has authored and co-authored numerous papers in this area and has written a book published by Cambridge University Press, January 2001. He was awarded the Gutwirth Scholarship for Special Excellence in Graduate Studies in 1991, the Ollendorff Fellowship for Excellence in Vision and Image Understanding Work in 1992, the Rothschild Fellowship for Post-Doctoral Studies in 1993, the Office of Naval Research Young Investigator Award in 1998, the Presidential Early Career Awards for Scientist and Engineers (PECASE) in 1998, and the National Science Foundation Career Award in 1999. He is the funding Editor-in-Chief of the SIAM Journal on Imaging Sciences.


http://lear.inrialpes.fr/people/mairal/tutorial_iccv09/

Sponsors

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值