Paper Reading - Long-term Recurrent Convolutional Networks for Visual Recognition and Description ( ...

最新推荐文章于 2024-06-14 20:43:21 发布

dichunpu6524

最新推荐文章于 2024-06-14 20:43:21 发布

阅读量140

点赞数

文章标签：人工智能

原文链接：http://www.cnblogs.com/zlian2016/p/9467093.html

版权

Link of the Paper: https://arxiv.org/abs/1411.4389

Main Points:

A novel Recurrent Convolutional Architecture ( CNN + LSTM ): both Spatially and Temporally Deep.
The recurrent long-term models are directly connected to modern visual convnet models and can be jointly trained to simultaneously learn temporal dynamics and convolutional perceptual representations.

Other Key Points:

A significant limitation of simple RNN models which strictly integrate state information over time is known as the "vanishing gradient" effect: the ability to backpropogate an error signal through a long-range temporal interval becomes increasingly impossible in practice.
The authors show LSTM-type models provide for improved recognition on conventional video activity challenges and enable a novel end-to-end optimizable mapping from image pixels to sentence-level natural language descriptions.

posted on 2018-08-13 11:31 LZ_Jaja 阅读( ...) 评论( ...) 编辑收藏

转载于:https://www.cnblogs.com/zlian2016/p/9467093.html

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

关注关注