跨媒体检索--有监督方法

最新推荐文章于 2024-11-02 17:24:50 发布

原创最新推荐文章于 2024-11-02 17:24:50 发布 · 964 阅读

·

0

·

CC 4.0 BY-SA版权

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文综述了跨模态检索领域的最新进展，包括对应自动编码器、深度视觉语义对齐、多模态卷积神经网络等技术，探讨了图像描述生成、图像与文本匹配、双注意力网络及深度结构保留嵌入等方法，旨在为该领域研究提供全面视角。

Cross-modal Retrieval with Correspondence Autoencoder

对应自动编码器的跨模态检索

~~Deep Visual-Semantic Alignments for Generating Image Descriptions~~

~~用于生成图像描述的深度视觉语义对齐~~

Multimodal Convolutional Neural Networks for Matching Image and Sentence

多模态卷积神经网络用于图像和句子匹配

~~Order-embeddings of images and language~~

~~图片和语言的顺序嵌入~~

Learning Deep Structure-Preserving Image-Text Embeddings

学习保留深层结构的图像-文本嵌入

Learning Deep Representations of Fine-Grained Visual Descriptions

学习细粒度视觉描述的深度表示

Dual Attention Networks for Multimodal Reasoning and Matching

双注意力网络用于多模式推理和匹配

Linking Image and Text with 2-Way Nets

通过2向网络链接图像和文本

Learning Cross-modal Embeddings for Cooking Recipes and Food Images

学习用于烹饪食谱和食物图像的跨模式嵌入

Person Search with Natural Language Description

具有自然语言描述的人搜索

Learning a Recurrent Residual Fusion Network for Multimodal Matching

学习递归残差融合网络以进行多模式匹配

Song Retrieval via Bridging Image Content and Lyric Words

通过桥接图像内容和歌词来检索歌曲

Improving Visual-Semantic Embeddings with Hard Negatives

用硬否定词改善视觉语义嵌入

Adversarial Cross-Modal Retrieval

对抗式跨模态检索

Learning two-branch neural networks for image-text matching tasks

学习用于图像文本匹配任务的两分支神经网络

Dual-Path Convolutional Image-Text Embedding

双路径卷积图像文本嵌入

~~Learning Semantic Concepts and Order for Image and Sentence Matching~~

~~学习语义概念和图像和句子匹配的顺序~~

Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models

外观，想象和匹配：使用生成模型改进文本视觉跨模态检索

Stacked Cross Attention for Image-Text Matching

堆叠式交叉注意用于图像-文本匹配

Saliency-Guided Attention Network for Image-Sentence Matching

显着性注意力网络用于图像句子匹配

Cross-modal Scene Graph Matching for Relationship-aware

用于关系识别图像-文本检索的跨模式场景图匹配

Su_Deep_Joint-Semantics_Reconstructing_Hashing_for_Large-Scale_Unsupervised_Cross-Modal_Retrieval_ICCV_2019_paper

面向大规模无监督跨模式检索的深度联合语义重构哈希算法

Coupled CycleGAN Unsupervised Hashing Network for Cross-Modal Retrieval

Coupled CycleGAN：用于跨模态检索的无监督哈希网络

Deep Semantic-Alignment Hashing for Unsupervised Cross-Modal Retrieval

用于无监督跨模态检索的深度语义对齐散列

Joint-modal Distribution-based Similarity Hashing for Large-scale Unsupervised Deep Cross-modal Retrieval

大规模无监督深度交叉模式检索的基于联合模式分布的相似性哈希

Multi-Pathway Generative Adversarial Hashing for Unsupervised Cross-Modal Retrieval

无监督跨模态检索的多路径生成对抗性哈希

Multi-Task Consistency-Preserving Adversarial Hashing for Cross-Modal Retrieval

跨模态检索的多任务保持一致性对抗散列

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。