pinkshell_1314-CSDN博客

原创 GQA数据集结构的详细描述

vqa数据集

2023-03-22 14:11:27 731

原创 GQA数据集简介

最终构建好的GQA数据集包含22,669,678个问题和113,018张图片，要想回答这些问题，需要模型具有多种的推理技巧和推理步骤。（数据集中覆盖的词汇量有3,097个，答案类型有1,878个。）

2023-03-17 04:06:35 1696 1

原创 Coarse-to-Fine Reasoning for Visual Question Answering

面向视觉问答的由粗到细推理方法

2023-03-17 02:44:30 386

原创 Greedy Gradient Ensemble for Robust Visual Question Answering

基于贪婪梯度集成的鲁棒视觉问答算法

2023-03-10 03:52:28 258

原创 Florence: A New Foundation Model for Computer Vision

Florence:计算机视觉的一个新的基础模型

2023-01-05 20:58:22 803

原创 LXMERT: Learning Cross-Modality Encoder Representationsfrom Transformers

LXMERT:学习Transformer的跨模态编码器表示

2022-12-19 05:20:26 665

原创 Towards Robust Visual Question Answering: Making the Most of BiasedSamples via Contrastive Learning

走向鲁棒的视觉问题回答: 通过对比学习，最大限度地利用有偏样本

2022-12-02 03:35:22 628

原创 VQA v2.0数据集图像问题答案对

一个完整的VQA v2.0数据集问题答案图像对

2022-11-11 21:18:36 1763 1

原创 Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

图像字幕和视觉问答中的自下而上和自上而下的注意力机制

2022-11-11 03:56:45 2032 3

原创 Lecture11 检测和分割

Lecture11 检测和分割

2022-09-16 10:18:22 880

原创 Lecture10 循环神经网络(RNN)

Lecture10 循环神经网络(RNN)

2022-09-09 11:09:08 802

原创 Lecture6 训练神经网络(3)

Lecture6 训练神经网络(3)

2022-09-08 09:28:30 204

原创 Lecture6 训练神经网络(2)

Lecture6 训练神经网络(2)

2022-09-02 04:34:24 350

原创 Lecture6 训练神经网络(1)

Lecture6 训练神经网络(1)

2022-09-01 06:34:08 233

原创 Lecture4 神经网络与反向传播(2)

Lecture4 神经网络与反向传播(2)

2022-08-25 03:01:23 371

原创 Lecture4 神经网络与反向传播(1)

Lecture4 神经网络与反向传播(1)

2022-08-21 22:36:48 444

原创 Lecture3 损失函数和优化损失函数

Lecture3 损失函数和优化损失函数

2022-08-19 00:35:56 1285

原创 Lecture2 图像分类数据驱动方法

Lecture2 图像分类数据驱动方法

2022-08-17 21:42:15 294

原创强化学习笔记

强化学习

2022-08-04 16:20:12 477

原创人工智能深度学习环境搭建

python3.9+CUDA11.3+pytorch1.12.0+tensorflow2.6.0+numpy1.19.5+pandas1.2.4+matplotib3.3.2版本对应关系

2022-07-28 20:43:40 1143

pinkshell_1314的博客