最近在做image caption的任务,因为有自己的数据集,因此需要自己创建字典、annotation等信息。而COCO官方针对coco image caption任务的数据集写了一个API接口(pycocotools包),因此如果想偷懒就老老实实按照coco image caption数据集的格式准备数据。否则自己写pycocotools接口,可能会出现各种各样的bug。因此我们应该首先看一下imgae caption的数据格式。
{
"info" : info, "images" : [image], "annotations" : [annotation], "licenses" : [license],
}
info{
"year" : int, "version" : str, "description" : str, "contributor" : str, "url" : str, "date_created" : datetime,
}
image{
"id" : int, "width" : int, "height" : int, "file_name" : str, "license" : int, "flickr_url" : str, "coco_url" : str, "date_captured" : datetime,
}
annotation{
"image_id":int ,"id":int ,"caption":"giraffe eating food from the top of the tree"
}
license{
"id" : int, "name" : str