Caffe使用：如何将一维数据或其他非图像数据转换成lmdb

最新推荐文章于 2024-04-13 12:01:04 发布

夏天7788

最新推荐文章于 2024-04-13 12:01:04 发布

阅读量681

点赞数

分类专栏： caffe

caffe 专栏收录该内容

12 篇文章 0 订阅

订阅专栏

转载地址：http://www.cnblogs.com/dcsds1/p/5205669.html

在caffe的google group里我找到了这个网址：http://deepdish.io/2015/04/28/creating-lmdb-in-Python/

[python]view plaincopy 
   
 import numpy as np  
 import lmdb  
 import caffe  
   
 N = 1000  
   
 # Let's pretend this is interesting data  
 X = np.zeros((N, 3, 32, 32), dtype=np.uint8)  
 y = np.zeros(N, dtype=np.int64)  
   
 # We need to prepare the database for the size. We'll set it 10 times  
 # greater than what we theoretically need. There is little drawback to  
 # setting this too big. If you still run into problem after raising  
 # this, you might want to try saving fewer entries in a single  
 # transaction.  
 map_size = X.nbytes * 10  
   
 env = lmdb.open('mylmdb', map_size=map_size)  
   
 with env.begin(write=True) as txn:  
     # txn is a Transaction object  
     for i in range(N):  
         datum = caffe.proto.caffe_pb2.Datum()  
         datum.channels = X.shape[1]  
         datum.height = X.shape[2]  
         datum.width = X.shape[3]  
         datum.data = X[i].tobytes()  # or .tostring() if numpy < 1.9  
         datum.label = int(y[i])  
         str_id = '{:08}'.format(i)  
   
         # The encode is only essential in Python 3  
         txn.put(str_id.encode('ascii'), datum.SerializeToString())  

这是用python将数据转为lmdb的代码，但是我用这个处理完数据再使用caffe会出现std::bad_alloc错误，后来经过艰苦奋斗，查阅了大量的资料，我发现了问题所在：

1、caffe的数据格式默认为四维(n_samples, n_channels, height, width) .所以必须把我的数据处理成这种格式

2、最后一行txn.put(str_id.encode('ascii'), datum.SerializeToString())一定要加上，我一开始一维python2不用写这个，结果老是出错，后来才发现这行必须写！

3、如果出现mdb_put: MDB_MAP_FULL: Environment mapsize limit reached的错误，是因为lmdb默认的map_size比较小，我把lmdb/cffi.py里面的map_size默认值改了一下，改成了1099511627776（也就是1Tb），我也不知道是不是这么改，然后我又把上面python程序里map_size = X.nbytes 这句改成了map_size = X.nbytes * 10，然后就成功了！

找资料的过程中，我还发现了用python写leveldb的程序，网址在这里：https://github.com/BVLC/caffe/issues/745和http://stackoverflow.com/questions/32707393/whats-caffes-input-format

用python写HDF5的程序在这里：http://stackoverflow.com/questions/31774953/test-labels-for-regression-caffe-float-not-allowed/31808324#31808324

参考：

　　1.http://stackoverflow.com/questions/30983213/how-to-use-1-dim-vector-as-input-for-caffe/30991590#30991590

　　2.关于lmdb的map_size大小的问题：https://github.com/BVLC/caffe/issues/1298和http://stackoverflow.com/questions/31820976/lmdb-increase-map-size

夏天7788

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Caffe使用：如何将一维数据或其他非图像数据转换成lmdb

转载地址：http://www.cnblogs.com/dcsds1/p/5205669.html在caffe的google group里我找到了这个网址：http://deepdish.io/2015/04/28/creating-lmdb-in-Python/[python] view plain copy
复制链接

扫一扫

专栏目录