python的fread_What is the equivalent of 'fread' from Matlab in Python?

最新推荐文章于 2021-02-15 20:54:01 发布

史愿

最新推荐文章于 2021-02-15 20:54:01 发布

阅读量300

点赞数

文章标签： python的fread

本文链接：https://blog.csdn.net/weixin_32836713/article/details/111911036

版权

I have practically no knowledge of Matlab, and need to translate some parsing routines into Python. They are for large files, that are themselves divided into 'blocks', and I'm having difficulty right from the off with the checksum at the top of the file.

What exactly is going on here in Matlab?

status = fseek(fid, 0, 'cof');

fposition = ftell(fid);

disp(' ');

disp(['** Block ',num2str(iBlock),' File Position = ',int2str(fposition)]);

% ----------------- Block Start ------------------ %

[A, count] = fread(fid, 3, 'uint32');

if(count == 3)

magic_l = A(1);

magic_h = A(2);

block_length = A(3);

else

if(fposition == file_length)

disp(['** End of file OK']);

else

disp(['** Cannot read block start magic ! Note File Length = ',num2str(file_length)]);

end

ok = 0;

break;

end

fid is the file currently being looked at iBlock is a counter for which 'block' you're in within the file

magic_l and magic_h are to do with checksums later, here is the code for that (follows straight from the code above):

disp(sprintf(' Magic_L = %08X, Magic_H = %08X, Length = %i', magic_l, magic_h, block_length));

correct_magic_l = hex2dec('4D445254');

correct_magic_h = hex2dec('43494741');

if(magic_l ~= correct_magic_l | magic_h ~= correct_magic_h)

disp(['** Bad block start magic !']);

ok = 0;

return;

end

remaining_length = block_length - 3*4 - 3*4; % We read Block Header, and we expect a footer

disp(sprintf(' Remaining Block bytes = %i', remaining_length));

What is going on with the %08X and the hex2dec stuff?

Also, why specify 3*4 instead of 12?

Really though, I want to know how to replicate [A, count] = fread(fid, 3, 'uint32'); in Python, as io.readline() is just pulling the first 3 characters of the file. Apologies if I'm missing the point somewhere here. It's just that using io.readline(3) on the file seems to return something it shouldn't, and I don't understand how the block_length can fit in a single byte when it could potentially be very long.

Thanks for reading this ramble. I hope you can understand kind of what I want to know! (Any insight at all is appreciated.)

python

matlab

numpy

readline

fread

edited Feb 8 '13 at 14:15

Matthew Rankin 184k 26 93 135 asked Jan 27 '10 at 10:25

Duncan Tait 544 2 10 21 You might want to think about splitting the question and move the second part into another question, the title is a bit misleading. –

Torsten Marek Jan 27 '10 at 10:45

4 Answers

---Accepted---Accepted---Accepted---

From the documentation of fread, it is a function to read binary data. The second argument specifies the size of the output vector, the third one the size/type of the items read.

In order to recreate this in Python, you can use the array module:

f = open(...)

import array

a = array.array("L") # L is the typecode for uint32

a.fromfile(f, 3)

This will read read three uint32 values from the file f, which are available in a afterwards. From the documentation of fromfile:

Read n items (as machine values) from the file object f and append them to the end of the array. If less than n items are available, EOFError is raised, but the items that were available are still inserted into the array. f must be a real built-in file object; something else with a read() method won’t do.

Arrays implement the sequence protocol and therefore support the same operations as lists, but you can also use the .tolist() method to create a normal list from the array.

answered Jan 27 '10 at 10:44

Torsten Marek 44.1k 16 67 88 Thanks, this really helped - it's all working now! –

Duncan Tait Jan 27 '10 at 12:14

Python Code for Reading a 1-Dimensional Array

When replacing Matlab with Python, I wanted to read binary data into a numpy.array, so I used numpy.fromfile to read the data into a 1-dimensional array:

import numpy as np

with open(inputfilename, 'rb') as fid:

data_array = np.fromfile(fid, np.int16)

Some advantages of using numpy.fromfile versus other Python solutions include:([1; 1], [1 1 1]). How would I accomplish this python matlab numpy

this question edited Sep 10 '14 at 1:58 dbliss 2,474 2 20 40 asked Nov 12 '09 at 12:20 vernomcrp 1,209 5 22 35 | 5 Answers

5 active oldest v

Not having to manually determine the number of items to be read. You can specify them using the count= argument, but it defaults to -1 which indicates reading the entire file.

Being able to specify either an open file object (as I did above with fid) or you can specify a filename. I prefer using an open file object, but if you wanted to use a filename, you could replace the two lines above with: data_array = numpy.fromfile(inputfilename, numpy.int16)

Matlab Code for a 2-Dimensional Array

Matlab's fread has the ability to read the data into a matrix of form [m, n] instead of just reading it into a column vector. For instance, to read data into a matrix with 2 rows use:

fid = fopen(inputfilename, 'r');

data_array = fread(fid, [2, inf], 'int16');

fclose(fid);

Equivalent Python Code for a 2-Dimensional Array

You can handle this scenario in Python using Numpy's shape and transpose.

import numpy as np

with open(inputfilename, 'rb') as fid:

data_array = np.fromfile(fid, np.int16).reshape((-1, 2)).T

The -1 tells numpy.reshape to infer the length of the array for that dimension based on the other dimension—the equivalent of Matlab's inf infinity representation.

The .T transposes the array so that it is a 2-dimensional array with the first dimension—the axis—having a length of 2.

edited Feb 6 '13 at 15:01 answered Feb 6 '13 at 2:36

Matthew Rankin 184k 26 93 135 Great answer, well explained with good clear examples. –

Jack Dec 1 '15 at 16:22

Really though, I want to know how to replicate [A, count] = fread(fid, 3, 'uint32');

In Matlab, one of fread()'s signatures is fread(fileID, sizeA, precision). This reads in the first sizeA elements (not bytes) of a file, each of a size sufficient for precision. In this case, since you're reading in uint32, each element is of size 32 bits, or 4 bytes.

So, instead, try io.readline(12) to get the first 3 4-byte elements from the file.

edited Feb 8 '13 at 14:16

Matthew Rankin 184k 26 93 135 answered Jan 27 '10 at 10:34

John Feminella 188k 30 283 311

The first part is covered by Torsten's answer... you're going to need array or numarray to do anything with this data anyway.

As for the %08X and the hex2dec stuff, %08X is just the print format for those unit32 numbers (8 digit hex, exactly the same as Python), and hex2dec('4D445254') is matlab for 0x4D445254.

Finally, ~= in matlab is a bitwise compare; use == in Python.

answered Jan 27 '10 at 11:37

Andrew McGregor 11.7k 2 20 26

binary file is encoded as a 'double float,' thus read by MATLAB with the following line: fread(fopen(fileName), 'float64'); In Python, I'm not really sure how to replicate this line. I thought using Numpy would be a good place to start, s