Making training mini-batches

最新推荐文章于 2024-03-24 19:53:47 发布

学海无涯子

最新推荐文章于 2024-03-24 19:53:47 发布

阅读量342

点赞数

本文链接：https://blog.csdn.net/u010552731/article/details/89347729

版权

Making training mini-batches

Here is where we'll make our mini-batches for training. Remember that we want our batches to be multiple sequences of some desired number of sequence steps. Considering a simple example, our batches would look like this:

We have our text encoded as integers as one long array in encoded. Let's create a function that will give us an iterator for our batches. I like using generator functions to do this. Then we can pass encoded into this function and get our batch generator.

The first thing we need to do is discard some of the text so we only have completely full batches. Each batch contains

After that, we need to split arr into

Now that we have this array, we can iterate through it to get our batches. The idea is each batch is a

y[:, :-1], y[:, -1] = x[:, 1:], x[:, 0]

where x is the input batch and y is the target batch.

The way I like to do this window is use range to take steps of size n_steps from

 1 def get_batches(arr, n_seqs, n_steps):
 2     '''Create a generator that returns batches of size
 3        n_seqs x n_steps from arr.
 4        
 5        Arguments
 6        ---------
 7        arr: Array you want to make batches from
 8        n_seqs: Batch size, the number of sequences per batch
 9        n_steps: Number of sequence steps per batch
10     '''
11     # Get the number of characters per batch and number of batches we can make
12     characters_per_batch = n_seqs * n_steps
13     n_batches = len(arr) // characters_per_batch
14     
15     # Keep only enough characters to make full batches
16     arr = arr[:n_batches*characters_per_batch]
17     
18     # Reshape into n_seqs rows
19     arr = arr.reshape((n_seqs, -1))
20     
21     for n in range(0, arr.shape[1], n_steps):
22         # The features
23         x = arr[:, n:n+n_steps]
24         # The targets, shifted by one
25         y = np.zeros_like(x)
26         y[:, :-1], y[:, -1] = x[:, 1:], x[:, 0]
27         yield x, y
28 
29 batches = get_batches(encoded, 10, 50)
30 x, y = next(batches)
31 
32 print('x\n', x[:10, :10])
33 print('\ny\n', y[:10, :10])