tf.nn.ctc_beam_search_decoder

最新推荐文章于 2024-07-26 16:49:46 发布

田野星

最新推荐文章于 2024-07-26 16:49:46 发布

阅读量5.1k

点赞数

分类专栏： tesseract

tesseract 专栏收录该内容

6 篇文章 0 订阅

订阅专栏

这篇博客介绍了TensorFlow中的ctc_beam_search_decoder函数，用于神经网络的连接主义时间分类（CTC）的波束搜索解码。该函数接受logits和序列长度作为输入，返回解码后的输出、最佳路径的类标识以及相应的概率。文章特别指出，当设置特定参数时，可以优化解码速度和结果输出。

摘要由CSDN通过智能技术生成

tf.nn.ctc_beam_search_decoder

tf.nn.ctc_beam_search_decoder(
    inputs,
    sequence_length,
    beam_width=100,
    top_paths=1,
    merge_repeated=True
)

定义于tensorflow/python/ops/ctc_ops.py。

参见指南：神经网络>连接主义时间分类（CTC）

对输入中给出的logits执行波束搜索解码。

注意这ctc_greedy_decoder是 ctc_beam_search_decoderwith top_paths=1和beam_width=1（但解码器在这种特殊情况下更快）的特殊情况。

如果merge_repeated是True，则合并输出光束中的重复类。这意味着如果梁中的连续条目相同，则仅发出第一个条目。也就是说，当顶部路径为时A B B B B，返回值为：

A B如果merge_repeated = True。
A B B B B如果merge_repeated = False。

ARGS：

inputs：3-D float Tensor，尺寸 [max_time x batch_size x num_classes]。The logits.
sequence_length：int32包含序列长度的1-D 载体，具有大小[batch_size]。
beam_width：int标量> = 0（波束搜索波束宽度）。
top_paths：int标量> = 0，<= beam_width（控制输出大小）。
merge_repeated：布尔值。默认值：True。

元组(decoded, log_probabilities)，其中* decoded：长度top_paths的列表，其中，decoded[j] 是SparseTensor包含经解码输出： decoded[j].indices：指数矩阵

(total_decoded_outputs[j] x 2) 中的行存储：[batch, time]。 decoded[j].values：值向量，大小(total_decoded_outputs[j])。向量存储波束j的解码类。 decoded[j].dense_shape：形状矢量，大小(2)。形状值为：[batch_size, max_decoded_length[j]]。

* log_probability：包含序列对数概率的float矩阵(batch_size x top_paths)。

田野星

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
1
评论
tf.nn.ctc_beam_search_decoder

tf.nn.ctc_beam_search_decodertf.nn.ctc_beam_search_decoder( inputs, sequence_length, beam_width=100, top_paths=1, merge_repeated=True)定义于tensorflow/python/ops/ctc_ops.py。参见指南：神经网络&gt;连接...
复制链接

扫一扫