tensorflow serving部署Bert预训练模型

最新推荐文章于 2023-01-16 18:46:17 发布

黄然大悟

最新推荐文章于 2023-01-16 18:46:17 发布

阅读量1.7k

点赞数

分类专栏：自然语言处理文章标签： BERT tfserving Bert向量表示

本文链接：https://blog.csdn.net/huanghaocs/article/details/111939507

版权

本文详述了如何使用TensorFlow Serving部署BERT预训练模型，包括将BERT的ckpt模型转换为saved_model格式，通过Docker进行模型部署，以及如何发送HTTP和gRPC请求获取句子向量表示。

摘要由CSDN通过智能技术生成

目前没有整理完善，先留个坑~

Bert模型介绍

BERT的关键技术创新是将Transformers双向训练作为一种流行的注意力模型应用到语言建模中。Masked LM (MLM)在向BERT输入单词序列之前，每个序列中有15%的单词被[MASK]token替换。然后，该模型试图根据序列中其他非MASK词提供的上下文来预测MASK词的原始值。

本文主要记录使用tensorflow serving部署训练好的bert模型，并根据模型获取句子向量表示。

ckpt转saved_model格式

google bert原始预训练模型保存的事ckpt格式，用tfserving部署需要saved_model的pb格式，这里需要一个转化过程。

import json
import os
import tensorflow as tf
import argparse

import modeling

def create_model(bert_config, is_training, input_ids):
    # 通过传入的训练数据，进行representation
    model = modeling.BertModel(config=bert_config, is_training=is_training, input_ids=input_ids)
    output = model.get_pooled_output()
    # output = model.get_sequence_output()

    return output

def transfer_saved_model(args):

    gpu_config = tf.ConfigProto()
    gpu_config.gpu_options.allow_growth = True
    sess = tf.Session(config=gpu_config)

    print("going to restore checkpoint")
    bert_config_file = os.path.join(args.model_path, 'bert_config.json')
    bert_config = modeling.BertConfig.from_json_file(bert_config_file)

    input_ids = tf.placeholder(tf.int32, [None, args.max_seq_len], name="input_ids")
    output = create_