概述
这个工作尝试重现这个论文的结果 A Neural Conversational Model (aka the Google chatbot).
它使用了循环神经网络(seq2seq 模型)来进行句子预测。它是用 python 和 TensorFlow 开发。
程序的加载主体部分是参考 Torch的 neuralconvo from macournoyer.
现在, DeepQA 支持一下对话语料:
* Cornell Movie Dialogs corpus (default). Already included when cloning the repository.
* OpenSubtitles (thanks to Eschnou). Much bigger corpus (but also noisier). To use it, follow those instructions and use the flag --corpus opensubs
.
* Supreme Court Conversation Data (thanks to julien-c). Available using --corpus scotus
. See the instructions for installation.
* Ubuntu Dialogue Corpus (thanks to julien-c). Available using -