CharpYu-CSDN博客

原创 Pytorch nn.CosineEmbeddingLoss() 学习

cosine损失1. 余弦相似度的计算pytorch存在一个计算两个向量的余弦相似度的方法，torch.cosine_similarity输入：(N,D)(N, D)(N,D)和(N,D)(N, D)(N,D)，返回(N)(N)(N)。2. cosine损失的计算Pytorch自带的Loss为：CosineEmbeddingLoss公式：详情见官方文档3.代码实现这里用两种不同的方式实现了cosine loss的功能。import torchimport torch.nn as nn

2021-07-30 15:51:24 15613

原创 Adapter-Bot开源了

标题：《The Adapter-Bot: All-In-One Controllable Conversational Model》作者：香港科技大学时间：2020年8月过去对话系统的问题：have little or no control of the generated responses and miss two important features：（1） continuous on-demand dialogue skills integration：连续性对话技术整合(e.g., em

2021-04-15 17:16:12 257

原创 MultiWOZ 2.4最新版本：通过改良标注提升DST

标题：《MultiWOZ 2.4: A Multi-Domain Task-Oriented Dialogue Dataset with Essential Annotation Corrections to Improve State Tracking Evaluation》作者：伦敦大学时间：2021年4月中文：《MultiWOZ2.4版本，通过改良标注提升DST》内容：作者关注2.1版本的标注中噪声非常多导致各种DST模型在测试集上joint accuracy总是卡在55%以下的问题，决心花大

2021-04-13 22:07:45 664

原创《DIET: Lightweight Language Understanding for Dialogue Systems》

标题：《DIET: Lightweight Language Understanding for Dialogue Systems》中文：用于对话系统的轻量语言理解方法时间：2020年5月作者：RASA简介：这个是RASA团队针对对话系统中NLU任务，设计的一种新框架，名叫Dual Intent and Entity Transformer (DIET，双重意图与实体Transformer ) 。成果是，DIET在不利用pre-trained embeddings.的情况下，达到了可比的性能，即la

2021-01-25 13:01:25 444 1

原创《Best Practices for Data-Efficient Modeling in NLG:How to Train Production-Ready Neural Models with

标题：《Best Practices for Data-Efficient Modeling in NLG:How to Train Production-Ready Neural Models with Less Data》作者：Facebook时间：2020项目地址：https://github.com/facebookresearch/DataEfficientNLG（只是个数据集仓库，暂时还没有开放code）中文：数据高效建模的最佳实践NLG：如何用较少的数据训练可落地的神经网络模型简介

2020-12-29 21:13:26 161

原创 PLUG AND PLAY LANGUAGE MODELS

标题：《PLUG AND PLAY LANGUAGE MODELS: A SIMPLE APPROACH TO CONTROLLED TEXT GENERATION》时间：2020年3月作者：Uber AI内容：本文关注可控生成，或条件生成问题。提出了一个Plug and Play Language Model (PPLM) 模型，它结合了一个预训练LM和一个或若干个属性分类器（attribute classifiers）来引导文本生成，而不需要进一步训练LM。源码：https://github.c

2020-11-20 09:23:38 1201

原创 RiSAWOZ中文任务型对话数据集

RiSAWOZ中文任务型对话数据集标题：《RiSAWOZ: A Large-Scale Multi-DomainWizard-of-Oz Dataset with Rich Semantic Annotations for Task-Oriented Dialogue Modeling》源码：https://github.com/terryqj0107/RiSAWOZ时间：2020年10月作者：苏州大学、天津大学内容：一个新的中文任务型对话数据集，包含12个领域，是目前最大的。标注很丰富，包含go

2020-11-14 11:38:28 1965 1

原创《STAR: A Schema-Guided Dialog Dataset for Transfer Learning》论文阅读

《STAR: A Schema-Guided Dialog Dataset for Transfer Learning》标题：《STAR: A Schema-Guided Dialog Dataset for Transfer Learning》作者：Rasa，卡耐基梅隆大学时间：2020年10月源码：https://github.com/RasaHQ/STAR内容：作者公开了名叫STAR的schema-guided任务型对话的新数据集。特别地，作者提出了新式的对话数据模式，解决了过去数据集的问题

2020-10-24 20:06:47 596

原创《DialoGLUE》任务型对话新Benchmark & ConvBERT模型

DialoGLUE标题：《DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue》作者：卡内基梅隆大学，Amazon Alexa AI时间：2020年10月内容：为了发展更通用的面向任务型对话系统，作者提出了一个大型公开benchmark，以鼓励学术界对representation-based transfer, domain adaptation, 以及sample-efficient t

2020-10-18 09:59:33 681

原创《MultiWOZ 2.3》MultiWOZ数据集的新版本

标题《MultiWOZ 2.3: A multi-domain task-oriented dataset enhanced with annotation corrections and co-reference annotation》时间：2020年10月关键词：co-reference features内容：老版本数据集的问题，1、dialogue state annotations导致dialogue act annotations untouched. 2、the critical co

2020-10-18 09:54:15 1005 2

原创论文阅读：Adapter-Bot【融合异质对话任务-工程范式】

《The Adapter-Bot: All-In-One Controllable Conversational Model》标题：《The Adapter-Bot: All-In-One Controllable Conversational Model》作者：香港科技大学时间：2020年8月过去对话系统的问题：have little or no control of the generated responses and miss two important features：（1） con

2020-10-10 21:04:32 544

原创论文阅读：MinTL【数据库查询结果的embedding】

标题：《MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems》作者：香港科技大学内容：也是基于Transformers预训练语言模型的任务型对话，与SimpleTOD，SOLOIST，BERT-TOD合称四大天王（狗头）。源码：https://github.com/zlinao/MinTLBert-TOD使用的是BERT，SimpltTOD，SOLOIST都使用的GPT-2，其中SOLOIST实现去dialogu

2020-10-10 21:00:24 746

原创强化学习trick：RBS

强化学习trick：RBS来自2017年论文《Efficient Dialogue Policy Learning with BBQ-Networks》arXiv:1608.05081v3RBS = replay buffer spiking = spike the replay buffer with a few experiencesRBS是强化学习的一个简单的tricky，即pre-fill the experience replay buffer with a small set of t

2020-10-03 19:50:29 347

weixin_44385551的博客

原创 Pytorch nn.CosineEmbeddingLoss() 学习

原创 Adapter-Bot开源了

原创 MultiWOZ 2.4最新版本：通过改良标注提升DST

原创《DIET: Lightweight Language Understanding for Dialogue Systems》

原创《Best Practices for Data-Efficient Modeling in NLG:How to Train Production-Ready Neural Models with

原创 PLUG AND PLAY LANGUAGE MODELS

原创 RiSAWOZ中文任务型对话数据集

原创《STAR: A Schema-Guided Dialog Dataset for Transfer Learning》论文阅读

原创《DialoGLUE》任务型对话新Benchmark & ConvBERT模型

原创《MultiWOZ 2.3》MultiWOZ数据集的新版本

原创论文阅读：Adapter-Bot【融合异质对话任务-工程范式】

原创论文阅读：MinTL【数据库查询结果的embedding】

原创强化学习trick：RBS

原创 Deep Dyna-Q 阅读笔记

原创《Meta Dialogue Policy Learning》Meta-DTQN (DP + RL)

原创强化学习备忘录

原创 Uncertainty Loss不确定损失

原创【略解】copy机制与SpanPtr

原创 MADA & DAMD

原创最新模型-TRADE【Transferable Dialogue state generator】

原创最新模型-SUMBT【slot-utterance matching belief tracker】

原创最新模型：COMER【Conditional Memory Relation Network】

原创论文阅读：《Find or Classify Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Trackin

原创论文阅读：《Efficient Dialogue State Tracking by Selectively Overwriting Memory》

原创多领域多轮问答调研报告3

原创论文阅读：《Towards Scalable Multi-domain Conversational Agents:The Schema-Guided Dialogue Dataset》

原创论文阅读：《Hybrid Code Networks》

原创论文阅读：《Frames: A Corpus for Adding Memory to Goal-Oriented Dialogue Systems》

原创论文阅读：《What Question Answering can Learn from Trivia Nerds》

原创论文阅读：《SIM: A Slot-Independent Neural Model for Dialogue State Tracking》

原创论文阅读：GLAD《Global-Locally Self-Attentive Dialogue State Tracker》

空空如也

空空如也