context建模学习(time-decay)

最新推荐文章于 2020-12-15 03:41:20 发布

zixufang

最新推荐文章于 2020-12-15 03:41:20 发布

阅读量739

点赞数

分类专栏： DST学习

本文链接：https://blog.csdn.net/yagreenhand/article/details/103216003

版权

本文探讨了如何在对话系统中利用时间衰减注意力来改进上下文建模。通过引入可训练的距离向量，模型能够自适应地学习时间衰减函数，从而更准确地捕捉对话状态。实验表明，这种灵活的时间意识注意力模型可以提高对话理解和SLU任务的准确性。

摘要由CSDN通过智能技术生成

说到底还是线性组合，学习POMDP，
How Time Matters: Learning Time-Decay Attention for Contextual Spoken Language Understanding in Dialogues
Time Masking: Leveraging Temporal Information in Spoken Dialogue Systems
Decay-Function-Free Time-Aware Attention to Context and Speaker Indicator for Spoken Language Understanding

1、在这里插入图片描述
考虑到system和user分别建模（role）
$\begin{aligned} \mathbf{v}_{\mathrm{cur}} &=\operatorname{BLSTM}\left(\mathbf{x}, W_{\mathrm{his}} \cdot \mathbf{v}_{\mathrm{his}}\right) \\ \mathbf{o} &=\operatorname{sigmoid}\left(W_{\mathrm{SLU}} \cdot \mathbf{v}_{\mathrm{cur}}\right) \end{aligned}$
$\begin{aligned} \mathbf{v}_{\text {his }} &=\sum_{\text {role }} \mathbf{v}_{\text {his, role }} \\ &=\sum_{\text {role }} \mathrm{BLSTM}_{\text {role }}\left(x_{t, \text { role }}\right) \end{aligned}$
$\mathbf{v}_{\mathrm{his}}^{U}=\sum_{\text {role }} \mathrm{BLSTM}_{\text {role }}\left(x_{t, \text { role }},\left\{\alpha_{u_{j}} | u_{j} \in \text { role }\right\}\right)$
$\mathbf{v}_{\mathrm{his}}^{R}=\sum_{\mathrm{role}} \alpha_{\mathrm{role}} \cdot \mathbf{v}_{\mathrm{his}, \mathrm{role}}$ $α_{role}=max(α_{u_j})$