长命百岁️-CSDN博客

原创【UCAS自然语言处理作业二】训练FFN, RNN, Attention机制的语言模型，并计算测试集上的PPL

训练前馈神经网络，循环神经网络，注意力机制语言模型，并计算测试集PPL

2023-11-25 21:39:39 758

原创【UCAS自然语言处理作业一】利用BeautifulSoup爬取中英文数据，计算熵，验证齐夫定律

本文分别针对中文，英文语料进行爬虫，并在两种语言上计算其对应的熵，验证齐夫定律github。

2023-10-22 22:14:23 1077 2

原创【SIGIR-AP 2023】A Comparative Study of Training Objectives for Clarification Facet Generation

【SIGIR-AP 2023】AComparative Study of Training Objectives for Clarification Facet Generation

2023-10-20 19:43:08 251

原创【论文阅读】检索增强发展历程及相关文章总结

检索增强相关文章总结：`Knn-LM`->`REALM`->`DPR`->`RAG`->`FID`->`COG`->`GenRead`->`REPLUG`->`Adaptive retrieval`

2023-09-19 11:32:13 813 3

原创如何计算文本的困惑度perplexity(ppl)

本文关注在Pytorch中如何计算困惑度ppl, 为什么能用cross-entropy loss 代表ppl。

2023-07-31 15:34:31 4439

原创【论文阅读】Unified Multi-Dimensional Automatic Evaluation for Open-Domain Conversations with LLMs

该文章提出一种利用大模型对open-domain对话进行评估的方法。主要利用一个Prompt，来指示LLMs一次性输出相应的多个指标。

2023-07-19 17:13:10 335

原创【论文阅读】一些多轮对话文章的体会 ACL 2023

本文是对昨天看到的ACL 2023三篇多轮对话文章的分享这三个工作都是根据一些额外属性控制输出的工作，且评估的方面比较相似，可以借鉴。

2023-07-18 17:49:52 1351

原创【论文阅读】Scaling Laws for Neural Language Models

本文简要介绍的主要结论个人认为不需要特别关注公式内各种符号的具体数值，而更应该关注不同因素之间的关系，比例等。

2023-07-13 10:47:40 2864

原创【论文阅读】Learing to summarize from human feedback

该仓库持续更新。

2023-06-16 20:13:16 1307 1

原创【论文阅读】Language Models are Few-Shot Learners(GPT-3)

本文简要介绍了GPT-3的背景，模型架构，训练数据以及训练方式部分。具体训练细节，实验结果很多，可以在用到的时候再看。

2023-06-09 22:40:10 1192

原创【论文阅读】REPLUG: Retrieval-Augmented Black-Box Language Models

【论文阅读】REPLUG: Retrieval-Augmented Black-Box Language Models

2023-05-19 22:54:37 2201 6

原创 CS 224N总结

本文记录学习CS 224N时的一些疑惑与个人理解

2023-05-14 23:49:12 372

原创 torch.nn.Embedding

torch.nn.Embedding

2023-03-25 15:52:31 518

原创 torch.bmm()

pytorch函数torch.bmm介绍

2023-03-24 10:34:38 221

原创【论文阅读】MIMICS: A Large-Scale Data Collection for Search Clarification

【论文阅读】MIMICS: A Large-Scale Data Collection for Search Clarification

2023-03-17 15:50:57 363

原创【论文阅读 SIGIR‘19】Asking Clarifying Questions in Open-Domain Information-Seeking Conversations

【论文阅读 SIGIR'19】Asking Clarifying Questions in Open-Domain Information-Seeking Conversations

2023-03-11 23:03:49 163 2

原创【论文阅读】Learning to Ask Good Questions: Ranking Clarification Questions using Neural Expected Value of

【论文阅读】Learning to Ask Good Questions: Ranking Clarification Questions using Neural Expected Value of Perfect Information

2023-03-10 11:09:48 132

原创【论文阅读 WWW‘23】Zero-shot Clarifying Question Generation for Conversational Search

【论文阅读 WWW'23】Zero-shot Clarifying Question Generation for Conversational Search

2023-03-05 21:59:50 890 1

原创 BPE（Byte-Pair Encoding）简介

BPE简介

2023-02-20 16:40:05 3416 1

原创常见的分词方法

常见的分词方法：word-based，character-based，subword-based tokenization

2023-02-19 23:34:05 318

原创使用Fairseq进行Bart预训练

使用Fairseq进行Bart预训练

2023-02-19 16:09:49 2186 13

原创【Huggingface系列学习】Finetuning一个预训练模型

【huggingface系列】Fituning预训练模型

2023-02-12 18:09:20 1829

原创【huggingface系列学习】Using Transformers

【huggingface系列学习】Using Transformers

2023-02-10 22:47:23 732

原创 Python@装饰器（函数+类装饰器）

Python中@的作用

2023-01-27 16:27:01 934

原创【论文阅读 T5】Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

【论文阅读 T5】Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

2023-01-16 00:37:42 460 1

原创【论文阅读 CIKM2011】Finding Dimensions for Queries

【论文阅读 CIKM2011】Finding Dimensions for Queries

2023-01-15 00:43:06 767 3

原创【论文阅读 CIKM2014】Extending Faceted Search to the General Web

【论文阅读 CIKM2014】Extending Faceted Search to the General Web

2023-01-14 17:15:28 430 2

原创【论文阅读】Stochastic Optimization of Text Set Generation for Learning Multiple Query Intent Representati

【论文阅读 CIKM 2022】Stochastic Optimization of Text Set Generation for Learning Multiple Query Intent Representations

2023-01-13 19:55:40 140

原创 Clarifying Question领域最常见的三个数据集

Clarifying Question 领域最常用的数据集

2023-01-12 22:20:40 778

原创 transformers库中的.from_pretrained()

Transformers库中的预训练模型加载函数.from_pretrained()

2022-12-28 22:34:53 14799 15

原创【信息检索与数据挖掘期末笔记】（六）Link Analysis

【信息检索与数据挖掘笔记】Link Analysis

2022-12-26 23:44:46 822

原创【论文阅读 CIKM‘2021】Learning Multiple Intent Representations for Search Queries

【论文阅读 CIKM'2021】Learning Multiple Intent Representations for Search Queries

2022-12-05 23:23:50 376

原创【信息检索与数据挖掘期末复习】（五）Language Model

【信息检索与数据挖掘期末复习】Language Model

2022-12-05 19:41:12 650

原创【信息检索与数据挖掘期末笔记】（四）概率检索模型

【信息检索与数据挖掘期末笔记】概率检索模型

2022-12-02 11:24:17 1167

原创【论文阅读 ICTIR‘2022】Revisiting Open Domain Query Facet Extraction and Generation

【论文阅读 ICTIR'2022】Revisiting Open Domain Query Facet Extraction and Generation

2022-11-30 23:08:21 746

原创【信息检索与数据挖掘期末笔记】（三）文档评分

【信息检索与数据挖掘期末笔记】文档评分

2022-11-30 17:46:24 369

原创【信息检索与数据挖掘期末笔记】(二) IR Evaluation

【信息检索与数据挖掘期末笔记】IR Evaluation

2022-11-30 17:38:38 487

原创【论文阅读】Evaluating Mixed-initiative Conversational Search Systems via User Simulation

【WSDM'2022】Evaluating Mixed-initiative Conversational Search Systems via User Simulation

2022-11-28 22:00:37 192

原创【信息检索与数据挖掘期末笔记】（一）Introduction

【信息检索与数据挖掘】Introduction

2022-11-28 21:15:24 1237

原创关于Dialog和Clarifying question的一些调研

关于Dialog和Clarifying question的文献整理

2022-11-19 21:54:12 780 2

空空如也

空空如也