AI Paper写作技巧-个人总结

本文探讨了如何通过丰富模型来利用更强大的特性,如LambdaRank放弃明确的平滑目标,转而采用直观的梯度函数。类似人类在解决多步推理问题时的深思熟虑,语言模型也能通过链式思考提示生成连贯的思考过程,从而提高推理能力。这一方法展示了其在促进模型推理方面的吸引力。
摘要由CSDN通过智能技术生成
  1. 模型的初始状态麻烦给清楚
  2. 可以适当给一个小实例, 在附录部分,这样很方便读者理解

优秀句总结

However, increasingly we see richer models appearing that
set out to harness an ever expanding set of more powerful
features
然而,我们越来越多地看到更丰富的模型出现,它们开始利用一组不断扩大的更强大的特性。

The approach taken by LambdaRank was to abandon the attempt to define an explicit smooth objective, and instead only work with an implicit objective via the definition of gradient functions with intuitively desirable properties.

i.e. 的正确使用

ML theory gives us some basic definitions like generalization gap and excess risk (i.e. the difference between training and testing losses)

类比使用

Similar to how humans naturally deliberate when presented with a multi-step reasoning problem, it might be beneficial if language models could analogously generate a coherent chain of thought before arriving at the answer.

e.g.的使用

Consider one’s own thought process when solving a type-2 task such as a multi-step math word problem, where it is typical to decompose the problem into intermediate steps and solve each before giving the final answer (e.g., “After Jane gives 2 flowers to her mom she has 10 then after she gives 3 to her dad she will have 7 so the answer is 7.”).

总结模型优点起始句

Chain of thought prompting has several attractive properties as an approach for facilitating reasoning in language models.

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值