刘顺琦 CSCI 561 midterm3

本次中期考试涵盖了搜索、博弈、合同满足、逻辑等多个主题,涉及MDP、决策树、神经网络和贝叶斯网络等内容。重点考察了贝叶斯定理的应用,条件概率,无监督学习中的kNN算法,以及Markov决策过程(MDP)的理解,包括其与部分可观测MDP(POMDP)的区别。还涉及到决策树的构建,神经网络的学习机制,以及近似最优化算法如Q-Learning。
摘要由CSDN通过智能技术生成

Outline

60% midterm2 后 + 40%midterm2 前,search+game playing+contract satisfaction+logic

MDP+DT+NN+BN

image-20210504233250069image-20210505201458234

试卷

  1. A preference relation admits a utility function representation if it is complete, transitive, and continuous.

  2. 2.1prior probability: (is the probability of an event before new data is collected. This is the best rational assessment of the probability of an outcome based on the current knowledge before an experiment is performed.)

    2.2Posterior probability is the revised probability of an event occurring after taking into consideration new information. Posterior probability is calculated by updating the prior probability by using Bayes’ theorem. => Bayes’ Theorem converts a prior belief into a posterior belief after you’ve collected some additional evidence. Bayes’ theorem thus gives the probability of an event based on new information that is, or may be related, to that event.

    image-20210505000946680
  3. Conditional probability is the likelihood of an outcome occurring, based on a previous outcome occurring

  4. So kNN is an exception to general workflow for building/testing supervised machine learning models. In particular, the model created via kNN is just the available labeled data, placed in some metric space.

    In other words, for kNN, there is no training step because there is no model to build.

    image-20210505003614984

5.The minimax algorithm can be considered to be solving a determi

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值