REINFORCEMENT LEARNING USING QUANTUM BOLTZMANN MACHINES
利用量子波兹曼机进行强化学习
Abstract. We investigate whether quantum annealers with select chip layouts can outperform classical computers in reinforcement learning tasks. We associate a transverse eld Ising spin Hamiltonian with a layout of qubits similar to that of a deep Boltzmann machine (DBM) and use simulated quantum annealing (SQA) to numerically simulate quantum sampling from this system. We design a reinforcement learning algorithm in which the set of visible nodes representing the states and actions of an optimal policy are the rst and last layers of the deep network. In absence of a transverse eld, our simulations show that DBMs are trained more e ectively than restricted Boltzmann machines (RBM) with the same number of nodes. We then develop a framework for training the network as a quantum Boltzmann machine (QBM) in the pres