Paper Reading: A 288μW Programmable Deep-Learning Processor with 270KB On-Chip Weight Storage

该论文介绍了一种在2017年ISSCC上发布的低功耗可编程深度学习加速器(DLA)。设计重点在于减少数据移动开销,通过在存储器中放置四个处理元素,并采用非均匀内存层次结构,平衡频繁访问数据的小型低功耗内存银行和大容量高密度内存银行之间的权衡。芯片的性能和实际照片在文中展示。
摘要由CSDN通过智能技术生成

1. Introduction

        This paper is published in ISSCC in 2017. Recently there has been increased interest in deep learning for mobile IoT to enable intelligence at the edge. Therefore, low power is a critical design constraint. The researchers introduce a low-power, programmable deep learning accelerator (DLA).

2. Innovation points

        Top-level diagram of proposed DLA is shown below.        

        2.1 Four processing elements (PEs) are located amidst the weight storage memory

                This accelerator is almost entirely on-chip storage, minimizing data movement overhead. But I think we need to take cost into account

        2.2 Adopt a non-uniform memory hierarchy  

                The non-uniform memory hierarchy provides a trade-off between small, low-power memory banks for frequently used data and larger, high density banks with higher power for the large amount of infrequently accessed data.

3. Summary

        The performance of the chip and die photo are shown below.

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 打赏
    打赏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

IC_菌

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值