FPGA2016~2018，FPL2017会议论文整理

最新推荐文章于 2024-08-05 19:32:45 发布

fuli_fox

最新推荐文章于 2024-08-05 19:32:45 发布

阅读量2.4k

点赞数 1

分类专栏：论文阅读笔记文章标签： FPGA 会议论文硬件

原文链接：https://blog.csdn.net/EdwardBao1993/article/details/85985541

版权

论文阅读笔记专栏收录该内容

3 篇文章 0 订阅

订阅专栏

FPGA2016会议论文

论文下载地址：https://dl.acm.org/citation.cfm?id=2847263&picked=prox

Workshop on Overlay Architectures for FPGAFPGA 覆盖架构研讨会

----------------------------------------------------------------------------------------------------------------------

Hayden Kwok-Hay So, John Wawrzynek:

OLAF'16: Second International Workshop on Overlay Architectures for FPGAs.1

Designers' Day Session 1:Hardware Features 设计师日会议1：硬件功能

----------------------------------------------------------------------------------------------------------------------

Gregg Baeckler:

HyperPipelining of High-Speed Interface Logic.2

Pankaj Shanker:

Spatial Debug & Debug Without Re-programming in FPGAs: On-Chip debugging in FPGAs.3

Designers' Day Session 2:System Level Methodology 设计师日第二场：系统级方法

----------------------------------------------------------------------------------------------------------------------

Vinod Kathail, James Hwang, Welson Sun, Yogesh Chobe, Tom Shui, Jorge Carrillo:

SDSoC: A Higher-level Programming Environment for Zynq SoC and Ultrascale+ MPSoC.4

Tan Nguyen, Swathi T. Gurumani, Kyle Rupnow, Deming Chen:

FCUDA-SoC: Platform Integration for Field-Programmable SoC with the CUDA-to-FPGA Compiler.5-14

Shlomi Alkalay, Hari Angepat, Adrian M. Caulfield, Eric S. Chung, Oren Firestein, Michael Haselman, Stephen Heil, Kyle Holohan, Matt Humphrey, Tamás Juhász, Puneet Kaur, Sitaram Lanka, Daniel Lo, Todd Massengill, Kalin Ovtcharov, Michael Papamichael, Andrew Putnam, Raja Seera, Rimon Tadros, Jason Thong, Lisa Woods, Derek Chiou, Doug Burger:

Agile Co-Design for a Reconfigurable Datacenter.15

Technical Session 1:Neural Networks and OpenCL 技术会议1：神经网络和OpenCL

----------------------------------------------------------------------------------------------------------------------

Naveen Suda, Vikas Chandra, Ganesh Dasika, Abinash Mohanty, Yufei Ma, Sarma B. K. Vrudhula, Jae-sun Seo, Yu Cao:

Throughput-Optimized OpenCL-based FPGA Accelerator for Large-Scale Convolutional Neural Networks.16-25

Jiantao Qiu, Jie Wang, Song Yao, Kaiyuan Guo, Boxun Li, Erjin Zhou, Jincheng Yu, Tianqi Tang, Ningyi Xu, Sen Song, Yu Wang, Huazhong Yang:

Going Deeper with Embedded FPGA Platform for Convolutional Neural Network.26-35

Bingzhe Li, M. Hassan Najafi, David J. Lilja:

Using Stochastic Computing to Reduce the Hardware Requirements for a Restricted Boltzmann Machine Classifier.36-41

Shih-Hao Hung, Min-Yu Tsai, Bo-Yi Huang, Chia-Heng Tu:

A Platform-Oblivious Approach for Heterogeneous Computing: A Case Study with Monte Carlo-based Simulation for Medical Applications.42-47

Nadesh Ramanathan, John Wickerson, Felix Winterstein, George A. Constantinides:

A Case for Work-stealing on FPGAs with OpenCL Atomics.48-53

Technical Session 2:Cooling and Clocking 技术会议2：冷却和时钟

----------------------------------------------------------------------------------------------------------------------

Zhiyuan Yang, Ankur Srivastava:

Physical Design of 3D FPGAs Embedded with Micro-channel-based Fluidic Cooling.54-63

Carl Ebeling, Dana How, David M. Lewis, Herman Schmit:

Stratix™ 10 High Performance Routable Clock Networks.64-73

Henri Fraisse, Abhishek Joshi, Dinesh Gaitonde, Alireza Kaviani:

Boolean Satisfiability-Based Routing and Its Application to Xilinx UltraScale Clock Network.74-79

Technical Session 3:Circuit Design, Graph Processing Applications 技术会议3：电路设计，图形处理应用

----------------------------------------------------------------------------------------------------------------------

Grace Zgheib, Manana Lortkipanidze, Muhsen Owaida, David Novo, Paolo Ienne:

FPRESSO: Enabling Express Transistor-Level Exploration of FPGA Architectures.80-89

Safeen Huda, Jason Anderson:

Towards PVT-Tolerant Glitch-Free Operation in FPGAs.90-99

Timothy A. Linscott, Benjamin Gojman, Raphael Rubin, André DeHon:

Pitfalls and Tradeoffs in Simultaneous, On-Chip FPGA Delay Measurement.100-104

Guohao Dai, Yuze Chi, Yu Wang, Huazhong Yang:

FPGP: Graph Processing Framework on FPGA A Case Study of Breadth-First Search.105-110

Tayo Oguntebi, Kunle Olukotun:

GraphOps: A Dataflow Library for Graph Analytics Acceleration.111-117

Technical Session 4:Applications and System-level Tools 技术会议4：应用和系统级工具

----------------------------------------------------------------------------------------------------------------------

Nikolaos Alachiotis, Gabriel Weisz:

High Performance Linkage Disequilibrium: FPGAs Hold the Key.118-127

Hsin-Jung Yang, Kermin Fleming, Michael Adler, Felix Winterstein, Joel S. Emer:

LMC: Automatic Resource-Aware Program-Optimized Memory Partitioning.128-137

Jincheng Su, Fan Yang, Xuan Zeng, Dian Zhou:

Efficient Memory Partitioning for Parallel Data Access via Data Reuse.138-147

Evening Panel 晚上小组

----------------------------------------------------------------------------------------------------------------------

Derek Chiou:

Intel Acquires Altera: How Will the World of FPGAs be Affected?148

Technical Session 5:Architecture and Tools 技术会议5：架构和工具

----------------------------------------------------------------------------------------------------------------------

Tuan D. A. Nguyen, Akash Kumar:

PRFloor: An Automatic Floorplanner for Partially Reconfigurable FPGA Systems.149-158

David M. Lewis, Gordon R. Chiu, Jeffrey Chromczak, David R. Galloway, Ben Gamsa, Valavan Manohararajah, Ian Milton, Tim Vanderhoek, John Van Dyken:

The Stratix™ 10 Highly Pipelined FPGA Architecture.159-168

Que Yanghua, Chinnakkannu Adaikkala Raj, Harnhua Ng, Kirvy Teo, Nachiket Kapre:

Case for Design-Specific Machine Learning in Timing Closure of FPGA Designs.169-172

Sen Ma, Zeyad Aklah, David Andrews:

Just In Time Assembly of Accelerators.173-178

Paul Grigoras, Pavel Burovskiy, Wayne Luk:

CASK: Open-Source Custom Architectures for Sparse Kernels.179-184

Technical Session 6:System-level Tools 技术会议6：系统级工具

----------------------------------------------------------------------------------------------------------------------

Nachiket Kapre, Deheng Ye:

GPU-Accelerated High-Level Synthesis for Bitwidth Optimization of FPGA Datapaths.185-194

Janarbek Matai, Dustin Richmond, Dajung Lee, Zac Blair, Qiongzhi Wu, Amin Abazari, Ryan Kastner:

Resolve: Generation of High-Performance Sorting Architectures from High-Level Synthesis.195-204

Michael J. Wirthlin, Andrew M. Keller, Chase McCloskey, Parker Ridd, David Lee, Jeffrey Draper:

SEU Mitigation and Validation of the LEON3 Soft Processor Using Triple Modular Redundancy for Space Processing.205-214

Technical Session 7:High-level Synthesis and Tools 技术会议7：高级综合和工具

----------------------------------------------------------------------------------------------------------------------

François Serre, Thomas Holenstein, Markus Püschel:

Optimal Circuits for Streamed Linear Permutations Using RAM.215-223

Xinheng Liu, Yao Chen, Tan Nguyen, Swathi T. Gurumani, Kyle Rupnow, Deming Chen:

High Level Synthesis of Complex Applications: An H.264 Video Decoder.224-233

Xitong Gao, John Wickerson, George A. Constantinides:

Automatically Optimizing the Latency, Area, and Accuracy of C Programs for High-Level Synthesis.234-243

Technical Session 8:Applications 技术会议8：应用

----------------------------------------------------------------------------------------------------------------------

David Boland:

Reducing Memory Requirements for High-Performance and Numerically Stable Gaussian Elimination.244-253

Muhammed Al Kadi, Benedikt Janßen, Michael Hübner:

FGPU: An SIMT-Architecture for FPGAs.254-263

Gabriel Weisz, Joseph Melber, Yu Wang, Kermin Fleming, Eriko Nurvitadhi, James C. Hoe:

A Study of Pointer-Chasing Performance on Shared-Memory Processor-FPGA Systems.264-273

Poster Session 1 海报会议1

----------------------------------------------------------------------------------------------------------------------

Mohammed Shaaban Ibraheem, Syed Zahid Ahmed, Khalil Hachicha, Sylvain Hochberg, Patrick Garda:

A Low DDR Bandwidth 100FPS 1080p Video 2D Discrete Wavelet Transform Implementation on FPGA (Abstract Only).274

Ehsan Ghasemi, Paul Chow:

A Scalable Heterogeneous Dataflow Architecture For Big Data Analytics Using FPGAs (Abstract Only).274

Ze-ke Wang, Hui Yan Cheah, Johns Paul, Bingsheng He, Wei Zhang:

Accelerating Database Query Processing on OpenCL-based FPGAs (Abstract Only).274

Daolu Zha, Xi Jin, Tian Xiang:

An Improved Global Stereo-Matching on FPGA for Real-Time Applications (Abstract Only).274

Wenchao Qian, Christopher Babecki, Robert Karam, Swarup Bhunia:

ENFIRE: An Energy-efficient Fine-grained Spatio-temporal Reconfigurable Computing Fabric (Abstact Only).275

Pingakshya Goswami, Dinesh Bhatia:

Floorplanning of Partially Reconfigurable Design on Heterogeneous FPGA (Abstract Only).275

Matthias Hinkfoth, Ralf Salomon:

Increasing the Utility of Self-Calibration Methods in High-Precision Time Measurement Systems (Abstract Only).275

James J. Davis, Eddie Hung, Joshua M. Levine, Edward A. Stott, Peter Y. K. Cheung, George A. Constantinides:

Knowledge is Power: Module-level Sensing for Runtime Optimisation (Abstact Only).276

Li Ting, Harri Wijaya, Nachiket Kapre:

Machine-Learning driven Auto-Tuning of High-Level Synthesis for FPGAs (Abstract Only).276

Ronak Kogta, Suresh Purini, Ajit Mathew:

Re-targeting Optimization Sequences from Scalar Processors to FPGAs in HLS compilers (Abstract Only).276

Poster Session 2 海报会议2

----------------------------------------------------------------------------------------------------------------------

Jie Lei, Yu-Ting Chen, Yunsong Li, Jason Cong:

A High-throughput Architecture for Lossless Decompression on FPGA Designed Using HLS (Abstract Only).277

Girish Deshpande, Dinesh K. Bhatia:

An Activity Aware Placement Approach For 3D FPGAs (Abstract Only).277

Tianqi Wang, Bo Peng, Xi Jin:

an Extensible Heterogeneous Multi-FPGA Framework for Accelerating N-body Simulation (Abstract Only).277

Sabrina Zereen, Sundeep Lal, Mohammed A. S. Khalid, Sazzadur Chowdhury:

An FPGA-Based Controller for a 77 GHz MEMS Tri-Mode Automotive Radar (Abstract Only).278

Bo Peng, Tianqi Wang, Xi Jin, Chuanjun Wang:

An FPGA-SOC Based Accelerating Solution for N-body Simulations in MOND (Abstract Only).278

Liwei Yang, Swathi T. Gurumani, Suhaib A. Fahmy, Deming Chen, Kyle Rupnow:

Automated Verification Code Generation in HLS Using Software Execution Traces (Abstract Only).278

Jing Ye, Yu Hu, Xiaowei Li:

DCPUF: Placement and Routing Constraint based Dynamically Configured Physical Unclonable Function on FPGA (Abstact Only).279

Sebastien Bellon, Claudio Favi, Miroslaw Malek, Marco Macchetti, Francesco Regazzoni:

Evaluating the Impact of Environmental Factors on Physically Unclonable Functions (Abstract Only).279

Yu Bai, Mingjie Lin:

Stochastic-Based Spin-Programmable Gate Array with Emerging MTJ Device Technology (Abstract Only).279

Zhen Yang, Jian Wang, Meng Yang, Jinmei Lai:

Testing FPGA Local Interconnects Based on Repeatable Configuration Modules (Abstract Only).280

Poster Session 3 海报会议3

----------------------------------------------------------------------------------------------------------------------

Stefan Visser, Harald Homulle, Edoardo Charbon:

A 1 GSa/s, Reconfigurable Soft-core FPGA ADC (Abstract Only).281

Xifan Tang, Pierre-Emmanuel Gaillardon, Giovanni De Micheli:

A Full-Capacity Local RoutingArchitecture for FPGAs (Abstract Only).281

Yu-Ting Chen, Jason Cong, Zhenman Fang, Peipei Zhou:

ARAPrototyper: Enabling Rapid Prototyping and Evaluation for Accelerator-Rich Architecture (Abstact Only).281

Aaron Landy, Greg Stitt:

Doubling FPGA Throughput via a Soft SerDes Architecture for Full-Bandwidth Serial Pipelining (Abstract Only).282

Cédric Marchand, Lilian Bossuet, Abdelkarim Cherkaoui:

Enhanced TERO-PUF Implementations and Characterization on FPGAs (Abstract Only).282

Yunxuan Yu, Lei He:

FPGA Power Estimation Using Automatic Feature Selection (Abstract Only).282

Sizhuo Zhang, Hari Angepat, Derek Chiou:

HGum: Messaging Framework for Hardware Accelerators (Abstact Only).283

Sayeh Sharifymoghaddam, Ali Sheikholeslami:

Low-Swing Signaling for FPGA Power Reduction (Abstract Only).283

Mohammed Alawad, Mingjie Lin:

Stochastic-Based Convolutional Networks with Reconfigurable Logic Fabric (Abstract Only).283

Nimish Agashiwala, Satya Prakash Upadhyay, Kia Bazargan:

t-QuadPlace: Timing Driven Quadratic Placement using Quadrisection Partitioning for FPGAs (Abstact Only).284

FPGA2017会议论文

论文下载地址：https://dl.acm.org/citation.cfm?id=3020078&picked=prox

FPGA'17 Workshops FPGA'17研讨会

----------------------------------------------------------------------------------------------------------------------

Hayden Kwok-Hay So, John Wawrzynek:

OLAF'17: Third International Workshop on Overlay Architectures for FPGAs.1

Special Session:The Role of FPGAs in Deep Learning 特别会议：FPGA在深度学习中的作用

----------------------------------------------------------------------------------------------------------------------

Andrew Ling, Jason Anderson:

The Role of FPGAs in Deep Learning. 3

Eriko Nurvitadhi, Ganesh Venkatesh, Jaewoong Sim, Debbie Marr, Randy Huang, Jason Ong Gee Hock, Yeong Tat Liew, Krishnan Srivatsan, Duncan J. M. Moss, Suchit Subhaschandra, Guy Boudoukh:

Can FPGAs Beat GPUs in Accelerating Next-Generation Deep Neural Networks? 5-14

Ritchie Zhao, Weinan Song, Wentao Zhang, Tianwei Xing, Jeng-Hau Lin, Mani B. Srivastava, Rajesh Gupta, Zhiru Zhang:

Accelerating Binarized Convolutional Neural Networks with Software-Programmable FPGAs. 15-24

Jialiang Zhang, Jing Li:

Improving the Performance of OpenCL-based FPGA Accelerator for Convolutional Neural Network. 25-34

Chi Zhang, Viktor K. Prasanna:

Frequency Domain Acceleration of Convolutional Neural Networks on CPU-FPGA Shared Memory System. 35-44

Yufei Ma, Yu Cao, Sarma B. K. Vrudhula, Jae-sun Seo:

Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks. 45-54

Machine Learning 机器学习

----------------------------------------------------------------------------------------------------------------------

Utku Aydonat, Shane O'Connell, Davor Capalija, Andrew C. Ling, Gordon R. Chiu:

An OpenCL™ Deep Learning Accelerator on Arria 10. 55-64

Yaman Umuroglu, Nicholas J. Fraser, Giulio Gambardella, Michaela Blott, Philip Heng Wai Leong, Magnus Jahre, Kees A. Vissers:

FINN: A Framework for Fast, Scalable Binarized Neural Network Inference. 65-74

Song Han, Junlong Kang, Huizi Mao, Yiming Hu, Xin Li, Yubin Li, Dongliang Xie, Hong Luo, Song Yao, Yu Wang, Huazhong Yang, William (Bill) J. Dally:

ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA. 75-84

Interconnect and Routing 互连和路由

----------------------------------------------------------------------------------------------------------------------

Hans Giesen, Raphael Rubin, Benjamin Gojman, André DeHon:

Quality-Time Tradeoffs in Component-Specific Mapping: How to Train Your Dynamically Reconfigurable Array of Gates with Outrageous Network-delays. 85-94

Alex Rodionov, Jonathan Rose:

Synchronization Constraints for Interconnect Synthesis. 95-104

Minghua Shen, Guojie Luo:

Corolla: GPU-Accelerated FPGA Routing Based on Subgraph Dynamic Expansion.105-114

Architecture 架构

----------------------------------------------------------------------------------------------------------------------

Sadegh Yazdanshenas, Kosuke Tatsumura, Vaughn Betz:

Don't Forget the Memory: Automatic Block RAM Modelling, Optimization, and Architecture Exploration.115-124

Hsin-Jung Yang, Kermin Fleming, Felix Winterstein, Annie I. Chen, Michael Adler, Joel S. Emer:

Automatic Construction of Program-Optimized FPGA Memory Networks. 125-134

Zhihong Huang, Xing Wei, Grace Zgheib, Wei Li, Yu Lin, Zhenghong Jiang, Kaihui Tu, Paolo Ienne, Haigang Yang:

NAND-NOR: A Compact, Fast, and Delay Balanced FPGA Logic Element. 135-140

Chethan Kumar H. B, Prashant Ravi, Gourav Modi, Nachiket Kapre:

120-core microAptiv MIPS Overlay for the Terasic DE5-NET FPGA board. 141-146

CAD Tools CAD工具

----------------------------------------------------------------------------------------------------------------------

Gai Liu, Zhiru Zhang:

A Parallelized Iterative Improvement Approach to Area Optimization for LUT-Based Technology Mapping.147-156

Chang Xu, Gai Liu, Ritchie Zhao, Stephen Yang, Guojie Luo, Zhiru Zhang:

A Parallel Bandit-Based Approach for Autotuning FPGA Compilation. 157-166

Panel:FPGAs in the Cloud 专家组：云端的FPGA

----------------------------------------------------------------------------------------------------------------------

George A. Constantinides:

FPGAs in the Cloud. 167

High-Level Synthesis -- Tools and Applications 高级综合--工具和应用

----------------------------------------------------------------------------------------------------------------------

Nadesh Ramanathan, Shane T. Fleming, John Wickerson, George A. Constantinides:

Hardware Synthesis of Weakly Consistent C Concurrency. 169-178

Yuan Zhou, Khalid Musa Al-Hawaj, Zhiru Zhang:

A New Approach to Automatic Memory Banking using Trace-Based Address Mining. 179-188

Steve Dai, Ritchie Zhao, Gai Liu, Shreesha Srinath, Udit Gupta, Christopher Batten, Zhiru Zhang:

Dynamic Hazard Resolution for Pipelining Irregular Loops in High-Level Synthesis. 189-194

Nitish Kumar Srivastava, Steve Dai, Rajit Manohar, Zhiru Zhang:

Accelerating Face Detection on Programmable SoC Using C-Based Synthesis. 195-200

Daniel Rozhko, Geoffrey Elliott, Daniel Ly-Ma, Paul Chow, Hans-Arno Jacobsen:

Packet Matching on FPGAs Using HMC Memory: Towards One Million Rules. 201-206

Graph Processing Applications 图形处理应用

----------------------------------------------------------------------------------------------------------------------

Jialiang Zhang, Soroosh Khoram, Jing Li:

Boosting the Performance of FPGA-based Graph Processor using Hybrid Memory Cube: A Case for Breadth First Search. 207-216

Guohao Dai, Tianhao Huang, Yuze Chi, Ningyi Xu, Yu Wang, Huazhong Yang:

ForeGraph: Exploring Large-scale Graph Processing on Multi-FPGA Architecture. 217-226

Xiaoyu Ma, Dan Zhang, Derek Chiou:

FPGA-Accelerated Transactional Execution of Graph Workloads.227-236

Virtualization and Applications 虚拟化和应用

----------------------------------------------------------------------------------------------------------------------

Naif Tarafdar, Thomas Lin, Eric Fukuda, Hadi Bannazadeh, Alberto Leon-Garcia, Paul Chow:

Enabling Flexible Network FPGA Clusters in a Heterogeneous Cloud Data Center. 237-246

Dennis Weller, Fabian Oboril, Dimitar Lukarski, Jürgen Becker, Mehdi Baradaran Tahoori:

Energy Efficient Scientific Computing on FPGAs using OpenCL. 247-256

Xin Fang, Stratis Ioannidis, Miriam Leeser:

Secure Function Evaluation Using an FPGA Overlay Architecture. 257-266

Applications 应用

----------------------------------------------------------------------------------------------------------------------

Zhuolun He, Guojie Luo:

FPGA Acceleration for Computational Glass-Free Displays. 267-274

Sitao Huang, Gowthami Jayashri Manikandan, Anand Ramachandran, Kyle Rupnow, Wen-mei W. Hwu, Deming Chen:

Hardware Acceleration of the Pair-HMM Algorithm for DNA Variant Calling. 275-284

Poster Session 1 海报会议1

----------------------------------------------------------------------------------------------------------------------

Andy Gean Ye, Karthik Ganesan:

Measuring the Power-Constrained Performance and Energy Gap between FPGAs and Processors (Abstract Only).285

Yue Zha, Jialiang Zhang, Zhiqiang Wei, Jing Li:

A Mixed-Signal Data-Centric Reconfigurable Architecture enabled by RRAM Technology (Abstract Only). 285

Shuo Wang, Yun Liang:

A Framework for Iterative Stencil Algorithm Synthesis on FPGAs from OpenCL Programming Model (Abstract Only). 285-286

Yanqiang Liu, Yao Li, Weilun Xiong, Meng Lai, Cheng Chen, Zhengwei Qi, Haibing Guan:

Scala Based FPGA Design Flow (Abstract Only). 286

Girish Deshpande, Dinesh K. Bhatia:

Thermal Flattening in 3D FPGAs Using Embedded Cooling (Abstract Only). 286

Gary William Grewal, Shawki Areibi, Matthew Westrik, Ziad Abuowaimer, Betty Zhao:

A Machine Learning Framework for FPGA Placement (Abstract Only). 286

Ralf Salomon, Ralf Joost:

Precise Coincidence Detection on FPGAs: Three Case Studies (Abstract Only). 287

Mostafa Koraei, Magnus Jahre, S. Omid Fatemi:

Towards Efficient Design Space Exploration of FPGA-based Accelerators for Streaming HPC Applications (Abstract Only). 287

Ahmed M. Abdelsalam, J. M. Pierre Langlois, Farida Cheriet:

Accurate and Efficient Hyperbolic Tangent Activation Function on FPGA using the DCT Interpolation Filter (Abstract Only). 287

Thomas Luinaud, Yvon Savaria, J. M. Pierre Langlois:

An FPGA Overlay Architecture for Cost Effective Regular Expression Search (Abstract Only).287-288

Poster Session 2 海报会议2

----------------------------------------------------------------------------------------------------------------------

Zhipeng Zhao, James C. Hoe:

Using Vivado-HLS for Structural Design: a NoC Case Study (Abstract Only). 289

Christophe Bobda, Taylor J. L. Whitaker, Charles A. Kamhoua, Kevin A. Kwiat, Laurent Njilla:

Automatic Generation of Hardware Sandboxes for Trojan Mitigation in Systems on Chip (Abstract Only). 289

Haohuan Fu, Conghui He, Huabin Ruan, Itay Greenspon, Wayne Luk, Yongkang Zheng, Junfeng Liao, Qing Zhang, Guangwen Yang:

Accelerating Financial Market Server through Hybrid List Design (Abstract Only). 289-290

Tianyi Lu, Shouyi Yin, Xianqing Yao, Zhicong Xie, Leibo Liu, Shaojun Wei:

Joint Modulo Scheduling and Memory Partitioning with Multi-Bank Memory for High-Level Synthesis (Abstract Only). 290

Hiroki Nakahara, Haruyoshi Yonekawa, Hisashi Iwamoto, Masato Motomura:

A Batch Normalization Free Binarized Convolutional Deep Neural Network on an FPGA (Abstract Only). 290

Yixing Li, Zichuan Liu, Kai Xu, Hao Yu, Fengbo Ren:

A 7.663-TOPS 8.2-W Energy-efficient FPGA Accelerator for Binary Convolutional Neural Networks (Abstract Only). 290-291

Jason Cong, Zhenman Fang, Muhuan Huang, Libo Wang, Di Wu:

CPU-FPGA Co-Optimization for Big Data Applications: A Case Study of In-Memory Samtool Sorting (Abstract Only). 291

Mohammed Alawad, Mingjie Lin:

Stochastic-Based Multi-stage Streaming Realization of a Deep Convolutional Neural Network (Abstract Only).291

Stylianos I. Venieris, Christos-Savvas Bouganis:

fpgaConvNet: Automated Mapping of Convolutional Neural Networks on FPGAs (Abstract Only). 291-292

Poster Session 3 海报会议3

----------------------------------------------------------------------------------------------------------------------

Emanuele Pezzotti, Alex Iacobucci, Gregory Nash, Umer I. Cheema, Paolo Vinella, Rashid Ansari:

FPGA-based Hardware Accelerator for Image Reconstruction in Magnetic Resonance Imaging (Abstract Only).293

Yongming Shen, Michael Ferdman, Peter A. Milder:

Storage-Efficient Batching for Minimizing Bandwidth of Fully-Connected Neural Network Layers (Abstract Only).293

Subho S. Banerjee, Mohamed El-Hadedy, Jong Bin Lim, Daniel Chen, Zbigniew T. Kalbarczyk, Deming Chen, Ravishankar K. Iyer:

ASAP: Accelerated Short Read Alignment on Programmable Hardware (Abstract Only).293-294

Atieh Lotfi, Rajesh K. Gupta:

RxRE: Throughput Optimization for High-Level Synthesis using Resource-Aware Regularity Extraction (Abstract Only).294

Haoyang Wu, Tao Wang, Zhiwei Li, Boyan Ding, Xiaoguang Li, Tianfu Jiang, Jun Liu, Songwu Lu:

GRT 2.0: An FPGA-based SDR Platform for Cognitive Radio Networks (Abstract Only).294-295

Srinivas Siripurapu, Aman Gayasen, Padmini Gopalakrishnan, Nitin Chandrachoodan:

FPGA Implementation of Non-Uniform DFT for Accelerating Wireless Channel Simulations (Abstract Only).295

Shouyi Yin, Dajiang Liu, Lifeng Sun, Xinhan Lin, Leibo Liu, Shaojun Wei:

Learning Convolutional Neural Networks for Data-Flow Graph Mapping on Spatial Programmable Architectures (Abstract Only).295

Sumanta Chaudhuri:

Cache Timing Attacks from The SoCFPGA Coherency Port (Abstract Only).295-296

Fubing Mao, Wei Zhang, Bingsheng He, SiewKei Lam:

Dynamic Partitioning for Library based Placement on Heterogeneous FPGAs (Abstract Only).296

Wei Ting Loke, Chin Yang Koay:

An Energy-Efficient Design-Time Scheduler for FPGAs Leveraging Dynamic Frequency Scaling Emulation (Abstract Only).296

FPGA2018会议论文

论文下载地址：https://dl.acm.org/citation.cfm?id=3174243&picked=prox

Special Session : Deep Learning 特别会议：深度学习

----------------------------------------------------------------------------------------------------------------------

Bita Darvish Rouhani, Mohammad Ghasemzadeh, Farinaz Koushanfar:

CausaLearn: Automated Framework for Scalable Streaming-based Causal Bayesian Learning using FPGAs.1-10

Shuo Wang, Zhe Li, Caiwen Ding, Bo Yuan, Qinru Qiu, Yanzhi Wang, Yun Liang:

C-LSTM: Enabling Efficient LSTM using Structured Compression Techniques on FPGAs.11-20

Chang Gao, Daniel Neil, Enea Ceolini, Shih-Chii Liu, Tobi Delbrück:

DeltaRNN: A Power-efficient Recurrent Neural Network Accelerator.21-30

Hiroki Nakahara, Haruyoshi Yonekawa, Tomoya Fujii, Shimpei Sato:

A Lightweight YOLOv2: A Binarized CNN with A Parallel Support Vector Regression for an FPGA.31-40

Session 1:Architecture 会议1：架构

----------------------------------------------------------------------------------------------------------------------

Stephen M. Williams, Mingjie Lin:

Architecture and Circuit Design of an All-Spintronic FPGA.41-50

Yue Zha, Jing Li:

Liquid Silicon: A Data-Centric Reconfigurable Architecture Enabled by RRAM Technology.51-60

Wenyi Feng, Jonathan W. Greene, Alan Mishchenko:

Improving FPGA Performance with a S44 LUT Structure.61-66

Session 2:CAD 会议2：CAD

----------------------------------------------------------------------------------------------------------------------

Chin Hau Hoo, Akash Kumar:

ParaDRo: A Parallel Deterministic Router Based on Spatial Partitioning and Scheduling.67-76

Soheil Mohajer, Zhiheng Wang, Kia Bazargan:

Routing Magic: Performing Computations Using Routing Networks and Voting Logic on Unary Encoded Data.77-86

Shenghsun Cho, Mrunal Patel, Han Chen, Michael Ferdman, Peter Milder:

A Full-System VM-HDL Co-Simulation Framework for Servers with PCIe-Connected FPGAs.87-96

Session 3:Deep Learning 会议3：深度学习

----------------------------------------------------------------------------------------------------------------------

Junzhong Shen, You Huang, Zelong Wang, Yuran Qiao, Mei Wen, Chunyuan Zhang:

Towards a Uniform Template-based Architecture for Accelerating 2D and 3D CNNs on FPGA.97-106

Duncan J. M. Moss, Krishnan Srivatsan, Eriko Nurvitadhi, Piotr Ratuszniak, Chris Johnson, Jaewoong Sim, Asit K. Mishra, Debbie Marr, Suchit Subhaschandra, Philip Heng Wai Leong:

A Customizable Matrix Multiplication Framework for the Intel HARPv2 Xeon+FPGA Platform: A Deep Learning Case Study.107-116

Hanqing Zeng, Ren Chen, Chi Zhang, Viktor K. Prasanna:

A Framework for Generating High Throughput CNN Implementations on FPGAs.117-126

Session 4:High Level Synthesis 1 会议4：高级综合1

----------------------------------------------------------------------------------------------------------------------

Lana Josipovic, Radhika Ghosal, Paolo Ienne:

Dynamically Scheduled High-level Synthesis.127-136

Steve Dai, Gai Liu, Zhiru Zhang:

A Scalable Approach to Exact Resource-Constrained Scheduling Based on a Joint SDC and SAT Formulation.137-146

Jeferson Santiago da Silva, François-Raymond Boyer, J. M. Pierre Langlois:

P4-Compatible High-Level Synthesis of Low Latency 100 Gb/s Streaming Packet Parsers in FPGAs.147-152

Session 5:Applications 1 会议5：应用1

----------------------------------------------------------------------------------------------------------------------

Hamid Reza Zohouri, Artur Podobas, Satoshi Matsuoka:

Combined Spatial and Temporal Blocking for High-Performance Stencil Computation on FPGAs Using OpenCL.153-162

Jan Dürre, Dario Paradzik, Holger Blume:

A HOG-based Real-time and Multi-scale Pedestrian Detector Demonstration System on FPGA.163-172

Greg Stitt, Abhay Gupta, Madison N. Emas, David Wilson, Austin Baylis:

Scalable Window Generation for the Intel Broadwell+Arria 10 and High-Bandwidth FPGA Systems.173-182

Martin Langhammer, Bogdan Pasca:

High-Performance QR Decomposition for FPGAs.183-188

Session 6:High Level Synthesis 2 会议6：高级综合2

----------------------------------------------------------------------------------------------------------------------

Ho-Cheung Ng, Shuanglong Liu, Wayne Luk:

ADAM: Automated Design Analysis and Merging for Speeding up FPGA Development.189-198

Juan Escobedo, Mingjie Lin:

Graph-Theoretically Optimal Memory Banking for Stencil-Based Computing Kernels.199-208

Al-Shahna Jamal, Jeffrey Goeders, Steven J. E. Wilton:

Architecture Exploration for HLS-Oriented FPGA Debug Overlays.209-218

Session 7:Circuits and Computation Engines 会议7：电路和计算引擎

----------------------------------------------------------------------------------------------------------------------

François Serre, Markus Püschel:

Memory-Efficient Fast Fourier Transform on Streaming Data by Fusing Permutations.219-228

Jialiang Zhang, Jing Li:

Degree-aware Hybrid Graph Traversal on FPGA-HMC Platform.229-238

Soroosh Khoram, Jialiang Zhang, Maxwell Strange, Jing Li:

Accelerating Graph Analytics by Co-Optimizing Storage and Access on an FPGA-HMC Platform.239-248

Session 8:Applications 2 会议8：应用2

----------------------------------------------------------------------------------------------------------------------

Jakub Cabal, Pavel Benácek, Lukas Kekely, Michal Kekely, Viktor Pus, Jan Korenek:

Configurable FPGA Packet Parser for Terabit Networks with Guaranteed Wire-Speed Throughput.249-258

Shijie Zhou, Rajgopal Kannan, Yu Min, Viktor K. Prasanna:

FASTCF: FPGA-based Accelerator for STochastic-Gradient-Descent-based Collaborative Filtering.259-268

Yuan Zhou, Udit Gupta, Steve Dai, Ritchie Zhao, Nitish Kumar Srivastava, Hanchen Jin, Joseph Featherston, Yi-Hsiang Lai, Gai Liu, Gustavo Angarita Velasquez, Wenping Wang, Zhiru Zhang:

Rosetta: A Realistic High-Level Synthesis Benchmark Suite for Software Programmable FPGAs.269-278

Sean Fox, David Boland, Philip Heng Wai Leong:

FPGA Fastfood - A High Speed Systolic Implementation of a Large Scale Online Kernel Method.279-284

Poster Session 1 海报会议1

----------------------------------------------------------------------------------------------------------------------

Zheming Jin, Kazutomo Yoshii:

Optimizations of Sequence Alignment on FPGA: A Case Study of Extended Sequence Alignment (Abstact Only).285

Ruizhe Zhao, Xinyu Niu, Wayne Luk:

Automatic Optimising CNN with Depthwise Separable Convolution on FPGA: (Abstact Only).285

Kenichi Koizumi, Kei Hiraki, Mary Inaba:

Continuous Skyline Computation Accelerator with Parallelizing Dominance Relation Calculations: (Abstract Only).285

Nachiket Kapre, Tushar Krishna:

FastTrack: Exploiting Fast FPGA Wiring for Implementing NoC Shortcuts (Abstract Only).286

Yuze Chi, Peipei Zhou, Jason Cong:

An Optimal Microarchitecture for Stencil Computation with Data Reuse and Fine-Grained Parallelism: (Abstract Only).286

Haiyue Song, Xiang Song, Tianjian Li, Hao Dong, Naifeng Jing, Xiaoyao Liang, Li Jiang:

A FPGA Friendly Approximate Computing Framework with Hybrid Neural Networks: (Abstract Only).286

Eriko Nurvitadhi, Jeffrey J. Cook, Asit K. Mishra, Debbie Marr, Kevin Nealis, Philip Colangelo, Andrew C. Ling, Davor Capalija, Utku Aydonat, Sergey Shumarayev, Aravind Dasu:

In-Package Domain-Specific ASICs for Intel® Stratix® 10 FPGAs: A Case Study of Accelerating Deep Learning Using TensorTile ASIC(Abstract Only).287

Zheming Jin, Hal Finkel:

Evaluation of OpenCL Performance-oriented Optimizations for Streaming Kernels on the FPGA: (Abstract Only).287

Jason Cong, Zhenman Fang, Yao Hu, Di Wu:

K-Flow: A Programming and Scheduling Framework to Optimize Dataflow Execution on CPU-FPGA Platforms: (Abstract Only).287

Zhe Chen, Andrew Howe, Hugh T. Blair, Jason Cong:

FPGA-based LSTM Acceleration for Real-Time EEG Signal Processing: (Abstract Only).288

Jason Cong, Zhenman Fang, Michael Lo, Hanrui Wang, Jingxian Xu, Shaochong Zhang:

Understanding Performance Differences of FPGAs and GPUs: (Abtract Only).288

Poster Session 2 海报会议2

----------------------------------------------------------------------------------------------------------------------

Nan Ding, Wei Zhang, Yanhua Ma, Zhenguo Gao:

Software/Hardware Co-design for Multichannel Scheduling in IEEE 802.11p MLME: (Abstract Only).289

Juexiao Su, Lei He:

Solving Satisfiability Problem on Quantum Annealer: A Lesson from FPGA CAD Tools: (Abstract Only).289

Chongchong Xu, Chao Wang, Yiwei Zhang, Lei Gong, Xi Li, Xuehai Zhou:

Domino: An Asynchronous and Energy-efficient Accelerator for Graph Processing: (Abstract Only).289

Minghua Shen, Wentai Zhang, Nong Xiao, Guojie Luo:

Towards Serial-Equivalent Parallel Routing for FPGAs: (Abstract Only).289

Matej Bartík, Sven Ubik, Pavel Kubalík, Tomás Benes:

Performance Comparison of Multiple Approaches of Status Register for Medium Density Memory Suitable for Implementation of a Lossless Compression Dictionary: (Abstract Only).290

Minghua Shen, Jiaxi Zhang, Nong Xiao, Guojie Luo:

BoxPlacer: Force Directed-Based Timing-Driven Placement for Large-Scale FPGAs: (Abstract Only).290

Gai Liu, Ecenur Ustun, Shaojie Xiang, Chang Xu, Guojie Luo, Zhiru Zhang:

DATuner: An Extensible Distributed Autotuning Framework for FPGA Design and Design Automation: (Abstract Only).290

Wentai Zhang, Jiaxi Zhang, Minghua Shen, Nong Xiao, Guojie Luo:

Mapping Large-Scale DNNs on Asymmetric FPGAs: (Abstract Only).291

Yankang Du, Qinrang Liu, Shuai Wei, Chen Gao:

Software-Defined FPGA-Based Accelerator for Deep Convolutional Neural Networks: (Abstract Only).291

Daisuke Suzuki, Takahiro Hanyu:

Design of an MTJ-Based Nonvolatile LUT Circuit with a Data-Update Minimized Shift Operation for an Ultra-Low-Power FPGA: (Abstract Only).291

Weikang Qiao, Jieqiong Du, Zhenman Fang, Libo Wang, Michael Lo, Mau-Chung Frank Chang, Jason Cong:

High-Throughput Lossless Compression on Tightly Coupled CPU-FPGA Platforms: (Abstract Only).291

Poster Session 3 海报会议3

----------------------------------------------------------------------------------------------------------------------

Fady Hussein, Luka Daoud, Nader Rafla:

HexCell: a Hexagonal Cell for Evolvable Systolic Arrays on FPGAs: (Abstract Only).293

Xiaoyu Yu, Dong Ye:

Performance Comparison of Multiples and Target Detection with Imager-driven Processing Mode for Ultrafast-Imager: (Abstract Only).293

Shuanglong Liu, Xinyu Niu, Wayne Luk:

A Low-Power Deconvolutional Accelerator for Convolutional Neural Network Based Segmentation on FPGA: Abstract Only.293

Mikhail Asiatici, Damian Maiorano, Paolo Ienne:

FPGAs in the Datacenters: the Case of Parallel Hybrid Super Scalar String Sample Sort (pHS5)(Abstract Only).294

Luka Daoud, Muhammad Kamran Latif, Nader Rafla:

SIFT Keypoint Descriptor Matching Algorithm: A Fully Pipelined Accelerator on FPGA(Abstract Only).294

Oluseyi A. Ayorinde, He Qi, Benton H. Calhoun:

FGC: A Tool-flow for Generating and Configuring Custom FPGAs(Abstract Only).294

Philip Colangelo, Nasibeh Nasiri, Eriko Nurvitadhi, Asit K. Mishra, Martin Margala, Kevin Nealis:

Exploration of Low Numeric Precision Deep Learning Inference Using Intel® FPGAs: (Abstract Only).294

Andrea Guerrieri, Sahand Kashani-Akhavan, Mikhail Asiatici, Pasquale Lombardi, Bilel Belhadj, Paolo Ienne:

LEOSoC: An Open-Source Cross-Platform Embedded Linux Library for Managing Hardware Accelerators in Heterogeneous System-on-Chips(Abstract Only).295