目录
论文整理打包链接:https://pan.baidu.com/s/1e_C3m-A0sgXfUUNtmCTpJA,提取码:8owl
FPGA2016会议论文
论文下载地址:https://dl.acm.org/citation.cfm?id=2847263&picked=prox
Workshop on Overlay Architectures for FPGAFPGA 覆盖架构研讨会
----------------------------------------------------------------------------------------------------------------------
OLAF'16: Second International Workshop on Overlay Architectures for FPGAs.1
Designers' Day Session 1:Hardware Features 设计师日会议1:硬件功能
----------------------------------------------------------------------------------------------------------------------
HyperPipelining of High-Speed Interface Logic.2
Spatial Debug & Debug Without Re-programming in FPGAs: On-Chip debugging in FPGAs.3
Designers' Day Session 2:System Level Methodology 设计师日第二场:系统级方法
----------------------------------------------------------------------------------------------------------------------
SDSoC: A Higher-level Programming Environment for Zynq SoC and Ultrascale+ MPSoC.4
FCUDA-SoC: Platform Integration for Field-Programmable SoC with the CUDA-to-FPGA Compiler.5-14
- Shlomi Alkalay, Hari Angepat, Adrian M. Caulfield, Eric S. Chung, Oren Firestein, Michael Haselman, Stephen Heil, Kyle Holohan, Matt Humphrey, Tamás Juhász, Puneet Kaur, Sitaram Lanka, Daniel Lo, Todd Massengill, Kalin Ovtcharov, Michael Papamichael, Andrew Putnam, Raja Seera, Rimon Tadros, Jason Thong, Lisa Woods, Derek Chiou, Doug Burger:
Agile Co-Design for a Reconfigurable Datacenter.15
Technical Session 1:Neural Networks and OpenCL 技术会议1:神经网络和OpenCL
----------------------------------------------------------------------------------------------------------------------
- Naveen Suda, Vikas Chandra, Ganesh Dasika, Abinash Mohanty, Yufei Ma, Sarma B. K. Vrudhula, Jae-sun Seo, Yu Cao:
Throughput-Optimized OpenCL-based FPGA Accelerator for Large-Scale Convolutional Neural Networks.16-25
- Jiantao Qiu, Jie Wang, Song Yao, Kaiyuan Guo, Boxun Li, Erjin Zhou, Jincheng Yu, Tianqi Tang, Ningyi Xu, Sen Song, Yu Wang, Huazhong Yang:
Going Deeper with Embedded FPGA Platform for Convolutional Neural Network.26-35
Using Stochastic Computing to Reduce the Hardware Requirements for a Restricted Boltzmann Machine Classifier.36-41
A Platform-Oblivious Approach for Heterogeneous Computing: A Case Study with Monte Carlo-based Simulation for Medical Applications.42-47
A Case for Work-stealing on FPGAs with OpenCL Atomics.48-53
Technical Session 2:Cooling and Clocking 技术会议2:冷却和时钟
----------------------------------------------------------------------------------------------------------------------
Physical Design of 3D FPGAs Embedded with Micro-channel-based Fluidic Cooling.54-63
Stratix™ 10 High Performance Routable Clock Networks.64-73
Boolean Satisfiability-Based Routing and Its Application to Xilinx UltraScale Clock Network.74-79
Technical Session 3:Circuit Design, Graph Processing Applications 技术会议3:电路设计,图形处理应用
----------------------------------------------------------------------------------------------------------------------
FPRESSO: Enabling Express Transistor-Level Exploration of FPGA Architectures.80-89
Towards PVT-Tolerant Glitch-Free Operation in FPGAs.90-99
Pitfalls and Tradeoffs in Simultaneous, On-Chip FPGA Delay Measurement.100-104
FPGP: Graph Processing Framework on FPGA A Case Study of Breadth-First Search.105-110
GraphOps: A Dataflow Library for Graph Analytics Acceleration.111-117
Technical Session 4:Applications and System-level Tools 技术会议4:应用和系统级工具
----------------------------------------------------------------------------------------------------------------------
High Performance Linkage Disequilibrium: FPGAs Hold the Key.118-127
LMC: Automatic Resource-Aware Program-Optimized Memory Partitioning.128-137
Efficient Memory Partitioning for Parallel Data Access via Data Reuse.138-147
Evening Panel 晚上小组
----------------------------------------------------------------------------------------------------------------------
Intel Acquires Altera: How Will the World of FPGAs be Affected?148
Technical Session 5:Architecture and Tools 技术会议5:架构和工具
----------------------------------------------------------------------------------------------------------------------
PRFloor: An Automatic Floorplanner for Partially Reconfigurable FPGA Systems.149-158
- David M. Lewis, Gordon R. Chiu, Jeffrey Chromczak, David R. Galloway, Ben Gamsa, Valavan Manohararajah, Ian Milton, Tim Vanderhoek, John Van Dyken:
The Stratix™ 10 Highly Pipelined FPGA Architecture.159-168
Case for Design-Specific Machine Learning in Timing Closure of FPGA Designs.169-172
Just In Time Assembly of Accelerators.173-178
CASK: Open-Source Custom Architectures for Sparse Kernels.179-184
Technical Session 6:System-level Tools 技术会议6:系统级工具
----------------------------------------------------------------------------------------------------------------------
GPU-Accelerated High-Level Synthesis for Bitwidth Optimization of FPGA Datapaths.185-194
Resolve: Generation of High-Performance Sorting Architectures from High-Level Synthesis.195-204
SEU Mitigation and Validation of the LEON3 Soft Processor Using Triple Modular Redundancy for Space Processing.205-214
Technical Session 7:High-level Synthesis and Tools 技术会议7:高级综合和工具
----------------------------------------------------------------------------------------------------------------------
Optimal Circuits for Streamed Linear Permutations Using RAM.215-223
High Level Synthesis of Complex Applications: An H.264 Video Decoder.224-233
Automatically Optimizing the Latency, Area, and Accuracy of C Programs for High-Level Synthesis.234-243
Technical Session 8:Applications 技术会议8:应用
----------------------------------------------------------------------------------------------------------------------
Reducing Memory Requirements for High-Performance and Numerically Stable Gaussian Elimination.244-253
FGPU: An SIMT-Architecture for FPGAs.254-263
A Study of Pointer-Chasing Performance on Shared-Memory Processor-FPGA Systems.264-273
Poster Session 1 海报会议1
----------------------------------------------------------------------------------------------------------------------
A Low DDR Bandwidth 100FPS 1080p Video 2D Discrete Wavelet Transform Implementation on FPGA (Abstract Only).274
A Scalable Heterogeneous Dataflow Architecture For Big Data Analytics Using FPGAs (Abstract Only).274
Accelerating Database Query Processing on OpenCL-based FPGAs (Abstract Only).274
An Improved Global Stereo-Matching on FPGA for Real-Time Applications (Abstract Only).274
ENFIRE: An Energy-efficient Fine-grained Spatio-temporal Reconfigurable Computing Fabric (Abstact Only).275
Floorplanning of Partially Reconfigurable Design on Heterogeneous FPGA (Abstract Only).275
Increasing the Utility of Self-Calibration Methods in High-Precision Time Measurement Systems (Abstract Only).275
- James J. Davis, Eddie Hung, Joshua M. Levine, Edward A. Stott, Peter Y. K. Cheung, George A. Constantinides:
Knowledge is Power: Module-level Sensing for Runtime Optimisation (Abstact Only).276
Machine-Learning driven Auto-Tuning of High-Level Synthesis for FPGAs (Abstract Only).276
Re-targeting Optimization Sequences from Scalar Processors to FPGAs in HLS compilers (Abstract Only).276
Poster Session 2 海报会议2
----------------------------------------------------------------------------------------------------------------------
A High-throughput Architecture for Lossless Decompression on FPGA Designed Using HLS (Abstract Only).277
An Activity Aware Placement Approach For 3D FPGAs (Abstract Only).277
an Extensible Heterogeneous Multi-FPGA Framework for Accelerating N-body Simulation (Abstract Only).277
An FPGA-Based Controller for a 77 GHz MEMS Tri-Mode Automotive Radar (Abstract Only).278
An FPGA-SOC Based Accelerating Solution for N-body Simulations in MOND (Abstract Only).278
Automated Verification Code Generation in HLS Using Software Execution Traces (Abstract Only).278
DCPUF: Placement and Routing Constraint based Dynamically Configured Physical Unclonable Function on FPGA (Abstact Only).279
Evaluating the Impact of Environmental Factors on Physically Unclonable Functions (Abstract Only).279
Stochastic-Based Spin-Programmable Gate Array with Emerging MTJ Device Technology (Abstract Only).279
Testing FPGA Local Interconnects Based on Repeatable Configuration Modules (Abstract Only).280
Poster Session 3 海报会议3
----------------------------------------------------------------------------------------------------------------------
A 1 GSa/s, Reconfigurable Soft-core FPGA ADC (Abstract Only).281
A Full-Capacity Local RoutingArchitecture for FPGAs (Abstract Only).281
ARAPrototyper: Enabling Rapid Prototyping and Evaluation for Accelerator-Rich Architecture (Abstact Only).281
Doubling FPGA Throughput via a Soft SerDes Architecture for Full-Bandwidth Serial Pipelining (Abstract Only).282
Enhanced TERO-PUF Implementations and Characterization on FPGAs (Abstract Only).282
FPGA Power Estimation Using Automatic Feature Selection (Abstract Only).282
HGum: Messaging Framework for Hardware Accelerators (Abstact Only).283
Low-Swing Signaling for FPGA Power Reduction (Abstract Only).283
Stochastic-Based Convolutional Networks with Reconfigurable Logic Fabric (Abstract Only).283
t-QuadPlace: Timing Driven Quadratic Placement using Quadrisection Partitioning for FPGAs (Abstact Only).284
FPGA2017会议论文
论文下载地址:https://dl.acm.org/citation.cfm?id=3020078&picked=prox
FPGA'17 Workshops FPGA'17研讨会
----------------------------------------------------------------------------------------------------------------------
OLAF'17: Third International Workshop on Overlay Architectures for FPGAs.1
Special Session:The Role of FPGAs in Deep Learning 特别会议:FPGA在深度学习中的作用
----------------------------------------------------------------------------------------------------------------------
The Role of FPGAs in Deep Learning. 3
- Eriko Nurvitadhi, Ganesh Venkatesh, Jaewoong Sim, Debbie Marr, Randy Huang, Jason Ong Gee Hock, Yeong Tat Liew, Krishnan Srivatsan, Duncan J. M. Moss, Suchit Subhaschandra, Guy Boudoukh:
Can FPGAs Beat GPUs in Accelerating Next-Generation Deep Neural Networks? 5-14
- Ritchie Zhao, Weinan Song, Wentao Zhang, Tianwei Xing, Jeng-Hau Lin, Mani B. Srivastava, Rajesh Gupta, Zhiru Zhang:
Accelerating Binarized Convolutional Neural Networks with Software-Programmable FPGAs. 15-24
Improving the Performance of OpenCL-based FPGA Accelerator for Convolutional Neural Network. 25-34
Frequency Domain Acceleration of Convolutional Neural Networks on CPU-FPGA Shared Memory System. 35-44
Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks. 45-54
Machine Learning 机器学习
----------------------------------------------------------------------------------------------------------------------
An OpenCL™ Deep Learning Accelerator on Arria 10. 55-64
- Yaman Umuroglu, Nicholas J. Fraser, Giulio Gambardella, Michaela Blott, Philip Heng Wai Leong, Magnus Jahre, Kees A. Vissers:
FINN: A Framework for Fast, Scalable Binarized Neural Network Inference. 65-74
- Song Han, Junlong Kang, Huizi Mao, Yiming Hu, Xin Li, Yubin Li, Dongliang Xie, Hong Luo, Song Yao, Yu Wang, Huazhong Yang, William (Bill) J. Dally:
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA. 75-84
Interconnect and Routing 互连和路由
----------------------------------------------------------------------------------------------------------------------
Quality-Time Tradeoffs in Component-Specific Mapping: How to Train Your Dynamically Reconfigurable Array of Gates with Outrageous Network-delays. 85-94
Synchronization Constraints for Interconnect Synthesis. 95-104
Corolla: GPU-Accelerated FPGA Routing Based on Subgraph Dynamic Expansion.105-114
Architecture 架构
----------------------------------------------------------------------------------------------------------------------
Don't Forget the Memory: Automatic Block RAM Modelling, Optimization, and Architecture Exploration.115-124
Automatic Construction of Program-Optimized FPGA Memory Networks. 125-134
- Zhihong Huang, Xing Wei, Grace Zgheib, Wei Li, Yu Lin, Zhenghong Jiang, Kaihui Tu, Paolo Ienne, Haigang Yang:
NAND-NOR: A Compact, Fast, and Delay Balanced FPGA Logic Element. 135-140
120-core microAptiv MIPS Overlay for the Terasic DE5-NET FPGA board. 141-146
CAD Tools CAD工具
----------------------------------------------------------------------------------------------------------------------
A Parallelized Iterative Improvement Approach to Area Optimization for LUT-Based Technology Mapping.147-156
A Parallel Bandit-Based Approach for Autotuning FPGA Compilation. 157-166
Panel:FPGAs in the Cloud 专家组:云端的FPGA
----------------------------------------------------------------------------------------------------------------------
FPGAs in the Cloud. 167
High-Level Synthesis -- Tools and Applications 高级综合--工具和应用
----------------------------------------------------------------------------------------------------------------------
Hardware Synthesis of Weakly Consistent C Concurrency. 169-178
A New Approach to Automatic Memory Banking using Trace-Based Address Mining. 179-188
Dynamic Hazard Resolution for Pipelining Irregular Loops in High-Level Synthesis. 189-194
Accelerating Face Detection on Programmable SoC Using C-Based Synthesis. 195-200
Packet Matching on FPGAs Using HMC Memory: Towards One Million Rules. 201-206
Graph Processing Applications 图形处理应用
----------------------------------------------------------------------------------------------------------------------
Boosting the Performance of FPGA-based Graph Processor using Hybrid Memory Cube: A Case for Breadth First Search. 207-216
ForeGraph: Exploring Large-scale Graph Processing on Multi-FPGA Architecture. 217-226
FPGA-Accelerated Transactional Execution of Graph Workloads.227-236
Virtualization and Applications 虚拟化和应用
----------------------------------------------------------------------------------------------------------------------
Enabling Flexible Network FPGA Clusters in a Heterogeneous Cloud Data Center. 237-246
Energy Efficient Scientific Computing on FPGAs using OpenCL. 247-256
Secure Function Evaluation Using an FPGA Overlay Architecture. 257-266
Applications 应用
----------------------------------------------------------------------------------------------------------------------
FPGA Acceleration for Computational Glass-Free Displays. 267-274
- Sitao Huang, Gowthami Jayashri Manikandan, Anand Ramachandran, Kyle Rupnow, Wen-mei W. Hwu, Deming Chen:
Hardware Acceleration of the Pair-HMM Algorithm for DNA Variant Calling. 275-284
Poster Session 1 海报会议1
----------------------------------------------------------------------------------------------------------------------
Measuring the Power-Constrained Performance and Energy Gap between FPGAs and Processors (Abstract Only).285
A Mixed-Signal Data-Centric Reconfigurable Architecture enabled by RRAM Technology (Abstract Only). 285
A Framework for Iterative Stencil Algorithm Synthesis on FPGAs from OpenCL Programming Model (Abstract Only). 285-286
Scala Based FPGA Design Flow (Abstract Only). 286
Thermal Flattening in 3D FPGAs Using Embedded Cooling (Abstract Only). 286
A Machine Learning Framework for FPGA Placement (Abstract Only). 286
Precise Coincidence Detection on FPGAs: Three Case Studies (Abstract Only). 287
Towards Efficient Design Space Exploration of FPGA-based Accelerators for Streaming HPC Applications (Abstract Only). 287
Accurate and Efficient Hyperbolic Tangent Activation Function on FPGA using the DCT Interpolation Filter (Abstract Only). 287
An FPGA Overlay Architecture for Cost Effective Regular Expression Search (Abstract Only).287-288
Poster Session 2 海报会议2
----------------------------------------------------------------------------------------------------------------------
Using Vivado-HLS for Structural Design: a NoC Case Study (Abstract Only). 289
Automatic Generation of Hardware Sandboxes for Trojan Mitigation in Systems on Chip (Abstract Only). 289
- Haohuan Fu, Conghui He, Huabin Ruan, Itay Greenspon, Wayne Luk, Yongkang Zheng, Junfeng Liao, Qing Zhang, Guangwen Yang:
Accelerating Financial Market Server through Hybrid List Design (Abstract Only). 289-290
Joint Modulo Scheduling and Memory Partitioning with Multi-Bank Memory for High-Level Synthesis (Abstract Only). 290
A Batch Normalization Free Binarized Convolutional Deep Neural Network on an FPGA (Abstract Only). 290
A 7.663-TOPS 8.2-W Energy-efficient FPGA Accelerator for Binary Convolutional Neural Networks (Abstract Only). 290-291
CPU-FPGA Co-Optimization for Big Data Applications: A Case Study of In-Memory Samtool Sorting (Abstract Only). 291
Stochastic-Based Multi-stage Streaming Realization of a Deep Convolutional Neural Network (Abstract Only).291
fpgaConvNet: Automated Mapping of Convolutional Neural Networks on FPGAs (Abstract Only). 291-292
Poster Session 3 海报会议3
----------------------------------------------------------------------------------------------------------------------
FPGA-based Hardware Accelerator for Image Reconstruction in Magnetic Resonance Imaging (Abstract Only).293
Storage-Efficient Batching for Minimizing Bandwidth of Fully-Connected Neural Network Layers (Abstract Only).293
- Subho S. Banerjee, Mohamed El-Hadedy, Jong Bin Lim, Daniel Chen, Zbigniew T. Kalbarczyk, Deming Chen, Ravishankar K. Iyer:
ASAP: Accelerated Short Read Alignment on Programmable Hardware (Abstract Only).293-294
RxRE: Throughput Optimization for High-Level Synthesis using Resource-Aware Regularity Extraction (Abstract Only).294
GRT 2.0: An FPGA-based SDR Platform for Cognitive Radio Networks (Abstract Only).294-295
FPGA Implementation of Non-Uniform DFT for Accelerating Wireless Channel Simulations (Abstract Only).295
Learning Convolutional Neural Networks for Data-Flow Graph Mapping on Spatial Programmable Architectures (Abstract Only).295
Cache Timing Attacks from The SoCFPGA Coherency Port (Abstract Only).295-296
Dynamic Partitioning for Library based Placement on Heterogeneous FPGAs (Abstract Only).296
An Energy-Efficient Design-Time Scheduler for FPGAs Leveraging Dynamic Frequency Scaling Emulation (Abstract Only).296
FPGA2018会议论文
论文下载地址:https://dl.acm.org/citation.cfm?id=3174243&picked=prox
Special Session : Deep Learning 特别会议:深度学习
----------------------------------------------------------------------------------------------------------------------
CausaLearn: Automated Framework for Scalable Streaming-based Causal Bayesian Learning using FPGAs.1-10
C-LSTM: Enabling Efficient LSTM using Structured Compression Techniques on FPGAs.11-20
DeltaRNN: A Power-efficient Recurrent Neural Network Accelerator.21-30
A Lightweight YOLOv2: A Binarized CNN with A Parallel Support Vector Regression for an FPGA.31-40
Session 1:Architecture 会议1:架构
----------------------------------------------------------------------------------------------------------------------
Architecture and Circuit Design of an All-Spintronic FPGA.41-50
Liquid Silicon: A Data-Centric Reconfigurable Architecture Enabled by RRAM Technology.51-60
Improving FPGA Performance with a S44 LUT Structure.61-66
Session 2:CAD 会议2:CAD
----------------------------------------------------------------------------------------------------------------------
ParaDRo: A Parallel Deterministic Router Based on Spatial Partitioning and Scheduling.67-76
Routing Magic: Performing Computations Using Routing Networks and Voting Logic on Unary Encoded Data.77-86
A Full-System VM-HDL Co-Simulation Framework for Servers with PCIe-Connected FPGAs.87-96
Session 3:Deep Learning 会议3:深度学习
----------------------------------------------------------------------------------------------------------------------
Towards a Uniform Template-based Architecture for Accelerating 2D and 3D CNNs on FPGA.97-106
- Duncan J. M. Moss, Krishnan Srivatsan, Eriko Nurvitadhi, Piotr Ratuszniak, Chris Johnson, Jaewoong Sim, Asit K. Mishra, Debbie Marr, Suchit Subhaschandra, Philip Heng Wai Leong:
A Customizable Matrix Multiplication Framework for the Intel HARPv2 Xeon+FPGA Platform: A Deep Learning Case Study.107-116
A Framework for Generating High Throughput CNN Implementations on FPGAs.117-126
Session 4:High Level Synthesis 1 会议4:高级综合1
----------------------------------------------------------------------------------------------------------------------
Dynamically Scheduled High-level Synthesis.127-136
A Scalable Approach to Exact Resource-Constrained Scheduling Based on a Joint SDC and SAT Formulation.137-146
P4-Compatible High-Level Synthesis of Low Latency 100 Gb/s Streaming Packet Parsers in FPGAs.147-152
Session 5:Applications 1 会议5:应用1
----------------------------------------------------------------------------------------------------------------------
Combined Spatial and Temporal Blocking for High-Performance Stencil Computation on FPGAs Using OpenCL.153-162
A HOG-based Real-time and Multi-scale Pedestrian Detector Demonstration System on FPGA.163-172
Scalable Window Generation for the Intel Broadwell+Arria 10 and High-Bandwidth FPGA Systems.173-182
High-Performance QR Decomposition for FPGAs.183-188
Session 6:High Level Synthesis 2 会议6:高级综合2
----------------------------------------------------------------------------------------------------------------------
ADAM: Automated Design Analysis and Merging for Speeding up FPGA Development.189-198
Graph-Theoretically Optimal Memory Banking for Stencil-Based Computing Kernels.199-208
Architecture Exploration for HLS-Oriented FPGA Debug Overlays.209-218
Session 7:Circuits and Computation Engines 会议7:电路和计算引擎
----------------------------------------------------------------------------------------------------------------------
Memory-Efficient Fast Fourier Transform on Streaming Data by Fusing Permutations.219-228
Degree-aware Hybrid Graph Traversal on FPGA-HMC Platform.229-238
Accelerating Graph Analytics by Co-Optimizing Storage and Access on an FPGA-HMC Platform.239-248
Session 8:Applications 2 会议8:应用2
----------------------------------------------------------------------------------------------------------------------
Configurable FPGA Packet Parser for Terabit Networks with Guaranteed Wire-Speed Throughput.249-258
FASTCF: FPGA-based Accelerator for STochastic-Gradient-Descent-based Collaborative Filtering.259-268
- Yuan Zhou, Udit Gupta, Steve Dai, Ritchie Zhao, Nitish Kumar Srivastava, Hanchen Jin, Joseph Featherston, Yi-Hsiang Lai, Gai Liu, Gustavo Angarita Velasquez, Wenping Wang, Zhiru Zhang:
Rosetta: A Realistic High-Level Synthesis Benchmark Suite for Software Programmable FPGAs.269-278
FPGA Fastfood - A High Speed Systolic Implementation of a Large Scale Online Kernel Method.279-284
Poster Session 1 海报会议1
----------------------------------------------------------------------------------------------------------------------
Optimizations of Sequence Alignment on FPGA: A Case Study of Extended Sequence Alignment (Abstact Only).285
Automatic Optimising CNN with Depthwise Separable Convolution on FPGA: (Abstact Only).285
Continuous Skyline Computation Accelerator with Parallelizing Dominance Relation Calculations: (Abstract Only).285
FastTrack: Exploiting Fast FPGA Wiring for Implementing NoC Shortcuts (Abstract Only).286
An Optimal Microarchitecture for Stencil Computation with Data Reuse and Fine-Grained Parallelism: (Abstract Only).286
A FPGA Friendly Approximate Computing Framework with Hybrid Neural Networks: (Abstract Only).286
- Eriko Nurvitadhi, Jeffrey J. Cook, Asit K. Mishra, Debbie Marr, Kevin Nealis, Philip Colangelo, Andrew C. Ling, Davor Capalija, Utku Aydonat, Sergey Shumarayev, Aravind Dasu:
In-Package Domain-Specific ASICs for Intel® Stratix® 10 FPGAs: A Case Study of Accelerating Deep Learning Using TensorTile ASIC(Abstract Only).287
Evaluation of OpenCL Performance-oriented Optimizations for Streaming Kernels on the FPGA: (Abstract Only).287
K-Flow: A Programming and Scheduling Framework to Optimize Dataflow Execution on CPU-FPGA Platforms: (Abstract Only).287
FPGA-based LSTM Acceleration for Real-Time EEG Signal Processing: (Abstract Only).288
Understanding Performance Differences of FPGAs and GPUs: (Abtract Only).288
Poster Session 2 海报会议2
----------------------------------------------------------------------------------------------------------------------
Software/Hardware Co-design for Multichannel Scheduling in IEEE 802.11p MLME: (Abstract Only).289
Solving Satisfiability Problem on Quantum Annealer: A Lesson from FPGA CAD Tools: (Abstract Only).289
Domino: An Asynchronous and Energy-efficient Accelerator for Graph Processing: (Abstract Only).289
Towards Serial-Equivalent Parallel Routing for FPGAs: (Abstract Only).289
Performance Comparison of Multiple Approaches of Status Register for Medium Density Memory Suitable for Implementation of a Lossless Compression Dictionary: (Abstract Only).290
BoxPlacer: Force Directed-Based Timing-Driven Placement for Large-Scale FPGAs: (Abstract Only).290
DATuner: An Extensible Distributed Autotuning Framework for FPGA Design and Design Automation: (Abstract Only).290
Mapping Large-Scale DNNs on Asymmetric FPGAs: (Abstract Only).291
Software-Defined FPGA-Based Accelerator for Deep Convolutional Neural Networks: (Abstract Only).291
Design of an MTJ-Based Nonvolatile LUT Circuit with a Data-Update Minimized Shift Operation for an Ultra-Low-Power FPGA: (Abstract Only).291
High-Throughput Lossless Compression on Tightly Coupled CPU-FPGA Platforms: (Abstract Only).291
Poster Session 3 海报会议3
----------------------------------------------------------------------------------------------------------------------
HexCell: a Hexagonal Cell for Evolvable Systolic Arrays on FPGAs: (Abstract Only).293
Performance Comparison of Multiples and Target Detection with Imager-driven Processing Mode for Ultrafast-Imager: (Abstract Only).293
A Low-Power Deconvolutional Accelerator for Convolutional Neural Network Based Segmentation on FPGA: Abstract Only.293
FPGAs in the Datacenters: the Case of Parallel Hybrid Super Scalar String Sample Sort (pHS5)(Abstract Only).294
SIFT Keypoint Descriptor Matching Algorithm: A Fully Pipelined Accelerator on FPGA(Abstract Only).294
FGC: A Tool-flow for Generating and Configuring Custom FPGAs(Abstract Only).294
Exploration of Low Numeric Precision Deep Learning Inference Using Intel® FPGAs: (Abstract Only).294
- Andrea Guerrieri, Sahand Kashani-Akhavan, Mikhail Asiatici, Pasquale Lombardi, Bilel Belhadj, Paolo Ienne:
LEOSoC: An Open-Source Cross-Platform Embedded Linux Library for Managing Hardware Accelerators in Heterogeneous System-on-Chips(Abstract Only).295
A Self-adaptation Method of Fitting Convolutional Neural Network into FPGA: Abstract Only).295
FPL2017会议论文
论文下载地址:
"All programmable FPGA, providing hardware efficiency to software programmers".1-3
01Evaluating high-level design strategies on FPGAs for high-performance computing.1-4
02Toward a pixel-parallel architecture for graph cuts inference on FPGA.1-4
03A systematic approach to design and optimise streaming applications on FPGA using high-level synthesis.1-4
04REAPR: Reconfigurable engine for automata processing.1-8
05Evaluating high-level design strategies on FPGAs for high-performance computing.1-4
06In-network online data analytics with FPGAs.1-2
07Area-optimized montgomery multiplication on IGLOO 2 FPGAs.1-4
08Determine the carry bit of carry-sum generated by unsigned MBE multiplier without final addition.1-4
09Optimizing streaming stencil time-step designs via FPGA floorplanning.1-4
10A demonstration of the GUINNESS: A GUI based neural NEtwork SyntheSizer for an FPGA.1
11TAIGA: A new RISC-V soft-processor framework enabling high performance CPU architectural features.1-4
- Muhammad Abdul Wahab, Pascal Cotret, Mounir Nasr Allah, Guillaume Hiet, Vianney Lapotre, Guy Gogniat:
12ARMHEx: A hardware extension for DIFT on ARM-based SoCs.1-7
13Mapping of P4 match action tables to FPGA.1-2
14Broken-Karatsuba multiplication and its application to Montgomery modular multiplication.1-4
15PolyPC: Polymorphic parallel computing framework on embedded reconfigurable system.1-8