Publications

193 entries « 1 of 4 »

2020

Samuel Hsia, Udit Gupta, Mark Wilkening, Carole-Jean Wu, Gu-Yeon Wei, David Brooks

Cross-Stack Workload Characterization of DeepRecommendation Systems Journal Article

IEEE International Symposium on Workload Characterization, 2020.

BibTeX

Glenn G. Ko, Yuji Chai, Marco Donato, Paul N. Whatmough, Thierry Tambe, Rob A. Rutenbar, David Brooks, Gu-Yeon Wei

A Scalable Bayesian Inference Accelerator for Unsupervised Learning Conference

IEEE Hot Chips 31 Symposium, 2020.

BibTeX

Thierry Tambe, En-Yu Yang, Zishen Wan, Yuntian Deng, Vijay Janapa Reddi, Alexander M. Rush, David Brooks, Gu-Yeon Wei

Algorithm-Hardware Co-Design of Adaptive Floating-Point Encodings for Resilient Deep Learning Inference Conference

Design Automation Conference (DAC 2020), 2020, (Best paper award).

Abstract | Links | BibTeX

Paul N. Whatmough, Marco Donato, Glenn G. Ko, Sae Kyu Lee, David Brooks, Gu-Yeon Wei

CHIPKIT: An agile, reusable open-source framework for rapid test chip development Journal Article

IEEE MICRO, 2020.

BibTeX

Udit Gupta, Samuel Hsia, Vikram Saraph, Xiaodong Wang, Brandon Reagen, Gu-Yeon Wei, Hsien-Hsin S. Lee, David Brooks, Carole-Jean Wu

DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference Conference

The 47th IEEE/ACM International Symposium on Computer Architecture (ISCA 2020), 2020.

Abstract | Links | BibTeX

Liu Ke, Udit Gupta, Carole-Jean Wu, Benjamin Youngjae Cho, Mark Hempstead, Brandon Reagen, Xuan Zhang, David Brooks, Vikas Chandra, Utku Diril, Amin Firoozshahian, Kim Hazelwood, Bill Jia, Hsien-Hsin S. Lee, Meng Li, Bert Maher, Dheevatsa Mudigere, Maxim Naumov, Martin Schatz, Mikhail Smelyanskiy, Xiaodong Wang

RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing Conference

The 47th IEEE/ACM International Symposium on Computer Architecture (ISCA 2020), 2020.

Abstract | Links | BibTeX

Glenn G. Ko, Yuji Chai, Marco Donato, Paul N. Whatmough, Thierry Tambe, Rob A. Rutenbar, David Brooks, Gu-Yeon Wei

A 3mm2 Programmable Bayesian Inference Accelerator for Unsupervised Machine Perception using Parallel Gibbs Sampling in 16nm Conference

IEEE Symposium on VLSI Circuits (VLSI), 2020.

BibTeX

Yu Emma Wang, Gu-Yeon Wei, David Brooks

A Systematic Methodology for Analysis of Deep Learning Hardware and Software Platforms Conference

Third Conference on Machine Learning and Systems (MLSys), 2020.

Abstract | Links | BibTeX

Udit Gupta, Carole-Jean Wu, Xiaodong Wang, Maxim Naumov, Brandon Reagen, David Brooks, Bradford Cottel, Kim Hazelwood, Bill Jia, Hsien-Hsin S. Lee, Andrey Malevich, Dheevatsa Mudigere, Mikhail Smelyanskiy, Liang Xiong, Xuan Zhang

The Architectural Implications of Facebook's DNN-based Personalized Recommendation Conference

The 26th IEEE International Symposium on High-Performance Computer Architecture, 2020.

Abstract | Links | BibTeX

Paul Whatmough, Marco Donato, Glenn Ko, David Brooks, Gu-Yeon Wei

CHIPKIT: An agile, reusable open-source framework for rapid test chip development Unpublished

2020.

Abstract | Links | BibTeX

2019

Lillian Pentecost, Udit Gupta, Elisa Ngan, Gu-Yeon Wei, David Brooks, Johanna Beyer, Michael Behrisch

CHAMPVis: Comparative Hierarchical Analysis of Microarchitectural Performance Workshop

ProTools workshop co-located with Supercomputing, 2019.

Abstract | Links | BibTeX

Lillian Pentecost, Marco Donato, Brandon Reagen, Udit Gupta, Siming Ma, Gu-Yeon Wei, David Brooks

MaxNVM: Maximizing DNN Storage Density and Inference Efficiency with Sparse Encoding and Error Mitigation Conference

IEEE/ACM International Symposium on Microarchitecture, 2019, ISBN: 978-1-4503-6938-1/19/10.

Abstract | Links | BibTeX

Marco Donato, Lillian Pentecost, David Brooks, Gu-Yeon Wei

MEMTI: Optimizing On-Chip Nonvolatile Storage for Visual Multitask Inference at the Edge Journal Article

IEEE MICRO, 2019.

Abstract | Links | BibTeX

Udit Gupta, Brandon Reagen, Lillian Pentecost, Marco Donato, Thierry Tambe, Alexander M. Rush, Gu-Yeon Wei, David Brooks

MASR: A Modular Accelerator for Sparse RNNs Conference

International Conference on Parallel Architectures and Compilation Techniques, 2019.

Abstract | Links | BibTeX

Glenn G. Ko, Yuji Chai, Rob A. Rutenbar, David Brooks, Gu-Yeon Wei

Accelerating Bayesian Inference on Structured Graphs Using Parallel Gibbs Sampling Proceeding

International Conference on Field-Programmable Logic and Applications, 2019.

Abstract | Links | BibTeX

Brian Plancher, Camelia D. Brumar, Iulian Brumar, Lillian Pentecost, Saketh Rama, David Brooks

Application of Approximate Matrix Multiplication to Neural Networks and Distributed SLAM Conference

IEEE High Performance Extreme Computing Conference (HPEC), 2019.

Abstract | Links | BibTeX

Sae Kyu Lee, Paul Whatmough, David Brooks, Gu-Yeon Wei

A 16-nm always-on DNN processor with adaptive clocking and multi-cycle banked SRAMs Journal Article

IEEE Journal of Solid-State Circuits, 2019.

Links | BibTeX

Paul N. Whatmough; Sae Kyu Lee; Marco Donato; Hsea-Ching Hsueh; Sam Likun Xi ; Udit Gupta; Lillian Pentecost; Glenn G. Ko; David Brooks; Gu-Yeon Wei

A 16nm 25mm2 SoC with a 54.5x Flexibility-Efficiency Range from Dual-Core Arm Cortex-A53 to eFPGA and Cache-Coherent Accelerators Journal Article

Symposium on VLSI Circuits, 2019.

Abstract | Links | BibTeX

Yu Emma Wang; Yuhao Zhu; Glenn G. Ko; Brandon Reagen; Gu-Yeon Wei; David Brooks

Demystifying Bayesian Inference Workloads Proceeding

2019.

Abstract | Links | BibTeX

Yu Emma Wang; Victor Lee; Gu-Yeon Wei; David Brooks.

Predicting New Workload or CPU Performance by Analyzing Public Datasets Journal Article

ACM Transactions on Architecture and Code Optimization (TACO), 15 (4), pp. 53:1–53:21, 2019.

Abstract | Links | BibTeX

2018

Sae Kyu Lee, Paul N Whatmough, Niamh Mulholland, Patrick Hansen, David Brooks, Gu-Yeon Wei

A wide dynamic range sparse FC-DNN processor with multi-cycle banked SRAM read and adaptive clocking in 16nm FinFET Journal Article

ESSCIRC 2018-IEEE 44th European Solid State Circuits Conference, 2018.

Links | BibTeX

Paul N Whatmough, Sae Kyu Lee, David Brooks, Gu-Yeon Wei

DNN ENGINE: A 28-nm Timing-Error Tolerant Sparse Deep Neural Network Processor for IoT Applications Journal Article

IEEE Journal of Solid-State Circuits (JSSC), 2018.

BibTeX

Paul Whatmough, Sae Kyu Lee, Sam Xi, Udit Gupta, Lillian Pentecost, Marco Donato, Hsea-Ching Hseuh, David Brooks,; Gu-Yeon Wei.

SMIV: A 16nm SoC with Efficient and Flexible DNN Acceleration for Intelligent IoT Devices Journal Article

Hot Chips 30: A Symposium on High Performance Chips, 2018.

BibTeX

Brandon Reagen, Udit Gupta, Lillian Pentecost, Paul Whatmough, Sae Kyu Lee, Niamh Mulholland, David Brooks, Gu-Yeon Wei

Ares: a framework for quantifying the resilience of deep neural networks Conference

Design Automation Conference, 2018.

Abstract | Links | BibTeX

Marco Donato; Brandon Reagen; Lillian Pentecost; Udit Gupta; David Brooks, Gu-Yeon Wei

On-Chip Deep Neural Network Storage with Multi-Level eNVM Inproceedings

Design Automation Conference (DAC), 2018.

Abstract | Links | BibTeX

Brandon Reagen, Udit Gupta, Robert Adolf, Michael M. Mitzenmacher, Alexander M. Rush, Gu-Yeon Wei, David Brooks

Weightless: Lossy Weight Encoding For Deep Neural Network Compression Conference

International Conference on Machine Learning, 2018.

Abstract | Links | BibTeX

Mario Lok; Elizabeth Farrell Helbling; Xuan Zhang; Robert Wood; David Brooks; Gu-Yeon Wei

A Low Mass Power Electronics Unit to Drive Piezoelectric Actuators for Flying Microrobots Journal Article

IEEE Transactions on Power Electronics, 33 (4), pp. 3180 - 3191, 2018.

Abstract | Links | BibTeX

2017

Sreela Kodali; Patrick Hansen; Niamh Mulholland; Paul Whatmough; David Brooks; Gu-Yeon Wei

Applications of Deep Neural Networks for Ultra Low Power IoT Inproceedings

International Conference on Computer Design, 2017.

Abstract | BibTeX

Paul Whatmough; Sae Kyu Lee; Gu-Yeon Wei; David Brooks

Sub-uJ Deep Neural Networks for Embedded Applications Inproceedings

IEEE 51st Asilomar Conference on Signals, Systems, and Computers, 2017.

BibTeX

Paul Whatmough; Saekyu Lee; Niamh Mulholland; Patrick Hansen; Sreela Kodali; David Brooks

DNN ENGINE: A 16nm Sub-uJ Deep Neural Network Inference Accelerator for the Embedded Masses Inproceedings

Hot Chips 29: A Symposium on High Performance Chips, 2017.

Links | BibTeX

Brandon Reagen, Robert Adolf, Paul Whatmough, Gu-Yeon Wei, David Brooks

Deep Learning for Computer Architects Book

Morgan & Claypool Publishers, 2017.

Abstract | Links | BibTeX

Brandon Reagen; Jose Miguel Hernandez-Lobato; Robert Adolf; Michael Gelbart; Paul Whatmough; Gu-Yeon Wei; David Brooks

A Case for Efficient Accelerator Design Space Exploration via Bayesian Optimization Conference

International Symposium on Low Power Electronics and Design, 2017.

Abstract | Links | BibTeX

Xuan Zhang; Mario Lok; Tao Tong; Sae Kyu Lee; Brandon Reagen; Pierre-Emile J. Duhamel; Robert Wood; David Brooks; Gu-Yeon Wei

A Fully Integrated Battery-Powered System-on-Chip in 40-nm CMOS for Closed-Loop Control of Insect-Scale Pico-Aerial Vehicle Journal Article

IEEE Journal of Solid-State Circuits, 52 (9), 2017.

Abstract | Links | BibTeX

Svilen Kanev; Sam (Likun) Xi; Gu-Yeon Wei; David Brooks

Mallacc: Accelerating Memory Allocation Conference

International Symposium on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2017.

Abstract | Links | BibTeX

Sae Kyu Lee; Tao Tong; Xuan Zhang; David Brooks; Gu-Yeon Wei

A 16-Core Voltage-Stacked System With Adaptive Clocking and an Integrated Switched-Capacitor DC–DC Converter Journal Article

IEEE Transactions on VLSI, 25 (4), pp. 1271-1284, 2017.

Abstract | Links | BibTeX

Paul N. Whatmough; Sae Kyu Lee; Hyunkwang Lee; Saketh Rama; David Brooks; Gu-Yeon Wei

A 28nm SoC with a 1.2GHz 568nJ/Prediction Sparse Deep-Neural-Network Engine with >0.1 Timing Error Rate Tolerance for IoT Applications Inproceedings

International Solid-State Circuits Conference, 2017.

Links | BibTeX

2016

Yakun Sophia Shao; Sam (Likun) Xi; Vijayalakshmi Srinivasan; Gu-Yeon Wei; David Brooks

Co-Designing Accelerators and SoC Interfaces using gem5-Aladdin Inproceedings

International Symposium on Microarchitecture (MICRO), 2016.

Abstract | Links | BibTeX

Robert Adolf; Saketh Rama; Brandon Reagen; Gu-Yeon Wei ; David Brooks

Fathom: Reference Workloads for Modern Deep Learning Methods Inproceedings

IEEE International Symposium on Workload Characterization, 2016.

Abstract | Links | BibTeX

Tao Tong; Sae Kyu Lee; Xuan Zhang; David Brooks; Gu-Yeon Wei

A Fully Integrated Reconfigurable Switched-Capacitor DC-DC Converter With Four Stacked Output Channels for Voltage Stacking Applications Journal Article

IEEE Journal of Solid-State Circuits, 51 (9), pp. 2142–2152, 2016.

Abstract | Links | BibTeX

Brandon Reagen; Paul Whatmough; Robert Adolf; Saketh Rama; Hyunkwang Lee; Sae Kyu Lee; José Miguel Hernández-Lobato; Gu-Yeon Wei; David Brooks

Minerva: Enabling Low-Power, Highly-Accurate Deep Neural Network Accelerators Inproceedings

International Symposium on Computer Architecture (ISCA), 2016.

Abstract | Links | BibTeX

2015

Mario Lok; Xuan Zhang; Elizabeth Farrell Helblinh; Robert Wood; David Brooks; Gu-Yeon Wei

A Power Electronics Unit to Drive Piezoelectric Actuators for Flying Microrobots Inproceedings

IEEE Custom Integrated Circuits Conference (CICC), 2015.

Abstract | Links | BibTeX

Xuan Zhang; Mario Lok; Tao Tong; Simon Chaput; Sae Kyu Lee; Brandon Reagen; Hyunkwang Lee; David Brooks; Gu-Yeon Wei

A Multi-Chip System Optimized for Insect-Scale Flapping-Wing Robots Inproceedings

IEEE Symposium on VLSI Circuits (VLSIC), 2015.

Abstract | Links | BibTeX

Sae Kyu Lee; Tao Tong; Xuang Zhang; David Brooks; Gu-Yeon Wei

A 16-Core Voltage-Stacked System with an Integrated Switched-Capacitor DC-DC Converter Inproceedings

IEEE Symposium on VLSI Circuits (VLSIC), 2015.

Abstract | Links | BibTeX

Paul N. Whatmough; George Smart; Shidhartha Das; Yiannis Andreopoulos; David M. Bull

A 0.6V All-Digital Body-Coupled Wakeup Transceiver for IoT Applications Inproceedings

IEEE Symposium on VLSI Circuits (VLSIC), 2015.

Abstract | Links | BibTeX

Svilen Kanev, Juan Pablo Darago, Kim Hazelwood, Parthasarathy Ranganathan, Tipp Moseley, Gu-Yeon Wei, David Brooks

Profiling a Warehouse-Scale Computer Inproceedings

International Symposium on Computer Architecture (ISCA), 2015.

Abstract | Links | BibTeX

Sam Xi; Hans Jacobson; Pradip Bose; Gu-Yeon Wei; David Brooks

Quantifying Sources of Error in McPAT and Potential Impacts on Architectural Studies Conference

International Symposium on High Performance Computer Architecture (HPCA), 2015.

Abstract | Links | BibTeX

Simone Campanoni; Glenn Holloway; Gu-Yeon Wei; David Brooks

HELIX-UP: Relaxing Program Semantics to Unleash Parallelization Conference

International Symposium on Code Generation and Optimization (CGO), 2015.

Abstract | Links | BibTeX

Yakun Sophia Shao; Sam Xi; Viji Srinivasan; Gu-Yeon Wei; David Brooks

Toward Cache-Friendly Hardware Accelerators Conference

HPCA Sensors and Cloud Architectures Workshop (SCAW), 2015.

Abstract | Links | BibTeX

Brandon Reagen; Robert Adolf; Gu-Yeon Wei; David Brooks

The MachSuite Benchmark Conference

Boston Area Architecture Workshop (BARC), 2015.

Links | BibTeX

Brandon Reagen; Gu-Yeon Wei; David Brooks

How Hardware Accelerators Trade-Off Pipelining and Parallelism to Maximize Efficiency Conference

Boston Area Architecture Workshop (BARC), 2015.

Links | BibTeX

193 entries « 1 of 4 »