Software

Aladdin

Aladdin is our pre-RTL power and performance simulator for hardware accelerators. Aladdin code available on GitHub . Hardware Accelerators gem5-Aladdin Increasing demand for power-efficient, highperformance computing has spurred a growing number and...

Ares

Ares is a framework for quantifying the resilience of deep neural networks. Ares code available on GitHub.

BayesSuite

BayesSuite is a collection of Bayesian inference workloads written in Stan framework. BayesSuite code available on GitHub. Paper with more details and profiling results on BayesSuite. The repository is maintained by Emma Wang.

CHAMPVis

CHAMPVis, Comparative Hierarchical Analysis of Microarchitectural Performance Visualization. CHAMPVis code available on GitHub.

DeepRecSys

DeepRecSys provides an end-to-end infrastructure to study and optimize at-scale neural recommendation inference. The infrastructure is configurable across three main dimensions that represent different recommendation use cases: the load generator (query...

EdgeBERT

EdgeBERT is a HW/SW co-design enabling sentence-level energy optimizations for latency-aware multi-task NLP inference. In this repo, we provide both the software and hardware modelings. EdgeBERT code available on GitHub .

Fathom

Fathom is a collection of workloads for benchmarking modern machine learning techniques. Fathom source code available on GitHub. Fathom is maintained by Robert Adolf and Saketh Rama.

FlexASR

An AXI-programmable hardware accelerator for attention-based seq-to-seq networks. FlexASR can be configured to accelerate end-to-end RNN, GRU or LSTM models with attention mechanisms (e.g. Listen-Attend-and-Spell models). It was designed in Synthesizable...

gem5-Aladdin

gem5-Aladdin is an integration of the Aladdin accelerator simulator with the gem5 system simulator to enable simulation of end-to-end accelerated workloads on SoCs. You may download the code here. It is maintained by Sam Xi and Yakun Sophia Shao. gem5...

GoldenEye

GoldenEye is a functional simulator with fault injection capabilities for common and emerging numerical formats, implemented for the PyTorch deep learning framework. Goldeneye code available on GitHu. Goldeneye documentation available on Read the Docs.

ILDJIT

ILDJIT is our compilation framework using a high-level intermediate representation. ILDJIT code available on SourceForge. ILDJIT is maintained by Simone Campanoni.

LLVM-Tracer

LLVM-Tracer is an LLVM instrumentation pass to print out a dynamic LLVM IR execution trace, including dynamic values and memory addresses. Download LLVM-Tracer.

MachSuite

MachSuite is a benchmark suite for high-level synthesis and accelerator-centric architectures. Download MachSuite .

McPAT CPU Models

We have developed McPAT power models for a recent high-performance multicore CPU. Download McPAT power models.

NVMExplorer

NVMExplorer is a cross-stack design space exploration framework for evaluating and comparing on-chip memory solutions including emerging, embedded non-volatile memories. NVMExplorer code available on GitHub.

RecPipe

RecPipe provides an end-to-end system to study and jointly optimize recommendation models and hardware for at-scale inference. This repository provides the RecPipe infrastructure which is configuration across different models and hardware (general purpose...

Smaug

A deep learning framework that enables end-to-end simulation of DL models on custom SoCs with a variety of hardware accelerators. SMAUG is designed to enable DNN researchers to rapidly evaluate different accelerator and SoC designs and perform hardware...

WIICA

WIICA is the Workload ISA Independent Characterization for Applications tool. Download WIICA.

XIOSim

The HELIX Project The HELIX project is an automatic parallelization framework. It consists of four major components: HELIX, a parallelizing compiler that uncovers parallelism among loop iterations. ILDJIT, a compilation framework using a high-level...