TY - CONF T1 - A Systematic Methodology for Analysis of Deep Learning Hardware and Software Platforms Y1 - 2020 A1 - Wang Yu A1 - Gu Wei A1 - David Brooks KW - deep learning KW - DNN benchmarking and characterization KW - profiling AB - Training deep learning models is compute-intensive and there is an industry-wide trend towards hardware and software specialization to improve performance. To systematically compare deep learning systems, we introduce a methodology comprised of a set of analysis techniques and parameterized end-to-end models for fully connected, convolutional, and recurrent neural networks. This methodology can be applied to analyze various hardware and software systems, and is intended to complement traditional methods. We demonstrate its utility by comparing two generations of specialized platforms (Google's Cloud TPU v2/v3), three heterogeneous platforms (Google TPU, Nvidia GPU, and Intel CPU), and specialized software stacks (TensorFlow and CUDA). PB - Third Conference on Machine Learning and Systems (MLSys) UR - https://proceedings.mlsys.org/papers/2020/12 ER -