#  EdgeBERT 

 



EdgeBERT is a HW/SW co-design enabling sentence-level energy optimizations for latency-aware multi-task NLP inference. In this repo, we provide both the software and hardware modelings. [**EdgeBERT code available on GitHub**](https://github.com/harvard-acc/EdgeBERT)**.**