| Title: |
Breakthrough Low-Latency, High-Energy-Efficiency LLM Inference Performance Using NorthPole |
| Authors: |
Appuswamy, Rathinakumar; Debole, Michael V.; Taba, Brian; Esser, Steven K.; Cassidy, Andrew S.; Amir, Arnon; Andreopoulos, Alexander; Bablani, Deepika; Datta, Pallab; Kusnitz, Jeffrey A.; McClatchey, Nathaniel J.; McGlohon, Neil; McKinstry, Jeffrey L.; Nayak, Tapan K.; Smith, Daniel F.; Sousa, Rafael; Terrizzano, Ignacio; Akopyan, Filipp; Carlson, Peter J.; Gandhasri, Rajamohan; Garreau, Guillaume J.; Gonzalez, Nelson M.; Ito, Megumi; Klamo, Jennifer L.; Nakamura, Yutaka; Otero, Carlos Ortega; Risk, William P.; Sawada, Jun; Schleupen, Kai; Sivagnaname, Jay; Stallone, Matthew; Ueda, Takanori; Flickner, Myron D.; Arthur, John V.; Panda, Rameswar; Cox, David D.; Modha, Dharmendra S. |
| Source: |
2024 IEEE High Performance Extreme Computing Conference (HPEC) High Performance Extreme Computing Conference (HPEC), 2024 IEEE. :1-8 Sep, 2024 |
| Relation: |
2024 IEEE High Performance Extreme Computing Conference (HPEC) |
| Database: |
IEEE Xplore Digital Library |