Main Conference: List of Accepted Papers

1. Characterizing Soft-Error Resiliency in Arm's Ethos-U55 Embedded Machine Learning Accelerator
Abhishek Tyagi (University of Rochester), Reiley Jeyapaul (AMD), Chu Zhou (Arm Inc), Paul Whatmough (Qualcomm), Yuhao Zhu (University of Rochester)

2. Aiding Microprocessor Performance Validation with Machine Learning
Erick Carvajal Barboza (Universidad de Costa Rica), Mahesh Ketkar (Intel Corporation), Paul V. Gratz (Texas A&M University), Jiang Hu (Texas A&M University)

3. Muchisim: A Simulation Framework for Design Exploration of Multi-Chip Manycore Systems
Marcelo Orenes-Vera (Princeton University), Esin Tureci (Princeton University), Margaret Martonosi (Princeton University), David Wentzlaff (Princeton University)

4. Vision Transformer Computation and Resilience for Dynamic Inference
Kavya Sreedhar (Stanford University), Jason Clemons (NVIDIA), Rangharajan Venkatesan (NVIDIA), Stephen W. Keckler (NVIDIA), Mark Horowitz (Stanford University)

5. DNA Storage Toolkit: A Modular End-to-End DNA Data Storage Codec and Simulator
Puru Sharma (National University of Singapore), Gary Goh Yipeng (National University of Singapore), Bin Gao (National University of Singapore), Longshen Ou (National University of Singapore), Dehui Lin (National University of Singapore), Deepak Sharma (National University of Singapore), Djordje Jevdjic (National University of Singapore)

6. Bandwidth Characterization of DeepSpeed on Distributed Large Language Model Training
Bagus Hanindhito (University of Texas at Austin), Bhavesh Patel (Dell Technologies), Lizy K. John (The University of Texas at Austin)

7. CiMLoop: A Fast, Flexible, and Accurate Compute-In-Memory Modeling Framework
Tanner Andrulis (MIT), Joel Emer (MIT), Vivienne Sze (MIT)

8. LIBRA: Workload-aware Multi-dimensional Network Optimization for Large Model Distributed Training
William Won (Georgia Institute of Technology), Saeed Rashisi (Georgia Institute of Technology), Sudarshan Srinivasan (Intel), Tushar Krishna (Georgia Institute of Technology)

9. CiFlow: Dataflow Analysis and Optimization of Key Switching for Homomorphic Encryption
Negar Neda (New York University), Austin Ebel (New York University), Benedict Reynwar (Information Sciences Institute), Brandon Reagen (NYU)

10. SAP: Silicon Authentication Platform for System-on-Chip Supply Chain Vulnerabilities
Md Sami Ul Islam Sami (University of Florida), Jingbo Zhou (University of Florida), Sujan Kumar Saha (University of Florida), Fahim Rahman (University of Florida), Farimah Farahmandi (University of Florida), Mark Tehranipoor (University of Florida)

11. Characterizing In-Kernel Observability of Latency-Sensitive Request-level Metrics with eBPF
Mohammadreza Rezvani (University of California), Ali Jahanshahi (University of California), Daniel Wong (University of California)

12. Zatel: Prediction Methodology for Evaluating GPU Performance on Ray Tracing Workloads
Davit Grigoryan (University of British Columbia), Yuan Hsi Chou (University of British Columbia), Tor M. Aamodt (University of British Columbia)

13. Workload Characterization of Commercial Mobile Benchmark Suites
Victor Kariofillis (University of Toronto), Natalie Enright Jerger (University of Toronto)

14. BTBench: A Benchmark for Comprehensive Binary Translation Performance Evaluation
Xinyu Li (University of Chinese Academy of Sciences), Yanzhi Lan (University of Chinese Academy of Sciences), Gen Niu (University of Chinese Academy of Sciences), Feng Xue (University of Chinese Academy of Sciences), Fuxin Zhang (University of Chinese Academy of Sciences)

15. SimPoint-Based Microarchitectural Hotspot & Energy-Efficiency Analysis of RISC-V OoO CPUs
Odysseas Chatzopoulos (University of Athens), Maria Trakosa (University of Athens), George Papadimitriou (University of Athens), Wing Shek Wong (Intel), Dimitris Gizopoulos (University of Athens)

16. SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems
Kailash gogineni (George Washington University), Sai Santosh Dayapule (George Washington University), Juan Gomez-Luna (ETH Zurich), Karthikeya Gogineni (Unaffiliated), Peng Wei (George Washington University), Tian Lan (George Washington University), Mohammad Sadrosadati (ETH Zurich), Onur Mutlu (ETH Zurich and Stanford), Guru Venkataramani (George Washington University)

17. Characterizing the Performance, Power Efficiency, and Programmability of AMD Matrix Cores
Gabin Schieffer (KTH Royal Institute of Technology), Daniel Medeiros (KTH Royal Institute of Technology), Jennifer Faj (KTH Royal Institute of Technology), Aniruddha Marathe (Lawrence Livermore National Laboratory), Ivy Peng (KTH Royal Institute of Technology)

18. Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
Alicia Golden (Harvard University), Samuel Hsia (Harvard University), Fei Sun (Meta), Bilge Acun (FAIR Meta), Basil Hosmer (FAIR Meta), Yejin Lee (FAIR Meta), Zachary DeVito (FAIR Meta), Jeff Johnson (FAIR Meta), Gu-Yeon Wei (Harvard University), David Brooks (Harvard University), Carole-Jean Wu (FAIR Meta)

19. BZSim: Fast, Large-scale Microarchitectural Simulation With Detailed Interconnect Modeling
Panagiotis Strikos (Chalmers University of Technology), Ahsen Ejaz (Chalmers University of Technology), Ioannis Sourdis (Chalmers University of Technology)

20. Forward to the Past: An Alternative to Hybrid CPU Design
Anna Yue (University of Minnesota), Sanyam Mehta (HPE)

21. Userspace Networking in gem5
Johnson Umeike (University of Kansas), Siddharth Agarwal (UIUC), Nikita Lazarev (MIT), Mohammad Alian (University of Kansas)

22. Towards Cognitive AI Systems: Workload and Characterization of Neuro-Symbolic AI
Zishen Wan (Georgia Institute of Technology), Che-Kai Liu (Georgia Institute of Technology), Hanchen Yang (Georgia Institute of Technology), Ritik Raj (Georgia Institute of Technology), Chaojian Li (Georgia Institute of Technology), Haoran You (Georgia Institute of Technology), Yonggan Fu (Georgia Institute of Technology), Cheng Wan (Georgia Institute of Technology), Ananda Samajdar (IBM Research), Yingyan (Celine) Lin (Georgia Institute of Technology), Tushar Krishna (Georgia Institute of Technology), Arijit Raychowdhury (Georgia Institute of Technology)

23. RTune: Towards Automated and Coordinated Optimization of Computing and Computational Objectives of Parallel Iterative Applications
Yonghong Yan (University of North Carolina at Charlotte), Kewei Yan (University of North Carolina at Charlotte), Anjia Wang (University of North Carolina at Charlotte)

24. Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's GPT-4 with Self-Hosted Open Source SLMs in Production
Chandra Irugalbandara (Jaseci Labs), Ashish Mahendra (Jaseci Labs), Roland Daynauth (University of Michigan), Tharuka Kasthuri Arachchige (Jaseci Labs), Krisztian Flautner (University of Michigan), Lingjia Tang (University of Michigan), Yiping Kang (University of Michigan), Jason Mars (University of Michigan)




This website is maintained by the ISPASS-2024 Committee.

Contact Ahmad Alawneh if you have any questions or comments on this website.