Day 1 -- April 17

Workshops and Tutorials

See the Workshop/Tutorials tab for the full details

08:30 Start morning session

Building Online Power Models from Real Data. (Part 1 of 2)
Working At Scale: Representative Workloads and Experiments for Datacenters.
Benchmarking and Measuring Approximate Computing.

10:00 - 11:00 Coffee

12:00 - 13:30 Lunch (downstairs)

13:30 Start afternoon sessions

Building Online Power Models from Real Data. (Part 2 of 2)
Using Containers to Uniformly Package, Distribute, and Execute Scientific Applications Across Multiple Platforms.
Understanding Energy Efficiency in High-Performance and Embedded Computing.
Programming Distributed Shared Memory Using ArgoDSM and Developing the SPLASH-3 Benchmarks.

15:00 - 15:30 Coffee

17:05 Excursion to the Museum Gustaviaum. Meet on the conference floor.

18:30 Reception in the bar with snacks.

Day 2 -- April 18

08:30 Welcome & Keynote (session chair: Andreas Moshovos)

08:30-08:45 Welcome (General, Program Chairs)
08:45-09:45 Keynote I: Antonio Gonzalez, Director, ARCO Research Group, Cognitive Computers: The Next Wave of Computing Innovation

09:50-11:10 Session I: 3 papers -- Best Paper Candidates (session chair: Sally McKee)

09:50-10:10: Performance Analysis of Accelerated Biophysically-Meaningful Neuron Simulations,
Georgios Smaragdos (Erasmus Medical Center), Georgios Chatzikostantis, Sofia Nomikou, Dimitrios Rodopoulos (NTUA), Ioannis Sourdis Chalmers University of Technology), Dimitrios Soudris (NTUA), Chris I. De Zeeuw and Christos Strydis (Erasmus Medical Center)
10:10-10:30: DVFS Performance Prediction for Managed Multithreaded Applications,
Shoaib Akram (Ghent University), Jennifer B. Sartor (Ghent University & Vrije Universiteit Brussel), Lieven Eeckhout (Ghent University)
10:30-10:50: Addressing Service Interruptions in Memory with Thread-to-Rank Assignment,
Manjunath Shevgoor, Rajeev Balasubramonian (Utah), Niladrish Chatterjee (NVIDIA Research), Jung-Sik Kim (Samsung)

10:50-11:20 Coffee
11:20-12:20 Session II: System and Workload Characterization/Optimizations (session chair: Rajeev Balasubramonian)

11:20-11:40: Characterization and Bottleneck Analysis of a 64-bit ARMv8 Platform,
Michael Laurenzano (Michigan), Ananta Tiwari, Allyson Cauble-Chantrenne, Adam Jundt, Laura Carrington (EP Analytics),
11:40-12:00: Analyzing the Energy-Efficiency of Sparse Matrix Multiplication on Heterogeneous Systems: A Comparative Study of GPU, Xeon Phi and FPGA,
Heiner Giefers, Peter Staar, Costas Bekas, Christoph Hagleitner (IBM Research, Zurich)

12:00-12:20: FastCap: An Efficient and Fair Algorithm for Power Capping in Many-Core Systems,
Yanpei Liu (Facebook Inc., University of Wisconsin Madison), Guilherme Cox (Rutgers University), Qingyuan Deng (Facebook Inc.), Stark C. Draper (University of Toronto), Ricardo Bianchini (Microsoft Research, Rutgers University)

12:20-13:40 Lunch (downstairs) 13:40-14:40 Session III: Reliability (session chair: Pedro Trancoso)

13:40-14:00: Anatomy of Microarchitecture-Level Reliability Assessment: Throughput and Accuracy,
Athanasios Chatzidimitriou, Dimitris Gizopoulos (University of Athens)
14:00-14:20: EmerGPU: Understanding and Mitigating Resonance-Induced Voltage Noise in GPU Architectures,
Renji Thomas, Naser Sedaghati, Radu Teodorescu (The Ohio State University)
14:20-14:40: GUFI: a Framework for GPUs Reliability Assessment,
Sotiris Tselonis, Dimitris Gizopoulos (University of Athens)

14:40-15:10 Coffee Break 15:10-16:10 Session IV: Workloads (session chair: Drew Hilton)

15:10-15:30: Splash-3: A Properly Synchronized Benchmark Suite for Contemporary Research,
Christos Sakalis, Carl Leonardsson, Stefanos Kaxiras (Uppsala University), Alberto Ros (Universidad de Murcia)
15:30-15:50: Workload Characterization and Optimization of TPC-H Queries on Apache Spark,
Tatsuhiro Chiba, Tamiya Onodera (IBM Research Tokyo)
15:50-16:10: Demystifying Cloud Benchmarking,
Tapti Palit, Yongming Shen, Michael Ferdman (Stony Brook University)
16:10-16:30: Analysis of PARSEC Workload Scalability,
Gabriel Southern, Jose Renau (UC Santa Cruz)

16:10 - 17:00 Session V: poster presentations (5 min each) (session chair: Benjamin Lee)

HL-PCM: MLC PCM Main Memory with Accelerated Read,
Mohammad Arjomand, Amin Jadidi, Mahmut T. Kandemir, Anand Sivasubramaniam, Chita Das, The Pennsylvania State University

Characterization and Architectural Implications of Big Data Workloads,
Wanglei, RenRui, ZhanJianfeng, JiaZhen, ICT,CAS

Elastic Traces for Fast and Accurate System Performance Exploration,
Radhika Jagtap, Andreas Hansson, Stephan Diestelhorst, Radhika Jagtap, Stephan Diestelhorst, ARM Ltd.

CoolSim: Eliminating Traditional Cache Warming with Fast, Virtualized Profiling,
Nikos Nikoleris (Uppsala University), Andreas Sandberg (ARM Research), Erik Hagersten, Trevor E. Carlson (Uppsala University)

Composable Modeling of Coherence and NUMA Effects for Optimizing Thread and Data Placement,
Hao Luo (University of Rochester), Chencheng Ye (Huazhong University of Science and Technology), Pengcheng Li, Chen Ding (University of Rochester)

Characterizing Hadoop Applications on Microservers for Performance and Energy Efficiency Optimizations,
Maria Malik, Avesta Sasan (George Mason University), Rajiv Joshi (IBM T. J. Watson Research Center), Setareh Rafatirad (George Mason University), Houman Homayoun (George Mason University)

RTHpower: Accurate Fine-grained Power Models for Predicting Race-to-halt Effect on Ultra-low Power Embedded Systems,
Vi Tran (UiT The Arctic University of Norway), Brendan Barry (Movidius Ltd., Ireland), Phuong Ha (UiT The Arctic University of Norway)

Agave: a Benchmark Suite Addressing Android System Complexity,
Martin K. Brown, Zachary Yannes (Florida State University, Tallahassee), Mazdak Sanati (Chalmers University of Technology), Michael Lustig (Florida State University, Tallahassee), Alexey Sidelnikov, Sally A. McKee (Chalmers University of Technology), Gary S. Tyson (Florida State University, Tallahassee), Steven K. Reinhardt (AMD)

Storage Consolidation on SSDs: Not always a panacea, but can we ease the pain?
Narges Shahidi, Mohammad Arjomand, Anand Sivasubramaniam, Mahmut Kandemir, Chita Das (The Pennsylvania State University)

17:10 Welcome Reception and Poster Session (banquet room) 18:10-21:00 Banquet: Keynote II: Lieven Eeckhout, Ghent University, Essentially, All Models Are Wrong, but Some Are Useful

Day 3 -- April 19

08:30 Keynote

08:30 Keynote III: Josep Torrellas, University of Illinois Urbana-Champaign, Energy-Efficient Extreme-Scale Manycores

09:30-10:30 Session VI: Understanding CPU and GPU Integration and Systems (session chair: Trevor Carlson)

09:30-9:50: Observations and Opportunities in Architecting Shared Virtual Memory for Heterogenous Systems,
Jan Vesely (AMD Research/Rutgers University), Arkaprava Basu (AMD Research), Mark Oskin (AMD Research/University of Washington), Gabriel H. Loh (AMD Research), Abhishek Bhattacharjee (Rutgers University), Arkaprava Basu (AMD Research)
09:50-10:10: Characterizing the Sources of Memory Stalls for Tightly Coupled GPUs,
Johnathan Alsop, Matthew D. Sinclair (University of Illinois at Urbana-Champaign), Rakesh Komuravelli (Qualcomm), Sarita V. Adve (University of Illinois at Urbana-Champaign)
10:10-10:30: A Comprehensive Performance Analysis of HSA and OpenCL 2.0,
Saoni Mukherjee, Yifan Sun (Northeastern University), Paul Blinzer (AMD), Amir Kavyan Ziabari, David Kaeli (Northeastern University)

10:30-11:00 Coffee Break 11:00-12:20 Session VII: Designs and Design Generators (session chair: David-Black Schaffer)

11:00-11:20: OpenSoC Fabric: On-Chip Network Generator,
Farzad Fatollahi-Fard, David Donofrio, George Michelogiannakis, John Shalf, George Michelogiannakis (Lawrence Berkeley National Laboratory)
11:20-11:40: A Study of Mobile Device Utilization
C. Gao, A. Gutierrez, M. Rajan, R. Dreslinski, T. Mudge, C. Wu
11:35-11:55: Optimizing Rasterizer Performance and Energy in the NyuziProcessor Open Source GPU,
Jeff Bush, Khaled Z. Mahmoud, Mohammad A. Khasawneh, Timothy N. Miller (Binghamton University (SUNY))
11:40-12:00: AnyCore: A Synthesizable RTL Model for Exploring and Fabricating Adaptive Superscalar Cores,
Rangeen Basu Roy Chowdhury, Anil Kumar Kannepalli, Sungkwan Ku, Eric Rotenberg (North Carolina State University)
12:00-12:20: Performance Analysis of a Hardware Accelerator of Dependency Management for Task-based Dataflow Programming models,
X. TAN, J. Bosch, D. Jimenez-Gonzalez, C. Alvarez-Martinez, E. Ayguad'e, M. Valero (Barcelona Supercomputing Center, Universitat Politecnica de Catalunya)

12:20-13:40 Lunch 13:40-15:00 Session VIII: Mobile and Cloud (session chair: Radu Theodorescu)

13:40-14:00: Evaluating Asymmetric Multiprocessing for Mobile Applications,
Songchun Fan, Benjamin C. Lee (Duke University)
14:00-14:20: MofySim: A Mobile Full System Simulation Framework for Energy Consumption and Performance Analysis,
Minho Ju (Samsung Electronics & Korea Advanced Institute of Science and Technology), Hyeonggyu Kim, Soontae Kim (Korea Advanced Institute of Science and Technology) , Minho Ju (Samsung Electronics & Korea Advanced Institute of Science and Technology)
14:20-14:40: Mobile Benchmarking Done Right: Understanding the System Impact of Mobile GPUs,
Rene de Jong, Andreas Sandberg (ARM)
14:40-15:00: X-Mem: A Cross-Platform and Extensible Memory Characterization Tool for the Cloud,
Mark Gottscho (University of California, Los Angeles), Sriram Govindan, Bikash Sharma (Microsoft), Mohammed Shoaib (Microsoft Research), Puneet Gupta (University of California, Los Angeles)

15:00-15:30 Coffee Break 15:30-16:50 Session IX: Tools and Methodologies (session chair: Stefanos Kaxiras)

15:30-15:50: Interactive Visualization of Cross-Layer Performance Anomalies in Dynamic Task-Parallel Applications and Systems,
Andi Drebes, Antoniu Pop (The University of Manchester), Karine Heydemann (Universite Pierre et Marie Curie / LIP6), Albert Cohen (Ecole Normale Superieure / Inria)
15:50-16:10: JIT-Assisted Fast-Forward Embedding and Instrumentation to Enable Fast, Accurate, and Agile Simulation,
Berkin Ilbeyi, Christopher Batten (Cornell University)
16:10-16:30: TaskPoint: Sampled Simulation of Task-Based Programs,
Thomas Grass (Universitat Politicnica de Catalunya and Barcelona Supercomputing Center), Alejandro Rico (ARM), Marc Casas (Barcelona Supercomputing Center), Miquel Moreto (Universitat Politicnica de Catalunya and Barcelona Supercomputing Center), Eduard Ayguad'e (Universitat Politicnica de Catalunya and Barcelona Supercomputing Center)
16:30-16:50: An Automated Framework for Characterizing and Subsetting GPGPU Workloads,
Vignesh Adhinarayanan, Wu-chun Feng (Virginia Tech)

16:50 Closing

Keynote I

Keynote II

Keynote III