ACM/IEEE Intl. Conf. for High Perf. Computing, Networking, Storage and Analysis, SC 2016


Title/Authors Title Research Artifacts
[?] A research artifact is any by-product of a research project that is not directly included in the published research paper. In Computer Science research this is often source code and data sets, but it could also be media, documentation, inputs to proof assistants, shell-scripts to run experiments, etc.
Details

dCUDA: hardware supported overlap of computation and communication

Tobias Gysi, Jeremia Bär, Torsten Hoefler

dCUDA: hardware supported overlap of computation and communication

Details
Discussion Comments: 0
Verification: Authors have not verified information

A machine learning framework for performance coverage analysis of proxy applications

Tanzima Zerin Islam, Jayaraman J. Thiagarajan, Abhinav Bhatele, Martin Schulz, Todd Gamblin

A machine learning framework for performance coverage analysis of proxy applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

The mont-blanc prototype: an alternative approach for HPC systems

Nikola Rajovic, Alejandro Rico, Filippo Mantovani, Daniel Ruiz, Josep Oriol Vilarrubi, Constantino Gomez, Luna Backes, Diego Nieto, Harald Servat, Xavier Martorell, Jesús Labarta, Eduard Ayguadé, Chris Adeniyi-Jones, Said Derradji, Hervé Gloaguen, Piero Lanucara, Nico Sanna, Jean-François Méhaut, Kevin Pouget, Brice Videau, Eric Boyer, Momme Allalen, Axel Auweter, David Brayford, Daniele Tafani, Volker Weinberg, Dirk Brömmel, René Halver, Jan H. Meinke, Ramón Beivide, Mariano Benito, Enrique Vallejo, Mateo Valero, Alex Ramírez

The mont-blanc prototype: an alternative approach for HPC systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

A PCIe congestion-aware performance model for densely populated accelerator servers

Maxime Martinasso, Grzegorz Kwasniewski, Sadaf R. Alam, Thomas C. Schulthess, Torsten Hoefler

A PCIe congestion-aware performance model for densely populated accelerator servers

Details
Discussion Comments: 0
Verification: Authors have not verified information

PIPES: a language and compiler for task-based programming on distributed-memory clusters

Martin Kong, Louis-Noël Pouchet, P. Sadayappan, Vivek Sarkar

PIPES: a language and compiler for task-based programming on distributed-memory clusters

Details
Discussion Comments: 0
Verification: Authors have not verified information

Server-side log data analytics for I/O workload characterization and coordination on large shared storage systems

Yang Liu, Raghul Gunasekaran, Xiaosong Ma, Sudharshan S. Vazhkudai

Server-side log data analytics for I/O workload characterization and coordination on large shared storage systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Designing scalable b-Matching algorithms on distributed memory multiprocessors by approximation

Arif M. Khan, Alex Pothen, Md. Mostofa Ali Patwary, Mahantesh Halappanavar, Nadathur Rajagopalan Satish, Narayanan Sundaram, Pradeep Dubey

Designing scalable b-Matching algorithms on distributed memory multiprocessors by approximation

Details
Discussion Comments: 0
Verification: Authors have not verified information

Translating OpenMP device constructs to OpenCL using unnecessary data transfer elimination

Junghyun Kim, Yong-Jun Lee, Jung-Ho Park, Jaejin Lee

Translating OpenMP device constructs to OpenCL using unnecessary data transfer elimination

Details
Discussion Comments: 0
Verification: Authors have not verified information

PFEAST: a high performance sparse eigenvalue solver using distributed-memory linear solvers

James Kestyn, Vasileios Kalantzis, Eric Polizzi, Yousef Saad

PFEAST: a high performance sparse eigenvalue solver using distributed-memory linear solvers

Details
Discussion Comments: 0
Verification: Authors have not verified information

A multi-faceted approach to job placement for improved performance on extreme-scale systems

Christopher Zimmer, Saurabh Gupta, Scott Atchley, Sudharshan S. Vazhkudai, Carl Albing

A multi-faceted approach to job placement for improved performance on extreme-scale systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Measuring and understanding throughput of network topologies

Sangeetha Abdu Jyothi, Ankit Singla, Brighten Godfrey, Alexandra Kolla

Measuring and understanding throughput of network topologies

Details
Artifacts for some papers are reviewed by an artifact evaluation, reproducibility, or similarly named committee. This is one such paper that passed review.
Artifact evaluation badge awarded
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Elastic multi-resource fairness: balancing fairness and efficiency in coupled CPU-GPU architectures

Shanjiang Tang, Bingsheng He, Shuhao Zhang, Zhaojie Niu

Elastic multi-resource fairness: balancing fairness and efficiency in coupled CPU-GPU architectures

Details
Discussion Comments: 0
Verification: Authors have not verified information

Caliper: performance introspection for HPC software stacks

David Böhme, Todd Gamblin, David Beckingsale, Peer-Timo Bremer, Alfredo Giménez, Matthew P. LeGendre, Olga Pearce, Martin Schulz

Caliper: performance introspection for HPC software stacks

Details
Discussion Comments: 0
Verification: Authors have not verified information

Graph colouring as a challenge problem for dynamic graph processing on distributed systems

Scott Sallinen, Keita Iwabuchi, Suraj Poudel, Maya Gokhale, Matei Ripeanu, Roger A. Pearce

Graph colouring as a challenge problem for dynamic graph processing on distributed systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Truenorth ecosystem for brain-inspired computing: scalable systems, software, and applications

Jun Sawada, Filipp Akopyan, Andrew S. Cassidy, Brian Taba, Michael V. DeBole, Pallab Datta, Rodrigo Alvarez-Icaza, Arnon Amir, John V. Arthur, Alexander Andreopoulos, Rathinakumar Appuswamy, Heinz Baier, Davis Barch, David J. Berg, Carmelo di Nolfo, Steven K. Esser, Myron Flickner, Thomas A. Horvath, Bryan L. Jackson, Jeff Kusnitz, Scott Lekuch, Michael Mastro, Timothy Melano, Paul A. Merolla, Steven E. Millman, Tapan K. Nayak, Norm Pass, Hartmut E. Penner, William P. Risk, Kai Schleupen, Benjamin Shaw, Hayley Wu, Brian Giera, Adam T. Moody, Nathan Mundhenk, Brian Van Essen, Eric X. Wang, David P. Widemann, Qing Wu, William E. Murphy, Jamie K. Infantolino, James A. Ross, Dale R. Shires, Manuel M. Vindiola, Raju Namburu, Dharmendra S. Modha

Truenorth ecosystem for brain-inspired computing: scalable systems, software, and applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Accelerating lattice QCD multigrid on GPUs using fine-grained parallelization

Michael A. Clark, Bálint Joó, Alexei Strelchenko, Michael Cheng, Arjun Singh Gambhir, Richard C. Brower

Accelerating lattice QCD multigrid on GPUs using fine-grained parallelization

Details
Discussion Comments: 0
Verification: Authors have not verified information

Evaluating and optimizing OpenCL kernels for high performance computing with FPGAs

Hamid Reza Zohouri, Naoya Maruyama, Aaron Smith, Motohiko Matsuda, Satoshi Matsuoka

Evaluating and optimizing OpenCL kernels for high performance computing with FPGAs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Perilla: metadata-based optimizations of an asynchronous runtime for adaptive mesh refinement

Tan Nguyen, Didem Unat, Weiqun Zhang, Ann S. Almgren, Muhammed Nufail Farooqi, John Shalf

Perilla: metadata-based optimizations of an asynchronous runtime for adaptive mesh refinement

Details
Discussion Comments: 0
Verification: Authors have not verified information

High-frequency nonlinear earthquake simulations on petascale heterogeneous supercomputers

Daniel Roten, Yifeng Cui, Kim B. Olsen, Steven M. Day, Kyle Withers, William H. Savran, Peng Wang, Dawei Mu

High-frequency nonlinear earthquake simulations on petascale heterogeneous supercomputers

Details
Discussion Comments: 0
Verification: Authors have not verified information

Flexfly: enabling a reconfigurable dragonfly through silicon photonics

Ke Wen, Payman Samadi, Sébastien Rumley, Christine P. Chen, Yiwen Shen, Meisam Bahadori, Keren Bergman, Jeremiah J. Wilke

Flexfly: enabling a reconfigurable dragonfly through silicon photonics

Details
Discussion Comments: 0
Verification: Authors have not verified information

HARP: predictive transfer optimization based on historical analysis and real-time probing

Engin Arslan, Kemal Guner, Tevfik Kosar

HARP: predictive transfer optimization based on historical analysis and real-time probing

Details
Discussion Comments: 0
Verification: Authors have not verified information

Modeling dilute solutions using first-principles molecular dynamics: computing more than a million atoms with over a million cores

Jean-Luc Fattebert, Daniel Osei-Kuffuor, Erik W. Draeger, Tadashi Ogitsu, William D. Krauss

Modeling dilute solutions using first-principles molecular dynamics: computing more than a million atoms with over a million cores

Details
Discussion Comments: 0
Verification: Authors have not verified information

GreenLA: green linear algebra software for GPU-accelerated heterogeneous computing

Jieyang Chen, Li Tan, Panruo Wu, Dingwen Tao, Hongbo Li, Xin Liang, Sihuan Li, Rong Ge, Laxmi N. Bhuyan, Zizhong Chen

GreenLA: green linear algebra software for GPU-accelerated heterogeneous computing

Details
Discussion Comments: 0
Verification: Authors have not verified information

Reliable and efficient performance monitoring in linux

Maria Dimakopoulou, Stéphane Eranian, Nectarios Koziris, Nicholas Bambos

Reliable and efficient performance monitoring in linux

Details
Artifacts for some papers are reviewed by an artifact evaluation, reproducibility, or similarly named committee. This is one such paper that passed review.
Artifact evaluation badge awarded
Discussion Comments: 0
Verification: Authors have not verified information

A highly effective global surface wave numerical simulation with ultra-high resolution

Fang-Li Qiao, Wei Zhao, Xunqiang Yin, Xiaomeng Huang, Xin Liu, Qi Shu, Guansuo Wang, Zhenya Song, Xinfang Li, Haixing Liu, Guangwen Yang, Yeli Yuan

A highly effective global surface wave numerical simulation with ultra-high resolution

Details
Discussion Comments: 0
Verification: Authors have not verified information

MetaMorph: a library framework for interoperable kernels on multi- and many-core clusters

Ahmed E. Helal, Paul Sathre, Wu-chun Feng

MetaMorph: a library framework for interoperable kernels on multi- and many-core clusters

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

FlipBack: automatic targeted protection against silent data corruption

Xiang Ni, Laxmikant V. Kalé

FlipBack: automatic targeted protection against silent data corruption

Details
Discussion Comments: 0
Verification: Authors have not verified information

Daino: a high-level framework for parallel and efficient AMR on GPUs

Mohamed Wahib, Naoya Maruyama, Takayuki Aoki

Daino: a high-level framework for parallel and efficient AMR on GPUs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Strassen's algorithm reloaded

Jianyu Huang, Tyler M. Smith, Greg M. Henry, Robert A. van de Geijn

Strassen's algorithm reloaded

Details
Discussion Comments: 0
Verification: Authors have not verified information

Block iterative methods and recycling for improved scalability of linear solvers

Pierre Jolivet, Pierre-Henri Tournier

Block iterative methods and recycling for improved scalability of linear solvers

Details
Artifacts for some papers are reviewed by an artifact evaluation, reproducibility, or similarly named committee. This is one such paper that passed review.
Artifact evaluation badge awarded
Discussion Comments: 0
Verification: Authors have not verified information

Enhancing infiniband with openflow-style SDN capability

Jason Lee, Zhou Tong, Karthik Achalkar, Xin Yuan, Michael Lang

Enhancing infiniband with openflow-style SDN capability

Details
Discussion Comments: 0
Verification: Authors have not verified information

DCA: a DRAM-cache-aware DRAM controller

Cheng-Chieh Huang, Vijay Nagarajan, Arpit Joshi

DCA: a DRAM-cache-aware DRAM controller

Details
Discussion Comments: 0
Verification: Authors have not verified information

Granularity and the cost of error recovery in resilient AMR scientific applications

Anshu Dubey, Hajime Fujita, Daniel T. Graves, Andrew A. Chien, Devesh Tiwari

Granularity and the cost of error recovery in resilient AMR scientific applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

High performance emulation of quantum circuits

Thomas Häner, Damian S. Steiger, Mikhail Smelyanskiy, Matthias Troyer

High performance emulation of quantum circuits

Details
Discussion Comments: 0
Verification: Authors have not verified information

LIBXSMM: accelerating small matrix multiplications by runtime code generation

Alexander Heinecke, Greg Henry, Maxwell Hutchinson, Hans Pabst

LIBXSMM: accelerating small matrix multiplications by runtime code generation

Details
Discussion Comments: 0
Verification: Authors have not verified information

Unprotected computing: a large-scale study of DRAM raw error rate on a supercomputer

Leonardo Bautista-Gomez, Ferad Zyulkyarov, Osman S. Unsal, Simon McIntosh-Smith

Unprotected computing: a large-scale study of DRAM raw error rate on a supercomputer

Details
Discussion Comments: 0
Verification: Authors have not verified information

The vectorization of the tersoff multi-body potential: an exercise in performance portability

Markus Höhnerbach, Ahmed E. Ismail, Paolo Bientinesi

The vectorization of the tersoff multi-body potential: an exercise in performance portability

Details
Artifacts for some papers are reviewed by an artifact evaluation, reproducibility, or similarly named committee. This is one such paper that passed review.
Artifact evaluation badge awarded
Author Comments: The results were replicated: Results Replicated Badge.
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Simulation and performance analysis of the ECMWF tape library system

Markus Mäsker, Lars Nagel, Tim Süß, André Brinkmann, Lennart Sorth

Simulation and performance analysis of the ECMWF tape library system

Details
Discussion Comments: 0
Verification: Authors have not verified information

Increasing molecular dynamics simulation rates with an 8-fold increase in electrical power efficiency

W. Michael Brown, Andrey Semin, Michael Hebenstreit, Sergey Khvostov, Karthik Raman, Steven J. Plimpton

Increasing molecular dynamics simulation rates with an 8-fold increase in electrical power efficiency

Details
Artifacts for some papers are reviewed by an artifact evaluation, reproducibility, or similarly named committee. This is one such paper that passed review.
Artifact evaluation badge awarded
Discussion Comments: 0
Verification: Authors have not verified information

Transient guarantees: maximizing the value of idle cloud capacity

Supreeth Shastri, Amr Rizk, David E. Irwin

Transient guarantees: maximizing the value of idle cloud capacity

Details
Discussion Comments: 0
Verification: Authors have not verified information

Improving application resilience to memory errors with lightweight compression

Scott Levy, Kurt B. Ferreira, Patrick G. Bridges

Improving application resilience to memory errors with lightweight compression

Details
Discussion Comments: 0
Verification: Authors have not verified information

Towards green aviation with python at petascale

Peter E. Vincent, Freddie D. Witherden, Brian C. Vermeire, Jin Seok Park, Arvind Iyer

Towards green aviation with python at petascale

Details
Discussion Comments: 0
Verification: Authors have not verified information

Enhanced MPSM3 for applications to quantum biological simulations

A. Pozdneev, Valéry Weber, Teodoro Laino, Constantine Bekas, Alessandro Curioni

Enhanced MPSM3 for applications to quantum biological simulations

Details
Discussion Comments: 0
Verification: Authors have not verified information

10M-core scalable fully-implicit solver for nonhydrostatic atmospheric dynamics

Chao Yang, Wei Xue, Haohuan Fu, Hongtao You, Xinliang Wang, Yulong Ao, Fangfang Liu, Lin Gan, Ping Xu, Lanning Wang, Guangwen Yang, Weimin Zheng

10M-core scalable fully-implicit solver for nonhydrostatic atmospheric dynamics

Details
Discussion Comments: 0
Verification: Authors have not verified information

Optimizing memory efficiency for deep convolutional neural networks on GPUs

Chao Li, Yi Yang, Min Feng, Srimat T. Chakradhar, Huiyang Zhou

Optimizing memory efficiency for deep convolutional neural networks on GPUs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Watch out for the bully!: job interference study on dragonfly network

Xu Yang, John Jenkins, Misbah Mubarak, Robert B. Ross, Zhiling Lan

Watch out for the bully!: job interference study on dragonfly network

Details
Discussion Comments: 0
Verification: Authors have not verified information

Development effort estimation in HPC

Sandra Wienke, Julian Miller, Martin Schulz, Matthias S. Müller

Development effort estimation in HPC

Details
Discussion Comments: 0
Verification: Authors have not verified information

SERF: efficient scheduling for fast deep neural network serving via judicious parallelism

Feng Yan, Yuxiong He, Olatunji Ruwase, Evgenia Smirni

SERF: efficient scheduling for fast deep neural network serving via judicious parallelism

Details
Author Comments:
Discussion Comments: 0
Sharing: Not able to share produced artifacts
Verification: Authors have verified information

Pinpointing scale-dependent integer overflow bugs in large-scale parallel applications

Ignacio Laguna, Martin Schulz

Pinpointing scale-dependent integer overflow bugs in large-scale parallel applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Understanding error propagation in GPGPU applications

Guanpeng Li, Karthik Pattabiraman, Chen-Yong Cher, Pradip Bose

Understanding error propagation in GPGPU applications

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Designing MPI library with on-demand paging (ODP) of infiniband: challenges and benefits

Mingzhe Li, Khaled Hamidouche, Xiaoyi Lu, Hari Subramoni, Jie Zhang, Dhabaleswar K. Panda

Designing MPI library with on-demand paging (ODP) of infiniband: challenges and benefits

Details
Discussion Comments: 0
Verification: Authors have not verified information

A domain-specific compiler for a parallel multiresolution adaptive numerical simulation environment

Samyam Rajbhandari, Jinsung Kim, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, Robert J. Harrison, P. Sadayappan

A domain-specific compiler for a parallel multiresolution adaptive numerical simulation environment

Details
Discussion Comments: 0
Verification: Authors have not verified information

Multi-resource fair sharing for datacenter jobs with placement constraints

Wei Wang, Baochun Li, Ben Liang, Jun Li

Multi-resource fair sharing for datacenter jobs with placement constraints

Details
Discussion Comments: 0
Verification: Authors have not verified information

Performance modeling of in situ rendering

Matthew Larsen, Cyrus Harrison, James Kress, David Pugmire, Jeremy S. Meredith, Hank Childs

Performance modeling of in situ rendering

Details
Discussion Comments: 0
Verification: Authors have not verified information

Simulations of below-ground dynamics of fungi: 1.184 pflops attained by automated generation and autotuning of temporal blocking codes

Takayuki Muranushi, Hideyuki Hotta, Junichiro Makino, Seiya Nishizawa, Hirofumi Tomita, Keigo Nitadori, Masaki Iwasawa, Natsuki Hosono, Yutaka Maruyama, Hikaru Inoue, Hisashi Yashiro, Yoshifumi Nakamura

Simulations of below-ground dynamics of fungi: 1.184 pflops attained by automated generation and autotuning of temporal blocking codes

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scalable non-blocking preconditioned conjugate gradient methods

Paul R. Eller, William Gropp

Scalable non-blocking preconditioned conjugate gradient methods

Details
Discussion Comments: 0
Verification: Authors have not verified information

Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputer

Haohuan Fu, Junfeng Liao, Wei Xue, Lanning Wang, Dexun Chen, Long Gu, Jinxiu Xu, Nan Ding, Xinliang Wang, Conghui He, Shizhen Xu, Yishuang Liang, Jiarui Fang, Yuanchao Xu, Weijie Zheng, Jingheng Xu, Zhen Zheng, Wanjing Wei, Xu Ji, He Zhang, Bingwei Chen, Kaiwei Li, Xiaomeng Huang, Wenguang Chen, Guangwen Yang

Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputer

Details
Discussion Comments: 0
Verification: Authors have not verified information

Exploring the potentials of parallel garbage collection in SSDs for enterprise storage systems

Narges Shahidi, Mohammad Arjomand, Myoungsoo Jung, Mahmut T. Kandemir, Chita R. Das, Anand Sivasubramaniam

Exploring the potentials of parallel garbage collection in SSDs for enterprise storage systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

MUSA: a multi-level simulation approach for next-generation HPC machines

Thomas Grass, César Allande, Adrià Armejach, Alejandro Rico, Eduard Ayguadé, Jesús Labarta, Mateo Valero, Marc Casas, Miquel Moretó

MUSA: a multi-level simulation approach for next-generation HPC machines

Details
Discussion Comments: 0
Verification: Authors have not verified information

G-store: high-performance graph store for trillion-edge processing

Pradeep Kumar, H. Howie Huang

G-store: high-performance graph store for trillion-edge processing

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Automating wavefront parallelization for sparse matrix computations

Anand Venkat, Mahdi Soltan Mohammadi, Jongsoo Park, Hongbo Rong, Rajkishore Barik, Michelle Mills Strout, Mary W. Hall

Automating wavefront parallelization for sparse matrix computations

Details
Discussion Comments: 0
Verification: Authors have not verified information

A parallel algorithm for finding all pairs k-mismatch maximal common substrings

Sriram P. Chockalingam, Sharma V. Thankachan, Srinivas Aluru

A parallel algorithm for finding all pairs k-mismatch maximal common substrings

Details
Discussion Comments: 0
Verification: Authors have not verified information

Extreme scale plasma turbulence simulations on top supercomputers worldwide

William M. Tang, Bei Wang, Stéphane Ethier, Grzegorz Kwasniewski, Torsten Hoefler, Khaled Z. Ibrahim, Kamesh Madduri, Samuel Williams, Leonid Oliker, Carlos Rosales-Fernandez, Timothy J. Williams

Extreme scale plasma turbulence simulations on top supercomputers worldwide

Details
Discussion Comments: 0
Verification: Authors have not verified information

An efficient and scalable algorithmic method for generating large: scale random graphs

Md. Maksudul Alam, Maleq Khan, Anil Vullikanti, Madhav V. Marathe

An efficient and scalable algorithmic method for generating large: scale random graphs

Details
Discussion Comments: 0
Verification: Authors have not verified information

An exploration of optimization algorithms for high performance tensor completion

Shaden Smith, Jongsoo Park, George Karypis

An exploration of optimization algorithms for high performance tensor completion

Details
Artifacts for some papers are reviewed by an artifact evaluation, reproducibility, or similarly named committee. This is one such paper that passed review.
Artifact evaluation badge awarded
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Scalemine: scalable parallel frequent subgraph mining in a single large graph

Ehab Abdelhamid, Ibrahim Abdelaziz, Panos Kalnis, Zuhair Khayyat, Fuad Jamour

Scalemine: scalable parallel frequent subgraph mining in a single large graph

Details
Artifacts for some papers are reviewed by an artifact evaluation, reproducibility, or similarly named committee. This is one such paper that passed review.
Artifact evaluation badge awarded
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Failure detection and propagation in HPC systems

George Bosilca, Aurelien Bouteiller, Amina Guermouche, Thomas Hérault, Yves Robert, Pierre Sens, Jack J. Dongarra

Failure detection and propagation in HPC systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Evaluating HPC networks via simulation of parallel workloads

Nikhil Jain, Abhinav Bhatele, Sam White, Todd Gamblin, Laxmikant V. Kalé

Evaluating HPC networks via simulation of parallel workloads

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced no artifacts
Verification: Authors have verified information

Extreme-scale phase field simulations of coarsening dynamics on the sunway taihulight supercomputer

Jian Zhang, Chunbao Zhou, Yangang Wang, Lili Ju, Qiang Du, Xuebin Chi, Dongsheng Xu, Dexun Chen, Yong Liu, Zhao Liu

Extreme-scale phase field simulations of coarsening dynamics on the sunway taihulight supercomputer

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient delaunay tessellation through K-D tree decomposition

Dmitriy Morozov, Tom Peterka

Efficient delaunay tessellation through K-D tree decomposition

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Extended task queuing: active messages for heterogeneous systems

Michael LeBeane, Brandon Potter, Abhisek Pan, Alexandru Dutu, Vinay Agarwala, Wonchan Lee, Deepak Majeti, Bibek Ghimire, Eric Van Tassell, Samuel Wasmundt, Brad Benton, Mauricio Breternitz, Michael L. Chu, Mithuna Thottethodi, Lizy K. John, Steven K. Reinhardt

Extended task queuing: active messages for heterogeneous systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Merge-based parallel sparse matrix-vector multiplication

Duane Merrill, Michael Garland

Merge-based parallel sparse matrix-vector multiplication

Details
Artifacts for some papers are reviewed by an artifact evaluation, reproducibility, or similarly named committee. This is one such paper that passed review.
Artifact evaluation badge awarded
Discussion Comments: 0
Verification: Authors have not verified information

Performance analysis, design considerations, and applications of extreme-scale in situ infrastructures

Utkarsh Ayachit, Andrew C. Bauer, Earl P. N. Duque, Greg Eisenhauer, Nicola Ferrier, Junmin Gu, Kenneth E. Jansen, Burlen Loring, Zarija Lukic, Suresh Menon, Dmitriy Morozov, Patrick O'Leary, Reetesh Ranjan, Michel E. Rasquin, Christopher P. Stone, Venkatram Vishwanath, Gunther H. Weber, Brad Whitlock, Matthew Wolf, K. John Wu, E. Wes Bethel

Performance analysis, design considerations, and applications of extreme-scale in situ infrastructures

Details
Discussion Comments: 0
Verification: Authors have not verified information

Optimal execution of co-analysis for large-scale molecular dynamics simulations

Preeti Malakar, Venkatram Vishwanath, Christopher Knight, Todd S. Munson, Michael E. Papka

Optimal execution of co-analysis for large-scale molecular dynamics simulations

Details
Artifacts for some papers are reviewed by an artifact evaluation, reproducibility, or similarly named committee. This is one such paper that passed review.
Artifact evaluation badge awarded
Discussion Comments: 0
Verification: Authors have not verified information

A data driven scheduling approach for power management on HPC systems

Sean Wallace, Xu Yang, Venkatram Vishwanath, William E. Allcock, Susan Coghlan, Michael E. Papka, Zhiling Lan

A data driven scheduling approach for power management on HPC systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Compiler-directed lightweight checkpointing for fine-grained guaranteed soft error recovery

Qingrui Liu, Changhee Jung, Dongyoon Lee, Devesh Tiwari

Compiler-directed lightweight checkpointing for fine-grained guaranteed soft error recovery

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scheduling-aware routing for supercomputers

Jens Domke, Torsten Hoefler

Scheduling-aware routing for supercomputers

Details
Discussion Comments: 0
Verification: Authors have not verified information

A parallel arbitrary-order accurate AMR algorithm for the scalar advection-diffusion equation

Arash Bakhtiari, Dhairya Malhotra, Amir Raoofy, Miriam Mehl, Hans-Joachim Bungartz, George Biros

A parallel arbitrary-order accurate AMR algorithm for the scalar advection-diffusion equation

Details
Discussion Comments: 0
Verification: Authors have not verified information

Enabling efficient preemption for SIMT architectures with lightweight context switching

Zhen Lin, Lars Nyland, Huiyang Zhou

Enabling efficient preemption for SIMT architectures with lightweight context switching

Details
Discussion Comments: 0
Verification: Authors have not verified information

Distributed-memory large deformation diffeomorphic 3D image registration

Andreas Mang, Amir Gholami, George Biros

Distributed-memory large deformation diffeomorphic 3D image registration

Details
Discussion Comments: 0
Verification: Authors have not verified information

Real-time synthesis of compression algorithms for scientific data

Martin Burtscher, Hari Mukka, Annie Yang, Farbod Hesaaraki

Real-time synthesis of compression algorithms for scientific data

Details
Discussion Comments: 0
Verification: Authors have not verified information

ZNNi: maximizing the inference throughput of 3D convolutional networks on CPUs and GPUs

Aleksandar Zlateski, Kisuk Lee, H. Sebastian Seung

ZNNi: maximizing the inference throughput of 3D convolutional networks on CPUs and GPUs

Details
Discussion Comments: 0
Verification: Authors have not verified information

An ephemeral burst-buffer file system for scientific applications

Teng Wang, Kathryn Mohror, Adam Moody, Kento Sato, Weikuan Yu

An ephemeral burst-buffer file system for scientific applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Týr: blob storage meets built-in transactions

Pierre Matri, Alexandru Costan, Gabriel Antoniu, Jesús Montes, María S. Pérez

Týr: blob storage meets built-in transactions

Details
Discussion Comments: 0
Verification: Authors have not verified information

Understanding performance interference in next-generation HPC systems

Oscar H. Mondragon, Patrick G. Bridges, Scott Levy, Kurt B. Ferreira, Patrick M. Widener

Understanding performance interference in next-generation HPC systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

DAOS and friends: a proposal for an exascale storage system

Jay F. Lofstead, Ivo Jimenez, Carlos Maltzahn, Quincey Koziol, John Bent, Eric Barton

DAOS and friends: a proposal for an exascale storage system

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Characterizing parallel scientific applications on commodity clusters: an empirical study of a tapered fat-tree

Edgar A. León, Ian Karlin, Abhinav Bhatele, Steven H. Langer, Chris Chambreau, Louis H. Howell, Trent D'Hooge, Matthew L. Leininger

Characterizing parallel scientific applications on commodity clusters: an empirical study of a tapered fat-tree

Details
Discussion Comments: 0
Verification: Authors have not verified information