ACM/IEEE Intl. Conf. for High Perf. Computing, Networking, Storage and Analysis, SC 2014


Title/Authors Title Research Artifacts
[?] A research artifact is any by-product of a research project that is not directly included in the published research paper. In Computer Science research this is often source code and data sets, but it could also be media, documentation, inputs to proof assistants, shell-scripts to run experiments, etc.
Details

High-Performance Computation of Distributed-Memory Parallel 3D Voronoi and Delaunay Tessellation

Tom Peterka, Dmitriy Morozov, Carolyn L. Phillips

High-Performance Computation of Distributed-Memory Parallel 3D Voronoi and Delaunay Tessellation

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fence Scoping

Changhui Lin, Vijay Nagarajan, Rajiv Gupta

Fence Scoping

Details
Discussion Comments: 0
Verification: Authors have not verified information

Oil and Water Can Mix: An Integration of Polyhedral and AST-Based Transformations

Jun Shirako, Louis-Noël Pouchet, Vivek Sarkar

Oil and Water Can Mix: An Integration of Polyhedral and AST-Based Transformations

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient I/O and Storage of Adaptive-Resolution Data

Sidharth Kumar, John Edwards, Peer-Timo Bremer, Aaron Knoll, Cameron Christensen, Venkatram Vishwanath, Philip H. Carns, John A. Schmidt, Valerio Pascucci

Efficient I/O and Storage of Adaptive-Resolution Data

Details
Discussion Comments: 0
Verification: Authors have not verified information

Faster Parallel Traversal of Scale Free Graphs at Extreme Scale with Vertex Delegates

Roger A. Pearce, Maya B. Gokhale, Nancy M. Amato

Faster Parallel Traversal of Scale Free Graphs at Extreme Scale with Vertex Delegates

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scheduling Multi-tenant Cloud Workloads on Accelerator-Based Systems

Dipanjan Sengupta, Anshuman Goswami, Karsten Schwan, Krishna Pallavi

Scheduling Multi-tenant Cloud Workloads on Accelerator-Based Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Maximizing Throughput on a Dragonfly Network

Nikhil Jain, Abhinav Bhatele, Xiang Ni, Nicholas J. Wright, Laxmikant V. Kalé

Maximizing Throughput on a Dragonfly Network

Details
Discussion Comments: 0
Verification: Authors have not verified information

Dissecting On-Node Memory Access Performance: A Semantic Approach

Alfredo Giménez, Todd Gamblin, Barry Rountree, Abhinav Bhatele, Ilir Jusufi, Peer-Timo Bremer, Bernd Hamann

Dissecting On-Node Memory Access Performance: A Semantic Approach

Details
Discussion Comments: 0
Verification: Authors have not verified information

Managing DRAM Latency Divergence in Irregular GPGPU Applications

Niladrish Chatterjee, Mike O'Connor, Gabriel H. Loh, Nuwan Jayasena, Rajeev Balasubramonian

Managing DRAM Latency Divergence in Irregular GPGPU Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications

Arash Ashari, Naser Sedaghati, John Eisenlohr, Srinivasan Parthasarathy, P. Sadayappan

Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fail-in-Place Network Design: Interaction Between Topology, Routing Algorithm and Failures

Jens Domke, Torsten Hoefler, Satoshi Matsuoka

Fail-in-Place Network Design: Interaction Between Topology, Routing Algorithm and Failures

Details
Discussion Comments: 0
Verification: Authors have not verified information

Pardicle: Parallel Approximate Density-Based Clustering

Md. Mostofa Ali Patwary, Nadathur Satish, Narayanan Sundaram, Fredrik Manne, Salman Habib, Pradeep Dubey

Pardicle: Parallel Approximate Density-Based Clustering

Details
Discussion Comments: 0
Verification: Authors have not verified information

Orion: Scaling Genomic Sequence Matching with Fine-Grained Parallelization

Kanak Mahadik, Somali Chaterji, Bowen Zhou, Milind Kulkarni, Saurabh Bagchi

Orion: Scaling Genomic Sequence Matching with Fine-Grained Parallelization

Details
Discussion Comments: 0
Verification: Authors have not verified information

An Image-Based Approach to Extreme Scale in Situ Visualization and Analysis

James P. Ahrens, Sébastien Jourdain, Patrick O'Leary, John Patchett, David H. Rogers, Mark Petersen

An Image-Based Approach to Extreme Scale in Situ Visualization and Analysis

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Communication-Optimal Framework for Contracting Distributed Tensors

Samyam Rajbhandari, Akshay Nikam, Pai-Wei Lai, Kevin Stock, Sriram Krishnamoorthy, P. Sadayappan

A Communication-Optimal Framework for Contracting Distributed Tensors

Details
Discussion Comments: 0
Verification: Authors have not verified information

High-Productivity Framework on GPU-Rich Supercomputers for Operational Weather Prediction Code ASUCA

Takashi Shimokawabe, Takayuki Aoki, Naoyuki Onodera

High-Productivity Framework on GPU-Rich Supercomputers for Operational Weather Prediction Code ASUCA

Details
Discussion Comments: 0
Verification: Authors have not verified information

Physics-Based Urban Earthquake Simulation Enhanced by 10.7 BlnDOF × 30 K Time-Step Unstructured FE Non-Linear Seismic Wave Simulation

Tsuyoshi Ichimura, Kohei Fujita, Seizo Tanaka, Muneo Hori, Wijerathne Maddegedara Lalith Lakshman, Yoshihisa Shizawa, Hiroshi Kobayashi

Physics-Based Urban Earthquake Simulation Enhanced by 10.7 BlnDOF × 30 K Time-Step Unstructured FE Non-Linear Seismic Wave Simulation

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallelization of Reordering Algorithms for Bandwidth and Wavefront Reduction

Konstantinos I. Karantasis, Andrew Lenharth, Donald Nguyen, María Jesús Garzarán, Keshav Pingali

Parallelization of Reordering Algorithms for Bandwidth and Wavefront Reduction

Details
Discussion Comments: 0
Verification: Authors have not verified information

Lattice QCD with Domain Decomposition on Intel® Xeon Phi Co-Processors

Simon Heybrock, Bálint Joó, Dhiraj D. Kalamkar, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Tilo Wettig, Pradeep Dubey

Lattice QCD with Domain Decomposition on Intel® Xeon Phi Co-Processors

Details
Discussion Comments: 0
Verification: Authors have not verified information

Omnisc'IO: A Grammar-Based Approach to Spatial and Temporal I/O Patterns Prediction

Matthieu Dorier, Shadi Ibrahim, Gabriel Antoniu, Robert B. Ross

Omnisc'IO: A Grammar-Based Approach to Spatial and Temporal I/O Patterns Prediction

Details
Discussion Comments: 0
Verification: Authors have not verified information

DISC: A Domain-Interaction Based Programming Model with Support for Heterogeneous Execution

Mehmet Can Kurt, Gagan Agrawal

DISC: A Domain-Interaction Based Programming Model with Support for Heterogeneous Execution

Details
Discussion Comments: 0
Verification: Authors have not verified information

Petascale High Order Dynamic Rupture Earthquake Simulations on Heterogeneous Supercomputers

Alexander Heinecke, Alexander Breuer, Sebastian Rettenberger, Michael Bader, Alice-Agnes Gabriel, Christian Pelties, Arndt Bode, William Barth, Xiangke Liao, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy, Pradeep Dubey

Petascale High Order Dynamic Rupture Earthquake Simulations on Heterogeneous Supercomputers

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scalable and High Performance Betweenness Centrality on the GPU

Adam McLaughlin, David A. Bader

Scalable and High Performance Betweenness Centrality on the GPU

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

A Study on Balancing Parallelism, Data Locality, and Recomputation in Existing PDE Solvers

Catherine Mills Olschanowsky, Michelle Mills Strout, Stephen M. Guzik, John Loffeld, Jeffrey Hittinger

A Study on Balancing Parallelism, Data Locality, and Recomputation in Existing PDE Solvers

Details
Discussion Comments: 0
Verification: Authors have not verified information

FlexSlot: Moving Hadoop Into the Cloud with Flexible Slot Management

Yanfei Guo, Jia Rao, Changjun Jiang, Xiaobo Zhou

FlexSlot: Moving Hadoop Into the Cloud with Flexible Slot Management

Details
Discussion Comments: 0
Verification: Authors have not verified information

24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs

Jeroen Bédorf, Evghenii Gaburov, Michiko S. Fujii, Keigo Nitadori, Tomoaki Ishiyama, Simon Portegies Zwart

24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

MC-Checker: Detecting Memory Consistency Errors in MPI One-Sided Applications

Zhezhe Chen, James Dinan, Zhen Tang, Pavan Balaji, Hua Zhong, Jun Wei, Tao Huang, Feng Qin

MC-Checker: Detecting Memory Consistency Errors in MPI One-Sided Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel Bayesian Network Structure Learning for Genome-Scale Gene Networks

Sanchit Misra, Md. Vasimuddin, Kiran Pamnany, Sriram P. Chockalingam, Yong Dong, Min Xie, Maneesha R. Aluru, Srinivas Aluru

Parallel Bayesian Network Structure Learning for Genome-Scale Gene Networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

Correctness Field Testing of Production and Decommissioned High Performance Computing Platforms at Los Alamos National Laboratory

Sarah Ellen Michalak, William N. Rust, John T. Dal, Rew J. Dubois, David H. Dubois

Correctness Field Testing of Production and Decommissioned High Performance Computing Platforms at Los Alamos National Laboratory

Details
Discussion Comments: 0
Verification: Authors have not verified information

Structure Slicing: Extending Logical Regions with Fields

Michael Bauer, Sean Treichler, Elliott Slaughter, Alex Aiken

Structure Slicing: Extending Logical Regions with Fields

Details
Discussion Comments: 0
Verification: Authors have not verified information

Reciprocal Resource Fairness: Towards Cooperative Multiple-Resource Fair Sharing in IaaS Clouds

Haikun Liu, Bingsheng He

Reciprocal Resource Fairness: Towards Cooperative Multiple-Resource Fair Sharing in IaaS Clouds

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient Sparse Matrix-Vector Multiplication on GPUs Using the CSR Storage Format

Joseph L. Greathouse, Mayank Daga

Efficient Sparse Matrix-Vector Multiplication on GPUs Using the CSR Storage Format

Details
Discussion Comments: 0
Verification: Authors have not verified information

Optimized Scheduling Strategies for Hybrid Density Functional theory Electronic Structure Calculations

William Dawson, François Gygi

Optimized Scheduling Strategies for Hybrid Density Functional theory Electronic Structure Calculations

Details
Discussion Comments: 0
Verification: Authors have not verified information

IndexFS: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion

Kai Ren, Qing Zheng, Swapnil Patil, Garth A. Gibson

IndexFS: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion

Details
Discussion Comments: 0
Verification: Authors have not verified information

A User-Friendly Approach for Tuning Parallel File Operations

Robert T. McLay, Doug James, Si Liu, John Cazes, William L. Barth

A User-Friendly Approach for Tuning Parallel File Operations

Details
Discussion Comments: 0
Verification: Authors have not verified information

Optimization of a Multilevel Checkpoint Model with Uncertain Execution Scales

Sheng Di, Leonardo Arturo Bautista-Gomez, Franck Cappello

Optimization of a Multilevel Checkpoint Model with Uncertain Execution Scales

Details
Discussion Comments: 0
Verification: Authors have not verified information

pTatin3D: High-Performance Methods for Long-Term Lithospheric Dynamics

Dave A. May, Jed Brown, Laetitia Le Pourhiet

pTatin3D: High-Performance Methods for Long-Term Lithospheric Dynamics

Details
Discussion Comments: 0
Verification: Authors have not verified information

Optimizing Data Locality for Fork/Join Programs Using Constrained Work Stealing

Jonathan Lifflander, Sriram Krishnamoorthy, Laxmikant V. Kalé

Optimizing Data Locality for Fork/Join Programs Using Constrained Work Stealing

Details
Discussion Comments: 0
Verification: Authors have not verified information

RAHTM: Routing Algorithm Aware Hierarchical Task Mapping

Ahmed H. Abdel-Gawad, Mithuna Thottethodi, Abhinav Bhatele

RAHTM: Routing Algorithm Aware Hierarchical Task Mapping

Details
Discussion Comments: 0
Verification: Authors have not verified information

ECC Parity: A Technique for Efficient Memory Error Resilience for Multi-Channel Memory Systems

Xun Jian, Rakesh Kumar

ECC Parity: A Technique for Efficient Memory Error Resilience for Multi-Channel Memory Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

A System Software Approach to Proactive Memory-Error Avoidance

Carlos H. A. Costa, Yoonho Park, Bryan S. Rosenburg, Chen-Yong Cher, Kyung Dong Ryu

A System Software Approach to Proactive Memory-Error Avoidance

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel De Bruijn Graph Construction and Traversal for De Novo Genome Assembly

Evangelos Georganas, Aydin Buluç, Jarrod Chapman, Leonid Oliker, Daniel Rokhsar, Katherine A. Yelick

Parallel De Bruijn Graph Construction and Traversal for De Novo Genome Assembly

Details
Discussion Comments: 0
Verification: Authors have not verified information

Enabling Efficient Multithreaded MPI Communication through a Library-Based Implementation of MPI Endpoints

Srinivas Sridharan, James Dinan, Dhiraj D. Kalamkar

Enabling Efficient Multithreaded MPI Communication through a Library-Based Implementation of MPI Endpoints

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scalable Computation of Stream Surfaces on Large Scale Vector Fields

Kewei Lu, Han-Wei Shen, Tom Peterka

Scalable Computation of Stream Surfaces on Large Scale Vector Fields

Details
Discussion Comments: 0
Verification: Authors have not verified information

The DRIHM Project: A Flexible Approach to Integrate HPC, Grid and Cloud Resources for Hydro-Meteorological Research

Daniele D'Agostino, Andrea Clematis, Antonella Galizia, Alfonso Quarati, Emanuele Danovaro, Luca Roverelli, Gabriele Zereik, Dieter Kranzlmüller, Michael Schiffers, Nils gentschen Felde, Christian Straube, Olivier Caumont, Evelyne Richard, Luis Garrote, Quillon Harpham, H. R. A. Jagers, Vladimir Dimitrijevic, Ljiljana Dekic, Elisabetta Fiori, Fabio Delogu, Antonio Parodi

The DRIHM Project: A Flexible Approach to Integrate HPC, Grid and Cloud Resources for Hydro-Meteorological Research

Details
Discussion Comments: 0
Verification: Authors have not verified information

Mapping to Irregular Torus Topologies and Other Techniques for Petascale Biomolecular Simulation

James C. Phillips, Yanhua Sun, Nikhil Jain, Eric J. Bohm, Laxmikant V. Kalé

Mapping to Irregular Torus Topologies and Other Techniques for Petascale Biomolecular Simulation

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

CYPRESS: Combining Static and Dynamic Analysis for Top-Down Communication Trace Compression

Jidong Zhai, Jianfei Hu, Xiongchao Tang, Xiaosong Ma, Wenguang Chen

CYPRESS: Combining Static and Dynamic Analysis for Top-Down Communication Trace Compression

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fault-Tolerant Dynamic Task Graph Scheduling

Mehmet Can Kurt, Sriram Krishnamoorthy, Kunal Agrawal, Gagan Agrawal

Fault-Tolerant Dynamic Task Graph Scheduling

Details
Discussion Comments: 0
Verification: Authors have not verified information

Anton 2: Raising the Bar for Performance and Programmability in a Special-Purpose Molecular Dynamics Supercomputer

David E. Shaw, J. P. Grossman, Joseph A. Bank, Brannon Batson, J. Adam Butts, Jack C. Chao, Martin M. Deneroff, Ron O. Dror, Amos Even, Christopher H. Fenton, Anthony Forte, Joseph Gagliardo, Gennette Gill, Brian Greskamp, Richard C. Ho, Douglas J. Ierardi, Lev Iserovich, Jeffrey Kuskin, Richard H. Larson, Timothy Layman, Li-Siang Lee, Adam K. Lerer, Chester Li, Daniel Killebrew, Kenneth M. Mackenzie, Shark Yeuk-Hai Mok, Mark A. Moraes, Rolf Mueller, Lawrence J. Nociolo, Jon L. Peticolas, Terry Quan, Daniel Ramot, John K. Salmon, Daniele Paolo Scarpazza, U. Ben Schafer, Naseer Siddique, Christopher W. Snyder, Jochen Spengler, Ping Tak Peter Tang, Michael Theobald, Horia Toma, Brian Towles, Benjamin Vitale, Stanley C. Wang, Cliff Young

Anton 2: Raising the Bar for Performance and Programmability in a Special-Purpose Molecular Dynamics Supercomputer

Details
Discussion Comments: 0
Verification: Authors have not verified information

Compiler Techniques for Massively Scalable Implicit Task Parallelism

Timothy G. Armstrong, Justin M. Wozniak, Michael Wilde, Ian T. Foster

Compiler Techniques for Massively Scalable Implicit Task Parallelism

Details
Discussion Comments: 0
Verification: Authors have not verified information

MSL: A Synthesis Enabled Language for Distributed Implementations

Zhilei Xu, Shoaib Kamil, Armando Solar-Lezama

MSL: A Synthesis Enabled Language for Distributed Implementations

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fast Parallel Computation of Longest Common Prefixes

Julian Shun

Fast Parallel Computation of Longest Common Prefixes

Details
Discussion Comments: 0
Verification: Author has not verified information

Quantitatively Modeling Application Resilience with the Data Vulnerability Factor

Li Yu, Dong Li, Sparsh Mittal, Jeffrey S. Vetter

Quantitatively Modeling Application Resilience with the Data Vulnerability Factor

Details
Discussion Comments: 0
Verification: Authors have not verified information

Maximizing Throughput of Overprovisioned HPC Data Centers Under a Strict Power Budget

Osman Sarood, Akhil Langer, Abhishek Gupta, Laxmikant V. Kalé

Maximizing Throughput of Overprovisioned HPC Data Centers Under a Strict Power Budget

Details
Discussion Comments: 0
Verification: Authors have not verified information

Microbank: Architecting Through-Silicon Interposer-Based Main Memory Systems

Young Hoon Son, Seongil O, Hyunggyun Yang, Daejin Jung, Jung Ho Ahn, John Kim, Jangwoo Kim, Jae W. Lee

Microbank: Architecting Through-Silicon Interposer-Based Main Memory Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

In-Situ Feature Extraction of Large Scale Combustion Simulations Using Segmented Merge Trees

Aaditya G. Landge, Valerio Pascucci, Attila Gyulassy, Janine Bennett, Hemanth Kolla, Jacqueline Chen, Peer-Timo Bremer

In-Situ Feature Extraction of Large Scale Combustion Simulations Using Segmented Merge Trees

Details
Discussion Comments: 0
Verification: Authors have not verified information

Practical Symbolic Race Checking of GPU Programs

Peng Li, Guodong Li, Ganesh Gopalakrishnan

Practical Symbolic Race Checking of GPU Programs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel Programming with Migratable Objects: Charm++ in Practice

Bilge Acun, Abhishek Gupta, Nikhil Jain, Akhil Langer, Harshitha Menon, Eric Mikida, Xiang Ni, Michael P. Robson, Yanhua Sun, Ehsan Totoni, Lukasz Wesolowski, Laxmikant V. Kalé

Parallel Programming with Migratable Objects: Charm++ in Practice

Details
Discussion Comments: 0
Verification: Authors have not verified information

NUMARCK: Machine Learning Algorithm for Resiliency and Checkpointing

Zhengzhang Chen, Seung Woo Son, William Hendrix, Ankit Agrawal, Wei-keng Liao, Alok N. Choudhary

NUMARCK: Machine Learning Algorithm for Resiliency and Checkpointing

Details
Discussion Comments: 0
Verification: Authors have not verified information

Understanding the Effects of Communication and Coordination on Checkpointing at Scale

Kurt B. Ferreira, Patrick M. Widener, Scott Levy, Dorian C. Arnold, Torsten Hoefler

Understanding the Effects of Communication and Coordination on Checkpointing at Scale

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scalable Kernel Fusion for Memory-Bound GPU Applications

Mohamed Wahib, Naoya Maruyama

Scalable Kernel Fusion for Memory-Bound GPU Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Nonblocking Epochs in MPI One-Sided Communication

Judicael A. Zounmevo, Xin Zhao, Pavan Balaji, William Gropp, Ahmad Afsahi

Nonblocking Epochs in MPI One-Sided Communication

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Volume Integral Equation Stokes Solver for Problems with Variable Coefficients

Dhairya Malhotra, Amir Gholami, George Biros

A Volume Integral Equation Stokes Solver for Problems with Variable Coefficients

Details
Discussion Comments: 0
Verification: Authors have not verified information

Domain Decomposition Preconditioners for Communication-Avoiding Krylov Methods on a Hybrid CPU/GPU Cluster

Ichitaro Yamazaki, Sivasankaran Rajamanickam, Erik G. Boman, Mark Hoemmen, Michael A. Heroux, Stanimire Tomov

Domain Decomposition Preconditioners for Communication-Avoiding Krylov Methods on a Hybrid CPU/GPU Cluster

Details
Discussion Comments: 0
Verification: Authors have not verified information

Real-Time Scalable Cortical Computing at 46 Giga-Synaptic OPS/Watt with ~100× Speedup in Time-to-Solution and ~100, 000× Reduction in Energy-to-Solution

Andrew S. Cassidy, Rodrigo Alvarez-Icaza, Filipp Akopyan, Jun Sawada, John V. Arthur, Paul Merolla, Pallab Datta, Marc González Tallada, Brian Taba, Alexander Andreopoulos, Arnon Amir, Steven K. Esser, Jeff Kusnitz, Rathinakumar Appuswamy, Chuck Haymes, Bernard Brezzo, Roger Moussalli, Ralph Bellofatto, Christian W. Baks, Michael Mastro, Kai Schleupen, Charles E. Cox, Ken Inoue, Steven E. Millman, Nabil Imam, Emmett McQuinn, Yutaka Y. Nakamura, Ivan Vo, Chen Guok, Don Nguyen, Scott Lekuch, Sameh W. Asaad, Daniel J. Friedman, Bryan L. Jackson, Myron Flickner, William P. Risk, Rajit Manohar, Dharmendra S. Modha

Real-Time Scalable Cortical Computing at 46 Giga-Synaptic OPS/Watt with ~100× Speedup in Time-to-Solution and ~100, 000× Reduction in Energy-to-Solution

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel Deep Neural Network Training for Big Data on Blue Gene/Q

I-Hsin Chung, Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny, John A. Gunnels, Vernon Austel, Upendra V. Chaudhari, Brian Kingsbury

Parallel Deep Neural Network Training for Big Data on Blue Gene/Q

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scaling MapReduce Vertically and Horizontally

Ismail El-Helw, Rutger F. H. Hofman, Henri E. Bal

Scaling MapReduce Vertically and Horizontally

Details
Discussion Comments: 0
Verification: Authors have not verified information

Understanding Soft Error Resiliency of Blue Gene/Q Compute Chip through Hardware Proton Irradiation and Software Fault Injection

Chen-Yong Cher, Meeta Sharma Gupta, Pradip Bose, K. Paul Muller

Understanding Soft Error Resiliency of Blue Gene/Q Compute Chip through Hardware Proton Irradiation and Software Fault Injection

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Unified Programming Model for Intra- and Inter-Node Offloading on Xeon Phi Clusters

Matthias Noack, Florian Wende, Thomas Steinke, Frank Cordes

A Unified Programming Model for Intra- and Inter-Node Offloading on Xeon Phi Clusters

Details
Discussion Comments: 0
Verification: Authors have not verified information

Using an Adaptive HPC Runtime System to Reconfigure the Cache Hierarchy

Ehsan Totoni, Josep Torrellas, Laxmikant V. Kalé

Using an Adaptive HPC Runtime System to Reconfigure the Cache Hierarchy

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Computation- and Communication-Optimal Parallel Direct 3-Body Algorithm

Penporn Koanantakool, Katherine A. Yelick

A Computation- and Communication-Optimal Parallel Direct 3-Body Algorithm

Details
Discussion Comments: 0
Verification: Authors have not verified information

Slim Fly: A Cost Effective Low-Diameter Network Topology

Maciej Besta, Torsten Hoefler

Slim Fly: A Cost Effective Low-Diameter Network Topology

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient Implementation of Many-Body Quantum Chemical Methods on the Intel® Xeon Phi Coprocessor

Edoardo Aprà, Michael Klemm, Karol Kowalski

Efficient Implementation of Many-Body Quantum Chemical Methods on the Intel® Xeon Phi Coprocessor

Details
Discussion Comments: 0
Verification: Authors have not verified information

Exploring Automatic, Online Failure Recovery for Scientific Applications at Extreme Scales

Marc Gamell, Daniel S. Katz, Hemanth Kolla, Jacqueline Chen, Scott Klasky, Manish Parashar

Exploring Automatic, Online Failure Recovery for Scientific Applications at Extreme Scales

Details
Discussion Comments: 0
Verification: Authors have not verified information

Recycled Error Bits: Energy-Efficient Architectural Support for Floating Point Accuracy

Ralph Nathan, Bryan Anthonio, Shih-Lien Lu, Helia Naeimi, Daniel J. Sorin, Xiaobai Sun

Recycled Error Bits: Energy-Efficient Architectural Support for Floating Point Accuracy

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient Shared-Memory Implementation of High-Performance Conjugate Gradient Benchmark and its Application to Unstructured Matrices

Jongsoo Park, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Alexander Heinecke, Dhiraj D. Kalamkar, Xing Liu, Md. Mostofa Ali Patwary, Yutong Lu, Pradeep Dubey

Efficient Shared-Memory Implementation of High-Performance Conjugate Gradient Benchmark and its Application to Unstructured Matrices

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fast Iterative Graph Computation: A Path Centric Approach

Pingpeng Yuan, Wenya Zhang, Changfeng Xie, Hai Jin, Ling Liu, Kisung Lee

Fast Iterative Graph Computation: A Path Centric Approach

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scaling the Power Wall: A Path to Exascale

Oreste Villa, Daniel R. Johnson, Mike O'Connor, Evgeny Bolotin, David W. Nellans, Justin Luitjens, Nikolai Sakharnykh, Peng Wang, Paulius Micikevicius, Anthony Scudiero, Stephen W. Keckler, William J. Dally

Scaling the Power Wall: A Path to Exascale

Details
Discussion Comments: 0
Verification: Authors have not verified information

Best Practices and Lessons Learned from Deploying and Operating Large-Scale Data-Centric Parallel File Systems

Sarp Oral, James Simmons, Jason Hill, Dustin Leverman, Feiyi Wang, Matthew A. Ezell, Ross Miller, Douglas Fuller, Raghul Gunasekaran, Youngjae Kim, Saurabh Gupta, Devesh Tiwari, Sudharshan S. Vazhkudai, James H. Rogers, David Dillow, Galen M. Shipman, Arthur S. Bland

Best Practices and Lessons Learned from Deploying and Operating Large-Scale Data-Centric Parallel File Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Pipelining Computational Stages of the Tomographic Reconstructor for Multi-Object Adaptive Optics on a Multi-GPU System

Ali Charara, Hatem Ltaief, Damien Gratadour, David E. Keyes, Arnaud Sevin, Ahmad Abdelfattah, Eric Gendron, Carine Morel, Fabrice Vidal

Pipelining Computational Stages of the Tomographic Reconstructor for Multi-Object Adaptive Optics on a Multi-GPU System

Details
Discussion Comments: 0
Verification: Authors have not verified information

The Lightweight Distributed Metric Service: A Scalable Infrastructure for Continuous Monitoring of Large Scale Computing Systems and Applications

Anthony Agelastos, Benjamin A. Allan, Jim M. Brandt, Paul Cassella, Jeremy Enos, Joshi Fullop, Ann C. Gentile, Steve Monk, Nichamon Naksinehaboon, Jeff Ogden, Mahesh Rajan, Michael T. Showerman, Joel Stevenson, Narate Taerat, Thomas W. Tucker

The Lightweight Distributed Metric Service: A Scalable Infrastructure for Continuous Monitoring of Large Scale Computing Systems and Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Metascalable Quantum Molecular Dynamics Simulations of Hydrogen-on-Demand

Ken-ichi Nomura, Rajiv K. Kalia, Aiichiro Nakano, Priya Vashishta, Kohei Shimamura, Fuyuki Shimojo, Manaschai Kunaseth, Paul C. Messina, Nichols A. Romero

Metascalable Quantum Molecular Dynamics Simulations of Hydrogen-on-Demand

Details
Discussion Comments: 0
Verification: Authors have not verified information

Application Centric Energy-Efficiency Study of Distributed Multi-Core and Hybrid CPU-GPU Systems

Ben Cumming, Gilles Fourestey, Oliver Fuhrer, Tobias Gysi, Massimiliano Fatica, Thomas C. Schulthess

Application Centric Energy-Efficiency Study of Distributed Multi-Core and Hybrid CPU-GPU Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Finding Constant from Change: Revisiting Network Performance Aware Optimizations on IaaS Clouds

Yifan Gong, Bingsheng He, Dan Li

Finding Constant from Change: Revisiting Network Performance Aware Optimizations on IaaS Clouds

Details
Discussion Comments: 0
Verification: Authors have not verified information

FAST: Near Real-Time Searchable Data Analytics for the Cloud

Yu Hua, Hong Jiang, Dan Feng

FAST: Near Real-Time Searchable Data Analytics for the Cloud

Details
Discussion Comments: 0
Verification: Authors have not verified information

Two-Choice Randomized Dynamic I/O Scheduler for Object Storage Systems

Dong Dai, Yong Chen, Dries Kimpe, Robert B. Ross

Two-Choice Randomized Dynamic I/O Scheduler for Object Storage Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information