IEEE Intl. Parallel and Distributed Processing Symposium, IPDPS 2017


Title/Authors Title Research Artifacts
[?] A research artifact is any by-product of a research project that is not directly included in the published research paper. In Computer Science research this is often source code and data sets, but it could also be media, documentation, inputs to proof assistants, shell-scripts to run experiments, etc.
Details

Container-Based Cloud Platform for Mobile Computation Offloading

Song Wu, Chao Niu, Jia Rao, Hai Jin, Xiaohai Dai

Container-Based Cloud Platform for Mobile Computation Offloading

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

General Purpose Task-Dependence Management Hardware for Task-Based Dataflow Programming Models

Xubin Tan, Jaume Bosch, Miquel Vidal, Carlos Álvarez, Daniel Jiménez-González, Eduard Ayguadé, Mateo Valero

General Purpose Task-Dependence Management Hardware for Task-Based Dataflow Programming Models

Details
Discussion Comments: 0
Verification: Authors have not verified information

Communication Optimization on GPU: A Case Study of Sequence Alignment Algorithms

Jie Wang, Xinfeng Xie, Jason Cong

Communication Optimization on GPU: A Case Study of Sequence Alignment Algorithms

Details
Discussion Comments: 0
Verification: Authors have not verified information

Large Scale Manycore-Aware PIC Simulation with Efficient Particle Binning

Hiroshi Nakashima, Yoshiki Summura, Keisuke Kikura, Yohei Miyake

Large Scale Manycore-Aware PIC Simulation with Efficient Particle Binning

Details
Discussion Comments: 0
Verification: Authors have not verified information

Autonomic Resource Management for Program Orchestration in Large-Scale Data Analysis

Masahiro Tanaka, Kenjiro Taura, Kentaro Torisawa

Autonomic Resource Management for Program Orchestration in Large-Scale Data Analysis

Details
Discussion Comments: 0
Verification: Authors have not verified information

Aces4: A Platform for Computational Chemistry Calculations with Extremely Large Block-Sparse Arrays

Beverly A. Sanders, Jason N. Byrd, Nakul Jindal, Victor F. Lotrich, Dmitry Lyakh, Ajith Perera, Rodney J. Bartlett

Aces4: A Platform for Computational Chemistry Calculations with Extremely Large Block-Sparse Arrays

Details
Discussion Comments: 0
Verification: Authors have not verified information

Clustering Throughput Optimization on the GPU

Michael G. Gowanlock, Cody M. Rude, David M. Blair, Justin D. Li, Victor Pankratius

Clustering Throughput Optimization on the GPU

Details
Discussion Comments: 0
Verification: Authors have not verified information

Power Efficient Sharing-Aware GPU Data Management

Abdulaziz Tabbakh, Murali Annavaram, Xuehai Qian

Power Efficient Sharing-Aware GPU Data Management

Details
Discussion Comments: 0
Verification: Authors have not verified information

DC2-MTCP: Light-Weight Coding for Efficient Multi-Path Transmission in Data Center Network

Jiyan Sun, Yan Zhang, Xin Wang, Shihan Xiao, Zhen Xu, Hongjing Wu, Xin Chen, Yanni Han

DC2-MTCP: Light-Weight Coding for Efficient Multi-Path Transmission in Data Center Network

Details
Discussion Comments: 0
Verification: Authors have not verified information

Data Centric Performance Measurement Techniques for Chapel Programs

Hui Zhang, Jeffrey K. Hollingsworth

Data Centric Performance Measurement Techniques for Chapel Programs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scalable Graph Traversal on Sunway TaihuLight with Ten Million Cores

Heng Lin, Xiongchao Tang, Bowen Yu, Youwei Zhuo, Wenguang Chen, Jidong Zhai, Wanwang Yin, Weimin Zheng

Scalable Graph Traversal on Sunway TaihuLight with Ten Million Cores

Details
Discussion Comments: 0
Verification: Authors have not verified information

26 PFLOPS Stencil Computations for Atmospheric Modeling on Sunway TaihuLight

Yulong Ao, Chao Yang, Xinliang Wang, Wei Xue, Haohuan Fu, Fangfang Liu, Lin Gan, Ping Xu, Wenjing Ma

26 PFLOPS Stencil Computations for Atmospheric Modeling on Sunway TaihuLight

Details
Discussion Comments: 0
Verification: Authors have not verified information

Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs

Olivier Beaumont, Lionel Eyraud-Dubois, Suraj Kumar

Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs

Details
Discussion Comments: 0
Verification: Authors have not verified information

One-Way Wave Equation Migration at Scale on GPUs Using Directive Based Programming

Kshitij Mehta, Maxime R. Hugues, Oscar R. Hernandez, David E. Bernholdt, Henri Calandra

One-Way Wave Equation Migration at Scale on GPUs Using Directive Based Programming

Details
Discussion Comments: 0
Verification: Authors have not verified information

Design and Implementation of Papyrus: Parallel Aggregate Persistent Storage

Jungwon Kim, Kittisak Sajjapongse, Seyong Lee, Jeffrey S. Vetter

Design and Implementation of Papyrus: Parallel Aggregate Persistent Storage

Details
Discussion Comments: 0
Verification: Authors have not verified information

Optimal Algorithms for a Mesh-Connected Computer with Limited Additional Global Bandwidth

Yujie An, Quentin F. Stout

Optimal Algorithms for a Mesh-Connected Computer with Limited Additional Global Bandwidth

Details
Discussion Comments: 0
Verification: Authors have not verified information

Accelerating Spark Datasets by Inlining Deserialization

Jan Wroblewski, Kazuaki Ishizaki, Hiroshi Inoue, Moriyoshi Ohara

Accelerating Spark Datasets by Inlining Deserialization

Details
Discussion Comments: 0
Verification: Authors have not verified information

Leader Election in a Smartphone Peer-to-Peer Network

Calvin Newport

Leader Election in a Smartphone Peer-to-Peer Network

Details
Discussion Comments: 0
Verification: Author has not verified information

A Scalable System Architecture to Addressing the Next Generation of Predictive Simulation Workflows with Coupled Compute and Data Intensive Applications

Mark Seager

A Scalable System Architecture to Addressing the Next Generation of Predictive Simulation Workflows with Coupled Compute and Data Intensive Applications

Details
Discussion Comments: 0
Verification: Author has not verified information

PaPar: A Parallel Data Partitioning Framework for Big Data Applications

Hao Wang, Jing Zhang, Da Zhang, Sarunya Pumma, Wu-chun Feng

PaPar: A Parallel Data Partitioning Framework for Big Data Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

MetaKV: A Key-Value Store for Metadata Management of Distributed Burst Buffers

Teng Wang, Adam Moody, Yue Zhu, Kathryn Mohror, Kento Sato, Tanzima Islam, Weikuan Yu

MetaKV: A Key-Value Store for Metadata Management of Distributed Burst Buffers

Details
Discussion Comments: 0
Verification: Authors have not verified information

MRapid: An Efficient Short Job Optimizer on Hadoop

Hong Zhang, Hai Huang, Liqiang Wang

MRapid: An Efficient Short Job Optimizer on Hadoop

Details
Discussion Comments: 0
Verification: Authors have not verified information

PhiOpenSSL: Using the Xeon Phi Coprocessor for Efficient Cryptographic Calculations

Shun Yao, Dantong Yu

PhiOpenSSL: Using the Xeon Phi Coprocessor for Efficient Cryptographic Calculations

Details
Discussion Comments: 0
Verification: Authors have not verified information

Directive-Based Partitioning and Pipelining for Graphics Processing Units

Xuewen Cui, Thomas R. W. Scogland, Bronis R. de Supinski, Wu-chun Feng

Directive-Based Partitioning and Pipelining for Graphics Processing Units

Details
Discussion Comments: 0
Verification: Authors have not verified information

ScalaIOExtrap: Elastic I/O Tracing and Extrapolation

Xiaoqing Luo, Frank Mueller, Philip H. Carns, Jonathan Jenkins, Robert Latham, Robert B. Ross, Shane Snyder

ScalaIOExtrap: Elastic I/O Tracing and Extrapolation

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Respin: Rethinking Near-Threshold Multiprocessor Design with Non-volatile Memory

Xiang Pan, Anys Bacha, Radu Teodorescu

Respin: Rethinking Near-Threshold Multiprocessor Design with Non-volatile Memory

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fly-Over: A Light-Weight Distributed Power-Gating Mechanism for Energy-Efficient Networks-on-Chip

Rahul Boyapati, Jiayi Huang, Ningyuan Wang, Kyung Hoon Kim, Ki Hwan Yum, Eun Jung Kim

Fly-Over: A Light-Weight Distributed Power-Gating Mechanism for Energy-Efficient Networks-on-Chip

Details
Discussion Comments: 0
Verification: Authors have not verified information

SlimSell: A Vectorizable Graph Representation for Breadth-First Search

Maciej Besta, Florian Marending, Edgar Solomonik, Torsten Hoefler

SlimSell: A Vectorizable Graph Representation for Breadth-First Search

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Robust Parallel Preconditioner for Indefinite Systems Using Hierarchical Matrices and Randomized Sampling

Pieter Ghysels, Xiaoye Sherry Li, Christopher Gorman, François-Henry Rouet

A Robust Parallel Preconditioner for Indefinite Systems Using Hierarchical Matrices and Randomized Sampling

Details
Discussion Comments: 0
Verification: Authors have not verified information

Leader Election in Asymmetric Labeled Unidirectional Rings

Karine Altisen, Ajoy K. Datta, Stéphane Devismes, Anaïs Durand, Lawrence L. Larmore

Leader Election in Asymmetric Labeled Unidirectional Rings

Details
Discussion Comments: 0
Verification: Authors have not verified information

E^2MC: Entropy Encoding Based Memory Compression for GPUs

Sohan Lal, Jan Lucas, Ben H. H. Juurlink

E^2MC: Entropy Encoding Based Memory Compression for GPUs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Adaptive Software Caching for Efficient NVRAM Data Persistence

Pengcheng Li, Dhruva R. Chakrabarti, Chen Ding, Liang Yuan

Adaptive Software Caching for Efficient NVRAM Data Persistence

Details
Discussion Comments: 0
Verification: Authors have not verified information

MOCHA: Morphable Locality and Compression Aware Architecture for Convolutional Neural Networks

Syed Mohammad Asad Hassan Jafri, Ahmed Hemani, Kolin Paul, Naeem Abbas

MOCHA: Morphable Locality and Compression Aware Architecture for Convolutional Neural Networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

Characterizing and Modeling Power and Energy for Extreme-Scale In-Situ Visualization

Vignesh Adhinarayanan, Wu-chun Feng, David H. Rogers, James P. Ahrens, Scott Pakin

Characterizing and Modeling Power and Energy for Extreme-Scale In-Situ Visualization

Details
Discussion Comments: 0
Verification: Authors have not verified information

HOMP: Automated Distribution of Parallel Loops and Data in Highly Parallel Accelerator-Based Systems

Yonghong Yan, Jiawen Liu, Kirk W. Cameron, Mariam Umar

HOMP: Automated Distribution of Parallel Loops and Data in Highly Parallel Accelerator-Based Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

DR-BW: Identifying Bandwidth Contention in NUMA Architectures with Supervised Learning

Hao Xu, Shasha Wen, Alfredo Giménez, Todd Gamblin, Xu Liu

DR-BW: Identifying Bandwidth Contention in NUMA Architectures with Supervised Learning

Details
Discussion Comments: 0
Verification: Authors have not verified information

Co-Run Scheduling with Power Cap on Integrated CPU-GPU Systems

Qi Zhu, Bo Wu, Xipeng Shen, Li Shen, Zhiying Wang

Co-Run Scheduling with Power Cap on Integrated CPU-GPU Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Tight Load Balancing Via Randomized Local Search

Petra Berenbrink, Peter Kling, Christopher Liaw, Abbas Mehrabian

Tight Load Balancing Via Randomized Local Search

Details
Discussion Comments: 0
Verification: Authors have not verified information

NVIDIA Deep Learning Tutorial

Julie Bernauer

NVIDIA Deep Learning Tutorial

Details
Discussion Comments: 0
Verification: Author has not verified information

SimProf: A Sampling Framework for Data Analytic Workloads

Jen-Cheng Huang, Lifeng Nai, Pranith Kumar, Hyojong Kim, Hyesoon Kim

SimProf: A Sampling Framework for Data Analytic Workloads

Details
Discussion Comments: 0
Verification: Authors have not verified information

Accommodating Thread-Level Heterogeneity in Coupled Parallel Applications

Samuel K. Gutierrez, Kei Davis, Dorian C. Arnold, Randal S. Baker, Robert W. Robey, Patrick S. McCormick, Daniel Holladay, Jon A. Dahl, R. Joe Zerr, Florian Weik, Christoph Junghans

Accommodating Thread-Level Heterogeneity in Coupled Parallel Applications

Details
Author Comments:
Discussion Comments: 0
Sharing: Other
Verification: Authors have verified information

Addressing Performance Heterogeneity in MapReduce Clusters with Elastic Tasks

Wei Chen, Jia Rao, Xiaobo Zhou

Addressing Performance Heterogeneity in MapReduce Clusters with Elastic Tasks

Details
Discussion Comments: 0
Verification: Authors have not verified information

Communication-Avoiding Parallel Algorithms for Solving Triangular Systems of Linear Equations

Tobias Wicky, Edgar Solomonik, Torsten Hoefler

Communication-Avoiding Parallel Algorithms for Solving Triangular Systems of Linear Equations

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

High-Performance Virtual Machine Migration Framework for MPI Applications on SR-IOV Enabled InfiniBand Clusters

Jie Zhang, Xiaoyi Lu, Dhabaleswar K. Panda

High-Performance Virtual Machine Migration Framework for MPI Applications on SR-IOV Enabled InfiniBand Clusters

Details
Discussion Comments: 0
Verification: Authors have not verified information

Monitoring Properties of Large, Distributed, Dynamic Graphs

Gal Yehuda, Daniel Keren, Islam Akaria

Monitoring Properties of Large, Distributed, Dynamic Graphs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fault-Tolerant Robot Gathering Problems on Graphs With Arbitrary Appearing Times

Sergio Rajsbaum, Armando Castañeda, David Flores-Peñaloza, Manuel Alcantara

Fault-Tolerant Robot Gathering Problems on Graphs With Arbitrary Appearing Times

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced no artifacts
Verification: Authors have verified information

Content-Aware Non-Volatile Cache Replacement

Qi Zeng, Jih-Kwon Peir

Content-Aware Non-Volatile Cache Replacement

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallelism and Garbage Collection Aware I/O Scheduler with Improved SSD Performance

Jiayang Guo, Yiming Hu, Bo Mao, Suzhen Wu

Parallelism and Garbage Collection Aware I/O Scheduler with Improved SSD Performance

Details
Discussion Comments: 0
Verification: Authors have not verified information

Cooling-Aware Job Scheduling and Node Allocation for Overprovisioned HPC Systems

Thang Cao, Wei Huang, Yuan He, Masaaki Kondo

Cooling-Aware Job Scheduling and Node Allocation for Overprovisioned HPC Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

O(log N)-Time Complete Visibility for Asynchronous Robots with Lights

Gokarna Sharma, Ramachandran Vaidyanathan, Jerry L. Trahan, Costas Busch, Suresh Rai

O(log N)-Time Complete Visibility for Asynchronous Robots with Lights

Details
Discussion Comments: 0
Verification: Authors have not verified information

Algorithms for Hierarchical and Semi-Partitioned Parallel Scheduling

Vincenzo Bonifaci, Gianlorenzo D'Angelo, Alberto Marchetti-Spaccamela

Algorithms for Hierarchical and Semi-Partitioned Parallel Scheduling

Details
Discussion Comments: 0
Verification: Authors have not verified information

The Reverse Cuthill-McKee Algorithm in Distributed-Memory

Ariful Azad, Mathias Jacquelin, Aydin Buluç, Esmond G. Ng

The Reverse Cuthill-McKee Algorithm in Distributed-Memory

Details
Discussion Comments: 0
Verification: Authors have not verified information

Enhancing Datacenter Resource Management through Temporal Logic Constraints

Hao He, Jiang Hu, Dilma Da Silva

Enhancing Datacenter Resource Management through Temporal Logic Constraints

Details
Discussion Comments: 0
Verification: Authors have not verified information

Bidiagonalization and R-Bidiagonalization: Parallel Tiled Algorithms, Critical Paths and Distributed-Memory Implementation

Mathieu Faverge, Julien Langou, Yves Robert, Jack J. Dongarra

Bidiagonalization and R-Bidiagonalization: Parallel Tiled Algorithms, Critical Paths and Distributed-Memory Implementation

Details
Discussion Comments: 0
Verification: Authors have not verified information

Similarity Search on Automata Processors

Vincent T. Lee, Justin Kotalik, Carlo C. del Mundo, Armin Alaghi, Luis Ceze, Mark Oskin

Similarity Search on Automata Processors

Details
Discussion Comments: 0
Verification: Authors have not verified information

Elastic-Cache: GPU Cache Architecture for Efficient Fine- and Coarse-Grained Cache-Line Management

Bingchao Li, Jizhou Sun, Murali Annavaram, Nam Sung Kim

Elastic-Cache: GPU Cache Architecture for Efficient Fine- and Coarse-Grained Cache-Line Management

Details
Discussion Comments: 0
Verification: Authors have not verified information

Relaxations for High-Performance Message Passing on Massively Parallel SIMT Processors

Benjamin Klenk, Holger Fröning, Hans Eberle, Larry Dennison

Relaxations for High-Performance Message Passing on Massively Parallel SIMT Processors

Details
Discussion Comments: 0
Verification: Authors have not verified information

Dynamic Adaptation in Wireless Networks Under Comprehensive Interference via Carrier Sense

Dongxiao Yu, Yuexuan Wang, Tigran Tonoyan, Magnús M. Halldórsson

Dynamic Adaptation in Wireless Networks Under Comprehensive Interference via Carrier Sense

Details
Discussion Comments: 0
Verification: Authors have not verified information

Optimization and Parallelization of B-Spline Based Orbital Evaluations in QMC on Multi/Many-Core Shared Memory Processors

Amrita Mathuriya, Ye Luo, Anouar Benali, Luke Shulenburger, Jeongnim Kim

Optimization and Parallelization of B-Spline Based Orbital Evaluations in QMC on Multi/Many-Core Shared Memory Processors

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Work-Efficient Parallel Sparse Matrix-Sparse Vector Multiplication Algorithm

Ariful Azad, Aydin Buluç

A Work-Efficient Parallel Sparse Matrix-Sparse Vector Multiplication Algorithm

Details
Discussion Comments: 0
Verification: Authors have not verified information

On Optimizing Distributed Tucker Decomposition for Dense Tensors

Venkatesan T. Chakaravarthy, Jee W. Choi, Douglas J. Joseph, Xing Liu, Prakash Murali, Yogish Sabharwal, Dheeraj Sreedhar

On Optimizing Distributed Tucker Decomposition for Dense Tensors

Details
Author Comments:
Discussion Comments: 0
Sharing: Not able to share produced artifacts
Verification: Authors have verified information

Capability Models for Manycore Memory Systems: A Case-Study with Xeon Phi KNL

Sabela Ramos, Torsten Hoefler

Capability Models for Manycore Memory Systems: A Case-Study with Xeon Phi KNL

Details
Discussion Comments: 0
Verification: Authors have not verified information

Production Hardware Overprovisioning: Real-World Performance Optimization Using an Extensible Power-Aware Resource Management Framework

Ryuichi Sakamoto, Thang Cao, Masaaki Kondo, Koji Inoue, Masatsugu Ueda, Tapasya Patki, Daniel A. Ellsworth, Barry Rountree, Martin Schulz

Production Hardware Overprovisioning: Real-World Performance Optimization Using an Extensible Power-Aware Resource Management Framework

Details
Discussion Comments: 0
Verification: Authors have not verified information

Improving the Integration of Task Nesting and Dependencies in OpenMP

Josep M. Pérez, Vicenç Beltran, Jesús Labarta, Eduard Ayguadé

Improving the Integration of Task Nesting and Dependencies in OpenMP

Details
Discussion Comments: 0
Verification: Authors have not verified information

Elastic Data Compression with Improved Performance and Space Efficiency for Flash-Based Storage Systems

Bo Mao, Hong Jiang, Suzhen Wu, Yaodong Yang, Zaifa Xi

Elastic Data Compression with Improved Performance and Space Efficiency for Flash-Based Storage Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

FFQ: A Fast Single-Producer/Multiple-Consumer Concurrent FIFO Queue

Sergei Arnautov, Pascal Felber, Christof Fetzer, Bohdan Trach

FFQ: A Fast Single-Producer/Multiple-Consumer Concurrent FIFO Queue

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Corrected Gossip Algorithms for Fast Reliable Broadcast on Unreliable Systems

Torsten Hoefler, Amnon Barak, Amnon Shiloh, Zvi Drezner

Corrected Gossip Algorithms for Fast Reliable Broadcast on Unreliable Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Computational Challenges in Constructing the Tree of Life

Tandy J. Warnow

Computational Challenges in Constructing the Tree of Life

Details
Discussion Comments: 0
Verification: Author has not verified information

Mimir: Memory-Efficient and Scalable MapReduce for Large Supercomputing Systems

Tao Gao, Yanfei Guo, Boyu Zhang, Pietro Cicotti, Yutong Lu, Pavan Balaji, Michela Taufer

Mimir: Memory-Efficient and Scalable MapReduce for Large Supercomputing Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

The SEPO Model of Computation to Enable Larger-Than-Memory Hash Tables for GPU-Accelerated Big Data Analytics

Reza Mokhtari, Michael Stumm

The SEPO Model of Computation to Enable Larger-Than-Memory Hash Tables for GPU-Accelerated Big Data Analytics

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Parallel FastTrack Data Race Detector on Multi-core Systems

Young Wn Song, Yann-Hang Lee

A Parallel FastTrack Data Race Detector on Multi-core Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Transparent Caching for RMA Systems

Salvatore Di Girolamo, Flavio Vella, Torsten Hoefler

Transparent Caching for RMA Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Elastic Consistent Hashing for Distributed Storage Systems

Wei Xie, Yong Chen

Elastic Consistent Hashing for Distributed Storage Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Memory Compression Techniques for Network Address Management in MPI

Yanfei Guo, Charles J. Archer, Michael Blocksome, Scott Parker, Wesley Bland, Ken Raffenetti, Pavan Balaji

Memory Compression Techniques for Network Address Management in MPI

Details
Discussion Comments: 0
Verification: Authors have not verified information

Bounded Reordering Allows Efficient Reliable Message Transmission

Keishla D. Ortiz-Lopez, Jennifer L. Welch

Bounded Reordering Allows Efficient Reliable Message Transmission

Details
Discussion Comments: 0
Verification: Authors have not verified information

Eliminating Irregularities of Protein Sequence Search on Multicore Architectures

Jing Zhang, Sanchit Misra, Hao Wang, Wu-chun Feng

Eliminating Irregularities of Protein Sequence Search on Multicore Architectures

Details
Discussion Comments: 0
Verification: Authors have not verified information

Generating Families of Practical Fast Matrix Multiplication Algorithms

Jianyu Huang, Leslie Rice, Devin A. Matthews, Robert A. van de Geijn

Generating Families of Practical Fast Matrix Multiplication Algorithms

Details
Discussion Comments: 0
Verification: Authors have not verified information

Accelerating Graph and Machine Learning Workloads Using a Shared Memory Multicore Architecture with Auxiliary Support for In-hardware Explicit Messaging

Halit Dogan, Farrukh Hijaz, Masab Ahmad, Brian Kahne, Peter Wilson, Omer Khan

Accelerating Graph and Machine Learning Workloads Using a Shared Memory Multicore Architecture with Auxiliary Support for In-hardware Explicit Messaging

Details
Discussion Comments: 0
Verification: Authors have not verified information

SWhybrid: A Hybrid-Parallel Framework for Large-Scale Protein Sequence Database Search

Haidong Lan, Weiguo Liu, Yongchao Liu, Bertil Schmidt

SWhybrid: A Hybrid-Parallel Framework for Large-Scale Protein Sequence Database Search

Details
Discussion Comments: 0
Verification: Authors have not verified information

Dynamic Memory-Aware Task-Tree Scheduling

Guillaume Aupy, Clement Brasseur, Loris Marchal

Dynamic Memory-Aware Task-Tree Scheduling

Details
Discussion Comments: 0
Verification: Authors have not verified information

Distributed Vehicle Routing Approximation

Akhil Krishnan, Mikhail Markov, Borzoo Bonakdarpour

Distributed Vehicle Routing Approximation

Details
Discussion Comments: 0
Verification: Authors have not verified information

Community Detection on the GPU

Md. Naim, Fredrik Manne, Mahantesh Halappanavar, Antonino Tumeo

Community Detection on the GPU

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Apollo: Reusable Models for Fast, Dynamic Tuning of Input-Dependent Code

David Beckingsale, Olga Pearce, Ignacio Laguna, Todd Gamblin

Apollo: Reusable Models for Fast, Dynamic Tuning of Input-Dependent Code

Details
Discussion Comments: 0
Verification: Authors have not verified information

ATM: Approximate Task Memoization in the Runtime System

Iulian Brumar, Marc Casas, Miquel Moretó, Mateo Valero, Gurindar S. Sohi

ATM: Approximate Task Memoization in the Runtime System

Details
Discussion Comments: 0
Verification: Authors have not verified information

Language-Based Optimizations for Persistence on Nonvolatile Main Memory Systems

Joel Edward Denny, Seyong Lee, Jeffrey S. Vetter

Language-Based Optimizations for Persistence on Nonvolatile Main Memory Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Sparse Tensor Factorization on Many-Core Processors with High-Bandwidth Memory

Shaden Smith, Jongsoo Park, George Karypis

Sparse Tensor Factorization on Many-Core Processors with High-Bandwidth Memory

Details
Discussion Comments: 0
Verification: Authors have not verified information

Multigrain Parallelism: Bridging Coarse-Grain Parallel Programs and Fine-Grain Event-Driven Multithreading

Jaime Arteaga Molina, Stéphane Zuckerman, Guang R. Gao

Multigrain Parallelism: Bridging Coarse-Grain Parallel Programs and Fine-Grain Event-Driven Multithreading

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced no artifacts
Verification: Authors have verified information

DEFT-Cache: A Cost-Effective and Highly Reliable SSD Cache for RAID Storage

Jiguang Wan, Wei Wu, Ling Zhan, Qing Yang, Xiaoyang Qu, Changsheng Xie

DEFT-Cache: A Cost-Effective and Highly Reliable SSD Cache for RAID Storage

Details
Discussion Comments: 0
Verification: Authors have not verified information

Significantly Improving Lossy Compression for Scientific Data Sets Based on Multidimensional Prediction and Error-Controlled Quantization

Dingwen Tao, Sheng Di, Zizhong Chen, Franck Cappello

Significantly Improving Lossy Compression for Scientific Data Sets Based on Multidimensional Prediction and Error-Controlled Quantization

Details
Discussion Comments: 0
Verification: Authors have not verified information

RCube: A Power Efficient and Highly Available Network for Data Centers

Zhenhua Li, Yuanyuan Yang

RCube: A Power Efficient and Highly Available Network for Data Centers

Details
Discussion Comments: 0
Verification: Authors have not verified information

Automatic-Signal Monitors with Multi-object Synchronization

Wei-Lun Hung, Vijay K. Garg

Automatic-Signal Monitors with Multi-object Synchronization

Details
Discussion Comments: 0
Verification: Authors have not verified information

Proximity-Aware Balanced Allocations in Cache Networks

Ali Pourmiri, Mahdi Jafari Siavoshani, Seyed Pooya Shariatpanahi

Proximity-Aware Balanced Allocations in Cache Networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

Exploring DataVortex Systems for Irregular Applications

Roberto Gioiosa, Antonino Tumeo, Jian Yin, Thomas Warfel, David J. Haglin, Santiago Betelú

Exploring DataVortex Systems for Irregular Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

swDNN: A Library for Accelerating Deep Learning Applications on Sunway TaihuLight

Jiarui Fang, Haohuan Fu, Wenlai Zhao, Bingwei Chen, Weijie Zheng, Guangwen Yang

swDNN: A Library for Accelerating Deep Learning Applications on Sunway TaihuLight

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel Construction of Suffix Trees and the All-Nearest-Smaller-Values Problem

Patrick Flick, Srinivas Aluru

Parallel Construction of Suffix Trees and the All-Nearest-Smaller-Values Problem

Details
Discussion Comments: 0
Verification: Authors have not verified information

Application Level Reordering of Remote Direct Memory Access Operations

Wim Lavrijsen, Costin Iancu

Application Level Reordering of Remote Direct Memory Access Operations

Details
Discussion Comments: 0
Verification: Authors have not verified information

PUNAS: A Parallel Ungapped-Alignment-Featured Seed Verification Algorithm for Next-Generation Sequencing Read Alignment

Yuandong Chan, Kai Xu, Haidong Lan, Weiguo Liu, Yongchao Liu, Bertil Schmidt

PUNAS: A Parallel Ungapped-Alignment-Featured Seed Verification Algorithm for Next-Generation Sequencing Read Alignment

Details
Discussion Comments: 0
Verification: Authors have not verified information

An Adaptive Core-Specific Runtime for Energy Efficiency

Sridutt Bhalachandra, Allan Porterfield, Stephen L. Olivier, Jan F. Prins

An Adaptive Core-Specific Runtime for Energy Efficiency

Details
Discussion Comments: 0
Verification: Authors have not verified information

Model-Driven Sparse CP Decomposition for Higher-Order Tensors

Jiajia Li, Jee Choi, Ioakeim Perros, Jimeng Sun, Richard W. Vuduc

Model-Driven Sparse CP Decomposition for Higher-Order Tensors

Details
Discussion Comments: 0
Verification: Authors have not verified information

Partitioning Low-Diameter Networks to Eliminate Inter-Job Interference

Nikhil Jain, Abhinav Bhatele, Xiang Ni, Todd Gamblin, Laxmikant V. Kalé

Partitioning Low-Diameter Networks to Eliminate Inter-Job Interference

Details
Discussion Comments: 0
Verification: Authors have not verified information

Automatic Collapsing of Non-Rectangular Loops

Philippe Clauss, Ervin Altintas, Matthieu Kuhn

Automatic Collapsing of Non-Rectangular Loops

Details
Discussion Comments: 0
Verification: Authors have not verified information

Rational Fair Consensus in the Gossip Model

Andrea E. F. Clementi, Luciano Gualà, Guido Proietti, Giacomo Scornavacca

Rational Fair Consensus in the Gossip Model

Details
Discussion Comments: 0
Verification: Authors have not verified information

Generating Performance Models for Irregular Applications

Ryan D. Friese, Nathan R. Tallent, Abhinav Vishnu, Darren J. Kerbyson, Adolfy Hoisie

Generating Performance Models for Irregular Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

When Neurons Fail

El Mahdi El Mhamdi, Rachid Guerraoui

When Neurons Fail

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Scalable and Resilient Microarchitecture Based on Multiport Binding for High-Radix Router Design

Yi Dai, Kefei Wang, Gang Qu, Liquan Xiao, Dezun Dong, Xingyun Qi

A Scalable and Resilient Microarchitecture Based on Multiport Binding for High-Radix Router Design

Details
Discussion Comments: 0
Verification: Authors have not verified information

Runtime Aware Architectures

Mateo Valero

Runtime Aware Architectures

Details
Discussion Comments: 0
Verification: Author has not verified information

Toucan - A Translator for Communication Tolerant MPI Applications

Sergio M. Martin, Marsha J. Berger, Scott B. Baden

Toucan - A Translator for Communication Tolerant MPI Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Reducing Pagerank Communication via Propagation Blocking

Scott Beamer, Krste Asanovic, David A. Patterson

Reducing Pagerank Communication via Propagation Blocking

Details
Discussion Comments: 0
Verification: Authors have not verified information

Towards Highly scalable Ab Initio Molecular Dynamics (AIMD) Simulations on the Intel Knights Landing Manycore Processor

Mathias Jacquelin, Wibe A. de Jong, Eric J. Bylaska

Towards Highly scalable Ab Initio Molecular Dynamics (AIMD) Simulations on the Intel Knights Landing Manycore Processor

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scalable Lock-Free Vector with Combining

Ivan Walulya, Philippas Tsigas

Scalable Lock-Free Vector with Combining

Details
Discussion Comments: 0
Verification: Authors have not verified information

Multi-GPU Graph Analytics

Yuechao Pan, Yangzihao Wang, Yuduo Wu, Carl Yang, John D. Owens

Multi-GPU Graph Analytics

Details
Author Comments: The Gunrock code base can be found at https://github.com/gunrock/gunrock (with documentation at https://gunrock.github.io/).
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Argo NodeOS: Toward Unified Resource Management for Exascale

Swann Perarnau, Judicael A. Zounmevo, Matthieu Dreher, Brian C. Van Essen, Roberto Gioiosa, Kamil Iskra, Maya B. Gokhale, Kazutomo Yoshii, Peter H. Beckman

Argo NodeOS: Toward Unified Resource Management for Exascale

Details
Discussion Comments: 0
Verification: Authors have not verified information

Localized Fault Recovery for Nested Fork-Join Programs

Gokcen Kestor, Sriram Krishnamoorthy, Wenjing Ma

Localized Fault Recovery for Nested Fork-Join Programs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fault-Tolerant Online Packet Scheduling on Parallel Channels

Pawel Garncarek, Tomasz Jurdzinski, Krzysztof Lorys

Fault-Tolerant Online Packet Scheduling on Parallel Channels

Details
Discussion Comments: 0
Verification: Authors have not verified information

Image-Domain Gridding on Graphics Processors

Bram Veenboer, Matthias Petschow, John W. Romein

Image-Domain Gridding on Graphics Processors

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced no artifacts
Verification: Authors have verified information

Partitioning Trillion-Edge Graphs in Minutes

George M. Slota, Sivasankaran Rajamanickam, Karen D. Devine, Kamesh Madduri

Partitioning Trillion-Edge Graphs in Minutes

Details
Discussion Comments: 0
Verification: Authors have not verified information

An N log N Parallel Fast Direct Solver for Kernel Matrices

Chenhan D. Yu, William B. March, George Biros

An N log N Parallel Fast Direct Solver for Kernel Matrices

Details
Discussion Comments: 0
Verification: Authors have not verified information

FlexVC: Flexible Virtual Channel Management in Low-Diameter Networks

Pablo Fuentes, Enrique Vallejo, Ramón Beivide, Cyriel Minkenberg, Mateo Valero

FlexVC: Flexible Virtual Channel Management in Low-Diameter Networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient and Deterministic Scheduling for Parallel State Machine Replication

Odorico Machado Mendizabal, Ruda S. T. De Moura, Fernando Luís Dotti, Fernando Pedone

Efficient and Deterministic Scheduling for Parallel State Machine Replication

Details
Discussion Comments: 0
Verification: Authors have not verified information

Autotuning Stencil Computations with Structural Ordinal Regression Learning

Biagio Cosenza, Juan J. Durillo, Stefano Ermon, Ben H. H. Juurlink

Autotuning Stencil Computations with Structural Ordinal Regression Learning

Details
Discussion Comments: 0
Verification: Authors have not verified information