IEEE International Conference on Parallel Processing, ICPP 2017


Title/Authors Title Research Artifacts
[?] A research artifact is any by-product of a research project that is not directly included in the published research paper. In Computer Science research this is often source code and data sets, but it could also be media, documentation, inputs to proof assistants, shell-scripts to run experiments, etc.
Details

Multiple Pattern Matching for Network Security Applications: Acceleration through Vectorization

Charalampos Stylianopoulos, Magnus Almgren, Olaf Landsiedel, Marina Papatriantafilou

Multiple Pattern Matching for Network Security Applications: Acceleration through Vectorization

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel Algorithm for Single-Source Earliest-Arrival Problem in Temporal Graphs

Peng Ni, Masatoshi Hanai, Wen Jun Tan, Chen Wang, Wentong Cai

Parallel Algorithm for Single-Source Earliest-Arrival Problem in Temporal Graphs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Boosting the Efficiency of HPCG and Graph500 with Near-Data Processing

Erik Vermij, Leandro Fiorin, Christoph Hagleitner, Koen Bertels

Boosting the Efficiency of HPCG and Graph500 with Near-Data Processing

Details
Discussion Comments: 0
Verification: Authors have not verified information

ES2: Aiming at an Optimal Virtual I/O Event Path

Xiaokang Hu, Wang Zhang, Jian Li, Ruhui Ma, Feng Wu, Haibing Guan

ES2: Aiming at an Optimal Virtual I/O Event Path

Details
Discussion Comments: 0
Verification: Authors have not verified information

The Cloud as an OpenMP Offloading Device

Hervé Yviquel, Guido Araujo

The Cloud as an OpenMP Offloading Device

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

A Parallel TSP-Based Algorithm for Balanced Graph Partitioning

Harshvardhan Das, Subodh Kumar

A Parallel TSP-Based Algorithm for Balanced Graph Partitioning

Details
Discussion Comments: 0
Verification: Authors have not verified information

Network Aware Multi-User Computation Partitioning in Mobile Edge Clouds

Lei Yang, Jiannong Cao, Zhenyu Wang, Weigang Wu

Network Aware Multi-User Computation Partitioning in Mobile Edge Clouds

Details
Discussion Comments: 0
Verification: Authors have not verified information

Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Modern Multi- and Many-Core Processors

Athena Elafrou, Georgios I. Goumas, Nectarios Koziris

Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Modern Multi- and Many-Core Processors

Details
Discussion Comments: 0
Verification: Authors have not verified information

An Efficient, Distributed Stochastic Gradient Descent Algorithm for Deep-Learning Applications

Guojing Cong, Onkar Bhardwaj, Minwei Feng

An Efficient, Distributed Stochastic Gradient Descent Algorithm for Deep-Learning Applications

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel Space-Time Kernel Density Estimation

Erik Saule, Dinesh Panchananam, Alexander Hohl, Wenwu Tang, Eric Delmelle

Parallel Space-Time Kernel Density Estimation

Details
Discussion Comments: 0
Verification: Authors have not verified information

Overlapping Data Transfers with Computation on GPU with Tiles

Burak Bastem, Didem Unat, Weiqun Zhang, Ann S. Almgren, John Shalf

Overlapping Data Transfers with Computation on GPU with Tiles

Details
Discussion Comments: 0
Verification: Authors have not verified information

Simple and Fast Parallel Algorithms for the Voronoi Map and the Euclidean Distance Map, with GPU Implementations

Takumi Honda, Shinnosuke Yamamoto, Hiroaki Honda, Koji Nakano, Yasuaki Ito

Simple and Fast Parallel Algorithms for the Voronoi Map and the Euclidean Distance Map, with GPU Implementations

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel Algorithms for the Computation of Cycles in Relative Neighborhood Graphs

Hari Sundar, Parmeshwar Khurd

Parallel Algorithms for the Computation of Cycles in Relative Neighborhood Graphs

Details
Discussion Comments: 0
Verification: Authors have not verified information

Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor

Lijuan Jiang, Chao Yang, Yulong Ao, Wanwang Yin, Wenjing Ma, Qiao Sun, Fangfang Liu, Rongfen Lin, Peng Zhang

Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor

Details
Discussion Comments: 0
Verification: Authors have not verified information

Non-Sequential Striping for Distributed Storage Systems with Different Redundancy Schemes

Yanwen Xie, Dan Feng, Fang Wang

Non-Sequential Striping for Distributed Storage Systems with Different Redundancy Schemes

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced no artifacts
Verification: Authors have verified information

Constrained Tensor Factorization with Accelerated AO-ADMM

Shaden Smith, Alec Beri, George Karypis

Constrained Tensor Factorization with Accelerated AO-ADMM

Details
Discussion Comments: 0
Verification: Authors have not verified information

E-Storm: Replication-Based State Management in Distributed Stream Processing Systems

Xunyun Liu, Aaron Harwood, Shanika Karunasekera, Benjamin I. P. Rubinstein, Rajkumar Buyya

E-Storm: Replication-Based State Management in Distributed Stream Processing Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Coflow-Based Co-Optimization Framework for High-Performance Data Analytics

Long Cheng, Ying Wang, Yulong Pei, Dick H. J. Epema

A Coflow-Based Co-Optimization Framework for High-Performance Data Analytics

Details
Discussion Comments: 0
Verification: Authors have not verified information

Large-Scale Parallelization of Smoothed Particle Hydrodynamics Method on Heterogeneous Cluster

Yingrui Wang, Leisheng Li, Rong Tian

Large-Scale Parallelization of Smoothed Particle Hydrodynamics Method on Heterogeneous Cluster

Details
Discussion Comments: 0
Verification: Authors have not verified information

Optimizations of Two Compute-Bound Scientific Kernels on the SW26010 Many-Core Processor

James Lin, Zhigeng Xu, Akira Nukada, Naoya Maruyama, Satoshi Matsuoka

Optimizations of Two Compute-Bound Scientific Kernels on the SW26010 Many-Core Processor

Details
Discussion Comments: 0
Verification: Authors have not verified information

Preparing HPC Applications for the Exascale Era: A Decoupling Strategy

Ivy Bo Peng, Roberto Gioiosa, Gokcen Kestor, Erwin Laure, Stefano Markidis

Preparing HPC Applications for the Exascale Era: A Decoupling Strategy

Details
Discussion Comments: 0
Verification: Authors have not verified information

High-Performance Recommender System Training Using Co-Clustering on CPU/GPU Clusters

Kubilay Atasu, Thomas P. Parnell, Celestine Dünner, Michail Vlachos, Haralampos Pozidis

High-Performance Recommender System Training Using Co-Clustering on CPU/GPU Clusters

Details
Discussion Comments: 0
Verification: Authors have not verified information

HyPPI NoC: Bringing Hybrid Plasmonics to an Opto-Electronic Network-on-Chip

Vikram K. Narayana, Shuai Sun, Armin Mehrabian, Volker J. Sorger, Tarek A. El-Ghazawi

HyPPI NoC: Bringing Hybrid Plasmonics to an Opto-Electronic Network-on-Chip

Details
Discussion Comments: 0
Verification: Authors have not verified information

Autotuning GPU Kernels via Static and Predictive Analysis

Robert V. Lim, Boyana Norris, Allen D. Malony

Autotuning GPU Kernels via Static and Predictive Analysis

Details
Discussion Comments: 0
Verification: Authors have not verified information

Bitslice Vectors: A Software Approach to Customizable Data Precision on Processors with SIMD Extensions

Shixiong Xu, David Gregg

Bitslice Vectors: A Software Approach to Customizable Data Precision on Processors with SIMD Extensions

Details
Discussion Comments: 0
Verification: Authors have not verified information

Predicting Response Latency Percentiles for Cloud Object Storage Systems

Yi Su, Dan Feng, Yu Hua, Zhan Shi

Predicting Response Latency Percentiles for Cloud Object Storage Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

WA-Dataspaces: Exploring the Data Staging Abstractions for Wide-Area Distributed Scientific Workflows

Mehmet Fatih Aktas, Javier Diaz Montes, Ivan Rodero, Manish Parashar

WA-Dataspaces: Exploring the Data Staging Abstractions for Wide-Area Distributed Scientific Workflows

Details
Discussion Comments: 0
Verification: Authors have not verified information

Application-Aware Power Coordination on Power Bounded NUMA Multicore Systems

Rong Ge, Pengfei Zou, Xizhou Feng

Application-Aware Power Coordination on Power Bounded NUMA Multicore Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Practical Experience with Transactional Lock Elision

Tingzhe Zhou, Pantea Zardoshti, Michael F. Spear

Practical Experience with Transactional Lock Elision

Details
Discussion Comments: 0
Verification: Authors have not verified information

Favorable Block First: A Comprehensive Cache Scheme to Accelerate Partial Stripe Recovery of Triple Disk Failure Tolerant Arrays

Luyu Li, Houxiang Ji, Chentao Wu, Jie Li, Minyi Guo

Favorable Block First: A Comprehensive Cache Scheme to Accelerate Partial Stripe Recovery of Triple Disk Failure Tolerant Arrays

Details
Discussion Comments: 0
Verification: Authors have not verified information

Variable-Size Batched LU for Small Matrices and Its Integration into Block-Jacobi Preconditioning

Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí

Variable-Size Batched LU for Small Matrices and Its Integration into Block-Jacobi Preconditioning

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel Construction of Simultaneous Deterministic Finite Automata on Shared-Memory Multicores

Minyoung Jung, Jinwoo Park, Johann Blieberger, Bernd Burgstaller

Parallel Construction of Simultaneous Deterministic Finite Automata on Shared-Memory Multicores

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient and Scalable Multi-Source Streaming Broadcast on GPU Clusters for Deep Learning

Ching-Hsiang Chu, Xiaoyi Lu, Ammar Ahmad Awan, Hari Subramoni, Jahanzeb Maqbool Hashmi, Bracy Elton, Dhabaleswar K. Panda

Efficient and Scalable Multi-Source Streaming Broadcast on GPU Clusters for Deep Learning

Details
Author Comments:
Discussion Comments: 0
Sharing: Research produced artifacts
Verification: Authors have verified information

Data Caching in Next Generation Mobile Cloud Services, Online vs. Off-Line

Yang Wang, Shuibing He, Xiaopeng Fan, Chengzhong Xu, Joseph Culberson, Joseph Horton

Data Caching in Next Generation Mobile Cloud Services, Online vs. Off-Line

Details
Discussion Comments: 0
Verification: Authors have not verified information

High-Performance and Memory-Saving Sparse General Matrix-Matrix Multiplication for NVIDIA Pascal GPU

Yusuke Nagasaka, Akira Nukada, Satoshi Matsuoka

High-Performance and Memory-Saving Sparse General Matrix-Matrix Multiplication for NVIDIA Pascal GPU

Details
Discussion Comments: 0
Verification: Authors have not verified information

Nearly Balanced Work Partitioning for Heterogeneous Algorithms

Mallipeddi Hardhik, Dip Sankar Banerjee, Kiran Raj Ramamoorthy, Kishore Kothapalli, Kannan Srinathan

Nearly Balanced Work Partitioning for Heterogeneous Algorithms

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scalable Write Allocation in the WAFL File System

Matthew Curtis-Maury, Ram Kesavan, Mrinal K. Bhattacharjee

Scalable Write Allocation in the WAFL File System

Details
Discussion Comments: 0
Verification: Authors have not verified information

MPI-GDS: High Performance MPI Designs with GPUDirect-aSync for CPU-GPU Control Flow Decoupling

Akshay Venkatesh, Khaled Hamidouche, Sreeram Potluri, Davide Rossetti, Ching-Hsiang Chu, Dhabaleswar K. Panda

MPI-GDS: High Performance MPI Designs with GPUDirect-aSync for CPU-GPU Control Flow Decoupling

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Novel Minimum Time Parallel 2-D Discrete Wavelet Transform Algorithm for General Purpose Processors

Eduardo Moscoso Rubino, Alberto Jose Alvares, Raul Marin Prades, Pedro Sanz Valero

A Novel Minimum Time Parallel 2-D Discrete Wavelet Transform Algorithm for General Purpose Processors

Details
Discussion Comments: 0
Verification: Authors have not verified information

Fading-Resistant Link Scheduling in Wireless Networks

Chenxi Qiu, Haiying Shen

Fading-Resistant Link Scheduling in Wireless Networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

CELIA: Cost-Time Performance of Elastic Applications on Cloud

Sunimal Rathnayake, Dumitrel Loghin, Yong Meng Teo

CELIA: Cost-Time Performance of Elastic Applications on Cloud

Details
Discussion Comments: 0
Verification: Authors have not verified information

GLTO: On the Adequacy of Lightweight Thread Approaches for OpenMP Implementations

Adrián Castelló, Sangmin Seo, Rafael Mayo, Pavan Balaji, Enrique S. Quintana-Ortí, Antonio J. Peña

GLTO: On the Adequacy of Lightweight Thread Approaches for OpenMP Implementations

Details
Discussion Comments: 0
Verification: Authors have not verified information

Greed Is Good: Parallel Algorithms for Bipartite-Graph Partial Coloring on Multicore Architectures

Mustafa Kemal Tas, Kamer Kaya, Erik Saule

Greed Is Good: Parallel Algorithms for Bipartite-Graph Partial Coloring on Multicore Architectures

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Machine Learning Approach for Efficient Parallel Simulation of Beam Dynamics on GPUs

Kamesh Arumugam, Desh Ranjan, Mohammad Zubair, Balsa Terzic, Alexander Godunov, Tunazzina Islam

A Machine Learning Approach for Efficient Parallel Simulation of Beam Dynamics on GPUs

Details
Discussion Comments: 0
Verification: Authors have not verified information

PDS: An I/O-Efficient Scaling Scheme for Parity Declustered Data Layout

Zhipeng Li, Yinlong Xu, Yongkun Li, Chengjin Tian, Youhui Bai

PDS: An I/O-Efficient Scaling Scheme for Parity Declustered Data Layout

Details
Discussion Comments: 0
Verification: Authors have not verified information

Scheduling Independent Tasks in Parallel under Power Constraints

Ayham Kassab, Jean-Marc Nicod, Laurent Philippe, Veronika Rehn-Sonigo

Scheduling Independent Tasks in Parallel under Power Constraints

Details
Discussion Comments: 0
Verification: Authors have not verified information

Order/Radix Problem: Towards Low End-to-End Latency Interconnection Networks

Ryota Yasudo, Michihiro Koibuchi, Koji Nakano, Hiroki Matsutani, Hideharu Amano

Order/Radix Problem: Towards Low End-to-End Latency Interconnection Networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

Efficient Data Sharing on Heterogeneous Systems

Victor Garcia-Flores, Eduard Ayguadé, Antonio J. Peña

Efficient Data Sharing on Heterogeneous Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

Exploiting GPUs for Fast Force-Directed Visualization of Large-Scale Networks

Govert G. Brinkmann, Kristian F. D. Rietveld, Frank W. Takes

Exploiting GPUs for Fast Force-Directed Visualization of Large-Scale Networks

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Scalable Hierarchical Semi-Separable Library for Heterogeneous Clusters

Isuru Dilanka Fernando, Sanath Jayasena, Milinda Fernando, Hari Sundar

A Scalable Hierarchical Semi-Separable Library for Heterogeneous Clusters

Details
Discussion Comments: 0
Verification: Authors have not verified information

Resilience for Stencil Computations with Latent Errors

Aiman Fang, Aurélien Cavelan, Yves Robert, Andrew A. Chien

Resilience for Stencil Computations with Latent Errors

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Dynamic Resource Controller for a Lambda Architecture

MohammadReza HoseinyFarahabady, Javid Taheri, Zahir Tari, Albert Y. Zomaya

A Dynamic Resource Controller for a Lambda Architecture

Details
Discussion Comments: 0
Verification: Authors have not verified information

A Pareto Framework for Data Analytics on Heterogeneous Systems: Implications for Green Energy Usage and Performance

Aniket Chakrabarti, Srinivasan Parthasarathy, Christopher Stewart

A Pareto Framework for Data Analytics on Heterogeneous Systems: Implications for Green Energy Usage and Performance

Details
Discussion Comments: 0
Verification: Authors have not verified information

OptiMatch: Enabling an Optimal Match between Green Power and Various Workloads for Renewable-Energy Powered Storage Systems

Xiaoyang Qu, Jiguang Wan, Fengguang Song, Xiaozhao Zhuang, Fei Wu, Changsheng Xie

OptiMatch: Enabling an Optimal Match between Green Power and Various Workloads for Renewable-Energy Powered Storage Systems

Details
Discussion Comments: 0
Verification: Authors have not verified information

GCN: GPU-Based Cube CNN Framework for Hyperspectral Image Classification

Han Dong, Tao Li, Jiabing Leng, Lingyan Kong, Gang Bai

GCN: GPU-Based Cube CNN Framework for Hyperspectral Image Classification

Details
Discussion Comments: 0
Verification: Authors have not verified information

Parallel Reconstruction of Three Dimensional Magnetohydrodynamic Equilibria in Plasma Confinement Devices

Sudip K. Seal, Mark R. Cianciosa, Steven P. Hirshman, Andreas Wingen, Robert S. Wilcox, Ezekial A. Unterberg

Parallel Reconstruction of Three Dimensional Magnetohydrodynamic Equilibria in Plasma Confinement Devices

Details
Discussion Comments: 0
Verification: Authors have not verified information

Locality-Aware Dynamic Task Graph Scheduling

Jordyn Maglalang, Sriram Krishnamoorthy, Kunal Agrawal

Locality-Aware Dynamic Task Graph Scheduling

Details
Discussion Comments: 0
Verification: Authors have not verified information

High Performance Query Processing for Web Scale RDF Data using BSP Style Communication and Balanced Distribution

Minho Bae, Junho Eum, Donghoon Kim, Sangyoon Oh

High Performance Query Processing for Web Scale RDF Data using BSP Style Communication and Balanced Distribution

Details
Discussion Comments: 0
Verification: Authors have not verified information

Runtime Data Layout Scheduling for Machine Learning Dataset

Yang You, James Demmel

Runtime Data Layout Scheduling for Machine Learning Dataset

Details
Discussion Comments: 0
Verification: Authors have not verified information

Accelerating Graph Analytics by Utilising the Memory Locality of Graph Partitioning

Jiawen Sun, Hans Vandierendonck, Dimitrios S. Nikolopoulos

Accelerating Graph Analytics by Utilising the Memory Locality of Graph Partitioning

Details
Discussion Comments: 0
Verification: Authors have not verified information