


default search action
IPDPS 2014: Phoenix, AZ, USA - Workshops
- 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, Phoenix, AZ, USA, May 19-23, 2014. IEEE Computer Society 2014, ISBN 978-0-7695-5208-8

Workshop 1: HCW - Heterogeneity in Computing Workshop
- Behrooz A. Shirazi, Uwe Schwiegelshohn:

HCW Introduction. 1-2 - Behrooz A. Shirazi:

Message from the HCW Steering Committee Chair. 3 - Uwe Schwiegelshohn:

Message from the HCW General Chair. 4 - Shoukat Ali:

Message from the HCW Program Chair. 5 - David Abramson

:
HCW 2014 Keynote Talk. 6
HCW Session 1: Heterogeneous Environments for Basic Linear Algebra
- Dimitar Lukarski, Hartwig Anzt

, Stanimire Tomov
, Jack J. Dongarra:
Hybrid Multi-elimination ILU Preconditioners on GPUs. 7-16 - Ashley M. DeFlumere, Alexey L. Lastovetsky

:
Searching for the Optimal Data Partitioning Shape for Parallel Matrix Matrix Multiplication on 3 Heterogeneous Processors. 17-28 - Xavier Lacoste, Mathieu Faverge, George Bosilca, Pierre Ramet

, Samuel Thibault:
Taking Advantage of Hybrid Systems for Sparse Direct Solvers via Task-Based Runtimes. 29-38 - Tania Malik

, Vladimir Rychkov, Alexey L. Lastovetsky
, Jean-Noël Quintin:
Topology-Aware Optimization of Communications for Parallel Matrix Multiplication on Hierarchical Heterogeneous HPC Platform. 39-47
HCW Session 2: Scheduling and Resource Allocation
- Linchuan Chen, Xin Huo, Gagan Agrawal:

Scheduling Methods for Accelerating Applications on Architectures with Heterogeneous Cores. 48-57 - Bhavesh Khemka, Ryan D. Friese

, Sudeep Pasricha, Anthony A. Maciejewski
, Howard Jay Siegel, Gregory A. Koenig, Sarah Powers, Marcia Hilton, Rajendra Rambharos, Steve Poole:
Utility Driven Dynamic Resource Management in an Oversubscribed Energy-Constrained Heterogeneous System. 58-67 - Adel Essafi, Denis Trystram, Zied Zaidi:

An Efficient Algorithm for Scheduling Jobs in Volunteer Computing Platforms. 68-76
HCW Session 3: Resource-Related Performance Optimization
- Jens Gustedt

, Stéphane Vialle, Patrick P. Mercier:
Resource Centered Computing Delivering High Parallel Performance. 77-88 - Lionel Eyraud-Dubois, Przemyslaw Uznanski

:
Point-to-Point and Congestion Bandwidth Estimation: Experimental Evaluation on PlanetLab Data. 89-96 - Ayman Tarakji, Niels Ole Salscheider:

Runtime Behavior Comparison of Modern Accelerators and Coprocessors. 97-108
Workshop 2: RAW - Reconfigurable Architectures Workshop
- Jürgen Becker, Ramachandran Vaidyanathan, Marco D. Santambrogio

, Jim Tørresen, Ron Sass, Philip Heng Wai Leong
:
RAW Introduction and Committees. 109-110 - Joshua D. Walstrom, Maya B. Gokhale:

RAW 2014 Keynotes. 111
RAW Session 1: Compilers and Binary Translation for Reconfigurable Architectures
- Doug Gallatin, Aaron W. Keen, Chris Lupo

, John Y. Oliver
:
Twill: A Hybrid Microcontroller-FPGA Framework for Parallelizing Single-Threaded C Programs. 112-121 - Ali Mustafa Zaidi, David J. Greaves:

A New Dataflow Compiler IR for Accelerating Control-Intensive Code in Spatial Hardware. 122-131 - Toan X. Mai, Jongeun Lee:

Efficient Software-Based Runtime Binary Translation for Coarse-Grained Reconfigurable Architectures. 132-140
RAW Session 2: New Reconfigurable Architectures
- Georgios Smaragdos

, Danish Anis Khan, Ioannis Sourdis, Christos Strydis
, Alirad Malek, Stavros Tzilis:
A Dependable Coarse-Grain Reconfigurable Multicore Array. 141-150 - Cuong Pham-Quoc

, Zaid Al-Ars, Koen Bertels:
Automated Hybrid Interconnect Design for FPGA Accelerators Using Data Communication Profiling. 151-160 - Anil Kumar Sistla, Xiaozhong Luo, Mukund Malladi, Marc Reisner, Rajasekhar Ganduri, Gayatri Mehta:

SmartBricks: A Visual Environment to Design and Explore Novel Custom Domain-Specific Architectures. 161-169
RAW Session 3: ViPES Papers
- Harry Sidiropoulos, Kostas Siozios

, Dimitrios Soudris:
A Framework for Mapping Dynamic Virtual Kernels onto Heterogeneous Reconfigurable Platforms. 170-175 - Andreas Emeretlis, George Theodoridis, Panayiotis Alefragis

, Nikolaos S. Voros
:
A Hybrid ILP-CP Model for Mapping Directed Acyclic Task Graphs to Multicore Architectures. 176-182 - Kostas Siozios

, Dimitrios Soudris
, Michael Hübner:
A Framework for Customizing Virtual 3-D Reconfigurable Platforms at Run-Time. 183-188
RAW Session 4: Circuit-Level Applications
- Rui Policarpo Duarte, Christos-Savvas Bouganis

:
Over-clocking of Linear Projection Designs through Device Specific Optimisations. 189-198 - Michael Raitza, Markus Vogt, Christian Hochberger, Thilo Pionteck

:
Influence of Magnetic Fields and X-Radiation on Ring Oscillators in FPGAs. 199-204 - Takumi Fujimori, Minoru Watanabe:

Radiation Tolerance of Color Configuration on an Optically Reconfigurable Gate Array. 205-210
RAW Session 5: Numerical Reconfigurable Computing Applications
- Esti Stein, Yosi Ben-Asher:

Adaptive Booth Algorithm for Three-Integers Multiplication for Reconfigurable Mesh. 211-219 - Xinying Wang, Joseph Zambreno:

An FPGA Implementation of the Hestenes-Jacobi Algorithm for Singular Value Decomposition. 220-227
RAW Session 6: Applications of Reconfigurable Computing
- Osama G. Attia, Tyler Johnson, Kevin Townsend, Phillip H. Jones, Joseph Zambreno:

CyGraph: A Reconfigurable Architecture for Parallel Breadth-First Search. 228-235 - Gianluca Durelli, Fabrizio Spada, Riccardo Cattaneo

, Christian Pilato
, Danilo Pau
, Marco D. Santambrogio
:
Adaptive Raytracing Implementation Using Partial Dynamic Reconfiguration. 236-242 - Riccardo Cattaneo

, Riccardo Bellini, Gianluca Durelli, Christian Pilato
, Marco D. Santambrogio
, Donatella Sciuto
:
PaRA-Sched: A Reconfiguration-Aware Scheduler for Reconfigurable Architectures. 243-250
RAW Poster Session 1
- Hiroki Nishiyama, Masato Inagi, Shin'ichi Wakabayashi, Shinobu Nagayama, Keisuke Inoue, Mineo Kaneko:

An ILP-Based Optimal Circuit Mapping Method for PLDs. 251-256 - Cristiano Bacelar de Oliveira, João M. P. Cardoso

, Eduardo Marques
:
High-Level Synthesis from C vs. a DSL-Based Approach. 257-262 - Zhang Zhang, Swamy D. Ponpandi

, Akhilesh Tyagi:
An Evaluation of User Satisfaction Driven Scheduling in a Polymorphic Embedded System. 263-268 - Georgios Tzimpragos

, Christoforos Kachris
, Dimitrios Soudris, Ioannis Tomkos
:
A Low-Latency Algorithm and FPGA Design for the Min-Search of LDPC Decoders. 269-274 - Jahanzeb Anwer, Marco Platzner

, Sebastian Meisner:
FPGA Redundancy Configurations: An Automated Design Space Exploration. 275-280
RAW Poster Session 2
- Chen Mei, Peng Cao, Yang Zhang, Bo Liu, Leibo Liu

:
Hierarchical Pipeline Optimization of Coarse Grained Reconfigurable Processor for Multimedia Applications. 281-286 - Alexander Wold, Andreas Agne, Jim Tørresen:

Module Placement Using Constraint Programming in Run-Time Reconfigurable Systems. 287-292 - Hasan Erdem Yantir, Arda Yurdakul:

An Efficient Heterogeneous Register File Implementation for FPGAs. 293-298 - Bernhard Schmidt, Daniel Ziener

, Jürgen Teich:
Minimizing Scrubbing Effort through Automatic Netlist Partitioning and Floorplanning. 299-304 - Viet Vu Duy, Timo Sandmann, Steffen Baehr, Oliver Sander, Jürgen Becker

:
Virtualization Support for FPGA-Based Coprocessors Connected via PCI Express to an Intel Multicore Platform. 305-310
Workshop 3: HIPS - Workshop on High-Level Parallel Programming Models and Supportive Environments
- John Cavazos:

HIPS Introduction and Committees. 311
HIPS Session 1: System Support
- Mads Ruben Burgdorff Kristensen

, Simon Andreas Frimann Lund, Troels Blum, Kenneth Skovhede
, Brian Vinter:
Bohrium: A Virtual Machine Approach to Portable Parallelism. 312-321 - Juan Carlos Martínez Santos

, Yunsi Fei
:
HATI: Hardware Assisted Thread Isolation for Concurrent C/C++ Programs. 322-331 - Tatsuya Abe

, Toshiyuki Maeda:
A General Model Checking Framework for Various Memory Consistency Models. 332-341
HIPS Session 2: Optimization
- Lai Wei, John M. Mellor-Crummey

:
Autotuning Tensor Transposition. 342-351 - Weifeng Liu, Isaías A. Comprés Ureña, Michael Gerndt, Bin Gong:

Automatic MPI-IO Tuning with the Periscope Tuning Framework. 352-360 - Jithin Jose, Khaled Hamidouche, Jie Zhang, Akshay Venkatesh, Dhabaleswar K. Panda:

Optimizing Collective Communication in UPC. 361-370
HIPS Session 3: Effective Communication
- Simon Pickartz

, Pablo Reble, Carsten Clauss, Stefan Lankes
:
SWIFT: A Transparent and Flexible Communication Layer for PCIe-Coupled Accelerators and (Co-)Processors. 371-380 - Christopher Boelmann, Lorenz Schwittmann, Torben Weis

:
Deterministic Synchronization of Multi-threaded Programs with Operational Transformation. 381-390 - Sai Charan Koduru, Keval Vora

, Rajiv Gupta
:
ABC2: Adaptively Balancing Computation and Communication in a DSM Cluster of Multicores for Irregular Applications. 391-400
Workshop 4: NIDISC - Workshop on Nature Inspired Distributed Computing
- Pascal Bouvry

, Franciszek Seredynski
, El-Ghazali Talbi:
NIDISC Introduction and Committees. 401
NIDISC Session 1: Applications of Bio-Inspired Algorithms
- Theodore P. Pavlic

:
Using Physical Stigmergy in Decentralized Optimization under Multiple Non-separable Constraints: Formal Methods and an Intelligent Lighting Example. 402-411 - Amir Nakib

, El-Ghazali Talbi, A. Fuser:
Hybrid Metaheuristic for Annual Hydropower Generation Optimization. 412-419 - Fatima Adly, Paul D. Yoo

, Sami Muhaidat
, Yousof Al-Hammadi
:
Machine-Learning-Based Identification of Defect Patterns in Semiconductor Wafer Maps: An Overview and Proposal. 420-429 - Alain Fuser, Florent Fontaine, Jack Copper:

Data Quality, Consistency, and Interpretation Management for Wind Farms by Using Neural Networks. 430-438
NIDISC Session 2: Wireless Networks and Mobility Management
- Antonina Tretyakova, Franciszek Seredynski

, Pascal Bouvry
:
Graph-Based Cellular Automata Approach to Maximum Lifetime Coverage Problem in Wireless Sensor Networks. 439-447 - Sankha Baran Dutta, Robert D. McLeod, Marcia R. Friesen:

GPU Accelerated Nature Inspired Methods for Modelling Large Scale Bi-directional Pedestrian Movement. 448-456 - Marcin Seredynski, Patricia Ruiz

, Krzysztof Szczypiorski
, Djamel Khadraoui:
Improving Bus Ride Comfort Using GLOSA-Based Dynamic Speed Optimisation. 457-463 - Huang Cheng, Xin Fei, Azzedine Boukerche, Mohammed Almulla

:
A Genetic Algorithm-Based Sparse Coverage over Urban VANETs. 464-469
NIDISC Session 3: Multi-objective Optimization
- Jakub Gasior

, Franciszek Seredynski
:
A Game-Theoretic Approach to Multiobjective Job Scheduling in Cloud Computing Systems. 470-479 - Yacine Kessaci, Nouredine Melab, El-Ghazali Talbi:

Multi-level and Multi-objective Survey on Cloud Scheduling. 480-488 - Benoît Bertholon, Sébastien Varrette, Pascal Bouvry

:
Comparison of Multi-objective Optimization Algorithms for the JShadObf JavaScript Obfuscator. 489-496
Workshop 5: HiCOMB - Workshop on High Performance Computational Biology
- Alba Cristina Magalhaes Alves de Melo

, Srinivas Aluru, David A. Bader
:
HiCOMB Introduction and Committees. 497-498 - Stephen Larson, Ümit V. Çatalyürek, Ananth Kalyanaraman:

HiCOMB Keynote and Invited Talks. 499
HiCOMB Session 1: Parallel Algorithms for Biological Sequence Analysis
- Jaroslaw Zola

:
Constructing Similarity Graphs from Large-Scale Biological Sequence Collections. 500-507 - Yi Wang, Gagan Agrawal, Hatice Gulcin Ozer, Kun Huang:

Removing Sequential Bottlenecks in Analysis of Next-Generation Sequencing Data. 508-517
HiCOMB Session 2: Parallel/Distributed Architectures for Biological Applications
- Alexey M. Kozlov, Christian Goll, Alexandros Stamatakis

:
Efficient Computation of the Phylogenetic Likelihood Function on the Intel MIC Architecture. 518-527 - Jie Li, Amin Salighehdar, Narayan Ganesan:

Process Simulation of Complex Biochemical Pathways in Explicit 3D Space Enabled by Heterogeneous Computing Platform. 528-535 - Kary A. C. S. Ocaña

, Silvia Benza, Daniel de Oliveira, Jonas Dias, Marta Mattoso
:
Exploring Large Scale Receptor-Ligand Pairs in Molecular Docking Workflows in HPC Clouds. 536-545 - Natasha Pavlovikj, Kevin Begcy

, Sairam Behera, Malachy Campbell, Harkamal Walia, Jitender S. Deogun:
A Comparison of a Campus Cluster and Open Science Grid Platforms for Protein-Guided Assembly Using Pegasus Workflow Management System. 546-555
HiCOMB Session 3: Metagenomics and Assembly
- Sasha Ames, Jonathan E. Allen

, David A. Hysom, G. Scott Lloyd, Maya B. Gokhale:
Design and Optimization of a Metagenomics Analysis Workflow for NVRAM. 556-565 - Vipin Sachdeva

, Chang Sik Kim, Kirk E. Jordan, Martyn D. Winn:
Parallelization of the Trinity Pipeline for De Novo Transcriptome Assembly. 566-575 - Xiaohui Duan, Kun Zhao, Weiguo Liu:

HiPGA: A High Performance Genome Assembler for Short Read Sequence Data. 576-584
Workshop 6: APDCM - Advances in Parallel and Distributed Computing Models
- Oscar H. Ibarra:

APDCM Introduction and Committees. 585
APDCM Session 1
- Kazuya Tani, Daisuke Takafuji, Koji Nakano

, Yasuaki Ito:
Bulk Execution of Oblivious Algorithms on the Unified Memory Machine, with GPU Implementation. 586-595 - Mario Alberto Chapa Martell, Hiroyuki Sato:

A Linear Performance-Breakdown Model for GPU Programming Optimization Guidance. 596-603 - Guangping Tang, Kenli Li, Keqin Li, Hang Chen, Jiayi Du:

A Hybrid Parallel Tridiagonal Solver on Multi-core Architectures. 604-613 - Atsushi Koike

, Kunihiko Sadakane
:
A Novel Computational Model for GPUs with Application to I/O Optimal Sorting Algorithms. 614-623 - Munara Tolubaeva, Yonghong Yan, Barbara M. Chapman:

Predicting Cache Contention for Multithread Applications at Compile Time. 624-631
APDCM Session 2
- Guyue Wang, Shinichi Yamagiwa, Koichi Wada:

Parallelism Extraction Algorithm from Stream-Based Processing Flow Applying Spanning Tree. 632-641 - Quan Chen, Long Zheng, Minyi Guo, Zhiyi Huang:

EEWA: Energy-Efficient Workload-Aware Task Scheduling in Multi-core Architectures. 642-651 - Chunyan Wang, Shoichi Hirasawa, Hiroyuki Takizawa

, Hiroaki Kobayashi:
A Platform-Specific Code Smell Alert System for High Performance Computing Applications. 652-661 - Anne Benoit

, Jean-Marc Nicod, Veronika Rehn-Sonigo:
Optimizing Buffer Sizes for Pipeline Workflow Scheduling with Setup Times. 662-670 - Hatem M. El-Boghdadi:

WECPAR: List Ranking Algorithm and Relative Computational Power. 671-678
APDCM Session 3
- George Bosilca, Aurélien Bouteiller

, Thomas Hérault
, Yves Robert
, Jack J. Dongarra:
Assessing the Impact of ABFT and Checkpoint Composite Strategies. 679-688 - Julien Herrmann, Loris Marchal

, Yves Robert:
Memory-Aware List Scheduling for Hybrid Platforms. 689-698 - Jocelyne Faddoul, Wendy MacCaull:

A Parallel Framework for Handling Non-determinism with Expressive Description Logics. 699-708 - Martti Forsell, Jussi Roivainen, Ville Leppänen

:
Prototyping the MBTAC Processor for the REPLICA CMP. 709-716 - Jens Breitbart, Mareike Schmidtobreick, Vincent Heuveline

:
Evaluation of the Global Address Space Programming Interface (GASPI). 717-726
APDCM Session 4
- Chong Li

, Gaétan Hains:
GPS: Towards Simplified Communication on SGL Model. 727-736 - Gokarna Sharma

, Hari Krishnan, Costas Busch, Steven R. Brandt:
Near-Optimal Location Tracking Using Sensor Networks. 737-746 - Yihua Ding, James Zijun Wang, Pradip K. Srimani:

Self-Stabilizing Algorithm for Maximal 2-Packing with Safe Convergence in an Arbitrary Graph. 747-754 - Satoshi Fujita:

Minimum Set Cover of Sparsely Distributed Sensor Nodes by a Collection of Unit Disks. 755-761 - Xin Zhou, Yasuaki Ito, Koji Nakano

:
An Efficient Implementation of the Gradient-Based Hough Transform Using DSP Slices and Block RAMs on the FPGA. 762-770
Workshop 7: HPPAC - High-Performance, Power-Aware Computing
- Dong Li, Robert J. Fowler:

HPPAC Introduction and Committees. 771-772
HPPAC Session 1: Power and Energy Analysis and Profiling
- Edgar A. León

, Ian Karlin:
Characterizing the Impact of Program Optimizations on Power and Energy for Explicit Hydrodynamics. 773-781 - Chung-Hsing Hsu, Jacob Combs, Jolie Nazor, Fabian Santiago, Rachelle Thysell, Suzanne Rivoire, Stephen W. Poole:

Application Power Signature Analysis. 782-789 - Ryan E. Grant, Stephen L. Olivier

, James H. Laros III, Ron Brightwell, Allan Porterfield:
Metrics for Evaluating Energy Saving Techniques for Resilient HPC Systems. 790-797
HPPAC Session 2: Power-Efficient Hardware
- Ehsan Atoofian:

Reducing Static and Dynamic Power of L1 Data Caches in GPGPUs. 798-804 - Gilbert Netzer, S. Lennart Johnsson, Daniel Ahlin, Eric Stotzer, Pekka Varis, Erwin Laure

:
Exploiting DMA for Performance and Energy Optimized STREAM on a DSP. 805-814 - Nico Reissmann, Jan Christian Meyer, Magnus Jahre

:
A Study of Energy and Locality Effects Using Space-Filling Curves. 815-822
HPPAC Session 3: Large Scale Power Management
- Ashkan Paya, Dan C. Marinescu:

Energy-Aware Load Balancing Policies for the Cloud Ecosystem. 823-832 - George Terzopoulos

, Helen D. Karatza
:
Bag-of-Task Scheduling on Power-Aware Clusters Using a DVFS-Based Mechanism. 833-840 - Haibo Zhang, Wenting Han, Feng Li, Songtao He, Yichao Cheng, Hong An, Zhitao Chen:

A Criticality-Aware DVFS Runtime Utility for Optimizing Power Efficiency of Multithreaded Applications. 841-848
Workshop 8: HPGC - High-Performance Grid and Cloud Computing Workshop
- Eric E. Aubanel, Virendrakumar C. Bhavsar, Michael A. Frumkin:

HPGC Introduction and Committees. 849 - Rajkumar Buyya, Derek Murray:

HPGC Keynotes. 850-851
HPGC Session 1
- Andrew J. Younge, John Paul Walters, Stephen P. Crago, Geoffrey Charles Fox:

Evaluating GPU Passthrough in Xen for High Performance Cloud Computing. 852-859 - Teng Long, Il-Chul Yoon, Alan Sussman

, Adam A. Porter, Atif M. Memon:
Scalable System Environment Caching and Sharing for Distributed Virtual Machines. 860-867 - Hangwei Qian, Michael Rabinovich

:
Mega Data Center for Elastic Internet Applications. 868-874
HPGC Session 2
- Ashkan Paya, Dan C. Marinescu:

Cloud-Based Simulation of a Smart Power Grid. 875-884 - Seung-Hwan Lim, Gautam S. Thakur

, James L. Horey:
Analyzing Reliability of Virtual Machine Instances with Dynamic Pricing in the Public Cloud. 885-893 - Mohammad Ahmadian, Ashkan Paya, Dan C. Marinescu:

Security of Applications Involving Multiple Organizations and Order Preserving Encryption in Hybrid Cloud Environments. 894-903
Workshop 9: AsHES - Accelerators and Hybrid Exascale Systems
- Yunquan Zhang:

AsHES Introduction and Committees. 904-906 - Jeffrey S. Vetter:

AsHES Keynote. 907
AsHES Session 1: Programming Model and Performance Optimizations
- Felix Schmitt, Robert Dietrich, Guido Juckeland

:
Scalable Critical Path Analysis for Hybrid MPI-CUDA Applications. 908-915 - Shuai Che, Jiayuan Meng, Kevin Skadron

:
Dymaxion++: A Directive-Based API to Optimize Data Layout and Memory Mapping for Heterogeneous Systems. 916-924 - Chenggang Lai, Zhijun Hao, Miaoqing Huang, Xuan Shi, Haihang You:

Comparison of Parallel Programming Models on Intel MIC Computer Cluster. 925-932 - Marco Maggioni, Tanya Y. Berger-Wolf

:
CoAdELL: Adaptivity and Compression for Improving Sparse Matrix-Vector Multiplication on GPUs. 933-940
AsHES Session 2: Accelerating Applications
- Hartwig Anzt

, William B. Sawyer, Stanimire Tomov
, Piotr Luszczek, Ichitaro Yamazaki, Jack J. Dongarra:
Optimizing Krylov Subspace Solvers on Graphics Processing Units. 941-949 - Lipeng Wang, Yuandong Chan, Xiaohui Duan, Haidong Lan, Xiangxu Meng, Weiguo Liu:

XSW: Accelerating Biological Database Search on Xeon Phi. 950-957 - Simplice Donfack, Stanimire Tomov

, Jack J. Dongarra:
Dynamically Balanced Synchronization-Avoiding LU Factorization with Multicore and GPUs. 958-965 - Qi Hu, Nail A. Gumerov, Rio Yokota

, Lorena A. Barba
, Ramani Duraiswami
:
Scalable Fast Multipole Accelerated Vortex Methods. 966-975
AsHES Session 3: Emerging Hybrid Systems
- Lena Oden, Holger Fröning, Franz-Josef Pfreundt:

Infiniband-Verbs on GPU: A Case Study of Controlling an Infiniband Network Device from the GPU. 976-983 - Anish Varghese, Bob Edwards, Gaurav Mitra, Alistair P. Rendell

:
Programming the Adapteva Epiphany 64-Core Network-on-Chip Coprocessor. 984-992 - Jianting Zhang, Dali Wang:

High-Performance Zonal Histogramming on Large-Scale Geospatial Rasters Using GPUs and GPU-Accelerated Clusters. 993-1000
Workshop 10: PLC - Programming Models, Languages, and Compilers Workshop for Manycore and Heterogeneous Architectures
- Barbara M. Chapman:

PLC Introduction and Committees. 1001
PLC Session 1: Programming and Compilation Techniques for GPUs
- Troels Blum, Mads Ruben Burgdorff Kristensen, Brian Vinter:

Transparent GPU Execution of NumPy Applications. 1002-1010 - Dmitry Mikushin, Nikolay Likhogrud, Eddy Z. Zhang, Christopher Bergstrom:

KernelGen - The Design and Implementation of a Next Generation Compiler Platform for Accelerating Numerical Models on GPUs. 1011-1020 - Wei Ding

, Ligang Lu, Mauricio Araya-Polo, Amik St.-Cyr, Detlef Hohl, Barbara M. Chapman:
Using GPU Shared Memory with a Directive-Based Approach. 1021-1028
PLC Session 2: Libraries and Optimization Frameworks
- Jagan Jayaraj, Pei-Hung Lin

, Paul R. Woodward, Pen-Chung Yew
:
CFD Builder: A Library Builder for Computational Fluid Dynamics. 1029-1038 - Benjamin Ranft, Oliver Denninger

, Philip Pfaffe:
A Stream Processing Framework for On-Line Optimization of Performance and Energy Efficiency on Heterogeneous Systems. 1039-1048
PLC Session 3: Tools and Performance Evaluation
- Ahmad Qawasmeh

, Abid Muslim Malik, Barbara M. Chapman:
OpenMP Task Scheduling Analysis via OpenMP Runtime API and Tool Visualization. 1049-1058 - Pavel Zaichenkov, Bert Gijsbers, Clemens Grelck, Olga Tveretina, Alex Shafarenko:

A Case Study in Coordination Programming: Performance Evaluation of S-Net vs Intel's Concurrent Collections. 1059-1067
Workshop 11: EduPar-NSF/TCPP Workshop on Parallel and Distributed Computing Education
- Sushil K. Prasad

:
EduPar Introduction and Committees. 1068-1069 - Randy H. Katz:

EduPar Keynote. 1070
EduPar Session: Introductory Course and Across Curriculum
- Steven Bogaerts

:
Limited Time and Experience: Parallelism in CS1. 1071-1078 - Victor P. Gergel, Alexey Liniov

, Iosif B. Meyerov
, Alexander Sysoyev
:
NSF/IEEE-TCPP Curriculum Implementation at the State University of Nizhni Novgorod. 1079-1084 - David J. John, Stan J. Thomas:

Parallel and Distributed Computing across the Computer Science Curriculum. 1085-1090 - Yinong Chen

, Zhizheng Zhou:
Service-Oriented Computing and Software Integration in Computing Curriculum. 1091-1098 - Nasser Giacaman, Oliver Sinnen

:
EA: Research-Infused Teaching of Parallel Programming Concepts for Undergraduate Software Engineering Students. 1099-1105 - Clayton Ferner, Barry Wilkinson, Barbara Heath:

Using Patterns to Teach Parallel Computing. 1106-1113
EduPar Session: Miscellaneous
- Linh Bao Ngo, Edward B. Duffy, Amy W. Apon:

Teaching HDFS/MapReduce Systems Concepts to Undergraduates. 1114-1121 - H. Martin Bücker

, M. Ali Rostami
:
Interactively Exploring the Connection between Nested Dissection Orderings for Parallel Cholesky Factorization and Vertex Separators. 1122-1129 - David Toth:

A Portable Cluster for Each Student. 1130-1134
Workshop 12: GABB - Graph Algorithms Building Blocks
- Tim Mattson, David A. Bader

, Aydin Buluç
, John R. Gilbert, Joseph Gonzalez
, Jeremy Kepner:
GABB Introduction. 1135-1137
Workshop 13: PDSEC - Workshop on Parallel and Distributed Scientific and Engineering Computing
- Peter E. Strazdins, Raphaël Couturier

, Michelle Mills Strout, Keita Teranishi, Thomas Rauber, Gudula Rünger, Laurence T. Yang:
PDSEC Introduction and Committees. 1138-1139
PDSEC Session 1: Best Papers
- William A. Magato, Philip A. Wilsey:

llamaOS: A Solution for Virtualized High-Performance Computing Clusters. 1140-1149 - Azzam Haidar, Piotr Luszczek, Jack J. Dongarra:

New Algorithm for Computing Eigenvectors of the Symmetric Eigenvalue Problem. 1150-1159
PDSEC Session 2: Algorithms (I)
- Davide Barbieri, Valeria Cardellini

, Salvatore Filippone
:
Exhaustive Key Search on Clusters of GPUs. 1160-1168 - Md. Mohsin Ali

, James Southern, Peter E. Strazdins, Brendan Harding
:
Application Level Fault Recovery: Using Fault-Tolerant Open MPI in a PDE Solver. 1169-1178 - Sudip K. Seal, Srikanth B. Yoginath

, Michael K. Miller:
Nanoscale Cluster Detection in Massive Atom Probe Tomography Data. 1179-1188 - Angel Gonzalez Mendez, Graciela Román-Alonso, Fernando Rojas-González, Miguel Alfonso Castro-García, Miguel Aguilar Cornejo, Salomón Cordero-Sánchez

:
Construction of Porous Networks Subjected to Geometric Restrictions by Using OpenMP. 1189-1197
PDSEC Session 3: Systems and Performance Analysis
- Daniel Espling, Per-Olov Östberg, Erik Elmroth:

Integration and Evaluation of Decentralized Fairshare Prioritization (Aequus). 1198-1207 - Jeremiah J. Wilke:

Coordination Languages and MPI Perturbation Theory: The FOX Tuple Space Framework for Resilience. 1208-1217 - Tyson Kendon

, Jörg Denzinger
:
DisSLib: CC: A Library for Distributed Search with a Central Common Search State. 1218-1227 - Hongbo Zou, Yongen Yu, Wei Tang, Hsuanwei Michelle Chen:

Improving I/O Performance with Adaptive Data Compression for Big Data Applications. 1228-1237 - Bertrand Putigny, Benoit Ruelle, Brice Goglin

:
Analysis of MPI Shared-Memory Communication Performance from a Cache Coherence Perspective. 1238-1247
PDSEC Session 4: Algorithms (II)
- Andrew A. Haigh, Eric C. McCreath:

Acceleration of GPU-Based Ultrasound Simulation via Data Compression. 1248-1255 - Klaus Kofler, Dominik Steinhauser, Biagio Cosenza

, Ivan Grasso, Sabine Schindler, Thomas Fahringer
:
Kd-Tree Based N-Body Simulations with Volume-Mass Heuristic on the GPU. 1256-1265 - Norihisa Fujita, Hideo Nuga

, Taisuke Boku, Yasuhiro Idomura
:
Nuclear Fusion Simulation Code Optimization and Performance Evaluation on GPU Cluster. 1266-1274 - Zhe Weng, Peter E. Strazdins:

Acceleration of a Python-Based Tsunami Modelling Application via CUDA and OpenHMPP. 1275-1284 - Roksana Hossain, Sebastian Magierowski, Geoffrey G. Messier

:
GPU Enhanced Path Finding for an Unmanned Aerial Vehicle. 1285-1293
Workshop 14: DPDNS - Dependable Parallel, Distributed, and Network-Centric Systems
- Dimiter Avresky, Erik Maehle, Salvatore Distefano

:
DPDNS Introduction and Committees. 1294-1295 - Edgar Nett:

DPDNS Keynote. 1296
DPDNS Session: Applications
- Timo Lindhorst, Burkhard Weseloh, Edgar Nett:

Maintaining Dependable Communication Service for Mobile Stations in Wireless Mesh Networks by Tracking Capacity Demands. 1297-1305 - Ammar Amory, Thomas Tosik, Erik Maehle:

A Load Balancing Behavior for Underwater Robot Swarms to Increase Mission Time and Fault Tolerance. 1306-1313 - Andreas Dittrich, Stefan Wanja, Miroslaw Malek:

ExCovery - A Framework for Distributed System Experiments and a Case Study of Service Discovery. 1314-1323 - Mohamed Mohamedin

, Roberto Palmieri
, Binoy Ravindran
:
Managing Soft-Errors in Transactional Systems. 1324-1329
DPDNS Session: Theoretical Aspects
- Salvatore Distefano

:
Standby System Reliability through DRBD. 1330-1337 - Yingxu Lai, Qiuyue Pan, Zenghui Liu, Yinong Chen

, Zhizheng Zhou:
Trust-Based Security for the Spanning Tree Protocol. 1338-1343 - Emil Vassev, Mike Hinchey

:
Autonomy Requirements Engineering for Self-Adaptive Science Clouds. 1344-1353
Workshop 15: MTAAP - Workshop on Multi-threaded Architectures and Applications
- Luiz DeRose:

MTAAP Introduction and Committees. 1354
MTAAP Session: Algorithms and Position Papers
- Siddharth Gupta, Diana Palsetia, Md. Mostofa Ali Patwary, Ankit Agrawal

, Alok N. Choudhary:
A New Parallel Algorithm for Two-Pass Connected Component Labeling. 1355-1362 - Jaime Arteaga, Stéphane Zuckerman, Elkin Garcia, Guang R. Gao:

Position Paper: Locality-Driven Scheduling of Tasks for Data-Dependent Multithreading. 1363-1367 - Walid J. Ghandour, Nadine J. Ghandour:

Position Paper: Leveraging Strength-Based Dynamic Slicing to Identify Control Reconvergence Instructions. 1368-1373
MTAAP Session: Graph Analytics
- Hao Lu

, Mahantesh Halappanavar, Ananth Kalyanaraman, Sutanay Choudhury:
Parallel Heuristics for Scalable Community Detection. 1374-1385 - Ahmet Erdem Sariyüce, Erik Saule, Kamer Kaya, Ümit V. Çatalyürek:

Hardware/Software Vectorization for Closeness Centrality on Multi-/Many-Core Architectures. 1386-1395 - Adam McLaughlin, David A. Bader

:
Revisiting Edge and Node Parallelism for Dynamic GPU Graph Analytics. 1396-1406
MTAAP Session: Accelerators
- Cheng Wang, Rengan Xu, Sunita Chandrasekaran, Barbara M. Chapman, Oscar R. Hernandez:

A Validation Testsuite for OpenACC 1.0. 1407-1416 - Anas Abu-Doleh, Kamer Kaya, Mohamed Abouelhoda, Ümit V. Çatalyürek:

Extracting Maximal Exact Matches on GPU. 1417-1426 - B. Neelima

, G. Ram Mohana Reddy, Prakash S. Raghavendra:
Predicting an Optimal Sparse Matrix Format for SpMV Computation on GPU. 1427-1436
Workshop 16: LSPP - Workshop on Large-Scale Parallel Processing
- Darren J. Kerbyson, Ram Rajamony, Charles C. Weems:

LSPP Introduction and Committees. 1437
LSPP Session 1: Performance Analysis and Optimization
- Arash Shamaei, Bella Bose, Mary Flahive:

Higher Dimensional Gaussian Networks. 1438-1447
LSPP Session 2: Modeling Performance for Scaling
- Bo Li, Hung-Ching Chang, Shuaiwen Song, Chun-Yi Su, Timmy Meyer, John Mooring, Kirk W. Cameron

:
The Power-Performance Tradeoffs of the Intel Xeon Phi on HPC Applications. 1448-1456 - Ying-Chieh Wang, Che-Rung Lee, Yeh-Ching Chung, I-Hsin Chung, Michael Perrone:

Performance Modeling for Hardware Thread-Level Speculation. 1457-1464 - John D. Leidel, Yong Chen

:
HMC-Sim: A Simulation Framework for Hybrid Memory Cube Devices. 1465-1474
LSPP Session 3: Large-Scale Systems
- Roberto Gioiosa, Gokcen Kestor

, Darren J. Kerbyson:
Online Monitoring System for Performance Fault Detection. 1475-1484
LSPP Session 4: Scheduling
- Paul T. Lin, Matthew T. Bettencourt, Stefan Domino, Travis Fisher, Mark Hoemmen, Jonathan J. Hu, Eric T. Phipps, Andrey Prokopenko, Sivasankaran Rajamanickam, Christopher M. Siefert, Eric C. Cyr

, Stephen Kennon:
Towards Extreme-Scale Simulations with Next-Generation Trilinos: A Low Mach Fluid Application Case Study. 1485-1494 - Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Jack J. Dongarra:

Design and Implementation of a Large Scale Tree-Based QR Decomposition Using a 3D Virtual Systolic Array and a Lightweight Runtime. 1495-1504 - Michael Sevilla, Ike Nassi

, Kleoni Ioannidou, Scott A. Brandt, Carlos Maltzahn:
SupMR: Circumventing Disk and Memory Bandwidth Bottlenecks for Scale-up MapReduce. 1505-1514
Workshop 17: PCO - Parallel Computing and Optimization
- Didier El Baz

:
PCO Introduction and Committees. 1515
PCO Session 1: Optimization Techniques for Parallel or Distributed Architectures
- Congfeng Jiang, Jian Wan, Christophe Cérin, Paolo Gianessi

, Yanik Ngoko:
Towards Energy Efficient Allocation for Applications in Volunteer Cloud. 1516-1525 - Karl-Eduard Berger, François Galea, Bertrand Le Cun, Renaud Sirdey

:
Fast Generation of Large Task Network Mappings. 1526-1530
PCO Session 2: Parallel Optimization Algorithms
- Tarek Menouer

, Bertrand Le Cun:
Adaptive N to P Portfolio for Solving Constraint Programming Problems on Top of the Parallel Bobpp Framework. 1531-1540 - Yves Caniou, Philippe Codognet:

Dependent Walks in Parallel Local Search. 1541-1546 - Mhand Hifi, Stéphane Nègre, Toufik Saadi, Sagvan Saleh, Lei Wu:

A Parallel Large Neighborhood Search-Based Heuristic for the Disjunctively Constrained Knapsack Problem. 1547-1551 - Yuji Shinano, Tobias Achterberg, Timo Berthold, Stefan Heinz, Thorsten Koch

, Michael Winkler:
Solving Hard MIPLIB2003 Problems with ParaSCIP on Supercomputers: An Update. 1552-1561
PCO Session 3: Task Scheduling and Miscellaneous
- Shuli Wang, Kenli Li, Jing Mei, Keqin Li, Yan Wang:

A Task Scheduling Algorithm Based on Replication for Maximizing Reliability on Heterogeneous Computing Systems. 1562-1571 - Si Zheng, Yunhuai Liu, Tian He, Shanshan Li, Xiangke Liao:

SkewControl: Gini Out of the Bottle. 1572-1580 - Yuri Alexeev, Sheri A. Mickelson, Sven Leyffer

, Robert L. Jacob, Anthony P. Craig:
The Heuristic Static Load-Balancing Algorithm Applied to the Community Earth System Model. 1581-1590 - Didier El Baz

, Benoît Piranda, Julien Bourgeois:
A Distributed Algorithm for a Reconfigurable Modular Surface. 1591-1598
Workshop 18: ParLearning - Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics
- Abhinav Vishnu, Yinglong Xia:

ParLearning Introduction and Committees. 1599-1600 - Eric P. Xing:

ParLearning Keynote. 1601
ParLearning Session 1
- Hsuan-Yi Chu, Yinglong Xia, Anand V. Panangadan, Viktor K. Prasanna:

Wait-Free Primitives for Initializing Bayesian Network Structure Learning on Multicore Processors. 1602-1611 - Karl Jansson, Håkan Sundell, Henrik Boström:

gpuRF and gpuERT: Efficient and Scalable GPU Algorithms for Decision Tree Ensembles. 1612-1621 - Lei Jin, Zhaokang Wang, Rong Gu, Chunfeng Yuan, Yihua Huang:

Training Large Scale Deep Neural Networks on the Intel Xeon Phi Many-Core Coprocessor. 1622-1630 - Xiujuan Qian, Yongli Wang, Xiaohui Jiang:

Parallel Bayesian Network Modelling for Pervasive Health Monitoring System. 1631-1637
ParLearning Session 2
- Nitin Sukhija, Brandon M. Malone, Srishti Srivastava, Ioana Banicescu, Florina M. Ciorba

:
Portfolio-Based Selection of Robust Dynamic Loop Scheduling Algorithms Using Machine Learning. 1638-1647 - Wei Wang, Guisong Yang, Naixue Xiong, Xingyu He, Wenzhong Guo:

A General P2P Scheme for Constructing Large-Scale Virtual Environments. 1648-1655
ParLearning Session 3
- Peter D. Kirchner, Matthias Böhm, Berthold Reinwald, Daby M. Sow, J. Michael Schmidt, Deepak S. Turaga, Alain Biem:

Large Scale Discriminative Metric Learning. 1656-1663 - Hongjian Qiu, Rong Gu, Chunfeng Yuan, Yihua Huang:

YAFIM: A Parallel Frequent Itemset Mining Algorithm with Spark. 1664-1671 - Yang Bo, Naixue Xiong, Wenzhong Guo:

The Empirical Research of Virtual Enterprise Knowledge Transfer's Effectiveness Faced to the Independent Innovation Ability. 1672-1679 - Naixue Xiong, Guoxiang Tong, Wenzhong Guo, Jian Tan, Guanning Wu:

A Distributed Speech Algorithm for Large Scale Data Communication Systems. 1680-1687
Workshop 19: HPDIC - High Performance Data Intensive Computing
- Christophe Cérin, Congfeng Jiang:

HPDIC Introduction and Committees. 1688
HPDIC Session 1: Memory, I/O, and Performance Enhancement
- Vishwanath Venkatesan, Mohamad Chaarawi, Quincey Koziol, Edgar Gabriel:

Compactor: Optimization Framework at Staging I/O Nodes. 1689-1697 - Keita Iwabuchi, Hitoshi Sato

, Ryo Mizote, Yuichiro Yasui, Katsuki Fujisawa
, Satoshi Matsuoka:
Hybrid BFS Approach Using Semi-external Memory. 1698-1707 - Jialin Liu, Surendra Byna

, Bin Dong, Kesheng Wu
, Yong Chen
:
Model-Driven Data Layout Selection for Improving Read Performance. 1708-1716
HPDIC Session 2: Clustering, Data Management, and Applications
- Stephane Martin, Tomasz Buchert, Pierric Willemet, Olivier Richard, Emmanuel Jeanvoine, Lucas Nussbaum:

Scalable and Reliable Data Broadcast with Kascade. 1717-1726 - Tugdual Sarazin, Hanane Azzag, Mustapha Lebbah:

SOM Clustering Using Spark-MapReduce. 1727-1734 - Liang Li

, Dixin Tang, Taoying Liu, Hong Liu, Wei Li, Chenzhou Cui
:
Optimizing the Join Operation on Hive to Accelerate Cross-Matching in Astronomy. 1735-1745
Workshop 20: JSSPP - Workshop on Job Scheduling Strategies for Parallel Processing
- Walfredo Cirne, Narayan Desai:

JSSPP Introduction and Committees. 1746
Workshop 21: CHIUW - Chapel Implementers and Users Workshop
- Brad Chamberlain

:
CHIUW Introduction and Committees. 1747-1749

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














