


Остановите войну!
for scientists:


default search action
IPDPS 2016: Chicago, IL, USA - Workshops
- 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPS Workshops 2016, Chicago, IL, USA, May 23-27, 2016. IEEE Computer Society 2016, ISBN 978-1-5090-3682-0
Workshop 1-HCW - Heterogeneity in Computing Workshop
- Denis Trystram, Erik Saule:
HCW Introduction. 1-2 - Behrooz A. Shirazi:
Message from the HCW Steering Committee Chair. 3 - Denis Trystram:
Message from the HCW General Chair. 4 - Erik Saule:
Message from the HCW Program Committee Chair. 5 - Mahmut T. Kandemir:
HCW 2016 Keynote Talk. 6
Session 1: Heterogeneity in the Cloud
- Julio Proaño
, Carmen Carrión
, María Blanca Caminero
:
Towards a Green, QoS-Enabled Heterogeneous Cloud Infrastructure. 7-16 - Rekha Singhal, Abhishek Verma:
Predicting Job Completion Time in Heterogeneous MapReduce Environments. 17-27 - Fouad Hanna, Loris Marchal
, Jean-Marc Nicod, Laurent Philippe, Veronika Rehn-Sonigo, Hala Sabbah:
Minimizing Rental Cost for Multiple Recipe Applications in the Cloud. 28-37
Session 2: Heterogeneity in Single Node Systems
- Saeid Barati, Hank Hoffmann:
Providing Fairness in Heterogeneous Multicores with a Predictive, Adaptive Scheduler. 38-49 - Jeremy Bottleson, SungYe Kim, Jeff Andrews, Preeti Bindu, Deepak N. Murthy, Jingyi Jin:
clCaffe: OpenCL Accelerated Caffe for Convolutional Neural Networks. 50-57 - Bahareh Goodarzi, Martin Burtscher, Dhrubajyoti Goswami:
Parallel Graph Partitioning on a CPU-GPU Architecture. 58-66
Session 3: Heterogeneity and Energy
- Dylan Machovec, Bhavesh Khemka, Sudeep Pasricha, Anthony A. Maciejewski
, Howard Jay Siegel, Gregory A. Koenig, Michael Wright, Marcia Hilton, Rajendra Rambharos, Neena Imam:
Dynamic Resource Management for Parallel Tasks in an Oversubscribed Energy-Constrained Heterogeneous Environment. 67-78 - JeeWhan Choi, Richard W. Vuduc
:
Analyzing the Energy Efficiency of the Fast Multipole Method Using a DVFS-Aware Energy Model. 79-88 - John E. Stone
, Michael J. Hallock
, James C. Phillips
, Joseph R. Peterson, Zaida Luthey-Schulten, Klaus Schulten:
Evaluation of Emerging Energy-Efficient Heterogeneous Computing Platforms for Biomolecular and Cellular Simulation Workloads. 89-100
Workshop 2-RAW - Reconfigurable Architectures Workshop
- Marco D. Santambrogio, Ramachandran Vaidyanathan, Diana Goehringer, Steven J. E. Wilton:
RAW Introduction and Committees. 101-102 - H. Peter Hofstee, Patrick Lysaght, Dirk van den Heuvel:
RAW 2016 Keynotes. 103-104
Session 1: Application Mapping and Design Space Exploration
- Lester Kalms, Diana Göhringer:
Clustering and Mapping Algorithm for Application Distribution on a Scalable FPGA Cluster. 105-113 - Syed Waqar Nabi, Wim Vanderbauwhede:
A Fast and Accurate Cost Model for FPGA Design Space Exploration in HPC Applications. 114-123 - Hyunsuk Nam, Roman Lysecky:
Latency, Power, and Security Optimization in Distributed Reconfigurable Embedded Systems. 124-131
Session 2: Applications
- Daniel Llamocca
, Daniel N. Aloi:
A Reconfigurable Fixed-Point Architecture for Adaptive Beamforming. 132-138 - Aaron Mills, Phillip H. Jones, Joseph Zambreno:
Parameterizable FPGA-Based Kalman Filter Coprocessor Using Piecewise Affine Modeling. 139-147 - Chi Zhang, Ren Chen, Viktor K. Prasanna:
High Throughput Large Scale Sorting on a CPU-FPGA Heterogeneous Platform. 148-155 - Juan Andrés Pérez-Celis, José Martínez-Carranza
, Alicia Morales-Reyes
, Claudia Feregrino Uribe, René Cumplido:
An FPGA Architecture to Accelerate the Burrows Wheeler Transform by Using a Linear Sorter. 156-161
Session 3: Processor Architectures
- Mohamed El-Hadedy, Hristina Mihajloska, Danilo Gligoroski, Amit Kulkarni, Dirk Stroobandt, Kevin Skadron:
A 16-Bit Reconfigurable Encryption Processor for p-Cipher. 162-171 - Stephan Nolting, Guillermo Payá Vayá, Florian Giesemann, Holger Blume
, Sebastian Niemann, Christian Müller-Schloer:
Dynamic Self-Reconfiguration of a MIPS-Based Soft-Processor Architecture. 172-180 - Steffen Vaas, Marc Reichenbach
, Dietmar Fey:
An Application-Specific Instruction Set Processor for Power Quality Monitoring. 181-188
Session 4: Scheduler and Runtime Systems
- Andrea Purgato, Davide Tantillo, Marco Rabozzi, Donatella Sciuto, Marco D. Santambrogio:
Resource-Efficient Scheduling for Partially-Reconfigurable FPGA-Based Systems. 189-197 - Tajas Ruschke, Lukas Johannes Jung, Dennis Wolf, Christian Hochberger:
Scheduler for Inhomogeneous and Irregular CGRAs with Support for Complex Control Flow. 198-207 - Jens Rettkowski, Philipp Wehner, Evgheni Cutiscev, Diana Göhringer:
LinROS: A Linux-Based Runtime System for Reconfigurable MPSoCs. 208-216
Session 5: High Level Synthesis and Object-Oriented Programming
- Emanuele Del Sozzo
, Andrea Solazzo, Antonio Miele
, Marco D. Santambrogio:
On the Automation of High Level Synthesis of Convolutional Neural Networks. 217-224 - Gianluca C. Durelli, Fabrizio Spada, Christian Pilato
, Marco D. Santambrogio:
Scala-Based Domain-Specific Language for Creating Accelerator-Based SoCs. 225-232 - Hongyuan Ding, Sen Ma, Miaoqing Huang, David Andrews
:
OOGen: An Automated Generation Tool for Custom MPSoC Architectures Based on Object-Oriented Programming Methods. 233-240
Short Papers
- Benedikt Janßen, Moataz Naserddin, Michael Hübner:
A Hardware/Software Co-Design Approach for Control Applications with Static Real-Time Reallocation. 241-246 - Giulia Guidi, Enrico Reggiani, Lorenzo Di Tucci, Gianluca Durelli, Michaela Blott, Marco D. Santambrogio:
On How to Improve FPGA-Based Systems Design Productivity via SDAccel. 247-252 - Jones Yudi Mori
, André Werner, Florian Fricke, Michael Hübner:
A Rapid Prototyping Method to Reduce the Design Time in Commercial High-Level Synthesis Tools. 253-258 - Salma Hesham, Diana Göhringer, Mohamed A. Abd El Ghany:
ARTNoCs: An Evaluation Framework for Hardware Architectures of Real-Time NoCs. 259-264 - Amit Kulkarni, Elias Vansteenkiste, Dirk Stroobandt, Andreas Brokalakis, Antonis Nikitakis:
A Fully Parameterized Virtual Coarse Grained Reconfigurable Array for High Performance Computing Applications. 265-270 - Anita Tino, Kaamran Raahemifar:
Assessing Multi-task Placement Algorithms in RCUs. 271-276 - Alexandra Kourfali
, Dirk Stroobandt:
Efficient Hardware Debugging Using Parameterized FPGA Reconfiguration. 277-282 - Fynn Schwiegelshohn, Florian Kastner, Michael Hübner:
Enabling Dynamic Reconfiguration of Numerical Methods for the Robotic Motion Control Task. 283-288 - Martín Letras
, Raudel Hernández-León, René Cumplido:
Hardware Architectures for Frequent Itemset Mining Based on Equivalence Classes Partitioning. 289-294 - Fabiola Casasopra, Gea Bianchi, Gianluca C. Durelli, Marco D. Santambrogio:
Parallel Protein Identification Using an FPGA-Based Solution. 295-299 - Nikolaos Stekas, Dirk van den Heuvel:
Face Recognition Using Local Binary Patterns Histograms (LBPH) on an FPGA-Based System on Chip (SoC). 300-304
Workshop 3-HIPS - High-Level Parallel Programming Models and Supportive Environments
- David Böhme, Xu Liu:
HIPS Introduction and Committees. 305-306 - Tim Mattson:
HIPS 2016 Keynote. 307
Session 1: Debugging and Optimization
- Faheem Ullah, Thomas R. Gross:
Detecting Anomalies in Concurrent Programs Based on Dynamic Control Flow Changes. 308-317 - Marc Sergent, David Goudin, Samuel Thibault, Olivier Aumage:
Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System. 318-327 - Shingo Okuno
, Tasuku Hiraishi, Hiroshi Nakashima, Masahiro Yasugi, Jun Sese
:
Reducing Redundant Search in Parallel Graph Mining Using Exceptions. 328-337
Session 2: Heterogeneous Computing
- Matt Martineau, Simon McIntosh-Smith
, Wayne P. Gaudin:
Evaluating OpenMP 4.0's Effectiveness as a Heterogeneous Parallel Programming Model. 338-347 - Ebad Salehi, Ahmad Lashgar, Amirali Baniasadi:
Employing Compression Solutions under OpenACC. 348-356 - Craig Edward Rasmussen, Matthew J. Sottile
, Søren Rasmussen, Daniel Nagle, William Dumas:
CAFe: Coarray Fortran Extensions for Heterogeneous Computing. 357-365
Session 3: Parallel Algorithms and Systems
- Peter Mills, Clinton Jeffery:
Embedding Concurrent Generators. 366-375 - Josef Weidendorfer, Jens Breitbart:
The Case for Binary Rewriting at Runtime for Efficient Implementation of High-Level Programming Models in HPC. 376-385 - Seyed Hessam Mirsadeghi, Ahmad Afsahi:
PTRAM: A Parallel Topology-and Routing-Aware Mapping Framework for Large-Scale HPC Systems. 386-396 - Joshua Dennis Booth, Kyungjoo Kim, Sivasankaran Rajamanickam:
A Comparison of High-Level Programming Choices for Incomplete Sparse Factorization Across Different Architectures. 397-406
Workshop 4-HiCOMB - High Performance Computational Biology
- Srinivas Aluru, David A. Bader
, Ananth Kalyanaraman, Jaroslaw Zola:
HiCOMB Introduction and Committees. 407
Session I
- Constantin Scholl, Kassian Kobert, Tomás Flouri, Alexandros Stamatakis:
The Divisible Load Balance Problem with Shared Cost and Its Application to Phylogenetic Inference. 408-417 - Nikolaos Alachiotis, Doru-Thom Popovici, Tze Meng Low:
Efficient Computation of Linkage Disequilibria as Dense Linear Algebra Operations. 418-427 - Michael J. Hallock
, Zaida Luthey-Schulten:
Improving Reaction Kernel Performance in Lattice Microbes: Particle-Wise Propensities and Run-Time Generated Code. 428-434
Session II
- Amir Bahmani, Alexander B. Sibley, Mahmoud Parsian, Kouros Owzar, Frank Mueller:
SparkScore: Leveraging Apache Spark for Distributed Genomic Inference. 435-442 - Shayan Shams, Nayong Kim, Xiandong Meng, Ming Tai Ha, Shantenu Jha
, Zhong Wang, Joohyun Kim:
A Scalable Pipeline for Transcriptome Profiling Tasks with On-Demand Computing Clouds. 443-452 - Vipin Sachdeva
, Srinivas Aluru, David A. Bader
:
A Memory and Time Scalable Parallelization of the Reptile Error-Correction Code. 453-462
Session III
- Nuttiiya Seekhao, Caroline Shung, Joseph F. JáJá, Luc Mongeau
, Nicole Y. K. Li-Jessen
:
Real-Time Agent-Based Modeling Simulation with in-Situ Visualization of Complex Biological Systems: A Case Study on Vocal Fold Inflammation and Healing. 463-472 - M. Ali Mirzaei, Francesco Crescioli, Sebastien Viret, William Tromeur, Giovanni Calderini, Giovanni Marchiori, Guillaume Baulieu, Geoffrey Galbit:
A Novel Associative Memory Based Architecture for Sequence Alignment. 473-478
Workshop 5-APDCM - Advances in Parallel and Distributed Computational Models
- Oscar H. Ibarra, Koji Nakano, Akihiro Fujiwara, Susumu Matsumae
:
APDCM Introduction and Committees. 479
Session 1: Graph Algorithms
- Jie Wu:
Stable Matching Beyond Bipartite Graphs. 480-488 - Paula Aguilera, Dong Ping Zhang, Nam Sung Kim, Nuwan Jayasena:
Fine-Grained Task Migration for Graph Algorithms Using Processing in Memory. 489-498
Session 2: Wireless Networks and Distributed Computing
- Wei Chen, Liang Hong, Sachin Shetty
, Dan Chia-Tien Lo, Reginald Cooper:
Cross-Layered Security Approach with Compromised Nodes Detection in Cooperative Sensor Networks. 499-508 - Hideharu Kojima
, Yuta Nagashima, Tatsuhiro Tsuchiya
:
Model Checking Techniques for State Space Reduction in MANET Protocol Verification. 509-516 - Feng Luo, Pradip K. Srimani:
New Biology Inspired Anonymous Distributed Algorithms to Compute Dominating and Total Dominating Sets in Network Graphs. 517-524
Session 3: Distributed Computing and Models
- Ta Yuan Hsu, Ajay D. Kshemkalyani:
Performance of Causal Consistency Algorithms for Partially Replicated Systems. 525-534 - Hassan Nawaz, Gideon Juve, Rafael Ferreira da Silva
, Ewa Deelman:
Performance Analysis of an I/O-Intensive Workflow Executing on Google Cloud and Amazon Web Services. 535-544 - Travis S. Humble, Alexander J. McCaskey, Jonathan Schrock, Hadayat Seddiqi, Keith A. Britt, Neena Imam:
Performance Models for Split-Execution Computing Systems. 545-554 - Ernesto Gomez, Keith E. Schubert
, Zongqi Ritchie Cai:
A Model for Entropy of Parallel Execution. 555-560
Session 4: Parallel Computing
- James Alexander Edwards, Uzi Vishkin:
FFT on XMT: Case Study of a Bandwidth-Intensive Regular Algorithm on a Highly-Parallel Many Core. 561-569 - Makoto Nakayama, Kenichi Yamazaki, Satoshi Tanaka:
Parallelization of Recursive Preorder Traversal Based on Building and Winding Call Stacks. 570-579 - P. B. Jayaraj
, K. Rahamathulla, G. Gopakumar:
A GPU Based Maximum Common Subgraph Algorithm for Drug Discovery Applications. 580-588 - Toru Fujita, Koji Nakano, Yasuaki Ito:
Bitwise Parallel Bulk Computation on the GPU, with Application to the CKY Parsing for Context-Free Grammars. 589-598 - Xin Zhou, Yasuaki Ito, Koji Nakano:
An Efficient Implementation of LZW Decompression in the FPGA. 599-607
Workshop 6-ASHES - Accelerators and Hybrid Exascale Systems
- James Dinan:
AsHES Introduction and Committees. 608-609 - Wen-mei W. Hwu:
AsHES 2016 Keynote. 610
Session 1: Programming Models and Tools
- Chris J. Newburn, Gaurav Bansal, Michael Wood, Luis Crivelli, Judit Planas
, Alejandro Duran, Paulo Souza, Leonardo Borges, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra, Hartwig Anzt
, Mark Gates
, Azzam Haidar, Yulu Jia, Khairul Kabir, Ichitaro Yamazaki, Jesús Labarta:
Heterogeneous Streaming. 611-620 - John D. Leidel, Yong Chen
:
HMC-Sim-2.0: A Simulation Platform for Exploring Custom Memory Cube Operations. 621-630 - Erik Zenker, Benjamin Worpitz, René Widera, Axel Huebl
, Guido Juckeland
, Andreas Knüpfer, Wolfgang E. Nagel, Michael Bussmann
:
Alpaka - An Abstraction Library for Parallel Kernel Acceleration. 631-640 - Souley Madougou, Ana Lucia Varbanescu, Cees de Laat, Rob van Nieuwpoort
:
A Tool for Bottleneck Analysis and Performance Prediction for GPU-Accelerated Applications. 641-652
Session 2: Algorithms and Applications
- Yulu Jia, Piotr Luszczek, Jack J. Dongarra:
Hessenberg Reduction with Transient Error Resilience on GPU-Based Hybrid Architectures. 653-662 - Ryan Eberhardt, Mark Hoemmen:
Optimization of Block Sparse Matrix-Vector Multiplication on Shared-Memory Parallel Architectures. 663-672 - Joshua Dennis Booth, Sivasankaran Rajamanickam, Heidi Thornquist:
Basker: A Threaded Sparse LU Factorization Utilizing Hierarchical Parallelism and Data Layouts. 673-682 - Hartwig Anzt
, Jack J. Dongarra, Moritz Kreutzer, Gerhard Wellein, Martin Koehler:
Efficiency of General Krylov Methods on GPUs - An Experimental Study. 683-691
Session 3: Workload Scheduling
- Luis Costero
, Francisco D. Igual
, Katzalin Olcoz
, Sandra Catalán
, Rafael Rodríguez-Sánchez
, Enrique S. Quintana-Ortí
:
Refactoring Conventional Task Schedulers to Exploit Asymmetric ARM big.LITTLE Architectures in Dense Linear Algebra. 692-701 - Valeria Cardellini
, Alessandro Fanfarillo
, Salvatore Filippone
:
Heterogeneous CAF-Based Load Balancing on Intel Xeon Phi. 702-711 - Iman Faraji, Seyed Hessam Mirsadeghi, Ahmad Afsahi:
Topology-Aware GPU Selection on Multi-GPU Nodes. 712-720
Workshop 7-PCO - Parallel Computing and Optimization
- Didier El Baz
, Bora Uçar
:
PCO Introduction and Committees. 721
Session I: Parallel Computing and Optimization
- Kevin Ryan, Deepak Rajan, Shabbir Ahmed
:
Scenario Decomposition for 0-1 Stochastic Programs: Improvements and Asynchronous Implementation. 722-729 - Lluís-Miquel Munguía, Geoffrey Oxberry, Deepak Rajan:
PIPS-SBB: A Parallel Distributed-Memory Branch-and-Bound Algorithm for Stochastic Mixed-Integer Programs. 730-739 - Adam Polak
:
Counting Triangles in Large Graphs on GPU. 740-746 - Adel Dabah, Ahcène Bendjoudi, Didier El Baz
, Abdelhakim AitZai:
GPU-Based Two Level Parallel B&B for the Blocking Job Shop Scheduling Problem. 747-755
Session II: Parallel Algorithms for Scheduling problems GPU-Based Two Level Parallel B&B for the Blocking Job Shop Scheduling
- Yumei Huo, Jun Xiong Huang:
Parallel Ant Colony Optimization for Flow Shop Scheduling Subject to Limited Machine Availability. 756-765 - Abhishek Awasthi, Jörg Lässig, Jens Leuschner, Thomas Weise:
GPGPU-Based Parallel Algorithms for Scheduling Against Due Date. 766-775 - Ali Al Buhussain, Robson Eduardo De Grande
, Azzedine Boukerche:
Performance Analysis of Bio-Inspired Scheduling Algorithms for Cloud Environments. 776-785
Session III: Parallel Heuristics and Metaheuristics
- José-Matías Cutillas-Lozano, Domingo Giménez, Luis-Pedro García:
Optimizing Metaheuristics and Hyperheuristics through Multi-level Parallelism on a Many-Core System. 786-795 - Didier El Baz
, Mhand Hifi, Lei Wu, Xiaochuan Shi:
A Parallel Ant Colony Optimization for the Maximum-Weight Clique Problem. 796-800 - Giovanni Cammarata, Antonella Di Stefano, Giovanni Morana, Daniele Zito:
Evaluating the Performance of A4SDN on Various Network Topologies. 801-808 - Ania Kaci, Huy-Nam Nguyen, Amir Nakib
, Patrick Siarry:
Hybrid Heuristics for Mapping Task Problem on Large Scale Heterogeneous Platforms. 809-816 - Karl-Eduard Berger, François Galea, Bertrand Le Cun, Renaud Sirdey
:
A Semi-Greedy Heuristic for the Mapping of Large Task Graphs. 817-824
Session IV: Combinatorial Scientific Computing
- Yu Jin, Joseph F. JáJá:
A High Performance Implementation of Spectral Clustering on CPU-GPU Platforms. 825-834 - Ning Hao, AmirReza Oghbaee, Mohammad Rostami, Nate Derbinsky, José Bento:
Testing Fine-Grained Parallelism for the ADMM on a Factor-Graph. 835-844 - Pingfan Li, Xuhao Chen, Zhe Quan, Jianbin Fang
, Huayou Su, Tao Tang, Canqun Yang:
High Performance Parallel Graph Coloring on GPGPUs. 845-854
Workshop 8-GABB - Graph Algorithms Building Blocks
- Tim Mattson:
GABB Introduction and Committees. 855 - David A. Bader
:
GABB 2016 Keynote. 856 - Mark Tullsen, Matthew J. Sottile
:
Array Types for a Graph Processing Language. 857-866 - Jiahao Chen, Weijian Zhang:
The Right Way to Search Evolving Graphs. 867-876 - E. Jason Riedy:
Updating PageRank for Streaming Graphs. 877-884 - Sriram Srinivasan, Sanjukta Bhowmick, Sajal K. Das
:
Application of Graph Sparsification in Developing Parallel Algorithms for Updating Connected Components. 885-891 - Keita Iwabuchi, Scott Sallinen, Roger A. Pearce, Brian Van Essen, Maya B. Gokhale, Satoshi Matsuoka:
Towards a Distributed Large-Scale Dynamic Graph Data Store. 892-901 - Brendan Gavin, Vijay Gadepally, Jeremy Kepner:
Enforced Sparse Non-negative Matrix Factorization. 902-911