default search action
IPDPS 2013: Cambridge, MA, USA
- 27th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2013, Cambridge, MA, USA, May 20-24, 2013. IEEE Computer Society 2013, ISBN 978-1-4673-6066-1
Keynote 1
- Shekhar Borkar:
Exascale Computing - A Fact or a Fiction? 3
Session 1: Checkpointing
- Itthichok Jangjaimon, Nian-Feng Tzeng:
Adaptive Incremental Checkpointing via Delta Compression for Networked Multicore Systems. 7-18 - Bogdan Nicolae:
Towards Scalable Checkpoint Restart: A Collective Inline Memory Contents Deduplication Proposal. 19-28 - Sudarsun Kannan, Ada Gavrilovska, Karsten Schwan, Dejan S. Milojicic:
Optimizing Checkpoints Using NVM as Virtual Memory. 29-40 - Aditya Dhoke, Binoy Ravindran, Bo Zhang:
On Closed Nesting and Checkpointing in Fault-Tolerant Distributed Transactional Memory. 41-52
Session 2: Cloud Computing
- Olivier Beaumont, Lionel Eyraud-Dubois, Hubert Larchevêque:
Reliable Service Allocation in Clouds. 55-66 - Ming Mao, Marty Humphrey:
Scaling and Scheduling to Maximize Application Performance within Budget Constraints in Cloud Workflows. 67-78 - Lionel Eyraud-Dubois, Hubert Larchevêque:
Optimizing Resource allocation while handling SLA violations in Cloud Computing platforms. 79-87 - Yanfei Guo, Palden Lama, Jia Rao, Xiaobo Zhou:
V-Cache: Towards Flexible Resource Provisioning for Multi-tier Applications in IaaS Clouds. 88-99
Session 3: Hybrid Systems
- George Teodoro, Tony Pan, Tahsin M. Kurç, Jun Kong, Lee A. D. Cooper, Norbert Podhorszki, Scott Klasky, Joel H. Saltz:
High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster Platforms. 103-114 - Jing Wu, Joseph F. JáJá:
High Performance FFT Based Poisson Solver on a CPU-GPU Heterogeneous Platform. 115-125 - Alexander Heinecke, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy, Alexander Kobotov, Roman Dubtsov, Greg Henry, Aniruddha G. Shet, George Chrysos, Pradeep Dubey:
Design and Implementation of the Linpack Benchmark for Single and Multi-node Systems Based on Intel® Xeon Phi Coprocessor. 126-137 - Judit Planas, Rosa M. Badia, Eduard Ayguadé, Jesús Labarta:
Self-Adaptive OmpSs Tasks in Heterogeneous Environments. 138-149
Session 4: Networks
- Lizhong Chen, Kai Hwang, Timothy Mark Pinkston:
RAIR: Interference Reduction in Regionalized Networks-on-Chip. 153-164 - Ruisheng Wang, Lizhong Chen, Timothy Mark Pinkston:
An Analytical Performance Model for Partitioning Off-Chip Memory Bandwidth. 165-176 - Lei Wang, Jagadish Jayabalan, Minseon Ahn, Haiyin Gu, Ki Hwan Yum, Eun Jung Kim:
A Case for Handshake in Nanophotonic Interconnects. 177-188 - David Whelihan, Jeffrey J. Hughes, Scott M. Sawyer, Eric Robinson, Michael M. Wolf, Sanjeev Mohindra, Julie Mullen, Anna Klein, Michelle S. Beard, Nadya T. Bliss, Johnnie Chan, Robert Hendry, Keren Bergman, Luca P. Carloni:
P-sync: A Photonically Enabled Architecture for Efficient Non-local Data Access. 189-200
Session 5: Graph Algorithms
- Mark Redekopp, Yogesh Simmhan, Viktor K. Prasanna:
Optimizations and Analysis of BSP Graph Processing Models on Public Clouds. 203-214 - Peter Sanders, Lawrence Mandow:
Parallel Label-Setting Multi-objective Shortest Path Search. 215-224 - Dominique Lasalle, George Karypis:
Multi-threaded Graph Partitioning. 225-236 - Aydin Buluç, Erika Duriakova, Armando Fox, John R. Gilbert, Shoaib Kamil, Adam Lugowski, Leonid Oliker, Samuel Williams:
High-Productivity and High-Performance Analysis of Filtered Semantic Graphs. 237-248
Session 6: Numerical Analysis
- Jakub Kurzak, Piotr Luszczek, Mark Gates, Ichitaro Yamazaki, Jack J. Dongarra:
Virtual Systolic Array for QR Decomposition. 251-260 - James Demmel, David Eliahu, Armando Fox, Shoaib Kamil, Benjamin Lipshitz, Oded Schwartz, Omer Spillinger:
Communication-Optimal Parallel Recursive Rectangular Matrix Multiplication. 261-272 - Theodoros Gkountouvas, Vasileios Karakasis, Kornilios Kourtis, Georgios I. Goumas, Nectarios Koziris:
Improving the Performance of the Symmetric Sparse Matrix-Vector Multiplication in Multicore. 273-283 - Yingchong Situ, Ye Wang, Zhiyuan Li:
Automated Rapid Prototyping of Regular Grid-Based Numerical Applications Using Generalized Elemental Subroutines. 284-294
Session 7: Parallel I/O and Server Software
- Yongen Yu, Jingjin Wu, Zhiling Lan, Douglas H. Rudd, Nickolay Y. Gnedin, Andrey V. Kravtsov:
A Transparent Collective I/O Implementation. 297-307 - Carmen Sigovan, Chris Muelder, Kwan-Liu Ma, Jason Cope, Kamil Iskra, Robert B. Ross:
A Visual Network Analysis Method for Large-Scale Parallel I/O Systems. 308-319 - Fang Zheng, Hongbo Zou, Greg Eisenhauer, Karsten Schwan, Matthew Wolf, Jai Dayal, Tuan-Anh Nguyen, Jianting Cao, Hasan Abbasi, Scott Klasky, Norbert Podhorszki, Hongfeng Yu:
FlexIO: I/O Middleware for Location-Flexible Scientific Data Analytics. 320-331 - Zhaoyi Luo, Zhuzhong Qian:
Burstiness-aware Server Consolidation via Queuing Theory Approach in a Computing Cloud. 332-341
Session 8: Parallel I/O and File Systems
- Yanlong Yin, Jibing Li, Jun He, Xian-He Sun, Rajeev Thakur:
Pattern-Direct and Layout-Aware Replication Scheme for Parallel I/O Systems. 345-356 - Ramya Prabhakar, Mahmut T. Kandemir, Myoungsoo Jung:
Disk-Cache and Parallelism Aware I/O Scheduling to Improve Storage System Performance. 357-368 - Dong H. Ahn, Michael J. Brim, Bronis R. de Supinski, Todd Gamblin, Gregory L. Lee, Matthew P. LeGendre, Barton P. Miller, Adam Moody, Martin Schulz:
Efficient and Scalable Retrieval Techniques for Global File Properties. 369-380 - Xuechen Zhang, Ke Liu, Kei Davis, Song Jiang:
iBridge: Improving Unaligned Parallel File Access with Solid-State Drives. 381-392
Session 9: Potpourri Algorithms 1
- Chen Avin, Bernhard Haeupler, Zvi Lotker, Christian Scheideler, Stefan Schmid:
Locally Self-Adjusting Tree Networks. 395-406 - Adam Hackett, Deepak Ajwani, Shoukat Ali, Steve Kirkland, John P. Morrison:
A Network Configuration Algorithm Based on Optimization of Kirchhoff Index. 407-417 - Patrick Flick, Peter Sanders, Jochen Speck:
Malleable Sorting. 418-426 - Mehdi Chitchian, Alexander S. van Amesfoort, Andrea Simonetto, Tamás Keviczky, Henk J. Sips:
Adapting Particle Filter Algorithms to Many-Core Architectures. 427-438
Session 10: GPU Scheduling
- Jianmin Chen, Xi Tao, Zhen Yang, Jih-Kwon Peir, Xiaoyuan Li, Shih-Lien Lu:
Guided Region-Based GPU Scheduling: Utilizing Multi-thread Parallelism to Hide Memory Latency. 441-451 - Wai Teng Tang, Wen Jun Tan, Ratna Krishnamoorthy, Yi Wen Wong, Shyh-Hao Kuo, Rick Siow Mong Goh, Stephen John Turner, Weng-Fai Wong:
Optimizing and Auto-Tuning Iterative Stencil Loops for GPUs with the In-Plane Method. 452-462 - Rupesh Nasre, Martin Burtscher, Keshav Pingali:
Data-Driven Versus Topology-driven Irregular Computations on GPUs. 463-474 - Ayse Yilmazer, David R. Kaeli:
HQL: A Scalable Synchronization Mechanism for GPUs. 475-486
Session 11: Fault Tolerance and Contention Resolution
- Keun Soo Yim, Zbigniew Kalbarczyk, Ravishankar K. Iyer:
Pluggable Watchdog: Transparent Failure Detection for MPI Programs. 489-500 - Mohamed-Slim Bouguerra, Ana Gainaru, Leonardo Arturo Bautista-Gomez, Franck Cappello, Satoshi Matsuoka, Naoya Maruyama:
Improving the Computing Efficiency of HPC Systems Using a Combination of Proactive and Preventive Checkpointing. 501-512 - Konstantina Mitropoulou, Vasileios Porpodas, Marcelo Cintra:
CASTED: Core-Adaptive Software Transient Error Detection for Tightly Coupled Cores. 513-524 - Gianluca De Marco, Dariusz R. Kowalski:
Contention Resolution in a Non-synchronized Multiple Access Channel. 525-533
Session 12: Communication and Routing 1
- Bogdan Prisacari, Germán Rodríguez, Cyriel Minkenberg:
Generalized Hierarchical All-to-All Exchange Patterns. 537-547 - Edgar Solomonik, Aydin Buluç, James Demmel:
Minimizing Communication in All-Pairs Shortest Paths. 548-559 - Jan Ciesko, Javier Bueno, Nikola Puzovic, Alex Ramírez, Rosa M. Badia, Jesús Labarta:
Programmable and Scalable Reductions on Clusters. 560-568 - Yandong Wang, Cong Xu, Xiaobing Li, Weikuan Yu:
JVM-Bypass for Efficient Hadoop Shuffling. 569-578
Symposium Tutorial
- Pradeep Padala:
Resource Management in VMware Powered Cloud: Concepts and Techniques. 581
Keynote 2
- James Demmel:
Communication-Avoiding Algorithms for Linear Algebra and Beyond. 585
Session 13: Data Centers
- Zhiyang Guo, Jun Duan, Yuanyuan Yang:
Oversubscription Bounded Multicast Scheduling in Fat-Tree Data Center Networks. 589-600 - Shachar Raindel, Yitzhak Birk:
Replicate and Bundle (RnB) - A Mechanism for Relieving Bottlenecks in Data Centers. 601-610 - Shuo Liu, Shaolei Ren, Gang Quan, Ming Zhao, Shangping Ren:
Profit Aware Load Balancing for Distributed Cloud Data Centers. 611-622 - Hao Jin, Tosmate Cheocherngngarn, Dmita Levy, Alex Smith, Deng Pan, Jason Liu, Niki Pissinou:
Joint Host-Network Optimization for Energy-Efficient Data Center Networking. 623-634
Session 14: Energy Modeling and Scheduling
- Zhihui Du, Hongyang Sun, Yuxiong He, Yu He, David A. Bader, Huazhe Zhang:
Energy-Efficient Scheduling for Best-Effort Interactive Services to Achieve High Response Quality. 637-648 - James Demmel, Andrew Gearhart, Benjamin Lipshitz, Oded Schwartz:
Perfect Strong Scaling Using No Additional Energy. 649-660 - JeeWhan Choi, Daniel Bedard, Robert J. Fowler, Richard W. Vuduc:
A Roofline Model of Energy. 661-672 - Shuaiwen Song, Chun-Yi Su, Barry Rountree, Kirk W. Cameron:
A Simplified and Accurate Model of Power-Performance Efficiency on Emergent GPU Architectures. 673-686
Session 15: Communication and Routing 2
- Sameer Kumar, Yanhua Sun, Laximant V. Kalé:
Acceleration of an Asynchronous Message Driven Programming Paradigm on IBM Blue Gene/Q. 689-699 - Matthias Diener, Eduardo Henrique Molina da Cruz, Philippe Olivier Alexandre Navaux:
Communication-Based Mapping Using Shared Pages. 700-711 - Sanjay Chatterjee, Sagnak Tasirlar, Zoran Budimlic, Vincent Cavé, Milind Chabbi, Max Grossman, Vivek Sarkar, Yonghong Yan:
Integrating Asynchronous Task Parallelism with MPI. 712-725 - Kang Chen, Haiying Shen:
DTN-FLOW: Inter-Landmark Data Flow for High-Throughput Routing in DTNs. 726-737
Session 16: Peer to Peer Systems
- Antoine Boutet, Davide Frey, Rachid Guerraoui, Arnaud Jégou, Anne-Marie Kermarrec:
WHATSUP: A Decentralized Instant News Recommender. 741-752 - Ioannis Boutsis, Vana Kalogeraki:
Crowdsourcing under Real-Time Constraints. 753-764 - Weixiong Rao, Chao Chen, Pan Hui, Sasu Tarkoma:
Replication-Based Load Balancing in Distributed Content-Based Publish/Subscribe. 765-774 - Tonglin Li, Xiaobing Zhou, Kevin Brandstatter, Dongfang Zhao, Ke Wang, Anupam Rajendran, Zhao Zhang, Ioan Raicu:
ZHT: A Light-Weight Reliable Persistent Dynamic Scalable Zero-Hop Distributed Hash Table. 775-787
Session 17: Programming Framework
- Kenneth Czechowski, Richard W. Vuduc:
A Theoretical Framework for Algorithm-Architecture Co-design. 791-802 - Martin Wimmer:
Wait-free Hyperobjects for Task-Parallel Programming Systems. 803-812 - Edgar Solomonik, Devin Matthews, Jeff R. Hammond, James Demmel:
Cyclops Tensor Framework: Reducing Communication and Eliminating Load Imbalance in Massively Parallel Contractions. 813-824 - Roger A. Pearce, Maya B. Gokhale, Nancy M. Amato:
Scaling Techniques for Massive Scale-Free Graphs in Distributed (External) Memory. 825-836
Session Scheduling 1
- Loris Marchal, Oliver Sinnen, Frédéric Vivien:
Scheduling Tree-Shaped Task Graphs to Minimize Memory and Makespan. 839-850 - Abdullah Gharaibeh, Lauro Beltrão Costa, Elizeu Santos-Neto, Matei Ripeanu:
On Graphs, GPUs, and Blind Dating: A Workload to Processor Matchmaking Quest. 851-862 - Olivier Beaumont, Hubert Larchevêque, Loris Marchal:
Non Linear Divisible Loads: There is No Free Lunch. 863-873 - Nakul Jindal, Victor Lotrich, Erik Deumens, Beverly A. Sanders:
SIPMaP: A Tool for Modeling Irregular Parallel Computations in the Super Instruction Architecture. 874-884
Symposium Panel
- Raghu Ramakrishnan:
Big Data in 10 Years. 887
Keynote 3
- Josh Simons:
HPC Cloud Bad; HPC in the Cloud Good. 891
Plenary Session: Best Papers
- Grey Ballard, Dulceneia Becker, James Demmel, Jack J. Dongarra, Alex Druinsky, Inon Peled, Oded Schwartz, Sivan Toledo, Ichitaro Yamazaki:
Implementing a Blocked Aasen's Algorithm with a Dynamic Scheduler on Multicore Architectures. 895-907 - Abdul Rahman Abdurrab, Tao Xie, Wei Wang:
DLOOP: A Flash Translation Layer Exploiting Plane-Level Parallelism. 908-918 - Ian Karlin, Abhinav Bhatele, Jeff Keasler, Bradford L. Chamberlain, Jonathan D. Cohen, Zachary DeVito, Riyaz Haque, Dan Laney, Edward Luke, Felix Wang, David F. Richards, Martin Schulz, Charles H. Still:
Exploring Traditional and Emerging Parallel Programming Models Using a Proxy Application. 919-932 - Daniele Paolo Scarpazza, Douglas J. Ierardi, Adam K. Lerer, Kenneth M. Mackenzie, Albert C. Pan, Joseph A. Bank, Edmond Chow, Ron O. Dror, J. P. Grossman, Daniel Killebrew, Mark A. Moraes, Cristian Predescu, John K. Salmon, David E. Shaw:
Extending the Generality of Molecular Dynamics Simulations on a Special-Purpose Machine. 933-945
Session 19: Scheduling 2
- Koyel Mukherjee, Samir Khuller, Amol Deshpande:
Algorithms for the Thermal Scheduling Problem. 949-960 - Pooja Aggarwal, Smruti R. Sarangi:
Lock-Free and Wait-Free Slot Scheduling Algorithms. 961-972 - Venkatesan T. Chakaravarthy, Anamitra R. Choudhury, Sambuddha Roy, Yogish Sabharwal:
Distributed Algorithms for Scheduling on Line and Tree Networks with Non-uniform Bandwidths. 973-984 - Richard Cole, Vijaya Ramachandran:
Analysis of Randomized Work Stealing with False Sharing. 985-998
Session 20: GPU Software
- Sreeram Potluri, Devendar Bureddy, Hao Wang, Hari Subramoni, Dhabaleswar K. Panda:
Extending OpenSHMEM for GPU Computing. 1001-1012 - Da Li, Michela Becchi:
Deploying Graph Algorithms on GPUs: An Adaptive Solution. 1013-1024 - Shay Berkovich, Borzoo Bonakdarpour, Sebastian Fischmeister:
GPU-based Runtime Verification. 1025-1036 - Nicholas Moore, Miriam Leeser, Laurie A. Smith King:
Kernel Specialization for Improved Adaptability and Performance on Graphics Processing Units (GPUs). 1037-1048
Session 21: Scientific Computing
- Mohsen Zohrevandi, Rida A. Bazzi:
The Bounded Data Reuse Problem in Scientific Workflows. 1051-1062 - Amanda Peters Randles, Vivek Kale, Jeff R. Hammond, William Gropp, Efthimios Kaxiras:
Performance Analysis of the Lattice Boltzmann Model Beyond Navier-Stokes. 1063-1074 - Michael B. Driscoll, Evangelos Georganas, Penporn Koanantakool, Edgar Solomonik, Katherine A. Yelick:
A Communication-Optimal N-Body Algorithm for Direct Interactions. 1075-1084 - Simon J. Pennycook, Christopher J. Hughes, Mikhail Smelyanskiy, Stephen A. Jarvis:
Exploring SIMD for Molecular Dynamics, Using Intel® Xeon® Processors and Intel® Xeon Phi Coprocessors. 1085-1097
Session 22: Wireless and Sensor Systems
- Cong Wang, Ji Li, Fan Ye, Yuanyuan Yang:
Multi-vehicle Coordination for Wireless Energy Replenishment in Sensor Networks. 1101-1111 - Xiaowei Mei, Donggang Liu, Kun Sun, Dingbang Xu:
On Feasibility of Fingerprinting Wireless Sensor Nodes Using Physical Properties. 1112-1121 - Dawei Gong, Yuanyuan Yang:
Distributed Algorithms for Joint Routing and Frame Aggregation in 802.11n Wireless Mesh Networks. 1122-1132 - Christopher Mutschler, Michael Philippsen:
Distributed Low-Latency Out-of-Order Event Processing for High Data Rate Sensor Streams. 1133-1144
Session 23: Potpourri Algorithms 2
- Armando Castañeda, Sergio Rajsbaum, Michel Raynal:
Agreement via Symmetry Breaking: On the Structure of Weak Subconsensus Tasks. 1147-1158 - Kun Huang, Jie Zhang, Dafang Zhang, Gaogang Xie, Kavé Salamatian, Alex X. Liu, Wei Li:
A Multi-partitioning Approach to Building Fast and Accurate Counting Bloom Filters. 1159-1170 - Vincent Gramoli, Rachid Guerraoui, Mihai Letia:
Composing Relaxed Transactions. 1171-1182 - Junliang Chen, Bing Bing Zhou, Chen Wang, Peng Lu, Penghao Wang, Albert Y. Zomaya:
Throughput Enhancement through Selective Time Sharing and Dynamic Grouping. 1183-1192
Session 24: Potpourri Applications
- Alexandros Stamatakis, Andre J. Aberer:
Novel Parallelization Schemes for Large-Scale Likelihood-based Phylogenetic Inference. 1195-1204 - Tekin Bicer, Jian Yin, David Chiu, Gagan Agrawal, Karen Schuchardt:
Integrating Online Compression to Accelerate Large-Scale Data Analytics Applications. 1205-1216 - Amanda Peters Randles, David G. Rand, Christopher Lee, Greg Morrisett, Jayanta Sircar, Martin A. Nowak, Hanspeter Pfister:
Massively Parallel Model of Extended Memory Use in Evolutionary Game Dynamics. 1217-1228 - Vitali A. Morozov, Kalyan Kumaran, Venkatram Vishwanath, Jiayuan Meng, Michael E. Papka:
Early Experience on the Blue Gene/Q Supercomputing System. 1229-1240
Session 25: Potpourri Systems
- Saurabh Gupta, Hongliang Gao, Huiyang Zhou:
Adaptive Cache Bypassing for Inclusive Last Level Caches. 1243-1253 - Kubilay Atasu, Florian Dörfler, Jan van Lunteren, Christoph Hagleitner:
Hardware-Accelerated Regular Expression Matching with Overlap Handling on IBM PowerEN Processor. 1254-1265 - Vesna Smiljkovic, Martin Nowack, Neboja Miletic, Tim Harris, Osman S. Ünsal, Adrián Cristal, Mateo Valero:
TM-dietlibc: A TM-aware Real-World System Library. 1266-1274 - Balaji Palanisamy, Aameek Singh, Ling Liu, Bryan Langston:
Cura: A Cost-Optimized Model for MapReduce in a Cloud. 1275-1286
Session 26: Programing Frameworks
- Martin Burtscher, Hassan Rabeti:
A Scalable Heterogeneous Parallelization Framework for Iterative Local Searches. 1289-1298 - Thierry Gautier, João V. F. Lima, Nicolas Maillard, Bruno Raffin:
XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures. 1299-1308 - Daniel Cederman, Bapi Chatterjee, Nhan Nguyen Dang, Yiannis Nikolakopoulos, Marina Papatriantafilou, Philippas Tsigas:
A Study of the Behavior of Synchronization Methods in Commonly Used Languages and Systems. 1309-1320 - Chaoran Yang, Karthik Murthy, John M. Mellor-Crummey:
Managing Asynchronous Operations in Coarray Fortran 2.0. 1321-1332
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.