IPDPS 2010: Atlanta, Georgia, USA
24th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010, Atlanta, Georgia, USA, 19-23 April 2010 - Conference Proceedings. IEEE 2010
Niloofar Fazlollahi, David Starobinski: Distributed advance network reservation with delay guarantees. 1-12
Iain A. Stewart: A general algorithm for detecting faults under the comparison diagnosis model. 1-9
Olivier Beaumont, Hejer Rejeb: On the importance of bandwidth control mechanisms for scheduling on large scale heterogeneous platforms. 1-12
Olivier Beaumont, Lionel Eyraud-Dubois, Shailesh Kumar Agrawal: Broadcasting on large scale heterogeneous platforms under the bounded multi-port model. 1-11
Michela Taufer, Omar Padron, Philip Saponaro, Sandeep Patel: Improving numerical reproducibility and stability in large-scale numerical simulations on GPUs. 1-9
Weirong Jiang, Yi-Hua Edward Yang, Viktor K. Prasanna: Scalable multi-pipeline architecture for high performance multi-pattern string matching. 1-12
Suneil Mohan, Amitava Biswas, Aalap Tripathy, Jagannath Panigrahy, Rabi N. Mahapatra: A parallel architecture for meaning comparison. 1-10
Amitabha Bagchi: Sparse power-efficient topologies for wireless ad hoc sensor networks. 1-10
Henry M. Monti, Ali Raza Butt, Sudharshan S. Vazhkudai: Reconciling scratch space consumption, exposure, and volatility to achieve timely staging of job input data. 1-12
Patrick P. C. Lee, Tian Bu, Girish P. Chandranmenon: A lock-free, cache-efficient multi-core synchronization mechanism for line-rate network traffic monitoring. 1-12
Jiuxing Liu: Evaluating standard-based self-virtualizing devices: A performance study on 10 GbE NICs with SR-IOV support. 1-12
Sriram Ramabhadran, Joseph Pasquale: Analysis of durability in replicated distributed storage systems. 1-12
Qingbo Yuan, Jianbo Zhao, Mingyu Chen, Ninghui Sun: GenerOS: An asymmetric operating system kernel for multi-core systems. 1-10
Ke Pan, Wentong Cai, Xueyan Tang, Suiping Zhou, Stephen John Turner: A hybrid Interest Management mechanism for peer-to-peer Networked Virtual Environments. 1-12
Bo Mao, Hong Jiang, Dan Feng, Suzhen Wu, Jianxi Chen, Lingfang Zeng, Lei Tian: HPDA: A hybrid parity-based disk array for enhanced performance and reliability. 1-12
Hiroki Yanagisawa: A multi-source label-correcting algorithm for the all-pairs shortest paths problem. 1-10
Shih-Hsin Lo, Ying-Cherng Lan, Hsin-Hsien Yeh, Wen-Chung Tsai, Yu Hen Hu, Sao-Jie Chen: QoS aware BiNoC architecture. 1-10
Konstantis Daloukas, Christos D. Antonopoulos, Nikolaos Bellas, Sek M. Chai: Fisheye lens distortion correction on multicore and hardware accelerator platforms. 1-10
Jun Zhu, Wei Dong, Zhefu Jiang, Xiaogang Shi, Zhen Xiao, Xiaoming Li: Improving the performance of hypervisor-based fault tolerance. 1-10
Jorge González-Domínguez, Guillermo L. Taboada, Basilio B. Fraguela, María J. Martín, Juan Touriño: Servet: A benchmark suite for autotuning on multicore clusters. 1-9
Mark Stillwell, Frédéric Vivien, Henri Casanova: Dynamic fractional resource scheduling for HPC workloads. 1-12
Toshio Endo, Akira Nukada, Satoshi Matsuoka, Naoya Maruyama: Linpack evaluation on a supercomputer with heterogeneous accelerators. 1-8
Rezaul Alam Chowdhury, Francesco Silvestri, Brandon Blakeley, Vijaya Ramachandran: Oblivious algorithms for multicores and network of processors. 1-12
Manoj Gupta, Fermín Sánchez, Josep Llosa: A low cost split-issue technique to improve performance of SMT clustered VLIW processors. 1-12
Kamil Kedzierski, Miquel Moretó, Francisco J. Cazorla, Mateo Valero: Adapting cache partitioning algorithms to pseudo-LRU replacement policies. 1-12
Bastian Degener, Barbara Kempkes, Peter Pietrzyk: A local, distributed constant-factor approximation algorithm for the dynamic facility location problem. 1-10
Germán Llort, Juan Gonzalez, Harald Servat, Judit Gimenez, Jesús Labarta: On-line detection of large-scale parallel application's structure. 1-10
Venkatesan T. Chakaravarthy, Vinayaka Pandit, Yogish Sabharwal, Deva P. Seetharam: Varying bandwidth resource allocation problem with bag constraints. 1-10
Simplice Donfack, Laura Grigori, Alok Kumar Gupta: Adapting communication-avoiding LU and QR factorizations to multicore architectures. 1-10
Daniel Delling, Bastian Katz, Thomas Pajor: Parallel computation of best connections in public transportation networks. 1-12
Kunal Agrawal, Anne Benoit, Loic Magnan, Yves Robert: Scheduling algorithms for linear workflow optimization. 1-12
Ali Jannesari, Walter F. Tichy: Identifying ad-hoc synchronization for enhanced race detection. 1-10
Che-Rung Lee, I-Hsin Chung, Zhaojun Bai: Parallelization of DQMC simulation for strongly correlated electron systems. 1-9
Manuel Holtgrewe, Peter Sanders, Christian Schulz: Engineering a scalable high quality graph partitioner. 1-12
John R. Lange, Kevin T. Pedretti, Trammell Hudson, Peter A. Dinda, Zheng Cui, Lei Xia, Patrick G. Bridges, Andy Gocke, Steven Jaconette, Michael Levenhagen, Ron Brightwell: Palacios and Kitten: New high performance operating systems for scalable virtualized and native supercomputing. 1-12
Anne Benoit, Paul Renaud-Goud, Yves Robert: Performance and energy optimization of concurrent pipelined applications. 1-12
Yong Fu, Chenyang Lu, Hongan Wang: Robust control-theoretic thermal balancing for server clusters. 1-11
Guochun Shi, Volodymyr V. Kindratenko, Ivan S. Ufimtsev, Todd J. Martinez: Direct self-consistent field computations on GPU clusters. 1-8
Frédéric de Mesmay, Yevgen Voronenko, Markus Püschel: Offline library adaptation using automatically generated heuristics. 1-10

Wei Tang, Narayan Desai, Daniel Buettner, Zhiling Lan: Analyzing and adjusting user runtime estimates to improve job scheduling on the Blue Gene/P. 1-11
Emmanuel Agullo, Camille Coti, Jack Dongarra, Thomas Hérault, Julien Langou: QR factorization of tall and skinny matrices in a grid computing environment. 1-11
Naoya Maruyama, Akira Nukada, Satoshi Matsuoka: A high-performance fault-tolerant software framework for memory on commodity GPUs. 1-12
Valentin Kravtsov, Pavel Bar, David Carmeli, Assaf Schuster, Martin T. Swain: A scheduling framework for large-scale, parallel, and topology-aware applications. 1-12
Tianming Yang, Hong Jiang, Dan Feng, Zhongying Niu, Ke Zhou, Yaping Wan: DEBAR: A scalable high-performance de-duplication storage system for backup and archiving. 1-12
Jiayuan Meng, Anand Raghunathan, Srimat T. Chakradhar, Surendra Byna: Exploiting the forgiving nature of applications for scalable parallel execution. 1-12
Jiayuan Meng, Jeremy W. Sheaffer, Kevin Skadron: Exploiting inter-thread temporal locality for chip multithreading. 1-12
Polychronis Xekalakis, Nikolas Ioannou, Salman Khan, Marcelo Cintra: Profitability-based power allocation for speculative multithreaded systems. 1-11
Dong Li, Bronis R. de Supinski, Martin Schulz, Kirk W. Cameron, Dimitrios S. Nikolopoulos: Hybrid MPI/OpenMP power-aware computing. 1-12
Dong Li, Dimitrios S. Nikolopoulos, Kirk W. Cameron, Bronis R. de Supinski, Martin Schulz: Power-aware MPI task aggregation prediction for high-end computing systems. 1-12
King Tin Lam, Yang Luo, Cho-Li Wang: Adaptive sampling-based profiling techniques for optimizing the distributed JVM runtime. 1-11
Tekin Bicer, Wei Jiang, Gagan Agrawal: Supporting fault tolerance in a data-intensive computing middleware. 1-12
Albert Hartono, Muthu Manikandan Baskaran, J. Ramanujam, Ponnuswamy Sadayappan: DynTile: Parametric tiled loop generation for parallel execution on multicore processors. 1-12
Christos Kotselidis, Mikel Luján, Mohammad Ansari, Konstantinos Malakasis, Behram Khan, Chris C. Kirkham, Ian Watson: Clustering JVMs with software transactional memory support. 1-12
Risat Mahmud Pathan, Jan Jonsson: Load regulating algorithm for static-priority task scheduling on multiprocessors. 1-12
Zheng Wei, Joseph JáJá: Optimization of linked list prefix computations on multithreaded GPUs using CUDA. 1-8
Deng Pan, Kia Makki, Niki Pissinou: Achieve constant performance guarantees using asynchronous crossbar scheduling without speedup. 1-12
Dong Yuan, Yun Yang, Xiao Liu, Jinjun Chen: A cost-effective strategy for intermediate data storage in scientific cloud workflow systems. 1-12
Fang Zheng, Hasan Abbasi, Ciprian Docan, Jay F. Lofstead, Qing Liu, Scott Klasky, Manish Parashar, Norbert Podhorszki, Karsten Schwan, Matthew Wolf: PreDatA - preparatory data analytics on peta-scale machines. 1-12
Ping Zhou, Yu Du, Youtao Zhang, Jun Yang: Fine-grained QoS scheduling for PCM-based main memory systems. 1-12
Han Zhao, Xinxin Liu, Xiaolin Li: Hypergraph-based task-bundle scheduling towards efficiency and fairness in heterogeneous distributed systems. 1-12
Haichuan Wang, Qiming Teng, Xiao Zhong, Peter F. Sweeney: Using the middle tier to understand cross-tier delay in a multi-tier application. 1-9
Swann Perarnau, Guillaume Huard: KRASH: Reproducible CPU load generation on many-core machines. 1-10
Xiaochun Ye, Dongrui Fan, Wei Lin, Nan Yuan, Paolo Ienne: High performance comparison-based sorting algorithm on many-core GPUs. 1-10
Annette Bieniusa, Thomas Fuhrmann: Consistency in hindsight: A fully decentralized STM algorithm. 1-12
Bilel Hadri, Hatem Ltaief, Emmanuel Agullo, Jack Dongarra: Tile QR factorization with parallel panel processing for multicore architectures. 1-10
Dongyuan Zhan, Hong Jiang, Sharad C. Seth: Exploiting set-level non-uniformity of capacity demand to enhance CMP cooperative caching. 1-10
François Broquedis, Olivier Aumage, Brice Goglin, Samuel Thibault, Pierre-André Wacrenier, Raymond Namyst: Structuring the execution of OpenMP applications for multicore architectures. 1-10
Giorgos Georgiadis, Marina Papatriantafilou: Overlays with preferences: Approximation algorithms for matching with preference lists. 1-10
Justin Luitjens, Martin Berzins: Improving the performance of Uintah: A large-scale adaptive meshing computational framework. 1-10
Seetharami Seelam, I-Hsin Chung, John Bauer, Hui-Fang Wen: Masking I/O latency using application level I/O caching and prefetching on Blue Gene systems. 1-12
Bogdan Nicolae, Diana Moise, Gabriel Antoniu, Luc Bougé, Matthieu Dorier: BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications. 1-11
Costin Iancu, Steven A. Hofmeyr, Filip Blagojevic, Yili Zheng: Oversubscription on multicore processors. 1-11
Bradley J. Barnes, Jeonifer Garren, David K. Lowenthal, Jaxk Reeves, Bronis R. de Supinski, Martin Schulz, Barry Rountree: Using focused regression for accurate time-constrained scaling of scientific applications. 1-12
Dorian C. Arnold, Barton P. Miller: Scalable failure recovery for high-performance data aggregation. 1-11
Yongpeng Zhang, Frank Mueller, Xiaohui Cui, Thomas E. Potok: Large-scale multi-dimensional document clustering on GPU clusters. 1-10
Arash Deshmeh, Jacob Machina, Angela C. Sodan: ADEPT scalability predictor in support of adaptive resource allocation. 1-12
Devesh Tiwari, Sanghoon Lee, James Tuck, Yan Solihin: MMT: Exploiting fine-grained parallelism in dynamic memory management. 1-12
Yi Guo, Jisheng Zhao, Vincent Cavé, Vivek Sarkar: SLAW: A scalable locality-aware adaptive work-stealing scheduler. 1-12
Zhe Wang, Sanjay Ranka: A simple thermal model for multi-core processors and its application to slack allocation. 1-11
Giridhar Sreenivasa Murthy, Mahesh Ravishankar, Muthu Manikandan Baskaran, Ponnuswamy Sadayappan: Optimal loop unrolling for GPGPU programs. 1-11
Andrew Uselton, Mark Howison, Nicholas J. Wright, David Skinner, Noel Keen, John Shalf, Karen L. Karavanic, Leonid Oliker: Parallel I/O performance: From events to ensembles. 1-11
Shoaib Kamil, Cy Chan, Leonid Oliker, John Shalf, Samuel Williams: An auto-tuning framework for parallel multicore stencil computations. 1-12
Louis-Claude Canon, Emmanuel Jeannot, Jon B. Weissman: A dynamic approach for characterizing collusion in desktop grids. 1-12
Ernst Gunnar Gran, Magne Eimot, Sven-Arne Reinemo, Tor Skeie, Olav Lysne, Lars Paul Huse, Gilad Shainer: First experiences with congestion control in InfiniBand hardware. 1-12
Jaehwan Lee, Peter J. Keleher, Alan Sussman: Decentralized resource management for multi-core desktop grids. 1-11
Hormozd Gahvari, William Gropp: An introductory exascale feasibility study for FFTs and multigrid. 1-9
Jie Li, Marty Humphrey, Deborah A. Agarwal, Keith R. Jackson, Catharine van Ingen, Youngryel Ryu: eScience in the cloud: A MODIS satellite data reprojection and reduction pipeline in the Windows Azure platform. 1-10
Aparna Chandramowlishwaran, Samuel Williams, Leonid Oliker, Ilya Lashuk, George Biros, Richard W. Vuduc: Optimizing and tuning the fast multipole method for state-of-the-art multicore architectures. 1-12
Sai Prashanth Muralidhara, Mahmut T. Kandemir, Padma Raghavan: Intra-application cache partitioning. 1-12
Long Chen, Oreste Villa, Sriram Krishnamoorthy, Guang R. Gao: Dynamic load balancing on single- and multi-GPU systems. 1-12
Jun Shirako, Vivek Sarkar: Hierarchical phasers for scalable synchronization and reductions in dynamic parallelism. 1-12
Konrad Malkowski, Padma Raghavan, Mahmut T. Kandemir: Analyzing the soft error resilience of linear solvers on multicore multiprocessors. 1-12
Tarun Bansal, Neeraj Mittal: A scalable algorithm for maintaining perpetual system connectivity in dynamic distributed systems. 1-12
Antonio Fernández Anta, Chryssis Georgiou, Miguel A. Mosteiro: Algorithmic mechanisms for internet-based master-worker computing with untrusted and selfish workers. 1-11
João Nuno Silva, Paulo Ferreira, Luís Veiga: Service and resource discovery in cycle-sharing environments with a utility algebra. 1-11
Sameer Kumar, Philip Heidelberger, Dong Chen, Michael Hines: Optimization of applications with non-blocking neighborhood collectives via multisends on the Blue Gene/P supercomputer. 1-11
Stefan Rührup, Ivan Stojmenovic: Contention-based georouting with guaranteed delivery, minimal communication overhead, and shorter paths in wireless sensor networks. 1-9
Guojin He, Antonia Zhai: Improving the performance of program monitors with compiler support in multi-core environment. 1-12

Aparna Chandramowlishwaran, Kathleen Knobe, Richard W. Vuduc: Performance evaluation of concurrent collections on high-performance multicore computing systems. 1-12
Zhengyu He, Bo Hong: Dynamically tuned push-relabel algorithm for the maximum flow problem on CPU-GPU-Hybrid platforms. 1-10
Lifan Xu, Michela Taufer, Stuart Collins, Dionisios G. Vlachos: Parallelization of tau-leap coarse-grained Monte Carlo simulations on GPUs. 1-9
Robert Hood, Haoqiang Jin, Piyush Mehrotra, Johnny Chang, M. Jahed Djomehri, Sharad Gavali, Dennis C. Jespersen, Kenichi Taylor, Rupak Biswas: Performance impact of resource contention in multicore systems. 1-12
Shuangshuang Jin, Zhenyu Huang, Yousu Chen, Daniel G. Chavarría-Miranda, John Feo, Pak Chung Wong: A novel application of parallel betweenness centrality to power grid contingency analysis. 1-7
Benjamin G. Jackson, Matthew Regennitter, Xiao Yang, Patrick S. Schnable, Srinivas Aluru: Parallel de novo assembly of large genomes from high-throughput short reads. 1-10
Seetharami Seelam, Liana L. Fong, Asser N. Tantawi, John Lewars, John Divirgilio, Kevin Gildea: Extreme scale computing: Modeling the impact of system noise in multicore clustered systems. 1-12
Kaiqi Xiong: Power-aware resource provisioning in cluster computing. 1-11
Yi-Hua E. Yang, Viktor K. Prasanna, Chenqian Jiang: Head-body partitioned string matching for Deep Packet Inspection with scalable and attack-resilient performance. 1-11
Bo Zhang, Binoy Ravindran: Dynamic analysis of the relay cache-coherence protocol for distributed transactional memory. 1-11
Everett H. Phillips, Massimiliano Fatica: Implementing the Himeno benchmark with CUDA on GPU clusters. 1-10
Dario Bruneo, Salvatore Distefano, Francesco Longo, Marco Scarpa: QoS assessment of WS-BPEL processes through non-Markovian stochastic Petri nets. 1-12
Arnab Sinha, Sharad Malik: Runtime checking of serializability in software transactional memory. 1-12
Xue Wang, Fasheng Qiu, Sushil K. Prasad, Guantao Chen: Efficient parallel algorithms for maximum-density segment problem. 1-9
Burton Smith: Operating system resource management. 1
Kunle Olukotun: Chip multiprocessor architecture: A programmability-driven approach. 1
Peter Sanders: Algorithm engineering for scalable parallel external sorting. 1-3
David A. Bader: Message from general chair. 1-2
Cynthia A. Phillips: Message from the program chair. 1-2
Viktor K. Prasanna: Message from steering co-chairs. 1
Richard W. Vuduc: Unconventional wisdom in multicore computing. 1
Milind A. Bhandarkar: MapReduce programming with apache Hadoop. 1
Michael Garland: Parallel computing with CUDA. 1



