ICS 2010:
Tsukuba,
Ibaraki,
Japan
Taisuke Boku, Hiroshi Nakashima, Avi Mendelson (Eds.):
Proceedings of the 24th International Conference on Supercomputing, 2010, Tsukuba, Ibaraki, Japan, June 2-4, 2010.
ACM 2010, ISBN 978-1-4503-0018-6
Keynotes
- Stephen S. Pawlowski:
Exascale science: the next frontier in high performance computing.
1
- William J. Dally:
Throughput computing.
2
- Kimihiko Hirao:
The next-generation supercomputer project and a plan for the advanced institute for computational science.
3
MPI
- Vladimir Marjanovic, Jesús Labarta, Eduard Ayguadé, Mateo Valero:
Overlapping communication and computation by using a hybrid MPI/SMPSs approach.
5-16
- Sreeram Potluri, Ping Lai, Karen A. Tomko, Sayantan Sur, Yifeng Cui, Mahidhar Tatineni, Karl W. Schulz, William L. Barth, Amitava Majumdar, Dhabaleswar K. Panda:
Quantifying performance benefits of overlap using MPI-2 in a seismic modeling application.
17-25
- Nikhil Jain, Yogish Sabharwal:
Optimal bucket algorithms for large MPI collectives on torus interconnects.
27-36
Cache and transaction memory
Applications (1)
- Atabak Mahram, Martin C. Herbordt:
Fast and accurate NCBI BLASTP: acceleration with multiphase FPGA-based prefiltering.
73-82
- Narges Bani Asadi, Christopher W. Fletcher, Greg Gibeling, John Wawrzynek, Wing H. Wong, Garry P. Nolan:
ParaLearn: a massively parallel, scalable system for learning interaction networks on FPGAs.
83-94
- Michael D. Linderman, Robert Bruggner, Vivek Athalye, Teresa H. Y. Meng, Narges Bani Asadi, Garry P. Nolan:
High-throughput Bayesian network learning using heterogeneous multicore computers.
95-104
- Chi Ching Chi, Ben H. H. Juurlink, Cor Meenderinck:
Evaluation of parallel H.264 decoding strategies for the Cell Broadband Engine.
105-114
GPGPU and accelerators (1)
Architecture
- Ramon Bertran, Marc González, Xavier Martorell, Nacho Navarro, Eduard Ayguadé:
Decomposable and responsive power models for multicore processors using performance counters.
147-158
- Lixin Zhang, Evan Speight, Ramakrishnan Rajamony, Jiang Lin:
Enigma: architectural and operating system support for reducing the impact of address translation.
159-168
- Huaiyu Zhu, Yong Chen, Xian-He Sun:
Timing local streams: improving timeliness in data prefetching.
169-178
- Chunyang Gou, Georgi Kuzmanov, Georgi Gaydadjiev:
SAMS multi-layout memory: providing multiple views of data to boost SIMD performance.
179-188
System and IO issues
Applications (2)
- Keith R. Bisset, Jiangzhuo Chen, Xizhou Feng, Yifei Ma, Madhav V. Marathe:
Indemics: an interactive data intensive framework for high performance epidemic simulation.
233-242
- Todd Gamblin, Bronis R. de Supinski, Martin Schulz, Robert J. Fowler, Daniel A. Reed:
Clustering performance data efficiently at massive scales.
243-252
- Jaewook Shin, Mary W. Hall, Jacqueline Chame, Chun Chen, Paul F. Fischer, Paul D. Hovland:
Speeding up Nek5000 with autotuning and specialization.
253-262
Compilers
GPGPU and accelerators (2)
- Liang Gu, Xiaoming Li, Jakob Siegel:
An empirically tuned 2D and 3D FFT library on CUDA GPU.
305-314
- Yifeng Chen, Xiang Cui, Hong Mei:
Large-scale FFT on GPU clusters.
315-324
- Yong Dou, Yuanwu Lei, Guiming Wu, Song Guo, Jie Zhou, Li Shen:
FPGA accelerating double/quad-double high precision floating-point applications for ExaScale computing.
325-336
- Jamin Naghmouchi, Daniele Paolo Scarpazza, Mladen Berekovic:
Small-ruleset regular expression matching on GPGPUs: quantitative performance analysis and optimization.
337-348
Last update Fri May 25 08:21:13 2012
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page