


default search action
14th ICS 2000: Santa Fe, New Mexico, USA
- John Reynders, Alexander V. Veidenbaum:

Proceedings of the 14th international conference on Supercomputing, ICS 2000, Santa Fe, NM, USA, May 8-11, 2000. ACM 2000, ISBN 1-58113-270-0
Java Compilation and Performance
- Pedro V. Artigas, Manish Gupta, Samuel P. Midkiff

, José E. Moreira:
Automatic loop transformations and parallelization for Java. 1-10 - Renato Ferreira, Gagan Agrawal, Joel H. Saltz:

Compiling object-oriented data intensive applications. 11-21 - Tao Li, Lizy Kurian John, Narayanan Vijaykrishnan, Anand Sivasubramaniam, Jyotsna Sabarinathan, Anupama Murthy:

Using complete system simulation to characterize SPECjvm98 benchmarks. 22-33
Interconnection Networks/Network Processors
- José Flich

, Manuel P. Malumbres
, Pedro López, José Duato:
Performance evaluation of a new routing strategy for irregular networks with source routing. 34-43 - Valentin Puente, Cruz Izu, José A. Gregorio

, Ramón Beivide, J. M. Prellezo, Fernando Vallejo:
Improving parallel system performance by changing the arrangement of the network links. 44-53 - Patrick Crowley, Marc E. Fiuczynski, Jean-Loup Baer, Brian N. Bershad:

Characterizing processor architectures for programmable network interfaces. 54-65
Sparse Compilation Techniques
- Hao Yu, Lawrence Rauchwerger:

Adaptive reduction parallelization techniques. 66-77 - Eladio Gutiérrez, Oscar G. Plata, Emilio L. Zapata:

A compiler method for the parallel execution of irregular reductions in scalable shared memory multiprocessors. 78-87 - Nikolay Mateev, Keshav Pingali, Paul Stodghill, Vladimir Kotlyar:

Next-generation generic programming and its application to sparse matrix computations. 88-99
MP Scheduling, Load Balancing, Memmory Management
- Yanyong Zhang, Anand Sivasubramaniam, José E. Moreira, Hubertus Franke:

A simulation-based study of scheduling mechanisms for a dynamic cluster environment. 100-109 - Karen D. Devine, Bruce Hendrickson, Erik G. Boman, Matthew St. John, Courtenay T. Vaughan:

Design of dynamic load-balancing tools for parallel applications. 110-118 - Dimitrios S. Nikolopoulos, Theodore S. Papatheodorou, Constantine D. Polychronopoulos, Jesús Labarta, Eduard Ayguadé:

A case for use-level dynamic page migration. 119-130
Compilation I
- Ken Kennedy:

Fast greedy weighted fusion. 131-140 - Nawaaz Ahmed, Nikolay Mateev, Keshav Pingali:

Synthesizing transformations for locality enhancement of imperfectly-nested loop nests. 141-152 - Vivek Sarkar:

Optimized unrolling of nested loops. 153-166
Memory Hierarchy
- Chengqiang Zhang, Sally A. McKee:

Hardware-only stream prefetching and dynamic access ordering. 167-175 - Chia-Lin Yang, Alvin R. Lebeck:

Push vs. pull: data movement for linked data structures. 176-186 - Cheol Ho Park, JaeWoong Chung, Byeong Hag Seong, Yangwoo Roh, Daeyeon Park:

Boosting superpage utilization with the shadow memory and the partial-subblock TLB. 187-195
Micro-Architecture
- Toshinori Sato, Itsujiro Arita:

Table size reduction for data value predictors by exploiting narrow width values. 196-205 - Srinivas Mantripragada, Alexandru Nicolau:

Using profiling to reduce branch misprediction costs on a dynamically scheduled processor. 206-214
Applications
- Dragan Mirkovic, Rishad Mahasoom, S. Lennart Johnsson:

An adaptive software library for fast Fourier transforms. 215-224 - Yun He, Chris H. Q. Ding:

Using accurate arithmetics to improve numerical reproducibility and stability in parallel applications. 225-234
Performance Evaluation and Modeling
- Patrick H. Worley:

Performance evaluation of the IBM SP and the Compaq AlphaServer SC. 235-244 - Jeffrey S. Vetter:

Performance analysis of distributed applications using automatic classification of communication inefficiencies. 245-254 - Mark M. Mathis, Nancy M. Amato, Marvin L. Adams:

A general performance model for parallel sweeps on orthogonal grids for particle transport calculations. 255-263
MP Potpouri
- Marius Pirvu, Laxmi N. Bhuyan:

Hardware spatial forwarding for widely shared data. 264-273 - Xiaohui Shen, Wei-keng Liao, Alok N. Choudhary, Gokhan Memik, Mahmut T. Kandemir, Sachin More, George K. Thiruvathukal, Arti Singh:

A novel application development environment for large-scale scientific computations. 274-283 - Junpei Niwa, Takashi Matsumoto, Kei Hiraki:

Comparative study of page-based and segment-based software DSM through compiler optimization. 284-295
Compilation II
- Suhyun Kim, Soo-Mook Moon, Jinpyo Park, Kemal Ebcioglu:

Unroll-based register coalescing. 296-305 - Gary M. Zoppetti, Gagan Agrawal, Lori L. Pollock, José Nelson Amaral, Xinan Tang, Guang R. Gao:

Automatic compiler techniques for thread coarsening for multithreaded architectures. 306-315 - Somnath Ghosh, Margaret Martonosi, Sharad Malik

:
Automated cache optimizations using CME driven diagnosis. 316-326
Instruction-Level Parallelism
- Ramon Canal, Antonio González:

A low-complexity issue logic. 327-335 - Michael Gschwind, Kemal Ebcioglu, Erik R. Altman, Sumedh W. Sathaye:

Binary translation and architecture convergence issues for IBM system/390. 336-347

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














