


default search action
31st Euro-Par 2025: Dresden, Germany - Part III
- Wolfgang E. Nagel, Diana Goehringer

, Pedro C. Diniz
:
Euro-Par 2025: Parallel Processing - 31st European Conference on Parallel and Distributed Processing, Dresden, Germany, August 25-29, 2025, Proceedings, Part III. Lecture Notes in Computer Science 15902, Springer 2026, ISBN 978-3-031-99871-3
Theory and Algorithms
- Jeffrey Spaan

, Kuan-Hsun Chen
, David A. Bader
, Ana Lucia Varbanescu
:
Wedge-Parallel Triangle Counting for GPUs. 3-17 - Spyros Angelopoulos

, Loris Marchal
, Adrien Obrecht
, Bertrand Simon
:
Cache Management for Mixture-of-Experts LLMs. 18-32 - Chryssis Georgiou, Manaswini Piduguralla, Sathya Peri:

Byzantine-Tolerant Consensus in GPU-Inspired Shared Memory. 33-47 - John Augustine

, Christian Scheideler
, Julian Werthmann
:
Supervised Distributed Computing. 48-62 - Atte Torri

, Przemyslaw Dominikowski
, Brice Pointal
, Oguz Kaya
, Laércio Lima Pilla
, Olivier Coulaud
:
Near-Optimal Contraction Strategies for the Scalar Product in the Tensor-Train Format. 63-77 - Anne Benoit, Thomas Hérault, Yves Robert, Alix Tremodeux:

Partial Detectors Versus Replication to Cope with Silent Errors. 78-92 - Thomas Koopman

, Sven-Bodo Scholz
, Bernard van Gastel
:
Partitioning In-Place on Massively Parallel Architectures. 93-106
Multidisciplinary, Domain-Specific and Applied Parallel and Distributed Computing
- Jiale Zhang, Xilong Che, Yuzhe Fan, Juncheng Hu:

Quantum Delta Encoding: Optimizing Data Storage on Quantum Computers with Resource Efficiency. 109-123 - Florian Willich

, Henning Meyerhenke
:
ScaleRunner: A Fast MPI-Based Random Walk Engine for Multi-CPU Systems. 124-138 - Abhijeet Sahu, Andaluri S. P. V. M. Aditya, G. Ramakrishna, Malleti Sai Nikhil, Kishore Kothapalli, Dip Sankar Banerjee:

External GPU Biconnected Components. 139-153 - Massimiliano Meneghin

, Ahmed H. Mahmoud
:
Disaggregated Design for GPU-Based Volumetric Data Structures. 154-168 - Xinrui Yang

, Shaohuai Shi
:
SQ-DeAR: Sparsified and Quantized Gradient Compression for Distributed Training. 169-182 - Lifeng Yan, Zekun Yin, Qixin Chang, Tong Zhang, Zhisong Wang, Xiaohui Duan, Bertil Schmidt, Weiguo Liu:

SWBWA: A Highly Efficient NGS Aligner on the New Sunway Architecture. 183-196 - Yi-Hua Chung, Shui Jiang, Wan-Luan Lee, Yanqing Zhang, Haoxing Ren, Tsung-Yi Ho, Tsung-Wei Huang:

SimPart: A Simple Yet Effective Replication-Aided Partitioning Algorithm for Logic Simulation on GPU. 197-210 - Soumyajit Chatterjee, Rahul Utkoor

, Uppu Eshwar, Sathya Peri, V. Krishna Nandivada:
Efficient Task Graph Scheduling for Parallel QR Factorization in SLSQP. 211-224 - John W. Romein

:
Breaking the I/O Barrier: 1.2 Tb/s Ethernet Packet Processing on a GPU. 225-238 - Tianyu Wan, Shijia Gong, Yangyang Hu, Jianxi Chen:

GECKO: A Write-Optimized Adaptive Radix Tree for Disaggregated Memory. 239-253 - Jhonatan Cléto

, Guilherme Valarini
, Márcio Machado Pereira, Guido Araujo
, Hervé Yviquel:
Scalable OpenMP Remote Offloading via Asynchronous MPI and Coroutine-Driven Communication. 254-267 - Apurv Deepak Kulkarni, Siavash Ghiasvand:

SProBench: Stream Processing Benchmark for High Performance Computing Infrastructure. 268-282 - Yisu Wang, Xinjiao Li, Ruilong Wu, Huangxun Chen, Dirk Kutscher:

NetSenseML: Network-Adaptive Compression for Efficient Distributed Machine Learning. 283-297 - Marie Reinbigler, Rishi Sharma, Rafael Pires, Elisabeth Brunet, Anne-Marie Kermarrec, Catalin I. Fetita:

Efficient Pyramidal Analysis of Gigapixel Images on a Decentralized Modest Computer Cluster. 298-312 - Samuel Wiggins

, Nikunj Gupta
, Grace Zgheib
, Mahesh A. Iyer
, Viktor K. Prasanna
:
Accelerating Independent Multi-Agent Reinforcement Learning on Multi-GPU Platforms. 313-326 - Wenxiang Lin

, Xinglin Pan
, Shaohuai Shi
, Xuan Wang, Xiaowen Chu
:
ScheInfer: Efficient Inference of Large Language Models with Task Scheduling on Moderate GPUs. 327-340 - Chao Wang, Junshi Chen

, Longsheng Song, Haijie Hou
, Dongdong Tan, Yueqiang He, Wentiao Wu
, Sihan Lu, Hong An
:
Uniform Dense Blocking for Efficient Sparse LU Factorization in First-Principles Materials Simulation. 341-354

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














