default search action
Pedro Valero-Lara
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2024
- [j18]William F. Godoy, Pedro Valero-Lara, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter:
Large language model evaluation for high-performance computing software development. Concurr. Comput. Pract. Exp. 36(26) (2024) - 2022
- [j17]Marc Jordà, Pedro Valero-Lara, Antonio J. Peña:
cuConv: CUDA implementation of convolution for CNN inference. Clust. Comput. 25(2): 1459-1473 (2022) - [j16]John Gounley, Madhurima Vardhan, Erik W. Draeger, Pedro Valero-Lara, Shirley V. Moore, Amanda Randles:
Propagation Pattern for Moment Representation of the Lattice Boltzmann Method. IEEE Trans. Parallel Distributed Syst. 33(3): 642-653 (2022) - 2020
- [j15]Pedro Valero-Lara, Sandra Catalán, Xavier Martorell, Tetsuzo Usui, Jesús Labarta:
sLASs: A fully automatic auto-tuned linear algebra library based on OpenMP extensions implemented in OmpSs (LASs Library). J. Parallel Distributed Comput. 138: 153-171 (2020) - 2019
- [j14]Pedro Valero-Lara, Diego Andrade, Raül Sirvent, Jesús Labarta, Basilio B. Fraguela, Ramon Doallo:
A Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library). IEEE Access 7: 23365-23378 (2019) - [j13]Marc Jordà, Pedro Valero-Lara, Antonio J. Peña:
Performance Evaluation of cuDNN Convolution Algorithms on NVIDIA Volta GPUs. IEEE Access 7: 70461-70473 (2019) - [j12]Pedro Valero-Lara, Raül Sirvent, Antonio J. Peña, Jesús Labarta:
MPI+OpenMP tasking scalability for multi-morphology simulations of the human brain. Parallel Comput. 84: 50-61 (2019) - 2018
- [j11]Pedro Valero-Lara, Ivan Martínez-Pérez, Raül Sirvent, Xavier Martorell, Antonio J. Peña:
cuThomasBatch and cuThomasVBatch, CUDA Routines to compute batch of tridiagonal systems on NVIDIA GPUs. Concurr. Comput. Pract. Exp. 30(24) (2018) - 2017
- [j10]Pedro Valero-Lara, Johan Jansson:
Heterogeneous CPU+GPU approaches for mesh refinement over Lattice-Boltzmann simulations. Concurr. Comput. Pract. Exp. 29(7) (2017) - [j9]Pedro Valero-Lara:
Reducing memory requirements for large size LBM simulations on GPUs. Concurr. Comput. Pract. Exp. 29(24) (2017) - [j8]Pedro Valero-Lara, Ezhilmathi Krishnasamy, Johan Jansson:
Towards HPC-Embedded. Case Study: Kalray and Message-Passing on NoC. Scalable Comput. Pract. Exp. 18(2): 151-160 (2017) - [j7]Pedro Valero-Lara, Mawussi Zounon, Maksims Abalenkovs, Fernando L. Pelayo:
Introduction to the Special Issue on High Performance Computing Solutions for Complex Problems. Scalable Comput. Pract. Exp. 18(2): iii-iv (2017) - 2016
- [j6]Pedro Valero-Lara, Poornima Nookala, Fernando L. Pelayo, Johan Jansson, Serapheim Dimitropoulos, Ioan Raicu:
Many-Task Computing on Many-Core Architectures. Scalable Comput. Pract. Exp. 17(1): 32-46 (2016) - [j5]Pedro Valero-Lara, Fernando L. Pelayo, Johan Jansson:
Introduction to the Special Issue on High Performance Computing Solutions for Complex Problems. Scalable Comput. Pract. Exp. 17(1): iii-iv (2016) - 2015
- [j4]Pedro Valero-Lara, Francisco D. Igual, Manuel Prieto-Matías, Alfredo Pinelli, Julien Favier:
Accelerating fluid-solid simulations (Lattice-Boltzmann & Immersed-Boundary) on heterogeneous architectures. J. Comput. Sci. 10: 249-261 (2015) - 2014
- [j3]Pedro Valero-Lara, Alfredo Pinelli, Manuel Prieto-Matías:
Fast finite difference Poisson solvers on heterogeneous architectures. Comput. Phys. Commun. 185(4): 1265-1272 (2014) - [j2]Pedro Valero-Lara:
Accelerating solid-fluid interaction based on the immersed boundary method on multicore and GPU architectures. J. Supercomput. 70(2): 799-815 (2014) - 2011
- [j1]Pedro Valero-Lara, José L. Sánchez, Diego Cazorla, Enrique Arias:
A GPU-based implementation of the MRF algorithm in ITK package. J. Supercomput. 58(3): 403-410 (2011)
Conference and Workshop Papers
- 2024
- [c48]Pedro Valero-Lara, Seyong Lee, Joel E. Denny, Keita Teranishi, Jeffrey S. Vetter, Marc González Tallada:
sKokkos: Enabling Kokkos with Transparent Device Selection on Heterogeneous Systems using OpenACC. HPC Asia 2024: 23-34 - [c47]Marc González Tallada, Joel E. Denny, Pedro Valero-Lara, Seyong Lee, Keita Teranishi, Jeffrey S. Vetter:
eCC++ : A Compiler Construction Framework for Embedded Domain-Specific Languages. IPDPS (Workshops) 2024: 667-677 - [c46]Pedro Valero-Lara:
9th IEEE International Workshop on Automatic Performance Tuning (iWAPT 2024). IPDPS (Workshops) 2024: 738-739 - [c45]Narasinga Rao Miniskar, Seyong Lee, Beau Johnston, Aaron R. Young, Mohammad Alaul Haque Monil, Pedro Valero-Lara, Jeffrey S. Vetter:
IRIS Reimagined: Advancements in Intelligent Runtime System for Task-Based Programming. WAMTA 2024: 46-58 - [c44]Mohammad Alaul Haque Monil, Narasinga Rao Miniskar, Pedro Valero-Lara, Keita Teranishi, Jeffrey S. Vetter:
MatRIS: Addressing the Challenges for Portability and Heterogeneity Using Tasking for Matrix Decomposition (Cholesky). WAMTA 2024: 59-70 - 2023
- [c43]Narasinga Rao Miniskar, Mohammad Alaul Haque Monil, Pedro Valero-Lara, Frank Y. Liu, Jeffrey S. Vetter:
IRIS-DMEM: Efficient Memory Management for Heterogeneous Computing. HPEC 2023: 1-7 - [c42]William F. Godoy, Pedro Valero-Lara, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter:
Evaluation of OpenAI Codex for HPC Parallel Programming Models Kernel Generation. ICPP Workshops 2023: 136-144 - [c41]William F. Godoy, Pedro Valero-Lara, T. Elise Dettling, Christian Trefftz, Ian Jorquera, Thomas Sheehy, Ross G. Miller, Marc González Tallada, Jeffrey S. Vetter, Valentin Churavy:
Evaluating performance and portability of high-level programming models: Julia, Python/Numba, and Kokkos on exascale nodes. IPDPS Workshops 2023: 373-382 - [c40]Pedro Valero-Lara:
(AsHES) 2023 Keynote Speaker Agnostic Programing: "Less is More". IPDPS Workshops 2023: 563 - [c39]Pedro Valero-Lara, Jeffrey S. Vetter:
A MultiGPU Performance-Portable Solution for Array Programming Based on Kokkos. ARRAY@PLDI 2023: 1-12 - [c38]Narasinga Rao Miniskar, Mohammad Alaul Haque Monil, Pedro Valero-Lara, Frank Liu, Jeffrey S. Vetter:
Tiling Framework for Heterogeneous Computing of Matrix based Tiled Algorithms. ExHET@PPoPP 2023: 1:1-1:6 - [c37]Pedro Valero-Lara, Ian Jorquera, Frank Liu, Jeffrey S. Vetter:
Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores. SC Workshops 2023: 177-186 - [c36]Mohammad Alaul Haque Monil, Narasinga Rao Miniskar, Keita Teranishi, Jeffrey S. Vetter, Pedro Valero-Lara:
MatRIS: Multi-level Math Library Abstraction for Heterogeneity and Performance Portability using IRIS Runtime. SC Workshops 2023: 1081-1092 - [c35]Pedro Valero-Lara, Jeffrey S. Vetter, John Gounley, Amanda Randles:
Moment Representation of Regularized Lattice Boltzmann Methods on NVIDIA and AMD GPUs. SC Workshops 2023: 1697-1704 - [c34]William F. Godoy, Pedro Valero-Lara, Caira Anderson, Katrina W. Lee, Ana Gainaru, Rafael Ferreira da Silva, Jeffrey S. Vetter:
Julia as a unifying end-to-end workflow language on the Frontier exascale system. SC Workshops 2023: 1989-1999 - 2022
- [c33]Pedro Valero-Lara, Jungwon Kim, Jeffrey S. Vetter:
A Portable and Heterogeneous LU Factorization on IRIS. Euro-Par Workshops 2022: 17-31 - [c32]Narasinga Rao Miniskar, Mohammad Alaul Haque Monil, Pedro Valero-Lara, Frank Liu, Jeffrey S. Vetter:
IRIS-BLAS: Towards a Performance Portable and Heterogeneous BLAS Library. HIPC 2022: 256-261 - [c31]Mohammad Alaul Haque Monil, Narasinga Rao Miniskar, Frank Y. Liu, Jeffrey S. Vetter, Pedro Valero-Lara:
LaRIS: Targeting Portability and Productivity for LAPACK Codes on Extreme Heterogeneous Systems by Using IRIS. RSDHA@SC 2022: 12-21 - [c30]Pedro Valero-Lara, Cameron Greenwalt, Jeffrey S. Vetter:
SparseLU, A Novel Algorithm and Math Library for Sparse LU Factorization. IA3@SC 2022: 25-31 - [c29]Pedro Valero-Lara, Seyong Lee, Marc González Tallada, Joel E. Denny, Jeffrey S. Vetter:
KokkACC: Enhancing Kokkos with OpenACC. WACCPD@SC 2022: 32-42 - 2021
- [c28]Pedro Valero-Lara, Jungwon Kim, Oscar R. Hernandez, Jeffrey S. Vetter:
OpenMP Target Task: Tasking and Target Offloading on Heterogeneous Systems. Euro-Par Workshops 2021: 445-455 - [c27]Leonel Toledo, Pedro Valero-Lara, Jeffrey S. Vetter, Antonio J. Peña:
Static Graphs for Coding Productivity in OpenACC. HiPC 2021: 364-369 - 2020
- [c26]Sandra Catalán, Tetsuzo Usui, Leonel Toledo, Xavier Martorell, Jesús Labarta, Pedro Valero-Lara:
Towards an Auto-Tuned and Task-Based SpMV (LASs Library). IWOMP 2020: 115-129 - 2019
- [c25]Sandra Catalán, Xavier Martorell, Jesús Labarta, Tetsuzo Usui, Leonel Antonio Toledo Díaz, Pedro Valero-Lara:
Accelerating Conjugate Gradient using OmpSs. PDCAT 2019: 121-126 - [c24]Leonel Toledo, Antonio J. Peña, Sandra Catalán, Pedro Valero-Lara:
Tasking in Accelerators: Performance Evaluation. PDCAT 2019: 127-132 - [c23]Pedro Valero-Lara, Sandra Catalán, Xavier Martorell, Jesús Labarta:
BLAS-3 Optimized by OmpSs Regions (LASs Library). PDP 2019: 25-32 - 2018
- [c22]Pedro Valero-Lara, Ivan Martínez-Pérez, Sergi Mateo, Raül Sirvent, Vicenç Beltran, Xavier Martorell, Jesús Labarta:
Variable Batched DGEMM. PDP 2018: 363-367 - [c21]Pedro Valero-Lara, Raül Sirvent, Antonio J. Peña, Xavier Martorell, Jesús Labarta:
MPI+OpenMP Tasking Scalability for the Simulation of the Human Brain: Human Brain Project. EuroMPI 2018: 5:1-5:8 - 2017
- [c20]Jack J. Dongarra, Sven Hammarling, Nicholas J. Higham, Samuel D. Relton, Pedro Valero-Lara, Mawussi Zounon:
The Design and Performance of Batched BLAS on Modern High-Performance Computing Systems. ICCS 2017: 495-504 - [c19]Pedro Valero-Lara, Ivan Martínez-Pérez, Antonio J. Peña, Xavier Martorell, Raül Sirvent, Jesús Labarta:
cuHinesBatch: Solving Multiple Hines systems on GPUs Human Brain Project*. ICCS 2017: 566-575 - [c18]Fernando López Pelayo, Fernando Cuartero Gómez, Diego Cazorla, Pedro Valero-Lara, Mercedes G. Merayo:
Heuristics for ROSA's LTS Searching. IWANN (2) 2017: 427-437 - [c17]Pedro Valero-Lara, Ivan Martínez-Pérez, Raül Sirvent, Xavier Martorell, Antonio J. Peña:
NVIDIA GPUs Scalability to Solve Multiple (Batch) Tridiagonal Systems Implementation of cuThomasBatch. PPAM (1) 2017: 243-253 - 2016
- [c16]Pedro Valero-Lara:
Leveraging the Performance of LBM-HPC for Large Sizes on GPUs Using Ghost Cells. ICA3PP 2016: 417-430 - 2015
- [c15]Pedro Valero-Lara, Johan Jansson:
LBM-HPC - An Open-Source Tool for Fluid Simulations. Case Study: Unified Parallel C (UPC-PGAS). CLUSTER 2015: 318-321 - [c14]Pedro Valero-Lara, Johan Jansson:
Multi-domain Grid Refinement for Lattice-Boltzmann Simulations on Heterogeneous Platforms. CSE 2015: 1-8 - [c13]Pedro Valero-Lara, Johan Jansson:
A Non-uniform Staggered Cartesian Grid Approach for Lattice-boltzmann Method. ICCS 2015: 296-305 - 2014
- [c12]Pedro Valero-Lara:
Multi-GPU acceleration of DARTEL (early detection of Alzheimer). CLUSTER 2014: 346-354 - [c11]Pedro Valero-Lara, Alfredo Pinelli, Manuel Prieto-Matías:
Accelerating Solid-fluid Interaction using Lattice-boltzmann and Immersed Boundary Coupled Simulations on Heterogeneous Platforms. ICCS 2014: 50-61 - [c10]Pedro Valero-Lara:
hLCS. A Hybrid GPGPU Approach for Solving Multiple Short and Unbalanced LCS Problems. ICCSA (6) 2014: 102-115 - 2013
- [c9]Pedro Valero-Lara, Fernando L. Pelayo:
Analysis in performance and new model for multiple kernels executions on many-core architectures. ICCI*CC 2013: 189-194 - [c8]Raúl Pardo, Fernando L. Pelayo, Pedro Valero-Lara:
GPU Powered ROSA Analyzer. ICPP 2013: 901-908 - [c7]Pedro Valero-Lara:
A GPU approach for accelerating 3D deformable registration (DARTEL) on brain biomedical images. EuroMPI 2013: 187-192 - 2012
- [c6]Roberto Uribe Paredes, Enrique Arias, José L. Sánchez, Diego Cazorla, Pedro Valero-Lara:
Improving the Performance for the Range Search on Metric Spaces Using a Multi-GPU Platform. DEXA (2) 2012: 442-449 - [c5]Pedro Valero-Lara:
MRF Satellite Image Classification on GPU. ICPP Workshops 2012: 149-156 - [c4]Pedro Valero-Lara, Alfredo Pinelli, Julien Favier, Manuel Prieto-Matías:
Block Tridiagonal Solvers on Heterogeneous Architectures. ISPA 2012: 609-616 - 2011
- [c3]Pedro Valero-Lara, Fernando L. Pelayo:
Towards a More Efficient Use of GPUs. ICCSA Workshops 2011: 3-9 - [c2]Roberto Uribe Paredes, Pedro Valero-Lara, Enrique Arias, José L. Sánchez, Diego Cazorla:
A GPU-Based Implementation for Range Queries on Spaghettis Data Structure. ICCSA (1) 2011: 615-629 - [c1]Roberto Uribe Paredes, Pedro Valero-Lara, Enrique Arias, José L. Sánchez, Diego Cazorla:
Similarity search implementations for multi-core and many-core processors. HPCS 2011: 656-663
Editorship
- 2024
- [e5]Patrick Diehl, Joseph Schuchart, Pedro Valero-Lara, George Bosilca:
Asynchronous Many-Task Systems and Applications - Second International Workshop, WAMTA 2024, Knoxville, TN, USA, February 14-16, 2024, Proceedings. Lecture Notes in Computer Science 14626, Springer 2024, ISBN 978-3-031-61762-1 [contents] - 2023
- [e4]Pedro Valero-Lara, Seyong Lee, Gokcen Kestor, Mohammad Alaul Haque Monil, Marc González:
Proceedings of the 2nd International Workshop on Extreme Heterogeneity Solutions, ExHET 2023, Montreal, QC, Canada, 25 February 2023. ACM 2023 [contents] - 2022
- [e3]Pedro Valero-Lara, Seyong Lee, Gokcen Kestor:
ExHET@PPoPP 2022: Proceedings of the 1st International Workshop on Extreme Heterogeneity Solutions, Seoul Republic of Korea, 2 April 2022. ACM 2022, ISBN 978-1-4503-9344-7 [contents] - 2019
- [e2]Ioannis Sourdis, Christos-Savvas Bouganis, Carlos Álvarez, Leonel Antonio Toledo Díaz, Pedro Valero-Lara, Xavier Martorell:
29th International Conference on Field Programmable Logic and Applications, FPL 2019, Barcelona, Spain, September 8-12, 2019. IEEE 2019, ISBN 978-1-7281-4884-7 [contents] - 2018
- [e1]Bronis R. de Supinski, Pedro Valero-Lara, Xavier Martorell, Sergi Mateo Bellido, Jesús Labarta:
Evolving OpenMP for Evolving Architectures - 14th International Workshop on OpenMP, IWOMP 2018, Barcelona, Spain, September 26-28, 2018, Proceedings. Lecture Notes in Computer Science 11128, Springer 2018, ISBN 978-3-319-98520-6 [contents]
Informal and Other Publications
- 2023
- [i6]William F. Godoy, Pedro Valero-Lara, T. Elise Dettling, Christian Trefftz, Ian Jorquera, Thomas Sheehy, Ross G. Miller, Marc González Tallada, Jeffrey S. Vetter, Valentin Churavy:
Evaluating performance and portability of high-level programming models: Julia, Python/Numba, and Kokkos on exascale nodes. CoRR abs/2303.06195 (2023) - [i5]William F. Godoy, Pedro Valero-Lara, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter:
Evaluation of OpenAI Codex for HPC Parallel Programming Models Kernel Generation. CoRR abs/2306.15121 (2023) - [i4]Pedro Valero-Lara, Alexis Huante, Mustafa Al-Lail, William F. Godoy, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter:
Comparing Llama-2 and GPT-3 LLMs for HPC kernels generation. CoRR abs/2309.07103 (2023) - [i3]William F. Godoy, Pedro Valero-Lara, Caira Anderson, Katrina W. Lee, Ana Gainaru, Rafael Ferreira da Silva, Jeffrey S. Vetter:
Julia as a unifying end-to-end workflow language on the Frontier exascale system. CoRR abs/2309.10292 (2023) - 2021
- [i2]Marc Jordà, Pedro Valero-Lara, Antonio J. Peña:
cuConv: A CUDA Implementation of Convolution for CNN Inference. CoRR abs/2103.16234 (2021) - 2020
- [i1]Pedro Valero-Lara, Raül Sirvent, Antonio J. Peña, Jesús Labarta:
MPI+OpenMP Tasking Scalability for Multi-Morphology Simulations of the Human Brain. CoRR abs/2005.06332 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-08 21:27 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint