


default search action
ACM SIGMOD Conference 2025: Berlin, Germany - Companion Volume
- Volker Markl, Joseph M. Hellerstein, Azza Abouzied:
Companion of the 2025 International Conference on Management of Data, SIGMOD/PODS 2025, Berlin, Germany, June 22-27, 2025. ACM 2025, ISBN 979-8-4007-1564-8
Keynote Talk Abstracts
- Philip A. Bernstein
:
Fifty Years of Transaction Processing Research. 1-2 - Christos H. Papadimitriou
:
How to Build A Brain. 3-4 - Margo I. Seltzer
:
The Case for Collaboration. 5-6
Demo Short Papers
- Pratyush Agnihotri
, Carsten Binnig
:
Demonstrating PDSP-Bench: A Benchmarking System for Parallel and Distributed Stream Processing. 7-10 - Ashwin Alaparthi
, Paul Loh
, Ryan Marcus
:
ScaleLLM: A Technique for Scalable LLM-augmented Data Systems. 11-14 - Angelos-Christos G. Anadiotis, Muhammad Ghufran Khan
, Ioana Manolescu
:
Catching up with Disorder: Dynamic Graphs with Out-of-Order Updates. 15-18 - Arman Ashkari
, El Kindi Rezig
:
CausalExplain: Causal Explanations of Black-box Models with Training Data Subsets. 19-22 - Teona Bagashvili
, Tarikul Islam Papon
, Manos Athanassoulis
:
ACE-in-Action: A Smart DBMS Bufferpool for SSDs. 23-26 - Wenchao Bai
, Wenfei Fan
, Jiahui Jin
, Daji Li
, Jian Li
, Shuhao Liu
, Mingliang Ouyang
, Qiang Yuan
:
MiniClean: A Single-Machine System for Cleaning Big Graphs. 27-30 - Björn Bamberg, Denis Hirn
, Torsten Grust
:
How DuckDB is USING KEY to Unlock Recursive Query Performance. 31-34 - Kaustubh Beedkar
, Aurélien Bertrand
, Haralampos Gavriilidis
, Augusto José Fonseca
, Zoi Kaoudi
, Mingxi Liu
, Volker Markl
, Juri Petersen
, Fábio Porto, Víctor Ribeiro, Mads Sejer Pedersen
, Lucas Giusti Tavares, Michalis Vargiamis
, Chen Xu
:
Apache Wayang in Action: Enabling Data Systems Integration via a Unified Data Analytics Framework. 35-38 - Lennart Behme
, Leonard Geißler
, Pratham Agrawal
, Emil Badura
, Benjamin Ueber
, Kaustubh Beedkar
, Volker Markl
:
Finding What You're Looking For: A Distribution-Aware Dataset Search Engine in Action. 39-42 - Hadar Ben-Efraim
, Susan B. Davidson
, Amit Somech
:
PY-SHARQ: A Holistic Python Library for Explaining Association Rules on Relational Data. 43-46 - Kyle Bossonney
, Nicolás Buzeta
, Vicente Calisto
, Juan-Eduardo López
, Cristian Riveros
, Stijn Vansummeren
:
CORE+: A Complex Event Recognition Engine in C++. 47-50 - Mohamed Bouadi
, Arta Alavi
, Salima Benbernou
, Mourad Ouziri
:
DANTE: Hybrid AI System for Context-Aware Interpretable Feature Engineering. 51-54 - Felix S. Campbell
, Yuval Moskovitch
:
Locator: Local Stability for Rankings. 55-58 - Jeffery Cao
, Lampros Flokas
, Yujian Xu
, Eugene Wu
, Xu Chu
, Cong Yu
:
Prompt Editor: A Taxonomy-driven System for Guided LLM Prompt Development in Enterprise Settings. 59-62 - Tsz Nam Chan, Bojian Zhu, Dingming Wu, Yun Peng, Leong Hou U
, Wei Tu
, Ruisheng Wang
:
A Fast Line Density Visualization Plugin for Geographic Information Systems. 63-66 - Kasidis Chanthatrojwong
, Sourav S. Bhowmick
, Byron Choi
:
PASCAL: A Theory-Informed Visual Interface for Property Graph Schema Visualization. 67-70 - Kaiwen Chen
, Yueting Chen
, Nick Koudas
, Xiaohui Yu
:
RTS+: Reliable Text to SQL. 71-74 - Noam Chen
, Anna Zeng
, Michael J. Cafarella
, Batya Kenig
, Markos Markakis
, Oren Mishali
, Brit Youngmann
, Babak Salimi
:
CausaLens: A System for Summarizing Causal DAGs. 75-78 - Mariana M. Garcez Duarte
, Dwi P. A. Nugroho
, Georges Tod
, Evert Bevernage
, Pieter Moelans
, Emine Tas
, Esteban Zimányi
, Mahmoud Sakr
, Steffen Zeuch
, Volker Markl
:
Mobility Stream Processing on NebulaStream and MEOS. 79-82 - Yael Einy
, Guy Dar
, Slava Novgorodov
, Tova Milo
:
Sentence to Model: Cost-Effective Data Collection LLM Agent. 83-86 - Saeed Fathollahzadeh
, Essam Mansour
, Matthias Boehm
:
Demonstrating CatDB: LLM-based Generation of Data-centric ML Pipelines. 87-90 - Yannis Foufoulas
, Theoni Palaiologou
, Alkis Simitsis
:
UDFBench: A Tool for Benchmarking UDF Queries on SQL Engines. 91-94 - Victor Giannakouris
, Immanuel Trummer
:
SwellDB: Dynamic Query-Driven Table Generation with Large Language Models. 95-98 - Amir Gilad
, Tova Milo
, Kathy Razmadze
, Ron Zadicario
:
Demonstration of DPClustX: Differentially Private Explanations for Clusters. 99-102 - Justin Breese
, Vijayan Prabhakaran
, Martin Grund
, Stefania Leone
, Amit Shukla
, Michael Armbrust
, Reynold Xin
, Matei Zaharia
, Lennart Kats
, Sung Chiu
, Tatiana Romanova
, Philip Nord
, Mitchell Webster
, Chris Munson
, Bo Pang
, David Ma
:
Blink Twice - Automatic Workload Pinning and Regression Detection for Versionless Apache Spark using Retries. 103-106 - Suchit Gupte
, John Paparrizos
:
ShapX Engine: A Demonstration of Shapley Value Approximations. 107-110 - Eldar Hacohen
, Yuval Moskovitch
, Amit Somech
:
OmniTune: A Universal Framework for Query Refinement via LLMs. 111-114 - Yuto Hayamizu
, Ryoji Kawamichi
, Tsuyoshi Ozawa
, Masaru Kitsuregawa
, Kazuo Goda
:
anagodb: Offering Massive Parallelism for Database Engine. 115-118 - Shiyi He
, Alexandra Meliou
, Anna Fariha
:
ChARLES: Change-Aware Recovery of Latent Evolution Semantics in Relational Data. 119-122 - Jeffrey Heer
, Dominik Moritz
, Ron Pechunk
:
Mosaic: An Architecture for Linking Databases and Scalable Interactive Visualizations. 123-126 - Kaiyuan Hu
, Jiongli Zhu
, Boris Glavic
, Babak Salimi
:
Zorro: Quantifying Uncertainty in Models & Predictions Arising from Dirty Data. 127-130 - Xuhua Huang
, Zirui Hu
, Siyang Weng
, Rong Zhang
, Chengcheng Yang
, Xuan Zhou
, Weining Qian
, Chuanhui Yang
, Quanqing Xu
:
A Query-Aware Enormous Database Generator For System Performance Evaluation. 131-134 - Tharushi Jayasekara
, Immanuel Trummer
:
Demonstrating CEDAR: A System for Cost-Efficient Data-Driven Claim Verification. 135-138 - Michael Jungmair
:
LingoDB-CT: Understanding LingoDB's Inner Workings. 139-142 - Eugenie Y. Lai
, Inbal Croitoru
, Noam Bitton
, Ariel Shalem
, Brit Youngmann
, Sainyam Galhotra
, El Kindi Rezig
, Michael J. Cafarella
:
SeerCuts: Explainable Attribute Discretization. 143-146 - Longbin Lai
, Changwei Luo
, Yunkai Lou
, Mingchen Ju
, Zhengyi Yang
:
Graphy'our Data: Towards End-to-End Modeling, Exploring and Generating Report from Raw Data. 147-150 - Jiale Lao
, Immanuel Trummer
:
Demonstrating SQLBarber: Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads. 151-154 - Yu Lei
, Xinle Jiang
, Hua Lu
, Christian S. Jensen
, Bo Tang
, Huan Li
:
TEQ: An Open and Developer-friendly Testbed for Edge-based Query Processing Algorithms. 155-158 - Nativ Levy
, Michael J. Cafarella
, Amir Gilad
, Sudeepa Roy
, Brit Youngmann
:
CauSumX: Summarized Causal Explanations For Group-By-Average Queries. 159-162 - Peizheng Li
, Chaoyi Chen
, Hao Yuan
, Zhenbo Fu
, Hang Shen
, Xinbo Yang
, Qiange Wang
, Xin Ai
, Yanfeng Zhang
, Yingyou Wen
, Ge Yu
:
NeutronRAG: Towards Understanding the Effectiveness of RAG from a Data Retrieval Perspective. 163-166 - Zhaoheng Li
, Supawit Chockchowwat
, Hanxi Fang
, Yongjoo Park
:
Demo of Kishu: Time-Traveling for Computational Notebooks. 167-170 - Jiangneng Li
, Haitao Yuan
, Jie Wang
, Ziting Wang
, Han Mao Kiah
, Gao Cong
:
Demonstrating MAST: An Efficient System for Point Cloud Data Analytics. 171-174 - Zhiyu Liang
, Dongrui Cai
, Chenyuan Zhang
, Zheng Liang
, Chen Liang
, Bo Zheng
, Shi Qiu
, Jin Wang
, Hongzhi Wang
:
KDSelector: A Knowledge-Enhanced and Data-Efficient Model Selector Learning Framework for Time Series Anomaly Detection. 175-178 - Tim Littau
, Rihan Hai
:
Qymera: Simulating Quantum Circuits using RDBMS. 179-182 - Chunwei Liu
, Gerardo Vitagliano
, Brandon Rose
, Matthew Printz
, David Andrew Samson
, Michael J. Cafarella
:
PalimpChat: Declarative and Interactive AI analytics. 183-186 - Christoph Mayer
, Haozhe Zhang
, Mahmoud Abo Khamis
, Dan Olteanu
, Dan Suciu
:
LpBound in Action: Cardinality Estimation with One-Sided Guarantees. 187-190 - Amin Meghrazi
, Pranav Maneriker
, Swati Padhee
, Srinivasan Parthasarathy
:
Interactive Fairness Auditing: Leveraging AVOIR for Dynamic Evaluation and Mitigation. 191-194 - Adrian Michalke
, Aljoscha P. Lepping
, Volker Markl
, Ricardo Martinez
, Nils L. Schubert
, Lukas Schwerdtfeger
, Taha Tekdogan
, Steffen Zeuch
, Ariane Ziehn
, Christoph Falkensteiner
, Kyle Krüger
, Alexander Meyer
, Tobias Röschl, Svea Wilkending
:
NebulaStream: An Extensible, High-Performance Streaming Engine for Multi-Modal Edge Applications. 195-198 - Amedeo Pachera
, Angela Bonifati
, Andrea Mauri
:
Grafixer: Enabling User-Centric Repairs for Property Graphs. 199-202 - Marcel Parciak
, Brecht Vandevoort
, Frank Neven
, Liesbet M. Peeters
, Stijn Vansummeren
:
LLM-Matcher: A Name-Based Schema Matching Tool using Large Language Models. 203-206 - Alok Pareek
, Bhushan Khaladkar
, Sanket Malde
, Vamshi Saggurthi
:
Real Time Sentinel: An LLM Based PII Detector - A Streaming Integration and Intelligence Platform. 207-210 - Sophie Pfister
, Alberto Lerner
, Abishek Ramdas
, Philippe Cudré-Mauroux
:
Alpha Demo: A Hardware-Accelerated Data Model for Ad-Hoc Manipulation of Point Clouds. 211-214 - Shaikh Quader
, Ghadeer Abuoda
, Yonis Abokar
, Marin Litoiu
, Manos Papagelis
:
Demo of LearnedWMP: Workload Memory Prediction Using Deep Query Template Representations. 215-218 - Florens Rohde
, Victor Christen
, Erhard Rahm
:
SecUREmatch: Integrating Clerical Review in Privacy-Preserving Record Linkage. 219-222 - Gianluca Rossi
, Riccardo Tommasini
, Angela Bonifati
:
TD-Join: Leveraging Temporal Dependencies in Time Series Joins. 223-226 - Diandre Miguel Sabale
, Wolfgang Gatterbauer
:
PatternVis: A Tool for Relational Pattern Visualization. 227-230 - Wenbo Sun
, Ziyu Li
, Rihan Hai
:
Database as Runtime: Compiling LLMs to SQL for In-database Model Serving. 231-234 - Zhaoyan Sun
, Xuanhe Zhou
, Jianming Wu
, Wei Zhou
, Guoliang Li
:
D-Bot: An LLM-Powered DBA Copilot. 235-238 - Govind Venkatraman Krishnan
, Eduardo Ramirez
, Drew Koszewnik
, Yujia (Cynthia) Xie
, Tej Vepa
, Bernardo Gomez Palacio
:
Introducing RAW Hollow: An In-Memory, Co-Located, Compressed Object Store with Opt-In Strong Consistency. 239-242 - Pengyi Wang
, Sibei Chen
, Ju Fan
, Bin Wu
, Nan Tang
, Jian Tan
:
Andromeda: Debugging Database Performance Issues with Retrieval-Augmented Large Language Models. 243-246 - Patrick Wang
, Wan Shen Lim
, William Zhang
, Samuel Arch
, Andrew Pavlo
:
Automated Database Tuning vs. Human-Based Tuning in a Simulated Stressful Work Environment: A Demonstration of the Database Gym. 247-250 - Haixin Wang
, Cheng Xu
, Ce Zhang
, Haibo Hu
, Shikun Tian
, Shenglong Chen
, Ying Yan
, Jianliang Xu
:
Authenticating Multi-Chain Queries: Verifiable Virtual Filesystem Is All You Need. 251-254 - Zixin Wei, Jun Han
, Xiaolin Han
, Chenhao Ma
:
SemExplorer: A User Interface for Semantic Approach to Customized Dataset Search. 255-258 - Jingzhe Xu, Yuhao Deng
, Chengliang Chai, Zequn Li
, Yuping Wang
, Lei Cao
:
OIE: An Interpretable System for Outlier Explanation and Summarization. 259-262 - Mike Xydas
, Anna Mitsopoulou
, George Katsogiannis-Meimarakis
, Christos Tsapelas
, Stavroula Eleftherakis
, Antonis Mandamadiotis
, Georgia Koutrika
:
DataDazzle: Intelligent Data Exploration through Natural Language. 263-266 - Yansha Jia
, Zhengxin You
, Yujie Wang
, Qiaomu Shen
, Bo Tang
:
VQLens: A Demonstration of Vector Query Execution Analysis. 267-270 - Geoffrey X. Yu
, Ziniu Wu
, Ferdi Kossmann
, Tianyu Li
, Markos Markakis
, Amadou Ngom
, Sophie Zhang
, Tim Kraska
, Samuel Madden
:
Virtualizing Cloud Data Infrastructures with BRAD. 271-274 - Yuanhao Zhong
, Yuhao Deng
, Chengliang Chai
, Ruixin Gu
, Ye Yuan
, Guoren Wang
, Lei Cao
:
Doctopus: A System for Budget-aware Structural Data Extraction from Unstructured Documents. 275-278 - Jun-Peng Zhu
, Peng Cai
, Kai Xu
, Li Li
, Yishen Sun
, Shuai Zhou
, Haihuang Su
, Liu Tang
, Qi Liu
:
UNITQA: A Unified Automated Tabular Question Answering System with Multi-Agent Large Language Models. 279-282
Industry Papers
- Molham Aref
, Paolo Guagliardo
, George Kastrinis
, Leonid Libkin
, Victor Marsault
, Wim Martens
, Mary McGrath
, Filip Murlak
, Nathaniel Nystrom
, Liat Peterfreund
, Allison Rogers
, Cristina Sirangelo
, Domagoj Vrgoc
, David Zhao
, Abdul Zreika
:
Rel: A Programming Language for Relational Data. 283-296 - Nicolas Bruno
, César A. Galindo-Legaria, Milind Joshi
:
Query Decorrelation in the Fabric Data Warehouse. 297-309 - Ramesh Chandra
, Haogang Chen
, Ray Matharu
, Sarah Cai
, Jeff Chen
, Priyam Dutta
, Bogdan Ghita
, Todd Greenstein
, Gopal Holla
, Peng Huang
, Yuchen Huo
, Adrian Ionescu
, Adriana Ispas
, Tim Januschowski
, Vihang Karajgaonkar
, Stefania Leone
, David Lewis
, Andrew Li
, Nong Li
, Cheng Lian
, Stephen Link
, Qing Lu
, Yesheng Ma
, Chris Pettitt
, Vijayan Prabhakaran
, Bogdan Raducanu
, Kyle Rong
, Paul Roome
, Samarth Shetty
, Sean Smith
, Xiaotong Sun
, Yuyuan Tang
, Weitao Wen
, Lei Xia
, Junlin Zeng
, Ben Zhang
, Reynold Xin
, Matei Zaharia
:
Unity Catalog: Open and Universal Governance for the Lakehouse and Beyond. 310-322 - Zihao Chen
, Jiazhi Jiang
, Jiangang Liu, Chao Zhang
, Yuqi Diao, Yang Li
, Hanmei Luo, Peng Chen:
Oceanus: Enable SLO-Aware Vertical Autoscaling for Cloud-Native Streaming Services in Tencent. 323-335 - Zongzhi Chen
, Xinjun Yang
, Mo Sha
, Feifei Li
, Kang Wang
, Zheyu Miao
, Jie Xu
, Jianfeng Wang
, Sheng Wang
:
CloudJump II: Optimizing Cloud Databases for Shared Storage. 336-349 - Zihao Chen
, Chenyang Zhang
, Chen Xu
, Zhao Zhang
, Jiaqiang Wang
, Weining Qian
, Aoying Zhou
:
Scheduling Data Processing Pipelines for Incremental Training on MLP-based Recommendation Models. 350-363 - Yangshen Deng
, Zhengxin You
, Long Xiang
, Qilong Li
, Peiqi Yuan
, Zhaoyang Hong
, Yitao Zheng
, Wanting Li
, Runzhong Li
, Haotian Liu
, Kyriakos Mouratidis
, Man Lung Yiu
, Huan Li
, Qiaomu Shen
, Rui Mao
, Bo Tang
:
AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference. 364-377 - Chenguang Fang
, Chen Qian
, Qi Yang
, Zeyu Wang
, Zhenkun Yang
, Fanyu Kong
, Quanqing Xu
, Hui Cao
, Fusheng Han
, Chuanhui Yang
:
MaLT: A Framework for Managing Large Transactions in OceanBase. 378-390 - Sen Gao, Jianwen Zhao, Hao Zhang, Shixuan Sun, Chen Liang, Gongye Chen, Wenliang Zhang, Bo Ren, Chao Liu, Chenyi Zhang, Quan Chen, Chao Li, Jingwen Leng, Minyi Guo:
GES: High-Performance Graph Processing Engine and Service in Huawei. 391-403 - Anja Gruenheid
, Jesús Camacho-Rodríguez, Carlo Curino
, Raghu Ramakrishnan
, Stanislav Pak
, Sumedh Sakdeo
, Lenisha Gandhi
, Sandeep K. Singhal
, Pooja Nilangekar
, Daniel J. Abadi
:
AutoComp: Automated Data Compaction for Log-Structured Tables in Data Lakes. 404-417 - Martin Grund
, Stefania Leone
, Herman Van Hövell
, Sven Wagner-Boysen
, Sebastian Hillig
, Hyukjin Kwon
, David Lewis
, Jakob Mund
, Polo-Francois Poli
, Lionel Montrieux
, Othon Crelier
, Xiao Li
, Reynold Xin
, Matei Zaharia
, Michalis Petropoulos
, Thanos Papathanasiou
:
Databricks Lakeguard: Supporting Fine-grained Access Control and Multi-user Capabilities for Apache Spark Workloads. 418-430 - Shashank Gugnani
, Zhen Hua Liu
, Hui J. Chang
, Beda Christoph Hammerschmidt, Srinivas Kareenhalli
, Kishy Kumar
, Tirthankar Lahiri
, Ying Lu
, Douglas McMahon
, Ajit Mylavarapu
, Sukhada Pendse
, Ananth Raghavan
:
JSON Relational Duality: A Revolutionary Combination of Document, Object, and Relational Models. 431-443 - Benjamin Hilprecht
, Nico Mürdter
, Arthur Arnold
, Kristijan Ziza
, Franz Färber
, Wolfgang Lehner
:
Scalable Execution of Application Logic within Everest BusinessStore. 444-456 - Gabriela Jacques-Silva
, Evangelia Kalyvianaki
, Katriel Cohn-Gordon
, Adham Meguid
, Huy Nguyen
, Danny Ben-David
, Carl Nayak
, Varun Saravagi
, George Stasa
, Ioannis Papagiannis
, David Taïeb
, Kalkidan Tamirat
, Haiyang Wu
, Bo Xi
, Taining Zhang
, Qi Zhou
:
Unified Lineage System: Tracking Data Provenance at Scale. 457-470 - Rong Kang
, Yanbin Chen
, Ye Liu
, Fuxin Jiang
, Qingshuo Li
, Miao Ma
, Jian Liu
, Guangliang Zhao
, Tieying Zhang
, Jianjun Chen
, Lei Zhang
:
ABase: the Multi-Tenant NoSQL Serverless Database for Diverse and Dynamic Workloads in Large-scale Cloud Environments. 471-484 - Kihong Kim
, Hyunwook Kim
, Jinsu Lee
, Taehyung Lee
, Alexander Böhm
, Norman May
, Guido Moerkotte
, Daniel Ritter
, Ralf Dentzer
, Heiko Gerwens
, Irena Kofman
, Mihnea Andrei
:
Enterprise Application-Database Co-Innovation for Hybrid Transactional/Analytical Processing: A Virtual Data Model and Its Query Optimization Needs. 485-498 - Lukas Landgraf
, Florian Wolf
, Wolfgang Lehner
:
Experimental Evaluation of Optimizing Memory Consumption in SAP HANA using PEOopt. 499-511 - Zeyan Li
, Jie Song
, Tieying Zhang
, Tao Yang, Xiongjun Ou, Yingjie Ye, Pengfei Duan, Muchen Lin, Jianjun Chen:
Adaptive and Efficient Log Parsing as a Cloud Service. 512-524 - Ji You Li
, Jiachi Zhang
, Yuhang Liu
, Wenchao Zhou
, Xin Zhou
, Fangyuan Zhou
, Feifei Li
:
Eigen+: Memory Over-Subscription for Alibaba Cloud Databases. 525-538 - Wei Li
, Jiachi Zhang
, Ye Yin
, Yan Li
, Zhanyang Zhu
, Yuhao Li
, Zhencan Peng
, Lan Lu
, Wenchao Zhou
, Liang Lin
, Feifei Li
:
Flux: Unifying Heterogeneous Infrastructure for Alibaba AnalyticDB. 539-552 - Shige Liu
, Zhifang Zeng
, Li Chen
, Adil Ainihaer
, Arun Ramasami
, Songting Chen
, Yu Xu
, Mingxi Wu
, Jianguo Wang
:
TigerVector: Supporting Vector Search in Graph Databases for Advanced RAGs. 553-565 - Bingqing Lyu
, Xiaoli Zhou
, Longbin Lai
, Yufan Yang
, Yunkai Lou
, Wenyuan Yu
, Ying Zhang
, Jingren Zhou
:
A Modular Graph-Native Query Optimization Framework. 566-579 - Norman May
, Alexander Böhm
, Daniel Ritter
, Frank Renkes
, Mihnea Andrei
, Wolfgang Lehner
:
SAP HANA Cloud: Data Management for Modern Enterprise Applications. 580-592 - Norifumi Nishikawa
, Akira Shimizu
, Akira Ito
, Shinji Fujiwara
, Yuto Hayamizu
, Masaru Kitsuregawa
, Kazuo Goda
:
Dynamic Pruning for Recursive Joins. 593-607 - Jeffrey Pound
, Floris Chabert
, Arjun Bhushan
, Ankur Goswami
, Anil Pacaci
, Shihabur Rahman Chowdhury
:
MicroNN: An On-device Disk-resident Updatable Vector Database. 608-621 - Daniel Sotolongo
, Daniel Mills
, Tyler Akidau
, Anirudh Santhiar
, Attila-Péter Tóth, Botong Huang
, Boyuan Zhang
, Igor Belianski
, Ling Geng
, Matt Uhlar
, Nikhil Shah
, Olivia Zhou
, Saras Nowak
, Sasha Lionheart
, Vlad Lifliand
, Wendy Grus
, Yiwen Zhu
, Ankur Sharma
, Dzmitry Pauliukevich
, Enrico Sartorello
, Ilaria Battiston
, Ivan Kalev
, Lawrence Benson
, Leon Papke
, Niklas Semmler
, Till Merker
, Yi Huang
:
Streaming Democratized: Ease Across the Latency Spectrum with Delayed View Semantics and Snowflake Dynamic Tables. 622-634 - V. Srinivasan
, Andrew Gooding
, Sunil Sayyaparaju
, Thomas Lopatic
, Kevin Porter
, Ashish Krishnadeo Shinde
, Sri Varun Poluri
, B. Narendran
, Daudkhan Pathan
, Srinivasan Seshadri
:
Asynchronous Replication Strategies for a Real-Time DBMS. 635-647 - Jeff Swenson
, Andy Kimball
, Raphael 'kena' Poss
, Rebecca Taft
, Jay Lim
, Adam Storm
, Sumeer Bhola
, Paul Bulkley-Logston
, Pj Tatlow
, Rachael Harding
, Rafi Shamim
, Aditya Maru
, Irfan Sharif
:
CockroachDB Serverless: Sub-second Scaling from Zero with Multi-region Cluster Virtualization. 648-661 - Vishal Vyas
, Andrei Paduroiu
, Srikanth Kandula
, Hari Ohm Prasath Rajagopal
, Mukesh Punhani
, Marco Manzo
, Ankur Goyal
, Santosh Chandrachood
, Rick Sears
, Joseph Marques
, Sushant Majithia
:
Managed Resource Scaling in Amazon EMR. 662-674 - Donghui Wang, Yuxing Chen
, Chengyao Jiang, Anqun Pan
, Wei Jiang
, Songli Wang, Hailin Lei, Chong Zhu, Lixiong Zheng, Wei Lu
, Yunpeng Chai, Feng Zhang, Xiaoyong Du:
TXSQL: Lock Optimizations Towards High Contented Workloads. 675-688 - Xinjun Yang
, Yingqiang Zhang
, Hao Chen
, Feifei Li
, Gerry Fan
, Yang Kong
, Bo Wang
, Jing Fang
, Yuhui Wang
, Tao Huang
, Wenpu Hu
, Jim Kao
, Jianping Jiang
:
Unlocking the Potential of CXL for Disaggregated Memory in Cloud-Native Databases. 689-702 - Tim Zeyl
, Qi Cheng
, Reza Pournaghi
, Jason Lam
, Weicheng Wang
, Calvin Wong
, Chong Chen
, Per-Åke Larson
:
Including Bloom Filters in Bottom-up Optimization. 703-715 - Shihao Zhou
, Qi Mao
, Yi Cheng
, Hongcheng Qi
, Yilun Huang
, Peng Cai
, Jun-Peng Zhu
:
RedTAO: A Trillion-edge High-throughput Graph Store. 716-728 - Xuanhe Zhou
, Wei Zhou
, Liguo Qi
, Hao Zhang, Dihao Chen
, Bingsheng He, Mian Lu
, Guoliang Li, Fan Wu
, Yuqiang Chen:
OpenMLDB: A Real-Time Relational Data Feature Computation System for Online ML. 729-742 - Yiwen Zhu
, Rathijit Sen
, Brian Kroth
, Sergiy Matusevych
, Andreas C. Mueller
, Tengfei Huang
, Rahul Challapalli
, Weihan Tang
, Xin He
, Mo Liu
, Estera Kot
, Sule Kahraman
, Arshdeep Sekhon
, Dario Bernal
, Aditya Lakra
, Shaily Fozdar
, Dhruv Relwani
, Rui Fang
, Long Tian
, Karuna Sagar Krishna
, Ashit Gosalia
, Carlo Curino
, Subru Krishnan
:
Rockhopper: A Robust Optimizer for Spark Configuration Tuning in Production Environment. 743-756 - Andreas Zimmerer
, Damien Dam
, Jan Kossmann
, Juliane Waack
, Ismail Oukid
, Andreas Kipf
:
Pruning in Snowflake: Working Smarter, Not Harder. 757-770
Panel Summaries
- Carsten Binnig
, Danica Porobic
:
Panel on AI for Future Databases: A New Beginning or a Boulevard of Broken Dreams? 771 - Eugene Wu
, Raul Castro Fernandez
:
Where Does Academic Database Research Go From Here? 772-774
Tutorial Papers
- Daniel Alabi
, Sainyam Galhotra
, Shagufta Mehnaz
, Zeyu Song
, Eugene Wu
:
Privacy and Security in Distributed Data Markets. 775-787 - Abdullah Al-Mamun
, Jianguo Wang
, Walid G. Aref
:
Learned Indexes From the One-dimensional to the Multi-dimensional Spaces: Challenges, Techniques, and Opportunities. 788-796 - Rico Bergmann
, Dirk Habich
:
Reproducible Prototyping of Query Optimizer Components. 797-804 - Daokun Hu
, Quanqing Xu
, Chuanghui Yang
:
OLTP Engines on Modern Storage Architectures. 805-812 - Bojan Karlas
, Babak Salimi
, Sebastian Schelter
:
Navigating Data Errors in Machine Learning Pipelines: Identify, Debug, and Learn. 813-820 - Brian Kroth
, Sergiy Matusevych
, Yiwen Zhu
:
Autotuning Systems: Techniques, Challenges, and Opportunities. 821-828 - Rodrigo Laigner
, George Christodoulou
, Kyriakos Psarakis
, Asterios Katsifodimos
, Yongluan Zhou
:
Transactional Cloud Applications: Status Quo, Challenges, and Opportunities. 829-836 - Guoliang Li, Jiayi Wang
, Chenyang Zhang, Jiannan Wang:
Data+AI: LLM4Data and Data4LLM. 837-843 - Ningyi Liao
, Siqiang Luo
, Xiaokui Xiao
, Reynold Cheng
:
Advances in Designing Scalable Graph Neural Networks: The Perspective of Graph Data Management. 844-850 - Vidya Setlur
:
Supporting Human-Centric Data Exploration Through Semantics and Natural Language Interaction. 851-854 - Utku Sirin
, Stratos Idreos
:
Data Storage and Management for Image AI Pipelines. 855-863
Workshop Summaries
- Akhil Arora
, Stefania Dumbrava
:
Eighth Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA). 864-865 - Tanja Auge
, Seokki Lee
:
ProvenanceWeek2025. 866-867 - Carsten Binnig
, Eric Sedlar
:
21st International Workshop on Data Management on New Hardware (DaMoN). 868-869 - Renata Borovica-Gajic
, Manisha Luthra
, Ryan Marcus
, Rajesh Bordawekar
, Oded Shmueli
:
Eighth International Workshop on Exploiting Artificial Intelligence Techniques for Data Management (aiDM). 870-871 - Faiza Allah Bukhsh
, Paolo Ceravolo
, Xu Chu
, Samira Maghool
, Eugene Wu
, Cong Yu
:
LLM-DPM - Workshop on Large Language Models for Data Process Management. 872-873 - Remco Chang
, Kexin Rong
, Roee Shraga
:
Ninth Workshop on Human-In-the-Loop Data Analytics (HILDA). 874-875 - Avrilia Floratou
, Jignesh M. Patel
, Subru Krishnan
:
First Workshop Connecting Academia and Industry on Modern Integrated Database and AI Systems (MIDAS). 876-877 - Stefan Grafberger
, Madelon Hulsebos
, Matteo Interlandi
, Shreya Shankar
:
Ninth Workshop on Data Management for End-to-End Machine Learning (DEEM). 878-879 - Michael Liut
, Sourav S. Bhowmick
, Abdussalam Alawini
:
Fourth International Workshop on Data Systems Education (DataEd'25). 880-881 - Ibrahim Sabek
, Immanuel Trummer
:
Second Workshop on Quantum Computing and Quantum-Inspired Technology for Data-Intensive Systems and Applications (Q-Data). 882-883 - Amir Shaikhha
, Torsten Grust
:
The 19th International Symposium on Database Programming Languages (DBPL). 884-885 - Gerardo Vitagliano
, Chunwei Liu
, Lei Cao
, Huan Sun
, Paolo Papotti
:
First Workshop on Novel Optimizations for Visionary AI Systems (NOVAS). 886-887

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.