default search action
ACM SIGMOD Conference 2019: Amsterdam, The Netherlands
- Peter A. Boncz, Stefan Manegold, Anastasia Ailamaki, Amol Deshpande, Tim Kraska:
Proceedings of the 2019 International Conference on Management of Data, SIGMOD Conference 2019, Amsterdam, The Netherlands, June 30 - July 5, 2019. ACM 2019, ISBN 978-1-4503-5643-5
SIGMOD Keynote 1
- Lise Getoor:
Responsible Data Science. 1
Research 1: Query Processing & Optimization 1 -- sponsored by Tableau
- Immanuel Trummer:
Exact Cardinality Query Optimization with Bounded Execution Cost. 2-17 - Walter Cai, Magdalena Balazinska, Dan Suciu:
Pessimistic Cardinality Estimation: Tighter Upper Bounds for Intermediate Join Cardinalities. 18-35 - Peter Van Sandt, Yannis Chronis, Jignesh M. Patel:
Efficiently Searching In-Memory Sorted Arrays: Revenge of the Interpolation Search? 36-53 - Kisung Park, Hojin Seo, Mostofa Kamal Rasel, Young-Koo Lee, Chanho Jeong, Sung Yeol Lee, Chungmin Lee, Dong-Hun Lee:
Iterative Query Processing based on Unified Optimization Techniques. 54-68 - Daniel Ting:
Approximate Distinct Counts for Billions of Datasets. 69-86 - Martin Perdacher, Claudia Plant, Christian Böhm:
Cache-oblivious High-performance Similarity Join. 87-104
Research 2: Privacy/Blockchain
- Ankur Sharma, Felix Martin Schuhknecht, Divya Agrawal, Jens Dittrich:
Blurring the Lines between Blockchains and Database Systems: the Case of Hyperledger Fabric. 105-122 - Hung Dang, Tien Tuan Anh Dinh, Dumitrel Loghin, Ee-Chien Chang, Qian Lin, Beng Chin Ooi:
Towards Scaling Blockchain Systems via Sharding. 123-140 - Cheng Xu, Ce Zhang, Jianliang Xu:
vChain: Enabling Verifiable Boolean Range Queries over Blockchain Databases. 141-158 - Tianhao Wang, Bolin Ding, Jingren Zhou, Cheng Hong, Zhicong Huang, Ninghui Li, Somesh Jha:
Answering Multi-Dimensional Analytical Queries under Local Differential Privacy. 159-176 - Chang Ge, Xi He, Ihab F. Ilyas, Ashwin Machanavajjhala:
APEx: Accuracy-Aware Differentially Private Data Exploration. 177-194 - Kun Xie, Xiaocan Li, Xin Wang, Gaogang Xie, Jigang Wen, Dafang Zhang:
Active Sparse Mobile Crowd Sensing Based on Matrix Completion. 195-210
Research 3: Information Extraction
- Sheng Hu, Chuan Xiao, Jianbin Qin, Yoshiharu Ishikawa, Qiang Ma:
Autocompletion for Prefix-Abbreviated Input. 211-228 - Pei Wang, Ryan Shea, Jiannan Wang, Eugene Wu:
Progressive Deep Web Crawling Through Keyword Queries For Data Enrichment. 229-246 - Ritesh Sarkhel, Arnab Nandi:
Visual Segmentation for Information Extraction from Heterogeneous Visually Rich Documents. 247-262 - Abolfazl Asudeh, Azade Nazi, Nan Zhang, Gautam Das, H. V. Jagadish:
RRR: Rank-Regret Representative. 263-280 - Min Xie, Raymond Chi-Wing Wong, Ashwin Lall:
Strongly Truthful Interactive Regret Minimization. 281-298 - Saehan Jo, Immanuel Trummer, Weicheng Yu, Xuezhi Wang, Cong Yu, Daniel Liu, Niyati Mehta:
Verifying Text Summaries of Relational Data Sets. 299-316
Industry 1: Data Applications
- Rui Ding, Shi Han, Yong Xu, Haidong Zhang, Dongmei Zhang:
QuickInsights: Quick and Automatic Discovery of Insights from Multi-Dimensional Data. 317-332 - Vimalkumar Jeyakumar, Omid Madani, Ali Parandeh, Ashutosh Kulshreshtha, Weifei Zeng, Navindra Yadav:
ExplainIt! - A Declarative Root-cause Analysis Engine for Time Series Data. 333-348 - Flip Korn, Xuezhi Wang, You Wu, Cong Yu:
Automatically Generating Interesting Facts from Wikipedia Tables. 349-361 - Stephen H. Bach, Daniel Rodriguez, Yintao Liu, Chong Luo, Haidong Shao, Cassandra Xia, Souvik Sen, Alexander Ratner, Braden Hancock, Houman Alborzi, Rahul Kuchhal, Christopher Ré, Rob Malkin:
Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale. 362-375 - Zhipeng Zhang, Bin Cui, Yingxia Shao, Lele Yu, Jiawei Jiang, Xupeng Miao:
PS2: Parameter Server on Spark. 376-388 - Yash Govind, Pradap Konda, Paul Suganthan G. C., Philip Martinkus, Palaniappan Nagarajan, Han Li, Aravind Soundararajan, Sidharth Mudgal, Jeffrey R. Ballard, Haojun Zhang, Adel Ardalan, Sanjib Das, Derek Paulsen, Amanpreet Singh Saini, Erik Paulson, Youngchoon Park, Marshall Carter, Mingju Sun, Glenn Moo Fung, AnHai Doan:
Entity Matching Meets Data Science: A Progress Report from the Magellan Project. 389-403
SIGMOD Keynote 2
- C. Mohan:
State of Public and Private Blockchains: Myths and Reality. 404-411
Panel
- H. V. Jagadish, Francesco Bonchi, Tina Eliassi-Rad, Lise Getoor, Krishna P. Gummadi, Julia Stoyanovich:
The Responsibility Challenge for Data. 412-414
Research 4: Distributed Data Management
- Ji Zhang, Yu Liu, Ke Zhou, Guoliang Li, Zhili Xiao, Bin Cheng, Jiashu Xing, Yangtao Wang, Tianheng Cheng, Li Liu, Minwei Ran, Zekang Li:
An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning. 415-432 - Alex Shamis, Matthew Renzelmann, Stanko Novakovic, Georgios Chatzopoulos, Aleksandar Dragojevic, Dushyanth Narayanan, Miguel Castro:
Fast General Distributed Transactions with Opacity. 433-448 - Niv Dayan, Stratos Idreos:
The Log-Structured Merge-Bush & the Wacky Continuum. 449-466 - Jiaqi Gu, Yugo H. Watanabe, William A. Mazza, Alexander Shkapsky, Mohan Yang, Ling Ding, Carlo Zaniolo:
RaSQL: Greater Power and Performance for Big Data Analytics with Recursive-aggregate-SQL on Spark. 467-484
Research 5: Provenance
- Zhengjie Miao, Qitian Zeng, Boris Glavic, Sudeepa Roy:
Going Beyond Provenance: Explaining Query Answers with Pattern-based Counterbalances. 485-502 - Zhengjie Miao, Sudeepa Roy, Jun Yang:
Explaining Wrong Queries Using Small Examples. 503-520 - Vicky Papavasileiou, Ken Yocum, Alin Deutsch:
Ariadne: Online Provenance for Big Graph Analytics. 521-536 - Daniel Deutch, Yuval Moskovitch, Noam Rinetzky:
Hypothetical Reasoning via Provenance Abstraction. 537-554
Research 6: Streams
- Olga Poppe, Chuan Lei, Elke A. Rundensteiner, David Maier:
Event Trend Aggregation Under Rich Event Matching Semantics. 555-572 - Li Wang, Tom Z. J. Fu, Richard T. B. Ma, Marianne Winslett, Zhenjie Zhang:
Elasticutor: Rapid Elasticity for Realtime Stateful Stream Processing. 573-588 - Ilya Kolchinsky, Assaf Schuster:
Real-Time Multi-Pattern Detection over Event Streams. 589-606 - Jeyhun Karimov, Tilmann Rabl, Volker Markl:
AStream: Ad-hoc Shared Stream Processing. 607-622
Industry 2: Storage & Indexing
- Andrew Carter, Andrew Rodriguez, Yiming Yang, Scott Meyer:
Nanosecond Indexing of Graph Data With Hash Maps and VLists. 623-635 - Misha Tyulenev, Andy Schwerin, Asya Kamsky, Randolph Tan, Alyson Cabral, Jack Mulrow:
Implementation of Cluster-wide Logical Clock and Causal Consistency in MongoDB. 636-650 - Gui Huang, Xuntao Cheng, Jianying Wang, Yujie Wang, Dengcheng He, Tieying Zhang, Feifei Li, Sheng Wang, Wei Cao, Qiang Li:
X-Engine: An Optimized Storage Engine for Large-scale E-commerce Transaction Processing. 651-665 - Sudipto Das, Miroslav Grbic, Igor Ilic, Isidora Jovandic, Andrija Jovanovic, Vivek R. Narasayya, Miodrag Radulovic, Maja Stikic, Gaoxiang Xu, Surajit Chaudhuri:
Automatically Indexing Millions of Databases in Microsoft Azure SQL Database. 666-679
Research 7: Modern Hardware
- Guna Prasaad, Badrish Chandramouli, Donald Kossmann:
Concurrent Prefix Recovery: Performing CPR on a Database. 687-704 - Shuhao Zhang, Jiong He, Amelie Chi Zhou, Bingsheng He:
BriskStream: Scaling Data Stream Processing on Shared-Memory Multicore Architectures. 705-722 - Jong-Bin Kim, Hyeongwon Jang, Seohui Son, Hyuck Han, Sooyong Kang, Hyungsoo Jung:
Border-Collie: A Wait-free, Read-optimal Algorithm for Database Logging on Multicore Hardware. 723-740 - Tobias Ziegler, Sumukha Tumkur Vani, Carsten Binnig, Rodrigo Fonseca, Tim Kraska:
Designing Distributed Tree-based Index Structures for Fast RDMA-capable Networks. 741-758 - Donghyoung Han, Yoon-Min Nam, Jihye Lee, Kyongseok Park, Hyunwoo Kim, Min-Soo Kim:
DistME: A Fast and Elastic Distributed Matrix Computation Engine using GPUs. 759-774 - Mo Sha, Yuchen Li, Kian-Lee Tan:
GPU-based Graph Traversal on Compressed Graphs. 775-792
Research 8: Data Integration/Cleaning
- Babak Salimi, Luke Rodriguez, Bill Howe, Dan Suciu:
Interventional Fairness: Causal Database Repair for Algorithmic Fairness. 793-810 - Pei Wang, Yeye He:
Uni-Detect: A Unified Approach to Automated Error Detection in Tables. 811-828 - Alireza Heidari, Joshua McGrath, Ihab F. Ilyas, Theodoros Rekatsinas:
HoloDetect: Few-Shot Learning for Error Detection. 829-846 - Erkang Zhu, Dong Deng, Fatemeh Nargesian, Renée J. Miller:
JOSIE: Overlap Set Similarity Search for Finding Joinable Tables in Data Lakes. 847-864 - Mohammad Mahdavi, Ziawasch Abedjan, Raul Castro Fernandez, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Raha: A Configuration-Free Error Detection System. 865-882 - Chang Ge, Yinan Li, Eric Eilebrecht, Badrish Chandramouli, Donald Kossmann:
Speculative Distributed CSV Data Parsing for Big Data Analytics. 883-899
Research 9: Query Processing & Optimization 2
- Kai Huang, Huey-Eng Chua, Sourav S. Bhowmick, Byron Choi, Shuigeng Zhou:
CATAPULT: Data-driven Selection of Canned Patterns for Efficient Visual Graph Query Formulation. 900-917 - Prajakta Kalmegh, Shivnath Babu, Sudeepa Roy:
iQCAR: inter-Query Contention Analyzer for Data Analytics Frameworks. 918-935 - Immanuel Trummer, Yicheng Wang, Saketh Mahankali:
A Holistic Approach for Query Evaluation andResult Vocalization in Voice-Based OLAP. 936-953 - Yifan Li, Xiaohui Yu, Nick Koudas:
Top-k Queries over Digital Traces. 954-971 - Brandon Haynes, Amrita Mazumdar, Magdalena Balazinska, Luis Ceze, Alvin Cheung:
Visual Road: A Video Data Management Benchmark. 972-987 - Qianrui Zhang, Haoci Zhang, Thibault Sellam, Eugene Wu:
Mining Precision Interfaces From Query Logs. 988-1005
Research 10: Graphs 1
- Francesco Bonchi, Arijit Khan, Lorenzo Severini:
Distance-generalized Core Decomposition. 1006-1023 - Yikai Zhang, Jeffrey Xu Yu:
Unboundedness and Efficiency of Truss Maintenance in Evolving Graphs. 1024-1041 - Zhewei Wei, Xiaodong He, Xiaokui Xiao, Sibo Wang, Yu Liu, Xiaoyong Du, Ji-Rong Wen:
PRSim: Sublinear Time SimRank Computation on Large Power-Law Graphs. 1042-1059 - Wentao Li, Miao Qiao, Lu Qin, Ying Zhang, Lijun Chang, Xuemin Lin:
Scaling Distance Labeling on Small-World Networks. 1060-1077 - Prithu Banerjee, Wei Chen, Laks V. S. Lakshmanan:
Maximizing Welfare in Social Networks under A Utility Driven Influence Diffusion model. 1078-1095 - Jing Tang, Keke Huang, Xiaokui Xiao, Laks V. S. Lakshmanan, Xueyan Tang, Aixin Sun, Andrew Lim:
Efficient Approximation Algorithms for Adaptive Seed Minimization. 1096-1113
Award Talks
- Joy Arulraj:
Data Management on Non-Volatile Memory. 1114 - Bas Ketsman:
Formal Approaches to Querying Big Data in Shared-Nothing Systems. 1115-1116
Research 11: Systems & Machine Learning
- Thibault Sellam, Kevin Lin, Ian Yiran Huang, Michelle Yang, Carl Vondrick, Eugene Wu:
DeepBase: Deep Inspection of Neural Networks. 1117-1134 - Yongjoo Park, Jingyi Qing, Xiaoyang Shen, Barzan Mozafari:
BlinkML: Efficient Maximum Likelihood Estimation with Probabilistic Guarantees. 1135-1152 - Immanuel Trummer, Junxiong Wang, Deepak Maram, Samuel Moseley, Saehan Jo, Joseph Antonakakis:
SkinnerDB: Regret-Bounded Query Evaluation via Reinforcement Learning. 1153-1170 - Zeyuan Shang, Emanuel Zgraggen, Benedetto Buratti, Ferdinand Kossmann, Philipp Eichmann, Yeounoh Chung, Carsten Binnig, Eli Upfal, Tim Kraska:
Democratizing Data Science through Interactive Curation of ML Pipelines. 1171-1188
Research 12: Indexing
- Alex Galakatos, Michael Markovitch, Carsten Binnig, Rodrigo Fonseca, Tim Kraska:
FITing-Tree: A Data-aware Index Structure. 1189-1206 - Markus Mäsker, Tim Süß, Lars Nagel, Lingfang Zeng, André Brinkmann:
Hyperion: Building the Largest In-memory Search Tree. 1207-1222 - Yingjun Wu, Jia Yu, Yuanyuan Tian, Richard Sidle, Ronald Barber:
Designing Succinct Secondary Indexing Mechanism by Exploiting Column Correlations. 1223-1240 - Bailu Ding, Sudipto Das, Ryan Marcus, Wentao Wu, Surajit Chaudhuri, Vivek R. Narasayya:
AI Meets AI: Leveraging Query Executions to Improve Index Recommendations. 1241-1258
Research 13: Fairness, Uncertainty
- Abolfazl Asudeh, H. V. Jagadish, Julia Stoyanovich, Gautam Das:
Designing Fair Ranking Schemes. 1259-1276 - Mangesh Bendre, Tana Wattanawaroon, Kelly Mack, Kevin Chang, Aditya G. Parameswaran:
Anti-Freeze for Large and Complex Spreadsheets: Asynchronous Formula Computation. 1277-1294 - Maarten Van den Heuvel, Peter Ivanov, Wolfgang Gatterbauer, Floris Geerts, Martin Theobald:
Anytime Approximation in Probabilistic Databases via Scaled Dissociations. 1295-1312 - Su Feng, Aaron Huber, Boris Glavic, Oliver Kennedy:
Uncertainty Annotated Databases - A Lightweight Approach for Approximating Certain Answers. 1313-1330
Research 14: Graphs 2
- Renchi Yang, Xiaokui Xiao, Zhewei Wei, Sourav S. Bhowmick, Jun Zhao, Rong-Hua Li:
Efficient Estimation of Heat Kernel PageRank for Local Clustering. 1339-1356 - Vinícius Vitor dos Santos Dias, Carlos H. C. Teixeira, Dorgival O. Guedes, Wagner Meira Jr., Srinivasan Parthasarathy:
Fractal: A General-Purpose Graph Pattern Mining System. 1357-1374 - Anil Pacaci, M. Tamer Özsu:
Experimental Analysis of Streaming Algorithms for Graph Partitioning. 1375-1392 - Yufei Tao, Yuanbing Li, Guoliang Li:
Interactive Graph Search. 1393-1410
Research 15: Graphs 3
- Qizhen Zhang, Akash Acharya, Hongzhi Chen, Simran Arora, Ang Chen, Vincent Liu, Boon Thau Loo:
Optimizing Declarative Graph Queries at Large Scale. 1411-1428 - Myoungji Han, Hyunjoon Kim, Geonmo Gu, Kunsoo Park, Wook-Shin Han:
Efficient Subgraph Matching: Harmonizing Dynamic Programming, Adaptive Matching Order, and Failing Set Together. 1429-1446 - Bibek Bhattarai, Hang Liu, H. Howie Huang:
CECI: Compact Embedding Cluster Index for Scalable Subgraph Matching. 1447-1462 - Sarisht Wadhwa, Anagh Prasad, Sayan Ranu, Amitabha Bagchi, Srikanta Bedathur:
Efficiently Answering Regular Simple Path Queries on Large Labeled Networks. 1463-1480 - Mohammad Hossein Namaki, Qi Song, Yinghui Wu, Shengqi Yang:
Answering Why-questions by Exemplars in Attributed Graphs. 1481-1498 - Theofilos Mailis, Yannis Kotidis, Vaggelis Nikolopoulos, Evgeny Kharlamov, Ian Horrocks, Yannis E. Ioannidis:
An Efficient Index for RDF Query Containment. 1499-1516
Research 16: Machine Learning
- Fengan Li, Lingjiao Chen, Yijing Zeng, Arun Kumar, Xi Wu, Jeffrey F. Naughton, Jignesh M. Patel:
Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent. 1517-1534 - Lingjiao Chen, Paraschos Koutris, Arun Kumar:
Towards Model-based Pricing for Machine Learning in a Data Marketplace. 1535-1552 - Qingzhi Ma, Peter Triantafillou:
DBEst: Revisiting Approximate Query Processing Engines with Machine Learning Models. 1553-1570 - Side Li, Lingjiao Chen, Arun Kumar:
Enabling and Optimizing Non-linear Feature Interactions in Factorized Linear Algebra. 1571-1588 - Supun Nakandala, Arun Kumar, Yannis Papakonstantinou:
Incremental and Approximate Inference for Faster Occlusion-based Deep CNN Explanations. 1589-1606 - Johanna Sommer, Matthias Boehm, Alexandre V. Evfimievski, Berthold Reinwald, Peter J. Haas:
MNC: Structure-Exploiting Sparsity Estimation for Matrix Expressions. 1607-1623
Research 17: Scalability
- Daniel Kocher, Nikolaus Augsten:
A Scalable Index for Top-k Subtree Similarity Queries. 1624-1641 - Maximilian Schleich, Dan Olteanu, Mahmoud Abo Khamis, Hung Q. Ngo, XuanLong Nguyen:
A Layered Aggregate Engine for Analytics Workloads. 1642-1659 - Rana Alotaibi, Damian Bursztyn, Alin Deutsch, Ioana Manolescu, Stamatis Zampetakis:
Towards Scalable Hybrid Stores: Constraint-Based Rewriting to the Rescue. 1660-1677 - Prajakta Kalmegh, Shivnath Babu:
MIFO: A Query-Semantic Aware Resource Allocation Policy. 1678-1695 - Ailidani Ailijiang, Aleksey Charapko, Murat Demirbas:
Dissecting the Performance of Strongly-Consistent Replication Protocols. 1696-1710 - Dong Xie, Badrish Chandramouli, Yinan Li, Donald Kossmann:
FishStore: Faster Ingestion with Subset Hashing. 1711-1728
Industry 3: Data Platforms
- Haifeng Liu, Wei Ding, Yuan Chen, Weilong Guo, Shuoran Liu, Tianpeng Li, Mofei Zhang, Jianxing Zhao, Hongyin Zhu, Zhengyi Zhu:
CFS: A Distributed File System for Large Scale Container Platforms. 1729-1742 - Panagiotis Antonopoulos, Alex Budovski, Cristian Diaconu, Alejandro Hernandez Saenz, Jack Hu, Hanuma Kodavalla, Donald Kossmann, Sandeep Lingam, Umar Farooq Minhas, Naveen Prakash, Vijendra Purohit, Hugh Qu, Chaitanya Sreenivas Ravella, Krystyna Reisteter, Sheetal Shrotri, Dixin Tang, Vikram Wakade:
Socrates: The New SQL Server in the Cloud. 1743-1756 - Edmon Begoli, Tyler Akidau, Fabian Hueske, Julian Hyde, Kathryn Knight, Kenneth L. Knowles:
One SQL to Rule Them All - an Efficient and Syntactically Idiomatic Approach to Management of Streams and Tables. 1757-1772 - Jesús Camacho-Rodríguez, Ashutosh Chauhan, Alan Gates, Eugene Koifman, Owen O'Malley, Vineet Garg, Zoltan Haindrich, Sergey Shelukhin, Prasanth Jayachandran, Siddharth Seth, Deepak Jaiswal, Slim Bouguerra, Nishant Bangarwa, Sankar Hariappan, Anishek Agarwal, Jason Dere, Daniel Dai, Thejas Nair, Nita Dembla, Gopal Vijayaraghavan, Günther Hagleitner:
Apache Hive: From MapReduce to Enterprise-grade Big Data Warehousing. 1773-1786 - Christos Chrysafis, Ben Collins, Scott Dugas, Jay Dunkelberger, Moussa Ehsan, Scott Gray, Alec Grieser, Ori Herrnstadt, Kfir Lev-Ari, Tao Lin, Mike McMahon, Nicholas Schiefer, Alexander Shraer:
FoundationDB Record Layer: A Multi-Tenant Structured Datastore. 1787-1802 - Pulkit Agrawal, Rajat Arya, Aanchal Bindal, Sandeep Bhatia, Anupriya Gagneja, Joseph Godlewski, Yucheng Low, Timothy Muss, Mudit Manu Paliwal, Sethu Raman, Vishrut Shah, Bochao Shen, Laura Sugden, Kaiyu Zhao, Ming-Chuan Wu:
Data Platform for Machine Learning. 1803-1816
Student Abstracts
- Altan Birler:
Scalable Reservoir Sampling on Many-Core CPUs. 1817-1819 - Ali Davoudian:
Helios: An Adaptive and Query Workload-driven Partitioning Framework for Distributed Graph Stores. 1820-1822 - Akhil A. Dixit:
CAvSAT: A System for Query Answering over Inconsistent Databases. 1823-1825 - Saheli Ghosh:
Interactive Visualization For Big Spatial Data. 1826-1828 - Varun Jain, James Lennon, Harshita Gupta:
LSM-Trees and B-Trees: The Best of Both Worlds. 1829-1831