default search action
ACM SIGMOD Conference 2018: Houston, TX, USA
- Gautam Das, Christopher M. Jermaine, Philip A. Bernstein:
Proceedings of the 2018 International Conference on Management of Data, SIGMOD Conference 2018, Houston, TX, USA, June 10-15, 2018. ACM 2018
Keynote 1
- Eric A. Brewer:
Kubernetes and the New Cloud. 1
Research 1: Data Integration &Cleaning
- Sainyam Galhotra, Donatella Firmani, Barna Saha, Divesh Srivastava:
Robust Entity Resolution using Random Graphs. 3-18 - Sidharth Mudgal, Han Li, Theodoros Rekatsinas, AnHai Doan, Youngchoon Park, Ganesh Krishnan, Rohit Deep, Esteban Arcaute, Vijay Raghavendra:
Deep Learning for Entity Matching: A Design Space Exploration. 19-34 - Cong Yan, Yeye He:
Synthesizing Type-Detection Logic for Rich Semantic Data Types using Open-source Code. 35-50 - Jian Dai, Meihui Zhang, Gang Chen, Ju Fan, Kee Yuan Ngiam, Beng Chin Ooi:
Fine-grained Concept Linking using Neural Networks in Healthcare. 51-66 - Disheng Qiu, Luciano Barbosa, Valter Crescenzi, Paolo Merialdo, Divesh Srivastava:
Big Data Linkage for Product Specification Pages. 67-81
Research 2: Usability and Security/Privacy
- Ben McCamish, Vahid Ghadakchi, Arash Termehchy, Behrouz Touri, Liang Huang:
The Data Interaction Game. 83-98 - Yinjun Wu, Abdussalam Alawini, Susan B. Davidson, Gianmaria Silvello:
Data Citation: Giving Credit Where Credit is Due. 99-114 - Dan Zhang, Ryan McKenna, Ios Kotsogiannis, Michael Hay, Ashwin Machanavajjhala, Gerome Miklau:
EKTELO: A Framework for Defining Differentially-Private Computations. 115-130 - Graham Cormode, Tejas Kulkarni, Divesh Srivastava:
Marginal Release Under Local Differential Privacy. 131-146 - Cheng Xu, Jianliang Xu, Haibo Hu, Man Ho Au:
When Query Authentication Meets Fine-Grained Access Control: A Zero-Knowledge Approach. 147-162 - Florian Hahn, Nicolas Loza, Florian Kerschbaum:
Practical and Secure Substring Search. 163-176
Industry 1: Adaptive Query Processing
- Adam Dziedzic, Jingjing Wang, Sudipto Das, Bolin Ding, Vivek R. Narasayya, Manoj Syamala:
Columnstore and B+ tree - Are Hybrid Physical Designs Important? 177-190 - Alekh Jindal, Shi Qiao, Hiren Patel, Zhicheng Yin, Jieming Di, Malay Bag, Marc T. Friedman, Yifung Lin, Konstantinos Karanasos, Sriram Rao:
Computation Reuse in Analytics Job Service at Microsoft. 191-203 - Rebecca Taft, Nosayba El-Sayed, Marco Serafini, Yu Lu, Ashraf Aboulnaga, Michael Stonebraker, Ricardo Mayerhofer, Francisco Jose Andrade:
P-Store: An Elastic Database System with Predictive Provisioning. 205-219 - Edmon Begoli, Jesús Camacho-Rodríguez, Julian Hyde, Michael J. Mior, Daniel Lemire:
Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources. 221-230
Research 3: Transactions and Indexing
- Xinan Yan, Linguan Yang, Hongbo Zhang, Xiayue Charles Lin, Bernard Wong, Kenneth Salem, Tim Brecht:
Carousel: Low-Latency Transaction Processing for Globally-Distributed Data. 231-243 - Ankur Sharma, Felix Martin Schuhknecht, Jens Dittrich:
Accelerating Analytical Processing in MVCC using Fine-Granular High-Frequency Virtual Snapshotting. 245-258 - Vivek Shah, Marcos Antonio Vaz Salles:
Reactors: A Case for Predictable, Virtualized Actor Database Systems. 259-274 - Badrish Chandramouli, Guna Prasaad, Donald Kossmann, Justin J. Levandoski, James Hunter, Mike Barnett:
FASTER: A Concurrent Key-Value Store with In-Place Updates. 275-290 - Mustafa Korkmaz, Martin Karsten, Kenneth Salem, Semih Salihoglu:
Workload-Aware CPU Performance Scaling for Transactional Database Systems. 291-306
Research 4: Query Processing
- Ruby Y. Tahboub, Grégory M. Essertel, Tiark Rompf:
How to Architect a Query Compiler, Revisited. 307-322 - Huanchen Zhang, Hyeontaek Lim, Viktor Leis, David G. Andersen, Michael Kaminsky, Kimberly Keeton, Andrew Pavlo:
SuRF: Practical Range Query Filtering with Fast Succinct Tries. 323-336 - Dmitri V. Kalashnikov, Laks V. S. Lakshmanan, Divesh Srivastava:
FastQRE: Fast Query Reverse Engineering. 337-350 - Thomas Kissinger, Dirk Habich, Wolfgang Lehner:
Adaptive Energy-Control for In-Memory Database Systems. 351-364 - Milos Nikolic, Dan Olteanu:
Incremental View Maintenance with Triple Lock Factorization Benefits. 365-380
Research 5: Graph Data Management
- Wenfei Fan, Xueli Liu, Ping Lu, Chao Tian:
Catching Numeric Inconsistencies in Graphs. 381-393 - Seongyun Ko, Wook-Shin Han:
TurboGraph++: A Scalable and Fast Graph Analytics System. 395-410 - Kyoungmin Kim, In Seo, Wook-Shin Han, Jeong-Hoon Lee, Sungpack Hong, Hassan Chafi, Hyungyu Shin, Geonhwa Jeong:
TurboFlux: A Fast Continuous Subgraph Matching System for Streaming Graph Data. 411-426 - Wenfei Fan, Chunming Hu, Xueli Liu, Ping Lu:
Discovering Graph Functional Dependencies. 427-439 - Zhewei Wei, Xiaodong He, Xiaokui Xiao, Sibo Wang, Shuo Shang, Ji-Rong Wen:
TopPPR: Top-k Personalized PageRank Queries with Precision Guarantees on Large Graphs. 441-456 - Rong-Hua Li, Lu Qin, Fanghua Ye, Jeffrey Xu Yu, Xiaokui Xiao, Nong Xiao, Zibin Zheng:
Skyline Community Search in Multi-valued Networks. 457-472
Research 6: Storage &Indexing
- Ziqi Wang, Andrew Pavlo, Hyeontaek Lim, Viktor Leis, Huanchen Zhang, Michael Kaminsky, David G. Andersen:
Building a Bw-Tree Takes More Than Just Buzz Words. 473-488 - Tim Kraska, Alex Beutel, Ed H. Chi, Jeffrey Dean, Neoklis Polyzotis:
The Case for Learned Index Structures. 489-504 - Niv Dayan, Stratos Idreos:
Dostoevsky: Better Space-Time Trade-Offs for LSM-Tree Based Key-Value Stores via Adaptive Removal of Superfluous Merging. 505-520 - Robert Binna, Eva Zangerle, Martin Pichl, Günther Specht, Viktor Leis:
HOT: A Height Optimized Trie Index for Main-Memory Database Systems. 521-534 - Stratos Idreos, Kostas Zoumpatianos, Brian Hentschel, Michael S. Kester, Demi Guo:
The Data Calculator: Data Structure Design and Cost Synthesis from First Principles and Learned Cost Models. 535-550 - Mohiuddin Abdul Qader, Shiwen Cheng, Vagelis Hristidis:
A Comparative Study of Secondary Indexing Techniques in LSM-based NoSQL Databases. 551-566 - Tao Guo, Kaiyu Feng, Gao Cong, Zhifeng Bao:
Efficient Selection of Geospatial Data on Maps for Interactive and Visualized Exploration. 567-582
Industry 2: Real-time Analytics
- Jean-François Im, Kishore Gopalakrishna, Subbu Subramaniam, Mayank Shrivastava, Adwait Tumbde, Xiaotian Jiang, Jennifer Dai, Seunghyun Lee, Neha Pawar, Jialiang Li, Ravi Aringunram:
Pinot: Realtime OLAP for 530 Million Users. 583-594 - Peilin Yang, Srikanth Thiagarajan, Jimmy Lin:
Robust, Scalable, Real-Time Event Time Series Aggregation at Twitter. 595-599 - Michael Armbrust, Tathagata Das, Joseph Torres, Burak Yavuz, Shixiong Zhu, Reynold Xin, Ali Ghodsi, Ion Stoica, Matei Zaharia:
Structured Streaming: A Declarative API for Real-Time Applications in Apache Spark. 601-613 - Wei Cao, Yusong Gao, Bingchen Lin, Xiaojie Feng, Yu Xie, Xiao Lou, Peng Wang:
TcpRT: Instrument and Diagnostic Analysis System for Service Quality of Cloud Databases at Massive Scale in Real-time. 615-627
Keynote 2
- Pedro M. Domingos:
Machine Learning for Data Management: Problems and Solutions. 629
Research 7: Tuning, Monitoring &Query Optimization
- Lin Ma, Dana Van Aken, Ahmed Hefny, Gustavo Mezerhane, Andrew Pavlo, Geoffrey J. Gordon:
Query-based Workload Forecasting for Self-Driving Database Management Systems. 631-645 - Zechao Shang, Jeffrey Xu Yu, Aaron J. Elmore:
RushMon: Real-time Isolation Anomalies Monitoring. 647-662 - Florian Wolf, Norman May, Paul R. Willems, Kai-Uwe Sattler:
On the Calculation of Optimality Ranges for Relational Query Execution Plans. 663-675 - Thomas Neumann, Bernhard Radke:
Adaptive Optimization of Very Large Join Queries. 677-692 - TaiNing Wang, Chee-Yong Chan:
Improving Join Reorderability with Compensation Operators. 693-708
Research 8: Spatial Data &Streams
- Dian Ouyang, Lu Qin, Lijun Chang, Xuemin Lin, Ying Zhang, Qing Zhu:
When Hierarchy Meets 2-Hop-Labeling: Efficient Shortest Distance Queries on Road Networks. 709-724 - Zeyuan Shang, Guoliang Li, Zhifeng Bao:
DITA: Distributed In-Memory Trajectory Analytics. 725-740 - Yang Zhou, Tong Yang, Jie Jiang, Bin Cui, Minlan Yu, Xiaoming Li, Steve Uhlig:
Cold Filter: A Meta-Framework for Faster and More Accurate Stream Processing. 741-756 - Kai Sheng Tai, Vatsal Sharan, Peter Bailis, Gregory Valiant:
Sketching Linear Classifiers over Data Streams. 757-772 - Yongxin Tong, Libin Wang, Zimu Zhou, Lei Chen, Bowen Du, Jieping Ye:
Dynamic Pricing in Spatial Crowdsourcing: A Matching-Based Approach. 773-788
Industry 3: DB Systems in the Cloud and Open Source
- Alexandre Verbitski, Anurag Gupta, Debanjan Saha, James Corey, Kamal Gupta, Murali Brahmadesam, Raman Mittal, Sailesh Krishnamurthy, Sandor Maurice, Tengiz Kharatishvili, Xiaofeng Bao:
Amazon Aurora: On Avoiding Distributed Consensus for I/Os, Commits, and Membership Changes. 789-796 - Ben Vandiver, Shreya Prasad, Pratibha Rana, Eden Zik, Amin Saeidi, Pratyush Parimal, Styliani Pantela, Jaimin Dave:
Eon Mode: Bringing the Vertica Columnar Database to the Cloud. 797-809 - Jose Picado, Willis Lang, Edward C. Thayer:
Survivability of Cloud Databases - Factors and Prediction. 811-823 - Lyublena Antova, Derrick Bryant, Tuan Cao, Michael Duller, Mohamed A. Soliman, Florian M. Waas:
Rapid Adoption of Cloud Data Warehouse Technology Using Datometry Hyper-Q. 825-839
Research 9: Similarity Queries &Estimation
- Ildar Absalyamov, Michael J. Carey, Vassilis J. Tsotras:
Lightweight Cardinality Estimation in LSM-based Systems. 841-855 - Brian Hentschel, Michael S. Kester, Stratos Idreos:
Column Sketches: A Scan Accelerator for Rapid and Robust Predicate Evaluation. 857-872 - Wenhai Li, Lingfeng Deng, Yang Li, Chen Li:
ZigZag: Supporting Similarity Queries on Vector Space Models. 873-888 - Yiqiu Wang, Anshumali Shrivastava, Jonathan Wang, Junghee Ryu:
Randomized Algorithms Accelerated over CPU-GPU for Ultra-High Dimensional Similarity Search. 889-903 - Dong Deng, Yufei Tao, Guoliang Li:
Overlap Set Similarity Joins with Theoretical Guarantees. 905-920
Research 10: Analytical Queries
- Yinglong Song, Huey-Eng Chua, Sourav S. Bhowmick, Byron Choi, Shuigeng Zhou:
BOOMER: Blending Visual Formulation and Processing of P -Homomorphic Queries on Large Networks. 927-942 - Yihan Gao, Silu Huang, Aditya G. Parameswaran:
Navigating the Data Lake with DATAMARAN: Automatically Extracting Structure from Log Datasets. 943-958 - Min Xie, Raymond Chi-Wing Wong, Jian Li, Cheng Long, Ashwin Lall:
Efficient k-Regret Query Algorithm with Restriction-free Bound for any Dimensionality. 959-974 - Kaiyu Li, Xiaohang Zhang, Guoliang Li:
A Rating-Ranking Method for Crowdsourced Top-k Computation. 975-990 - Jing Tang, Xueyan Tang, Xiaokui Xiao, Junsong Yuan:
Online Processing Algorithms for Influence Maximization. 991-1005 - Eyal Dushkin, Tova Milo:
Top-k Sorting Under Partial Order Information. 1007-1019
Research 11: Data Mining
- Babak Salimi, Johannes Gehrke, Dan Suciu:
Bias in OLAP Queries: Detection, Explanation, and Removal. 1021-1035 - Yanqing Peng, Jinwei Guo, Feifei Li, Weining Qian, Aoying Zhou:
Persistent Bloom Filter: Membership Testing for the Entire History. 1037-1052 - Michele Linardi, Yan Zhu, Themis Palpanas, Eamonn J. Keogh:
Matrix Profile X: VALMOD - Scalable Discovery of Variable-Length Motifs in Data Series. 1053-1066 - Junhao Gan, Yufei Tao:
Fast Euclidean OPTICS with Bounded Precision in Low Dimensional Space. 1067-1082 - Matthias Ruhl, Mukund Sundararajan, Qiqi Yan:
The Cascading Analysts Algorithm. 1083-1096 - Xiangyu Ke, Arijit Khan, Gao Cong:
Finding Seeds and Relevant Tags Jointly: For Targeted Influence Maximization in Social Networks. 1097-1111 - Sibo Wang, Yufei Tao:
Efficient Algorithms for Finding Approximate Heavy Hitters in Personalized PageRanks. 1113-1127 - Daniel Ting:
Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation. 1129-1140
Research 12: Distributed and Parallel Databases
- Wenfei Fan, Ping Lu, Xiaojian Luo, Jingbo Xu, Qiang Yin, Wenyuan Yu, Ruiqi Xu:
Adaptive Asynchronous Parallelization of Graph Algorithms. 1141-1156 - Raul Castro Fernandez, William Culhane, Pijika Watcharapichat, Matthias Weidlich, Victoria Lopez Morales, Peter R. Pietzuch:
Meta-Dataflows: Efficient Exploratory Dataflow Jobs. 1157-1172 - Hwanjun Song, Jae-Gil Lee:
RP-DBSCAN: A Superfast Parallel DBSCAN Algorithm Based on Random Partitioning. 1173-1187 - Jia Zou, R. Matthew Barnett, Tania Lorido-Botran, Shangyu Luo, Carlos Monroy, Sourav Sikdar, Kia Teymourian, Binhang Yuan, Chris Jermaine:
PlinyCompute: A Platform for High-Performance, Distributed, Data-Intensive Tool Development. 1189-1204 - Maaz Bin Safeer Ahmad, Alvin Cheung:
Automatically Leveraging MapReduce Frameworks for Data-Intensive Applications. 1205-1220 - Faisal Nawab, Divyakant Agrawal, Amr El Abbadi:
DPaxos: Managing Data Closer to Users for Low-Latency and Mobile Applications. 1221-1236 - Rundong Li, Mirek Riedewald, Xinyan Deng:
Submodularity of Distributed Join Computation. 1237-1252 - Ryan Marcus, Olga Papaemmanouil, Sofiya Semenova, Solomon Garber:
NashDB: An End-to-End Economic Method for Elastic Database Fragmentation, Replication, and Provisioning. 1253-1267
Research 13: Machine Learning &Knowledge-base Construction
- Jiawei Jiang, Fangcheng Fu, Tong Yang, Bin Cui:
SketchML: Accelerating Distributed Machine Learning with Data Sketches. 1269-1284 - Manasi Vartak, Joana M. F. da Trindade, Samuel Madden, Matei Zaharia:
MISTIQUE: A System to Store and Query Model Intermediates for Model Diagnosis. 1285-1300 - Sen Wu, Luke Hsiao, Xiao Cheng, Braden Hancock, Theodoros Rekatsinas, Philip Alexander Levis, Christopher Ré:
Fonduer: Knowledge Base Construction from Richly Formatted Data. 1301-1316 - Gensheng Zhang, Damian Jimenez, Chengkai Li:
Maverick: Discovering Exceptional Facts from Knowledge Graphs. 1317-1332 - Jinfeng Li, Xiao Yan, Jie Zhang, An Xu, James Cheng, Jie Liu, Kelvin Kai Wing Ng, Ti-Chung Cheng:
A General and Efficient Querying Method for Learning to Hash. 1333-1347 - Hao Xin, Rui Meng, Lei Chen:
Subjective Knowledge Base Construction Powered By Crowdsourcing and Knowledge Base. 1349-1361 - Jiawei Jiang, Bin Cui, Ce Zhang, Fangcheng Fu:
DimBoost: Boosting Gradient Boosting Decision Tree to Higher Dimensions. 1363-1376 - Zhipeng Huang, Yeye He:
Auto-Detect: Data-Driven Error Detection in Tables. 1377-1392
Industry 4: Graph databases &Query Processing on Modern Hardware
- Pramod A. Jamkhedkar, Theodore Johnson, Yaron Kanza, Aman Shaikh, N. K. Shankaranarayanan, Vladislav Shkapenyuk:
A Graph Database for a Virtualized Network Infrastructure. 1393-1405 - Cagri Balkesen, Nitin Kunal, Georgios Giannikis, Pit Fender, Seema Sundara, Felix Schmidt, Jarod Wen, Sandeep R. Agrawal, Arun Raghavan, Venkatanathan Varadarajan, Anand Viswanathan, Balakrishnan Chandrasekaran, Sam Idicula, Nipun Agarwal, Eric Sedlar:
RAPID: In-Memory Analytical Query Processing Engine with Extreme Performance per Watt. 1407-1419 - Renzo Angles, Marcelo Arenas, Pablo Barceló, Peter A. Boncz, George H. L. Fletcher, Claudio Gutierrez, Tobias Lindaaker, Marcus Paradies, Stefan Plantikow, Juan F. Sequeda, Oskar van Rest, Hannes Voigt:
G-CORE: A Core for Future Graph Query Languages. 1421-1432 - Nadime Francis, Alastair Green, Paolo Guagliardo, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Stefan Plantikow, Mats Rydberg, Petra Selmer, Andrés Taylor:
Cypher: An Evolving Query Language for Property Graphs. 1433-1445 - Michal Nowakiewicz, Eric Boutin, Eric N. Hanson, Robert Walzer, Akash Katipally:
BIPie: Fast Selection and Aggregation on Encoded Data using Operator Specialization. 1447-1459
Research 14: Approximate Query Processing
- Yongjoo Park, Barzan Mozafari, Joseph Sorenson, Junhao Wang:
VerdictDB: Universalizing Approximate Query Processing. 1461-1476 - Jinglin Peng, Dongxiang Zhang, Jiannan Wang, Jian Pei:
AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics. 1477-1492 - Yao Lu, Aakanksha Chowdhery, Srikanth Kandula, Surajit Chaudhuri:
Accelerating Machine Learning Inference with Probabilistic Predicates. 1493-1508 - Uzi Cohen, Batya Kenig, Haoyue Ping, Benny Kimelfeld, Julia Stoyanovich:
A Query Engine for Probabilistic Preferences. 1509-1524 - Zhuoyue Zhao, Robert Christensen, Feifei Li, Xiao Hu, Ke Yi:
Random Sampling over Joins Revisited. 1525-1539
Research 15: Databases for Emerging Hardware
- Alexander van Renen, Viktor Leis, Alfons Kemper, Thomas Neumann, Takushi Hashida, Kazuichi Oe, Yoshiyasu Doi, Lilian Harada, Mitsuru Sato:
Managing Non-Volatile Memory in Database Systems. 1541-1555 - Anil Shanbhag, Holger Pirk, Samuel Madden:
Efficient Top-K Query Processing on Massively Parallel Hardware. 1557-1570 - Dong Young Yoon, Mosharaf Chowdhury, Barzan Mozafari:
Distributed Lock Management with RDMA: Decentralization without Starvation. 1571-1586 - Shuo Han, Lei Zou, Jeffrey Xu Yu:
Speeding Up Set Intersections in Graph Algorithms using SIMD Instructions. 1587-1602 - Henning Funke, Sebastian Breß, Stefan Noll, Volker Markl, Jens Teubner:
Pipelined Query Processing in Coprocessor Environments. 1603-1618 - Till Kolditz, Dirk Habich, Wolfgang Lehner, Matthias Werner, Stefan T. J. de Bruijn:
AHEAD: Adaptable Data Hardening for On-the-Fly Hardware Error Detection during Database Query Processing. 1619-1634
Special Session: A Technical Research Agenda in Data Ethics and Responsible Data Management
- Julia Stoyanovich, Bill Howe, H. V. Jagadish:
Special Session: A Technical Research Agenda in Data Ethics and Responsible Data Management. 1635-1636
Tutorials
- Lilong Jiang, Protiva Rahman, Arnab Nandi:
Evaluating Interactive Data Systems: Workloads, Metrics, and Guidelines. 1637-1644 - Xin Luna Dong, Theodoros Rekatsinas:
Data Integration and Machine Learning: A Natural Synergy. 1645-1650 - Georgia Koutrika:
Modern Recommender Systems: from Computing Matrices to Thinking with Neurons. 1651-1654 - Graham Cormode, Somesh Jha, Tejas Kulkarni, Ninghui Li, Divesh Srivastava, Tianhao Wang:
Privacy at Scale: Local Differential Privacy in Practice. 1655-1658 - Paris Koutris, Semih Salihoglu, Dan Suciu:
Algorithmic Aspects of Parallel Query Processing. 1659-1664