default search action
ACM SIGMOD Conference 2016: San Francisco, CA, USA
- Fatma Özcan, Georgia Koutrika, Sam Madden:
Proceedings of the 2016 International Conference on Management of Data, SIGMOD Conference 2016, San Francisco, CA, USA, June 26 - July 01, 2016. ACM 2016, ISBN 978-1-4503-3531-7
Keynote - Jeff Dean
- Jeff Dean:
Building Machine Learning Systems that Understand. 1
Session 1 - Scalable Analytics and Machine Learning
- Maximilian Schleich, Dan Olteanu, Radu Ciucanu:
Learning Linear Regression Models over Factorized Joins. 3-18 - Arun Kumar, Jeffrey F. Naughton, Jignesh M. Patel, Xiaojin Zhu:
To Join or Not to Join?: Thinking Twice about Joins before Feature Selection. 19-34 - Yanxiang Huang, Bin Cui, Jie Jiang, Kunqian Hong, Wenyu Zhang, Yiran Xie:
Real-time Video Recommendation Exploration. 35-46 - Akash Das Sarma, Aditya G. Parameswaran, Jennifer Widom:
Towards Globally Optimal Crowdsourcing Quality Management: The Uniform Worker Setting. 47-62 - Jeff LeFevre, Rui Liu, Cornelio Inigo, Lupita Paz, Edward Ma, Malú Castellanos, Meichun Hsu:
Building the Enterprise Fabric for Big Data with Vertica and Spark Integration. 63-75 - Xin Huang, Wei Lu, Laks V. S. Lakshmanan:
Truss Decomposition of Probabilistic Graphs: Semantics and Algorithms. 77-90 - Rong-Hua Li, Lu Qin, Jeffrey Xu Yu, Rui Mao:
Efficient and Progressive Group Steiner Tree Search. 91-106
Session 2 - Privacy and Security
- Zach Jorgensen, Ting Yu, Graham Cormode:
Publishing Attributed Social Graphs with Formal Privacy Guarantees. 107-122 - Wei-Yen Day, Ninghui Li, Min Lyu:
Publishing Graph Degree Distribution with Node Differential Privacy. 123-138 - Michael Hay, Ashwin Machanavajjhala, Gerome Miklau, Yan Chen, Dan Zhang:
Principled Evaluation of Differentially Private Algorithms using DPBench. 139-154 - Jun Zhang, Xiaokui Xiao, Xing Xie:
PrivTree: A Differentially Private Algorithm for Hierarchical Decompositions. 155-170 - Panagiotis Karras, Artyom Nikitin, Muhammad Saad, Rudrika Bhatt, Denis Antyukhov, Stratos Idreos:
Adaptive Indexing over Encrypted Numeric Data. 171-183 - Ioannis Demertzis, Stavros Papadopoulos, Odysseas Papapetrou, Antonios Deligiannakis, Minos N. Garofalakis:
Practical Private Range Search Revisited. 185-198 - Zhao Chang, Lei Zou, Feifei Li:
Privacy Preserving Subgraph Matching on Large Graphs in Cloud. 199-213
Session 3 - Logical and Physical Database Design
- Benoît Dageville, Thierry Cruanes, Marcin Zukowski, Vadim Antonov, Artin Avanes, Jon Bock, Jonathan Claybaugh, Daniel Engovatov, Martin Hentschel, Jiansheng Huang, Allison W. Lee, Ashish Motivala, Abdul Q. Munir, Steven Pelley, Peter Povinec, Greg Rahn, Spyridon Triantafyllis, Philipp Unterbrunner:
The Snowflake Elastic Data Warehouse. 215-226 - Zhen Hua Liu, Beda Christoph Hammerschmidt, Douglas Mcmahon, Ying Liu, Hui Joe Chang:
Closing the functional and Performance Gap between SQL and NoSQL. 227-238 - Dipti Borkar, Ravi Mayuram, Gerald Sangudi, Michael J. Carey:
Have Your Data and Query It Too: From Key-Value Caching to Big Data Management. 239-251 - Shadi A. Noghabi, Sriram Subramanian, Priyesh Narayanan, Sivabalan Narayanan, Gopalakrishna Holla, Mammad Zadeh, Tianwei Li, Indranil Gupta, Roy H. Campbell:
Ambry: LinkedIn's Scalable Geo-Distributed Object Store. 253-265 - Henning Köhler, Sebastian Link:
SQL Schema Design: Foundations, Normal Forms, and Normalization. 267-279 - Shrainik Jain, Dominik Moritz, Daniel Halperin, Bill Howe, Ed Lazowska:
SQLShare: Results from a Multi-Year SQL-as-a-Service Experiment. 281-293 - Michael DiScala, Daniel J. Abadi:
Automatic Generation of Normalized Relational Schemas from Nested Key-Value Data. 295-310
Session 4 - New Storage and Network Architectures
- Harald Lang, Tobias Mühlbauer, Florian Funke, Peter A. Boncz, Thomas Neumann, Alfons Kemper:
Data Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation. 311-326 - Niv Dayan, Philippe Bonnet, Stratos Idreos:
GeckoFTL: Scalable Flash Translation Techniques For Very Large Flash Devices. 327-342 - Gihwan Oh, Chiyoung Seo, Ravi Mayuram, Yang-Suk Kee, Sang-Won Lee:
SHARE Interface in Flash Storage for Relational and NoSQL Databases. 343-354 - Feng Li, Sudipto Das, Manoj Syamala, Vivek R. Narasayya:
Accelerating Relational Databases by Leveraging Remote Memory and RDMA. 355-370 - Ismail Oukid, Johan Lasperas, Anisoara Nica, Thomas Willhalm, Wolfgang Lehner:
FPTree: A Hybrid SCM-DRAM Persistent and Concurrent B-Tree for Storage Class Memory. 371-386 - Utku Sirin, Pinar Tözün, Danica Porobic, Anastasia Ailamaki:
Micro-architectural Analysis of In-memory OLTP. 387-402
Session 5 - Graphs 1: Infrastructure and Processing on Modern Hardware
- Hang Liu, H. Howie Huang, Yang Hu:
iBFS: Concurrent Breadth-First Search on GPUs. 403-416 - Xiaogang Shi, Bin Cui, Yingxia Shao, Yunhai Tong:
Tornado: A System For Real-Time Iterative Analysis Over Evolving Data. 417-430 - Christopher R. Aberger, Susan Tu, Kunle Olukotun, Christopher Ré:
EmptyHeaded: A Relational Engine for Graph Processing. 431-446 - Min-Soo Kim, Kyuhyeon An, Himchan Park, Hyunseok Seo, Jinwook Kim:
GTS: A Fast and Scalable Graph Processing Method based on Streaming Topology to GPUs. 447-461 - Zechao Shang, Feifei Li, Jeffrey Xu Yu, Zhiwei Zhang, Hong Cheng:
Graph Analytics Through Fine-Grained Parallelism. 463-478 - Zhigang Wang, Yu Gu, Yubin Bao, Ge Yu, Jeffrey Xu Yu:
Hybrid Pulling/Pushing for I/O-Efficient Distributed and Iterative Graph Computing. 479-494
Session 6 - Streaming 1: Systems and Outlier Detection
- Medhabi Ray, Chuan Lei, Elke A. Rundensteiner:
Scalable Pattern Sharing on Event Streams. 495-510 - Milos Nikolic, Mohammad Dashti, Christoph Koch:
How to Win a Hot Dog Eating Contest: Distributed Incremental View Maintenance with Batch Updates. 511-526 - Lei Cao, Jiayuan Wang, Elke A. Rundensteiner:
Sharing-Aware Outlier Analytics over High-Volume Data Streams. 527-540 - Evangelia Kalyvianaki, Marco Fiscato, Theodoros Salonidis, Peter R. Pietzuch:
THEMIS: Fairness in Federated Stream Processing under Overload. 541-553 - Alexandros Koliousis, Matthias Weidlich, Raul Castro Fernandez, Alexander L. Wolf, Paolo Costa, Peter R. Pietzuch:
SABER: Window-Based Hybrid Stream Processing for Heterogeneous Architectures. 555-569 - Miao Qiao, Junhao Gan, Yufei Tao:
Range Thresholding on Streams. 571-582
Session 7 - Approximate Query Processing
- Joy Arulraj, Andrew Pavlo, Prashanth Menon:
Bridging the Archipelago between Row-Stores and Column-Stores for Hybrid Workloads. 583-598 - Yang Cao, Wenfei Fan:
An Effective Syntax for Bounded Relational Queries. 599-614 - Feifei Li, Bin Wu, Ke Yi, Zhuoyue Zhao:
Wander Join: Online Aggregation via Random Walks. 615-629 - Srikanth Kandula, Anil Shanbhag, Aleksandar Vitorovic, Matthaios Olma, Robert Grandl, Surajit Chaudhuri, Bolin Ding:
Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters. 631-646 - Shuang Chen, Shunning Jiang, Bingsheng He, Xueyan Tang:
A Study of Sorting Algorithms on Approximate Memory. 647-662 - Ioannis Mytilinis, Dimitrios Tsoumakos, Nectarios Koziris:
Distributed Wavelet Thresholding for Maximum Error Metrics. 663-677 - Bolin Ding, Silu Huang, Surajit Chaudhuri, Kaushik Chakrabarti, Chi Wang:
Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee. 679-694
Session 8 - Networks and the Web
- Hung T. Nguyen, My T. Thai, Thang N. Dinh:
Stop-and-Stare: Optimal Sampling Algorithms for Viral Marketing in Billion-scale Networks. 695-710 - Yasir Mehmood, Francesco Bonchi, David García-Soriano:
Spheres of Influence for More Effective Viral Marketing. 711-726 - Yu Yang, Xiangbo Mao, Jian Pei, Xiaofei He:
Continuous Influence Maximization: What Discounts Should We Offer to Social Network Users? 727-741 - Sainyam Galhotra, Akhil Arora, Shourya Roy:
Holistic Influence Maximization: Combining Scalability and Efficiency with Opinion-Aware Models. 743-758 - Astrid Rheinländer, Mario Lehmann, Anja Kunkel, Jörg Meier, Ulf Leser:
Potential and Pitfalls of Domain-Specific Information Extraction at Web Scale. 759-771 - Tim Furche, Jinsong Guo, Sebastian Maneth, Christian Schallhart:
Robust and Noise Resistant Wrapper Induction. 773-784
Session 9 - Data Discovery and Extraction
- Alon Y. Halevy, Flip Korn, Natalya Fridman Noy, Christopher Olston, Neoklis Polyzotis, Sudip Roy, Steven Euijong Whang:
Goods: Organizing Google's Datasets. 795-806 - Tomer Sagi, Avigdor Gal, Omer Barkol, Ruth Bergman, Alexander Avram:
Multi-Source Uncertain Entity Resolution at Yad Vashem: Transforming Holocaust Victim Reports into People. 807-819 - Thorsten Papenbrock, Felix Naumann:
A Hybrid Approach to Functional Dependency Discovery. 821-833 - Yang Chen, Sean Goldberg, Daisy Zhe Wang, Soumitra Siddharth Johri:
Ontological Pathfinding. 835-846 - Ce Zhang, Jaeho Shin, Christopher Ré, Michael J. Cafarella, Feng Niu:
Extracting Databases from Dark Data with DeepDive. 847-859 - Yeounoh Chung, Michael Lind Mortensen, Carsten Binnig, Tim Kraska:
Estimating the Impact of Unknown Unknowns on Aggregate Query Results. 861-876
Session 10 - Data Integration / Cleaning
- Shaoxu Song, Han Zhu, Jianmin Wang:
Constraint-Variance Tolerant Data Repairing. 877-892 - Jian He, Enzo Veltri, Donatello Santoro, Guoliang Li, Giansalvatore Mecca, Paolo Papotti, Nan Tang:
Interactive and Deterministic Data Cleaning. 893-907 - Aoqian Zhang, Shaoxu Song, Jianmin Wang:
Sequential Data Cleaning: A Statistical Approach. 909-924 - Asif Iqbal Baba, Manfred Jaeger, Hua Lu, Torben Bach Pedersen, Wei-Shinn Ku, Xike Xie:
Learning-Based Cleansing for Indoor RFID Data. 925-936 - Sanjay Krishnan, Jiannan Wang, Michael J. Franklin, Ken Goldberg, Tim Kraska:
PrivateClean: Data Cleaning and Differential Privacy. 937-951 - Sebastian Kruse, Anja Jentzsch, Thorsten Papenbrock, Zoi Kaoudi, Jorge-Arnulfo Quiané-Ruiz, Felix Naumann:
RDFind: Scalable Conditional Inclusion Dependency Discovery in RDF Datasets. 953-967 - Chengliang Chai, Guoliang Li, Jian Li, Dong Deng, Jianhua Feng:
Cost-Effective Crowdsourced Entity Resolution: A Partial-Order Approach. 969-984
Session 11 - Spatio / Temporal Databases
- Kaiqi Zhao, Lisi Chen, Gao Cong:
Topic Exploration in Spatio-Temporal Document Collections. 985-998 - Markus Pilman, Martin Kaufmann, Florian Köhl, Donald Kossmann, Damien Profeta:
ParTime: Parallel Temporal Aggregation. 999-1010 - Fernando Chirigati, Harish Doraiswamy, Theodoros Damoulas, Juliana Freire:
Data Polygamy: The Many-Many Relationships among Urban Spatio-Temporal Data Sets. 1011-1025 - Julien Pilourdault, Vincent Leroy, Sihem Amer-Yahia:
Distributed Evaluation of Top-k Temporal Joins. 1027-1039 - Peter Ogden, David B. Thomas, Peter R. Pietzuch:
AT-GIS: Highly Parallel Spatial Query Processing with Associative Transducers. 1041-1054 - Kaiyu Feng, Gao Cong, Sourav S. Bhowmick, Wen-Chih Peng, Chunyan Miao:
Towards Best Region Search for Data Exploration. 1055-1070 - Dong Xie, Feifei Li, Bin Yao, Gefei Li, Liang Zhou, Minyi Guo:
Simba: Efficient In-Memory Spatial Analytics. 1071-1085
Session 12 - Distributed Data Processing
- Guoqiang Jerry Chen, Janet L. Wiener, Shridhar Iyer, Anshul Jaiswal, Ran Lei, Nikhil Simha, Wei Wang, Kevin Wilfong, Tim Williamson, Serhat Yilmaz:
Realtime Data Processing at Facebook. 1087-1098 - Shivaram Venkataraman, Zongheng Yang, Davies Liu, Eric Liang, Hossein Falaki, Xiangrui Meng, Reynold Xin, Ali Ghodsi, Michael J. Franklin, Ion Stoica, Matei Zaharia:
SparkR: Scaling R Programs with Spark. 1099-1104 - Andrei Costea, Adrian Ionescu, Bogdan Raducanu, Michal Switakowski, Cristian Bârca, Juliusz Sompolski, Alicja Luszczak, Michal Szafranski, Giel de Nijs, Peter A. Boncz:
VectorH: Taking SQL-on-Hadoop to the Next Level. 1105-1117 - Chang Yao, Divyakant Agrawal, Gang Chen, Beng Chin Ooi, Sai Wu:
Adaptive Logging: Optimizing Logging and Recovery Costs in Distributed In-memory Databases. 1119-1134 - Alexander Shkapsky, Mohan Yang, Matteo Interlandi, Hsuan Chiu, Tyson Condie, Carlo Zaniolo:
Big Data Analytics with Datalog Queries on Spark. 1135-1149 - Tova Milo, Eyal Altshuler:
An Efficient MapReduce Cube Algorithm for Varied DataDistributions. 1151-1165
Session 13 - Graphs 2: Subgraph-based Optimization Techniques
- Zhengwei Yang, Ada Wai-Chee Fu, Ruifeng Liu:
Diversified Top-k Subgraph Querying in a Large Graph. 1167-1182 - Mohamed S. Hassan, Walid G. Aref, Ahmed M. Aly:
Graph Indexing for Shortest-Path Finding over Dynamic Sub-Graphs. 1183-1197 - Fei Bi, Lijun Chang, Xuemin Lin, Lu Qin, Wenjie Zhang:
Efficient Subgraph Matching by Postponing Cartesian Products. 1199-1214 - Wenfei Fan, Yinghui Wu, Jingbo Xu:
Adding Counting Quantifiers to Graph Patterns. 1215-1230 - Hyeonji Kim, Juneyoung Lee, Sourav S. Bhowmick, Wook-Shin Han, Jeong-Hoon Lee, Seongyun Ko, Moath H. A. Jarrah:
DUALSIM: Parallel Subgraph Enumeration in a Massive Graph on a Single Machine. 1231-1245 - Sairam Gurajada, Martin Theobald:
Distributed Set Reachability. 1247-1261
Session 14 - Main Memory Analytics
- Wenjian Xu, Ziqiang Feng, Eric Lo:
Fast Multi-Column Sorting in Main-Memory Column-Stores. 1263-1278 - Li Wang, Minqi Zhou, Zhenjie Zhang, Yin Yang, Aoying Zhou, Dina Bitton:
Elastic Pipelining in an In-Memory Database Cluster. 1279-1294 - Reza Sherkat, Colin Florendo, Mihnea Andrei, Anil K. Goel, Anisoara Nica, Peter Bumbulis, Ivan Schreter, Günter Radestock, Christian Bensberg, Daniel Booss, Heiko Gerwens:
Page As You Go: Piecewise Columnar Access In SAP HANA. 1295-1306 - Juchang Lee, Hyungyu Shin, Chang Gyoo Park, Seongyun Ko, Jaeyun Noh, Yongjae Chuh, Wolfgang Stephan, Wook-Shin Han:
Hybrid Garbage Collection for Multi-Version Concurrency Control in SAP HANA. 1307-1318 - Manos Athanassoulis, Zheng Yan, Stratos Idreos:
UpBit: Scalable In-Memory Updatable Bitmap Indexing. 1319-1332
Session 15 - Interactive Analytics
- Roee Ebenstein, Niranjan Kamat, Arnab Nandi:
FluxQuery: An Execution Framework for Highly Interactive Query Workloads. 1333-1345 - Kai Zeng, Sameer Agarwal, Ion Stoica:
iOLAP: Managing Uncertainty for Efficient Incremental OLAP. 1347-1361 - Leilani Battle, Remco Chang, Michael Stonebraker:
Dynamic Prefetching of Data Tiles for Interactive Visualization. 1363-1375 - Eirik Bakke, David R. Karger:
Expressive Query Construction through Direct Manipulation of Nested Relational Results. 1377-1392 - Gokul Nath Babu Manoharan, Stephan Ellner, Karl Schnaitter, Sridatta Chegu, Alejandro Estrella-Balderrama, Stephan Gudmundson, Apurv Gupta, Ben Handy, Bart Samwel, Chad Whipkey, Larysa Aharkava, Himani Apte, Nitin Gangahar, Jun Xu, Shivakumar Venkataraman, Divyakant Agrawal, Jeffrey D. Ullman:
Shasta: Interactive Reporting At Scale. 1393-1404 - Lyublena Antova, Rhonda Baldwin, Derrick Bryant, Tuan Cao, Michael Duller, John Eshleman, Zhongxian Gu, Entong Shen, Mohamed A. Soliman, F. Michael Waas:
Datometry Hyper-Q: Bridging the Gap Between Real-Time and Historical Analytics. 1405-1416
Session 16 - Streaming 2: Sketches
- Anshumali Shrivastava, Arnd Christian König, Mikhail Bilenko:
Time Adaptive Sketches (Ada-Sketches) for Summarizing Data Streams. 1417-1432 - Di Chen, Qin Zhang:
Streaming Algorithms for Robust Distinct Elements. 1433-1447 - Pratanu Roy, Arijit Khan, Gustavo Alonso:
Augmented Sketch: Faster and More Accurate Stream Processing. 1449-1463 - Zhewei Wei, Xuancheng Liu, Feifei Li, Shuo Shang, Xiaoyong Du, Ji-Rong Wen:
Matrix Sketching Over Sliding Windows. 1465-1480 - Nan Tang, Qing Chen, Prasenjit Mitra:
Graph Stream Summarization: From Big Bang to Big Crunch. 1481-1496 - Nikos Giatrakos, Antonios Deligiannakis, Minos N. Garofalakis:
Scalable Approximate Query Tracking over Highly Distributed Data Streams. 1497-1512
Session 17 - Transaction Processing
- Amirhesam Shahvarani, Hans-Arno Jacobsen:
A Hybrid B+-tree as Solution for In-Memory Indexing on CPU-GPU Heterogeneous Computing Platforms. 1523-1538 - Kun Ren, Thaddeus Diamond, Daniel J. Abadi, Alexander Thomson:
Low-Overhead Asynchronous Checkpointing in Main-Memory Database Systems. 1539-1551 - Shan-Hung Wu, Tsai-Yu Feng, Meng-Kai Liao, Shao-Kan Pi, Yu-Shan Lin:
T-Part: Partitioning of Transactions for Forward-Pushing in Deterministic Database Systems. 1553-1565 - Huanchen Zhang, David G. Andersen, Andrew Pavlo, Michael Kaminsky, Lin Ma, Rui Shen:
Reducing the Storage Overhead of Main-Memory OLTP Databases with Hybrid Indexes. 1567-1581 - Kun Ren, Jose M. Faleiro, Daniel J. Abadi:
Design Principles for Scaling Multi-core OLTP Under High Contention. 1583-1598 - Dong Young Yoon, Ning Niu, Barzan Mozafari:
DBSherlock: A Performance Diagnostic Tool for Transactional Databases. 1599-1614
Session 18 - Transactions and Consistency
- Natacha Crooks, Youer Pu, Nancy Estrada, Trinabh Gupta, Lorenzo Alvisi, Allen Clement:
TARDiS: A Branch-and-Merge Approach To Weak Consistency. 1615-1628 - Xiangyao Yu, Andrew Pavlo, Daniel Sánchez, Srinivas Devadas:
TicToc: Time Traveling Optimistic Concurrency Control. 1629-1642 - Zhaoguo Wang, Shuai Mu, Yang Cui, Han Yi, Haibo Chen, Jinyang Li:
Scaling Multicore Databases via Constrained Parallel Execution. 1643-1658 - Qian Lin, Pengfei Chang, Gang Chen, Beng Chin Ooi, Kian-Lee Tan, Zhengkui Wang:
Towards a Non-2PC Transaction Management in Distributed Database Systems. 1659-1674 - Kangnyeon Kim, Tianzheng Wang, Ryan Johnson, Ippokratis Pandis:
ERMIA: Fast Memory-Optimized Database System for Heterogeneous Workloads. 1675-1687 - Yingjun Wu, Chee Yong Chan, Kian-Lee Tan:
Transaction Healing: Scaling Optimistic Concurrency Control on Multicores. 1689-1704
Session 19 - Query Optimization
- Mengmeng Liu, Zachary G. Ives, Boon Thau Loo:
Enabling Incremental Query Re-Optimization. 1705-1720 - Wentao Wu, Jeffrey F. Naughton, Harneet Singh:
Sampling-Based Query Re-Optimization. 1721-1736