


default search action
ACM SIGMOD Conference 2014: Snowbird, UT, USA
- Curtis E. Dyreson, Feifei Li, M. Tamer Özsu:
International Conference on Management of Data, SIGMOD 2014, Snowbird, UT, USA, June 22-27, 2014. ACM 2014, ISBN 978-1-4503-2376-5
Keynote 1
- Eric Sedlar:
How i learned to stop worrying and love compilers. 1-2
Research session 1: transaction processing
- Gene Pang, Tim Kraska, Michael J. Franklin, Alan D. Fekete:
PLANET: making progress with commit processing in unpredictable environments. 3-14 - Jose M. Faleiro, Alexander Thomson, Daniel J. Abadi
:
Lazy evaluation of transactions in database systems. 15-26 - Peter Bailis, Alan D. Fekete, Joseph M. Hellerstein, Ali Ghodsi, Ion Stoica:
Scalable atomic visibility with RAMP transactions. 27-38 - Khai Q. Tran, Jeffrey F. Naughton, Bruhathi Sundarmurthy, Dimitris Tsirogiannis:
JECB: a join-extension, code-based approach to OLTP data partitioning. 39-50
Research session 2: social networks 1
- Siyuan Liu, Shuhui Wang, Feida Zhu
, Jinbo Zhang, Ramayya Krishnan:
HYDRA: large-scale social identity linkage via heterogeneous behavior modeling. 51-62 - Kaiyu Feng
, Gao Cong
, Sourav S. Bhowmick
, Shuai Ma:
In search of influential event organizers in online social networks. 63-74 - Youze Tang, Xiaokui Xiao
, Yanchen Shi:
Influence maximization: near-optimal time complexity meets practical efficiency. 75-86 - Guoliang Li, Shuo Chen, Jianhua Feng, Kian-Lee Tan
, Wen-Syan Li:
Efficient location-aware influence maximization. 87-98
Research session 3: spatial data
- Jieming Shi
, Nikos Mamoulis, Dingming Wu, David W. Cheung:
Density-based place clustering in geo-social networks. 99-110 - Cheng Long
, Raymond Chi-Wing Wong
, Bin Zhang, Min Xie:
Hypersphere dominance: an optimal approach. 111-122 - Zitong Chen, Yubao Liu, Raymond Chi-Wing Wong
, Jiamin Xiong, Ganglin Mai, Cheng Long
:
Efficient algorithms for optimal location queries in road networks. 123-134 - Di Chen, Christian Konrad, Ke Yi, Wei Yu, Qin Zhang
:
Robust set reconciliation. 135-146
Industry session 1: real-time/complex data analytics
- Ankit Toshniwal, Siddarth Taneja, Amit Shukla, Karthikeyan Ramasamy, Jignesh M. Patel, Sanjeev Kulkarni, Jason Jackson, Krishna Gade, Maosong Fu, Jake Donham, Nikunj Bhagat, Sailesh Mittal, Dmitriy V. Ryaboy:
Storm@twitter. 147-156 - Fangjin Yang, Eric Tschetter, Xavier Léauté, Nelson Ray, Gian Merlino, Deep Ganguli:
Druid: a real-time analytical data store. 157-168 - Sheng Huang, Yaoliang Chen, Xiaoyan Chen, Kai Liu, Xiaomin Xu, Chen Wang, Kevin Brown, Inge Halilovic:
The next generation operational data historian for IoT based on informix. 169-176 - Rebecca Taft, Manasi Vartak, Nadathur Rajagopalan Satish, Narayanan Sundaram, Samuel Madden, Michael Stonebraker:
GenBase: a complex analytics genomics benchmark. 177-188
Tutorial 1
- Anastasia Ailamaki, Erietta Liarou, Pinar Tözün
, Danica Porobic
, Iraklis Psaroudakis:
How to stop under-utilization and love multicores. 189-192
Research session 4: streams and complex event processing
- Yasuko Matsubara, Yasushi Sakurai, Christos Faloutsos
:
AutoPlait: automatic mining of co-evolving time sequences. 193-204 - Yoshitaka Yamamoto, Koji Iwanuma, Shoshi Fukuda:
Resource-oriented approximation for frequent itemset mining from bursty data streams. 205-216 - Haopeng Zhang, Yanlei Diao, Neil Immerman:
On complexity and optimization of expensive queries in complex event processing. 217-228 - Yingmei Qi, Lei Cao
, Medhabi Ray, Elke A. Rundensteiner:
Complex event analytics: online aggregation of stream sequence patterns. 229-240
Research session 5: data analytics
- Arijit Khan
, Pouya Yanki, Bojana Dimcheva, Donald Kossmann:
Towards indexing functions: answering scalar product queries. 241-252 - Milos Nikolic, Mohammed Elseidy, Christoph Koch:
LINVIEW: incremental view maintenance for complex analytical queries. 253-264 - Ce Zhang, Arun Kumar, Christopher Ré:
Materialization optimizations for feature selection workloads. 265-276 - Kai Zeng
, Shi Gao, Barzan Mozafari, Carlo Zaniolo:
The analytical bootstrap: a new method for fast error estimation in approximate query processing. 277-288
Research session 6: graph and RDF data processing
- Sairam Gurajada, Stephan Seufert, Iris Miliaraki, Martin Theobald
:
TriAD: a distributed shared-nothing RDF engine based on asynchronous message passing. 289-300 - Wenfei Fan
, Xin Wang, Yinghui Wu:
Querying big graphs within bounded resources. 301-312 - Lei Zou, Ruizhe Huang, Haixun Wang, Jeffrey Xu Yu, Wenqiang He, Dongyan Zhao:
Natural language question answering over RDF: a graph data driven approach. 313-324 - Mitsuru Kusumoto, Takanori Maehara, Ken-ichi Kawarabayashi:
Scalable similarity search for SimRank. 325-336
Industry session 2: query optimization
- Mohamed A. Soliman, Lyublena Antova, Venkatesh Raghavan, Amr El-Helw, Zhongxian Gu, Entong Shen
, George C. Caragea, Carlos Garcia-Alvarado, Foyzur Rahman, Michalis Petropoulos, Florian Waas, Sivaramakrishnan Narayanan, Konstantinos Krikellas, Rhonda Baldwin:
Orca: a modular query optimizer architecture for big data. 337-348 - Pedram Ghodsnia, Ivan T. Bowman, Anisoara Nica:
Parallel I/O aware query optimization. 349-360 - Guido Moerkotte, David DeHaan, Norman May, Anisoara Nica, Alexander Böhm:
Exploiting ordered dictionaries to efficiently construct histograms with q-error guarantees in SAP HANA. 361-372 - Lyublena Antova, Amr El-Helw, Mohamed A. Soliman, Zhongxian Gu, Michalis Petropoulos, Florian Waas:
Optimizing queries over partitioned tables in MPP systems. 373-384
Research session 7: multidimensional data
- Spyros Blanas, Kesheng Wu
, Surendra Byna
, Bin Dong, Arie Shoshani:
Parallel data analysis directly on scientific file formats. 385-396 - Tilmann Zäschke, Christoph Zimmerli, Moira C. Norrie:
The PH-tree: a space-efficient storage structure and multi-dimensional index. 397-408 - Jennie Duggan
, Michael Stonebraker:
Incremental elasticity for array databases. 409-420 - Jie Xu, Dmitri V. Kalashnikov, Sharad Mehrotra:
Efficient summarization framework for multi-attribute uncertain data. 421-432
Research session 8: data cleaning
- Ravali Pochampally, Anish Das Sarma, Xin Luna Dong, Alexandra Meliou
, Divesh Srivastava:
Fusing data with correlations. 433-444 - Anup Chalamalla, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti
:
Descriptive and prescriptive data cleaning. 445-456 - Jiannan Wang, Nan Tang:
Towards dependable data repairing with fixing rules. 457-468 - Jiannan Wang, Sanjay Krishnan, Michael J. Franklin, Ken Goldberg
, Tim Kraska, Tova Milo:
A sample-and-clean framework for fast and accurate query processing on dirty data. 469-480
Research session 9: data exploration
- Sameer Agarwal, Henry Milner, Ariel Kleiner, Ameet Talwalkar, Michael I. Jordan
, Samuel Madden, Barzan Mozafari, Ion Stoica:
Knowing when you're wrong: building fast and reliable approximate query processing systems. 481-492 - Yanyan Shen, Kaushik Chakrabarti, Surajit Chaudhuri, Bolin Ding, Lev Novik:
Discovering queries based on example tuples. 493-504 - Alexander Kalinin, Ugur Çetintemel, Stanley B. Zdonik:
Interactive data exploration using semantic windows. 505-516 - Kyriaki Dimitriadou, Olga Papaemmanouil
, Yanlei Diao:
Explore-by-example: an automatic query steering framework for interactive data exploration. 517-528
Industry session 3: storage management
- Woon-Hak Kang, Sang-Won Lee, Bongki Moon, Yang-Suk Kee, Moonwook Oh:
Durable write cache in flash memory SSD for relational and NoSQL databases. 529-540 - Aakash Goel, Bhuwan Chopra, Ciprian Gerea, Dhruv Mátáni
, Josh Metzler, Fahim Ul Haq, Janet L. Wiener:
Fast database restarts at facebook. 541-549 - Khaled Elmeleegy, Christopher Olston, Benjamin C. Reed
:
SpongeFiles: mitigating data skew in mapreduce using distributed memory. 551-562 - Richard Michael Grantham Wesley, Pawel Terlecki:
Leveraging compression in the tableau data engine. 563-573
Keynote 2
- Maurice Herlihy:
Fun with hardware transactional memory. 575
Research session 10: crowdsourcing
- Hyunjung Park, Jennifer Widom:
CrowdFill: collecting structured data from the crowd. 577-588 - Yael Amsterdamer
, Susan B. Davidson, Tova Milo, Slava Novgorodov, Amit Somech
:
OASSIS: query driven crowd mining. 589-600 - Chaitanya Gokhale, Sanjib Das, AnHai Doan, Jeffrey F. Naughton, Narasimhan Rampalli, Jude W. Shavlik, Xiaojin Zhu:
Corleone: hands-off crowdsourcing for entity matching. 601-612
Research session 11: parallel graph processing
- Yingxia Shao, Lei Chen
, Bin Cui
:
Efficient cohesive subgraphs detection in parallel. 613-624 - Yingxia Shao, Bin Cui
, Lei Chen
, Lin Ma, Junjie Yao, Ning Xu:
Parallel subgraph listing in a large-scale graph. 625-636 - Jinha Kim, Wook-Shin Han, Sangyeon Lee, Kyungyeol Park, Hwanjo Yu:
OPT: a new framework for overlapped and parallel triangulation in large-scale graphs. 637-648
Research session 12: potpouri
- Yang Chen, Daisy Zhe Wang:
Knowledge expansion over probabilistic knowledge bases. 649-660 - Dongqing Xiao, Mohamed Y. Eltabakh:
InsightNotes: summary-based annotation management in relational databases. 661-672 - Dong Deng, Guoliang Li, Jianhua Feng:
A pivotal prefix based filtering algorithm for string similarity search. 673-684
Demo A
- Astrid Rheinländer, Martin Beckmann, Anja Kunkel, Arvid Heise, Thomas Stoltmann, Ulf Leser:
Versatile optimization of UDF-heavy data flows with sofa. 685-688 - Tim Kiefer, Thomas Kissinger, Benjamin Schlegel, Dirk Habich, Daniel Molka, Wolfgang Lehner:
ERIS live: a NUMA-aware in-memory storage engine for tera-scale multiprocessor systems. 689-692 - Tomas Karnagel, Matthias Hille, Mario Ludwig, Dirk Habich, Wolfgang Lehner, Max Heimel, Volker Markl:
Demonstrating efficient query processing in heterogeneous environments. 693-696 - Tobias Mühlbauer, Wolf Rödiger, Robert Seilbeck, Angelika Reiser, Alfons Kemper, Thomas Neumann
:
One DBMS for all: the brawny few and the wimpy crowd. 697-700 - Alkis Simitsis, Kevin Wilkinson, Jason Blais, Joe Walsh:
VQA: vertica query analyzer. 701-704 - Fei Chen, Tere Gonzalez, Jun Li, Manish Marwah, Jim Pruyne, Krishnamurthy Viswanathan, Mijung Kim:
Palette: enabling scalable analytics for big-memory, multicore machines. 705-708 - Fei Li, H. V. Jagadish:
NaLIR: an interactive natural language interface for querying relational databases. 709-712 - Petar Jovanovic
, Alkis Simitsis, Kevin Wilkinson:
BabbleFlow: a translator for analytic data flow programs. 713-716 - Justin J. Levandoski, David B. Lomet, Sudipta Sengupta, Adrian Birka, Cristian Diaconu:
Indexing on modern hardware: hekaton and beyond. 717-720 - Chen Jason Zhang
, Ziyuan Zhao, Lei Chen
, H. V. Jagadish, Caleb Chen Cao:
CrowdMatcher: crowd-assisted schema matching. 721-724
Tutorial 2
- Zoi Kaoudi
, Ioana Manolescu
:
Cloud-based RDF data management. 725-729
Research session 13: data management over modern hardware
- Badrish Chandramouli, Jonathan Goldstein:
Patience is a virtue: revisiting merge and sort on modern processors. 731-742 - Viktor Leis, Peter A. Boncz, Alfons Kemper, Thomas Neumann
:
Morsel-driven parallelism: a NUMA-aware query evaluation framework for the many-core age. 743-754 - Orestis Polychroniou, Kenneth A. Ross:
A comprehensive study of main-memory partitioning and its application to large-scale comparison- and radix-sort. 755-766 - Oliver Arnold, Sebastian Haas, Gerhard P. Fettweis, Benjamin Schlegel, Thomas Kissinger, Wolfgang Lehner:
An application-specific instruction set for accelerating set-oriented database primitives. 767-778
Research session 14: non-traditional data
- Arash Termehchy, Ali Vakilian
, Yodsawalai Chodpathumwan, Marianne Winslett:
Which concepts are worth extracting? 779-790 - Curtis E. Dyreson
, Sourav S. Bhowmick
, Ryan Grapp:
Querying virtual hierarchies using virtual prefix-based numbers. 791-802 - Sumit Gulwani, Mark Marron:
NLyze: interactive programming by natural language for spreadsheet data analysis and manipulation. 803-814 - Daniel Tahara, Thaddeus Diamond, Daniel J. Abadi
:
Sinew: a SQL system for multi-structured data. 815-826
Research session 15: mapreduce processing
- Lu Qin
, Jeffrey Xu Yu, Lijun Chang, Hong Cheng, Chengqi Zhang
, Xuemin Lin
:
Scalable big graph processing in MapReduce. 827-838 - Alper Okcan, Mirek Riedewald:
Anti-combining for MapReduce. 839-850 - Jeff LeFevre
, Jagan Sankaranarayanan, Hakan Hacigümüs, Jun'ichi Tatemura, Neoklis Polyzotis, Michael J. Carey:
Opportunistic physical design for big data analytics. 851-862 - Roy Levin, Yaron Kanza:
Stratified-sampling over social networks using mapreduce. 863-874
Demo B
- Daniel Halperin
, Victor Teixeira de Almeida, Lee Lee Choo, Shumo Chu, Paraschos Koutris, Dominik Moritz, Jennifer Ortiz, Vaspol Ruamviboonsuk, Jingjing Wang, Andrew Whitaker, Shengliang Xu, Magdalena Balazinska, Bill Howe
, Dan Suciu
:
Demonstration of the Myria big data management service. 881-884 - Aditya G. Parameswaran
, Ming Han Teh, Hector Garcia-Molina, Jennifer Widom:
DataSift: a crowd-powered search toolkit. 885-888 - Iraklis Psaroudakis, Manos Athanassoulis
, Matthaios Olma, Anastasia Ailamaki:
Reactive and proactive sharing across concurrent analytical queries. 889-892 - Shengqi Yang, Yanan Xie, Yinghui Wu, Tianyi Wu, Huan Sun, Jian Wu, Xifeng Yan:
SLQ: a user-friendly graph querying system. 893-896 - Louai Alarabi
, Ahmed Eldawy
, Rami Alghamdi, Mohamed F. Mokbel:
TAREEG: a MapReduce-based web service for extracting spatial data from OpenStreetMap. 897-900 - Davide Mottin
, Matteo Lissandrini
, Yannis Velegrakis
, Themis Palpanas:
Searching with XQ: the exemplar query search engine. 901-904 - Mehdi Kargar, Aijun An, Nick Cercone, Parke Godfrey, Jaroslaw Szlichta, Xiaohui Yu
:
MeanKS: meaningful keyword search in relational databases with complex schema. 905-908 - Nikolaos Papailiou, Dimitrios Tsoumakos, Ioannis Konstantinou
, Panagiotis Karras, Nectarios Koziris:
H2RDF+: an efficient data management system for big RDF graphs. 909-912 - Carsten Binnig
, Abdallah Salama, Erfan Zamanian:
DoomDB: kill the query. 913-916
Panel
- Bill Howe
, Michael J. Franklin, Juliana Freire
, James Frew, Tim Kraska, Raghu Ramakrishnan:
Should we all be teaching "intro to data science" instead of "intro to databases"? 917-918
Research session 16: distributed and parallel data management
- Theodoros Rekatsinas
, Xin Luna Dong, Divesh Srivastava:
Characterizing and selecting fresh data sources. 919-930 - Alvin Cheung
, Samuel Madden, Armando Solar-Lezama
:
Sloth: being lazy is a virtue (when issuing database queries). 931-942 - Konstantinos Karanasos, Andrey Balmin
, Marcel Kutsch, Fatma Ozcan
, Vuk Ercegovac, Chunyang Xia, Jesse Jackson:
Dynamically optimizing queries over large scale data platforms. 943-954 - PengCheng Xiong
, Hakan Hacigümüs, Jeffrey F. Naughton:
A software-defined networking based approach for performance management of analytical queries on distributed data stores. 955-966
Research session 17: graph analytics
- Panos Parchas, Francesco Gullo
, Dimitris Papadias, Francesco Bonchi:
The pursuit of a good possible world: extracting representative instances of uncertain graphs. 967-978 - Nadathur Satish, Narayanan Sundaram, Md. Mostofa Ali Patwary, Jiwon Seo, Jongsoo Park, Muhammad Amber Hassaan, Shubho Sengupta, Zhaoming Yin, Pradeep Dubey:
Navigating the maze of graph analytics frameworks using massive graph datasets. 979-990 - Wanyun Cui, Yanghua Xiao, Haixun Wang, Wei Wang:
Local search of communities in large graphs. 991-1002 - Akhil Arora, Mayank Sachan, Arnab Bhattacharya:
Mining statistically significant connected subgraphs in vertex labeled graphs. 1003-1014
Research session 18: query processing and optimization 1
- Ioana Ileana, Bogdan Cautis, Alin Deutsch, Yannis Katsis:
Complete yet practical search for minimal query reformulations under constraints. 1015-1026 - James Cheney
, Sam Lindley
, Philip Wadler:
Query shredding: efficient relational evaluation of queries over nested multisets. 1027-1038 - Anshuman Dutt, Jayant R. Haritsa:
Plan bouquets: query processing without selectivity estimation. 1039-1050 - Fei Li, Tianyin Pan, H. V. Jagadish:
Schema-free SQL. 1051-1062
Demo C
- You Wu, Brett Walenz, Peggy Li, Andrew Shim, Emre Sonmez, Pankaj K. Agarwal, Chengkai Li, Jun Yang, Cong Yu:
iCheck: computationally combating "lies, d-ned lies, and statistics". 1063-1066 - Kai Zeng
, Shi Gao, Jiaqi Gu, Barzan Mozafari, Carlo Zaniolo:
ABS: a system for scalable approximate queries with accuracy guarantees. 1067-1070 - Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Jorge-Arnulfo Quiané-Ruiz
, Nan Tang, Si Yin:
NADEEF/ER: generic and interactive entity resolution. 1071-1074 - Alex Cheng, Mary Malit, Chuanxi Zhang, Nick Koudas:
SerpentTI: flexible analytics of users, boards and domains for pinterest. 1075-1078 - Esther Galbrun, Pauli Miettinen
:
Interactive redescription mining. 1079-1082 - Carlos Garcia-Alvarado, Carlos Ordonez:
ONTOCUBO: cube-based ontology construction and exploration. 1083-1086 - Tobias Emrich, Maximilian Franzke, Hans-Peter Kriegel, Johannes Niedermayer, Matthias Renz, Andreas Züfle:
An extendable framework for managing uncertain spatio-temporal data. 1087-1090 - Fangbo Tao, George Brova, Jiawei Han, Heng Ji, Chi Wang, Brandon Norick, Ahmed El-Kishky, Jialu Liu, Xiang Ren, Yizhou Sun:
NewsNetExplorer: automatic construction and exploration of news information networks. 1091-1094 - Davide Mottin
, Alice Marascu, Senjuti Basu Roy, Gautam Das
, Themis Palpanas, Yannis Velegrakis
:
IQR: an interactive query relaxation system for the empty-answer problem. 1095-1098 - Shiming Zhang, Yin Yang
, Wei Fan, Liang Lan
, Mingxuan Yuan:
OceanRT: real-time analytics over large temporal data. 1099-1102