![](https://dblp.uni-trier.de/img/logo.ua.320x120.png)
![](https://dblp.uni-trier.de/img/dropdown.dark.16x16.png)
![](https://dblp.uni-trier.de/img/peace.dark.16x16.png)
Остановите войну!
for scientists:
![search dblp search dblp](https://dblp.uni-trier.de/img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de/img/search.dark.16x16.png)
default search action
33rd ICDE 2017: San Diego, CA, USA
- 33rd IEEE International Conference on Data Engineering, ICDE 2017, San Diego, CA, USA, April 19-22, 2017. IEEE Computer Society 2017, ISBN 978-1-5090-6543-1
Keynotes
- Volker Markl:
Mosaics: Stratosphere, Flink and Beyond. 3 - Laura M. Haas:
Leveraging Data and People to Accelerate Data Science. 4
TKDE Posters
- Wentao Wu, Hongsong Li, Haixun Wang, Kenny Q. Zhu:
Semantic Bootstrapping: A Theoretical Perspective. 7-8 - Gang Hu, Jie Shao, Fenglin Liu, Yuan Wang, Heng Tao Shen:
IF-Matching: Towards Accurate Map-Matching with Information Fusion. 9-10 - Qi Zhang, Yang Wang, Jin Qian, Binbin Deng, Xuanjing Huang:
A Mixed Generative-Discriminative Based Hashing Method. 11-12 - Yung-Chun Chang, Chien Chin Chen
, Wen-Lian Hsu:
SPIRIT: A Tree Kernel-Based Method for Topic Person Interaction Detection (Extended Abstract). 13-14 - Ying Zhang, Yu-Ling Hsueh
, Wang-Chien Lee, Yi-Hao Jhang:
Efficient Cache-Supported Path Planning on Roads (Extended Abstract). 15-16 - Jianxin Li
, Chengfei Liu
, Jeffrey Xu Yu, Yi Chen, Timos Sellis
, J. Shane Culpepper
:
Personalized Influential Topic Search via Social Network Summarization. 17-18 - Jun Chen, Chaokun Wang, Jianmin Wang
, Philip S. Yu:
Recommendation for Repeat Consumption from User Implicit Feedback. 19-20 - Meng Wang
, Hui Li, Jiangtao Cui, Ke Deng, Sourav S. Bhowmick
, Zhenhua Dong:
PINOCCHIO: Probabilistic Influence-Based Location Selection over Moving Objects. 21-22 - Zeyuan Shang, Yaxiao Liu, Guoliang Li, Jianhua Feng:
K-Join: Knowledge-Aware Similarity Join. 23-24 - Feng Tian, Tian Lan, Qinghua Zheng, Kuo-Ming Chao
, Nick Godwin, Nazaraf Shah
, Fan Zhang:
Mining Suspicious Tax Evasion Groups in Big Data. 25-26 - Long Guo, Dongxiang Zhang, Gao Cong
, Wei Wu, Kian-Lee Tan
:
Influence Maximization in Trajectory Databases. 27-28 - Chenyun Yu, Sarana Nutanong, Hangyu Li
, Cong Wang
, Xingliang Yuan
:
A Generic Method for Accelerating LSH-Based Similarity Join Processing (Extended Abstract). 29-30 - Yu Gu, Guanli Liu
, Jianzhong Qi
, Hongfei Xu, Ge Yu, Rui Zhang:
The Moving K Diversified Nearest Neighbor Query. 31-32 - Binbin Gu
, Zhixu Li, Xiangliang Zhang
, An Liu, Guanfeng Liu
, Kai Zheng, Lei Zhao, Xiaofang Zhou
:
The Interaction Between Schema Matching and Record Matching in Data Integration (Extended Abstract). 33-34 - Dixin Luo, Hongteng Xu, Yi Zhen, Bistra Dilkina
, Hongyuan Zha, Xiaokang Yang, Wenjun Zhang:
Learning Mixtures of Markov Chains from Aggregate Data with Structural Constraints (Extended Abstract). 35-36 - Hongteng Xu, Weichang Wu, Shamim Nemati, Hongyuan Zha:
Patient Flow Prediction via Discriminative Learning of Mutually-Correcting Processes (Extended Abstract). 37-38 - Guoliang Li, Jiannan Wang, Yudian Zheng, Michael J. Franklin:
Crowdsourced Data Management: A Survey. 39-40 - Cheng Chen, Lan Zheng, Venkatesh Srinivasan, Alex Thomo
, Kui Wu, Anthony Sukow:
Conflict-Aware Weighted Bipartite b-Matching and Its Application to E-Commerce. 41-42 - Xinpeng Zhang, Yasuhito Asano, Masatoshi Yoshikawa:
Mutually Beneficial Confluent Routing. 43-44 - Bo Tang, Man Lung Yiu
, Kien A. Hua:
Exploit Every Bit: Effective Caching for High-Dimensional Nearest Neighbor Search. 45-46 - Sreevani, C. A. Murthy:
Bridging Feature Selection and Extraction: Compound Feature Generation (Extended Abstract). 47-48 - Shuang Hao, Nan Tang, Guoliang Li, Jian He, Na Ta, Jianhua Feng:
A Novel Cost-Based Model for Data Repairing. 49-50 - Mahsa Salehi
, Christopher Leckie
, James C. Bezdek, Tharshan Vaithianathan, Xuyun Zhang
:
Fast Memory Efficient Local Outlier Detection in Data Streams (Extended Abstract). 51-52 - Yunjun Gao, Qing Liu, Gang Chen, Linlin Zhou, Baihua Zheng
:
Finding Causality and Responsibility for Probabilistic Reverse Skyline Query Non-Answers. 53-54 - Xiaoyang Wang
, Ying Zhang
, Wenjie Zhang
, Xuemin Lin
, Chen Chen
:
Bring Order into the Samples: A Novel Scalable Method for Influence Maximization (Extended Abstract). 55-56 - Linhong Zhu, Dong Guo, Junming Yin, Greg Ver Steeg, Aram Galstyan:
Scalable Temporal Latent Space Inference for Link Prediction in Dynamic Social Networks (Extended Abstract). 57-58 - Shuo Shang, Lisi Chen, Zhewei Wei, Christian S. Jensen
, Ji-Rong Wen, Panos Kalnis
:
Collective Travel Planning in Spatial Networks. 59-60 - Shuai Ma, Kaiyu Feng
, Jianxin Li, Haixun Wang, Gao Cong
, Jinpeng Huai:
Proxies for Shortest Path and Distance Queries. 61-62 - Guillaume Bagan, Angela Bonifati
, Radu Ciucanu, George H. L. Fletcher, Aurélien Lemay, Nicky Advokaat:
gMark: Schema-Driven Generation of Graphs and Queries. 63-64 - Yoones A. Sekhavat
, Jeffrey Parsons
:
SEDEX: Scalable Entity Preserving Data Exchange. 65-66 - Yanfeng Zhang, Shimin Chen, Ge Yu:
Efficient Distributed Density Peaks for Clustering Large Data Sets in MapReduce. 67-68 - Junbeom Hur, Dongyoung Koo, Young-joo Shin
, Kyungtae Kang:
Secure Data Deduplication with Dynamic Ownership Management in Cloud Storage. 69-70 - Libin Zheng, Lei Chen:
Maximizing Acceptance in Rejection-aware Spatial Crowdsourcing. 71-72
ICDE Short Paper Posters
- Hongzhi Yin
, Liang Chen, Weiqing Wang
, Xingzhong Du, Nguyen Quoc Viet Hung
, Xiaofang Zhou
:
Mobi-SAGE: A Sparse Additive Generative Model for Mobile App Recommendation. 75-78 - Zhao Kang, Chong Peng, Qiang Cheng:
Clustering with Adaptive Manifold Structure Learning. 79-82 - Xiang Ao, Ping Luo
, Jin Wang, Fuzhen Zhuang, Qing He:
Mining Precise-Positioning Episode Rules from Event Sequences. 83-86 - Shubhadip Mitra, Priya Saraf, Richa Sharma, Arnab Bhattacharya, Sayan Ranu, Harsh Bhandari:
NetClus: A Scalable Framework for Locating Top-K Sites for Placement of Trajectory-Aware Services. 87-90 - Gang Hu, Jie Shao, Dongxiang Zhang, Yang Yang, Heng Tao Shen:
Preserving-Ignoring Transformation Based Index for Approximate k Nearest Neighbor Search. 91-94 - Lijun Chang, Chen Zhang, Xuemin Lin
, Lu Qin
:
Scalable Top-K Structural Diversity Search. 95-98 - Anuradha Awasthi, Arnab Bhattacharya, Sanchit Gupta, Ujjwal Kumar Singh:
K-Dominant Skyline Join Queries: Extending the Join Paradigm to K-Dominant Skylines. 99-102 - Tong Yang, Lingtong Liu, Yibo Yan, Muhammad Shahzad, Yulong Shen, Xiaoming Li, Bin Cui
, Gaogang Xie:
SF-sketch: A Fast, Accurate, and Memory Efficient Data Structure to Store Frequencies of Data Items. 103-106 - Lei Chen, Yafei Li, Jianliang Xu
, Christian S. Jensen
:
Direction-Aware Why-Not Spatial Keyword Top-k Queries. 107-110 - Yilin Shen, Yanping Chen, Eamonn J. Keogh, Hongxia Jin:
Searching Time Series with Invariance to Large Amounts of Uniform Scaling. 111-114 - Shengli Sun, Yimo Wang, Weilong Liao, Wei Wang:
Mining Maximal Cliques on Dynamic Graphs Efficiently by Local Strategies. 115-118 - Zhongle Xie, Qingchao Cai, H. V. Jagadish, Beng Chin Ooi, Weng-Fai Wong
:
Parallelizing Skip Lists for In-Memory Multi-Core Database Systems. 119-122 - Long Guo, Dongxiang Zhang, Huayu Wu, Bin Cui
, Kian-Lee Tan
:
From Raw Footprints to Personal Interests: Bridging the Semantic Gap via Trip Intention Aggregation. 123-126 - Yunfan Chen, Lei Chen, Chen Jason Zhang:
CrowdFusion: A Crowdsourced Approach on Data Fusion Refinement. 127-130 - David B. Blumenthal
, Johann Gamper
:
Correcting and Speeding-Up Bounds for Non-Uniform Graph Edit Distance. 131-134 - Dimitrios Karapiperis
, Aris Gkoulalas-Divanis, Vassilios S. Verykios
:
Distance-Aware Encoding of Numerical Values for Privacy-Preserving Record Linkage. 135-138 - Ibrahim Abdelaziz
, Essam Mansour, Mourad Ouzzani, Ashraf Aboulnaga
, Panos Kalnis
:
Query Optimizations over Decentralized RDF Graphs. 139-142 - Klaus Broelemann, Thomas Gottron, Gjergji Kasneci:
LTD-RBM: Robust and Fast Latent Truth Discovery Using Restricted Boltzmann Machines. 143-146 - Young-Seok Kim, Taewoo Kim, Michael J. Carey, Chen Li:
A Comparative Study of Log-Structured Merge-Tree-Based Spatial Indexes for Big Data. 147-150 - Angen Zheng, Alexandros Labrinidis, Christos Faloutsos:
Skew-Resistant Graph Partitioning. 151-154 - Ahsanul Haque, Swarup Chandra, Latifur Khan
, Kevin W. Hamlen, Charu C. Aggarwal:
Efficient Multistream Classification Using Direct Density Ratio Estimation. 155-158 - Victor Amelkin, Petko Bogdanov, Ambuj K. Singh:
A Distance Measure for the Analysis of Polar Opinion Dynamics in Social Networks. 159-162 - Jingchao Ni, Hongliang Fei, Wei Fan, Xiang Zhang:
Cross-Network Clustering and Cluster Ranking for Medical Diagnosis. 163-166 - Qiang Huang
, Jianlin Feng, Qiong Fang:
Reverse Query-Aware Locality-Sensitive Hashing for High-Dimensional Furthest Neighbor Search. 167-170 - Nhat X. T. Le, Vagelis Hristidis
, Neal E. Young
:
Ontology- and Sentiment-Aware Review Summarization. 171-174 - Fengchao Peng, Qiong Luo
, Lionel M. Ni:
ACTS: An Active Learning Method for Time Series Classification. 175-178 - Ran Yu
, Ujwal Gadiraju, Besnik Fetahu, Stefan Dietze:
FuseM: Query-Centric Data Fusion on Structured Web Markup. 179-182 - Zhaonian Zou, Faming Li, Jianzhong Li, Yingshu Li:
Scalable Processing of Massive Uncertain Graph Data: A Simultaneous Processing Approach. 183-186 - Charu C. Aggarwal, Yao Li, Philip S. Yu, Yuchen Zhao:
On Edge Classification in Networks with Structure and Content. 187-190 - Konstantinos Lolos, Ioannis Konstantinou, Verena Kantere, Nectarios Koziris:
Adaptive State Space Partitioning of Markov Decision Processes for Elastic Resource Management. 191-194 - Neha Sengupta, Amitabha Bagchi, Srikanta Bedathur, Maya Ramanath:
Sampling and Reconstruction Using Bloom Filters. 195-198 - Chunyao Song, Xuanming Liu, Tingjian Ge:
Top-k Frequent Items and Item Frequency Tracking over Sliding Windows of Any Sizes. 199-202 - Linfei Zhou, Claudia Plant
, Christian Böhm:
Joint Gaussian Based Measures for Multiple-Instance Learning. 203-206 - Mohamed Sarwat, Yuhan Sun:
Answering Location-Aware Graph Reachability Queries on GeoSocial Data. 207-210 - Moloud Shahbazi, Matthew T. Wiley, Vagelis Hristidis
:
IRanker: Query-Specific Ranking of Reviewed Items. 211-214 - Yang Zhang, Yusu Wang, Srinivasan Parthasarathy
:
Analyzing and Visualizing Scalar Fields on Graphs. 215-218 - Jiawei Zhang, Philip S. Yu, Yuanhua Lv:
Enterprise Community Detection. 219-222 - Chaoyue Niu, Zhenzhe Zheng, Fan Wu, Xiaofeng Gao, Guihai Chen
:
Trading Data in Good Faith: Integrating Truthfulness and Privacy Preservation in Data Markets. 223-226 - Kasper Grud Skat Madsen
, Yongluan Zhou
, Jianneng Cao:
Integrative Dynamic Reconfiguration in a Parallel Stream Processing Engine. 227-230 - Xinyu Lei, Alex X. Liu, Rui Li:
Secure KNN Queries over Encrypted Data: Dimensionality Is Not Always a Curse. 231-234
Industry Posters
- Anjan Kumar Amirishetty, Yunrui Li, Tolga Yurek, Mahesh Girkar, Wilson Chan, Graham Ivey, Vsevolod Panteleenko, Ken Wong:
Improving Predictable Shared-Disk Clusters Performance for Database Clouds. 237-242 - Dong Wang, Wei Cao, Jian Li, Jieping Ye:
DeepSD: Supply-Demand Prediction for Online Car-Hailing Services Using Deep Neural Networks. 243-254 - Sung Jin Kim, Mohammed Al-Kateb, Paul Sinclair, Alain Crolotte, Chengyang Zhang, Linda Rose:
Dynamic Statistics Collection in the Teradata Unified Data Architecture. 255-258 - Joan Serrà, Ilias Leontiadis, Alexandros Karatzoglou, Konstantina Papagiannaki:
Hot or Not? Forecasting Cellular Network Hot Spots Using Sector Performance Indicators. 259-270 - Lijun Tang, Eric Yi Liu:
Joint User-Entity Representation Learning for Event Recommendation in Social Network. 271-280 - Jie Jiang, Jiawei Jiang, Bin Cui
, Ce Zhang:
TencentBoost: A Gradient Boosting Tree System with Parameter Server. 281-284 - Yongseok Son, Jaeyoon Choi, Jekyeom Jeon, Cheolgi Min, Sunggon Kim
, Heon Young Yeom, Hyuck Han:
SSD-Assisted Backup and Recovery for Database Systems. 285-296 - Ilias Leontiadis, Joan Serrà, Alessandro Finamore, Giorgos Dimopoulos
, Konstantina Papagiannaki:
The Good, the Bad, and the KPIs: How to Combine Performance Metrics to Better Capture Underperforming Sectors in Mobile Networks. 297-308 - Yosef Moatti, Eran Rom, Raúl Gracia Tinedo, Dalit Naor, Doron Chen, Josep Sampé, Marc Sánchez Artigas, Pedro García López, Filip Gluszak, Eric Deschdt, Francesco Pace, Daniele Venzano, Pietro Michiardi:
Too Big to Eat: Boosting Analytics Data Ingestion from Object Stores with Scoop. 309-320
Research Track
Session: Graphs
- Xiongcai Luo, Jun Gao, Chang Zhou, Jeffrey Xu Yu:
UniWalk: Unidirectional Random Walk Based Scalable SimRank Computation over Large Graph. 325-336 - Yikai Zhang, Jeffrey Xu Yu, Ying Zhang
, Lu Qin
:
A Fast Order-Based Approach for Core Maintenance. 337-348 - Son T. Mai, Martin Storgaard Dieu, Ira Assent
, Jon Jacobsen, Jesper Kristensen, Mathias Skovgaard Birk:
Scalable and Interactive Graph Clustering Algorithm on Multicore CPUs. 349-360 - Shuai Ma, Renjun Hu, Luoshu Wang, Xuelian Lin, Jinpeng Huai:
Fast Computation of Dense Temporal Subgraphs. 361-372
Session: Keyword Search, Text and Strings
- Xike Xie, Xin Lin, Jianliang Xu
, Christian S. Jensen
:
Reverse Keyword-Based Location Search. 375-386 - Jingwen Zhao, Yunjun Gao, Gang Chen, Christian S. Jensen
, Rui Chen, Deng Cai:
Reverse Top-k Geo-Social Keyword Queries in Road Networks. 387-398 - Yangjun Chen, Yujia Wu:
BWT Arrays and Mismatching Trees: A New Way for String Matching with k Mismatches. 399-410 - Justin Wood, Patrick Tan, Wei Wang
, Corey W. Arnold:
Source-LDA: Enhancing Probabilistic Topic Models Using Prior Knowledge Sources. 411-422
Session: Data Mining
- Michele Coscia, Frank M. H. Neffke
:
Network Backboning with Noisy Data. 425-436 - Guoyao Feng, Lukasz Golab, Divesh Srivastava:
Scalable Informative Rule Mining. 437-448 - Yu Zhang, Kanat Tangwongsan, Srikanta Tirthapura:
Streaming k-Means Clustering with Fast Queries. 449-460 - Md Farhadur Rahman, Weimo Liu, Saad Bin Suhaim, Saravanan Thirumuruganathan, Nan Zhang, Gautam Das
:
Density Based Clustering over Location Based Services. 461-469
Session: Query Optimization and Provenance
- Xing Niu, Raghav Kapoor, Boris Glavic
, Dieter Gawlick, Zhen Hua Liu, Venkatesh Radhakrishnan:
Provenance-Aware Query Optimization. 473-484 - Seokki Lee
, Sven Köhler, Bertram Ludäscher, Boris Glavic
:
A SQL-Middleware Unifying Why and Why-Not Provenance for First-Order Queries. 485-496 - Marios Meimaris, George Papastefanatos
, Nikos Mamoulis, Ioannis Anagnostopoulos:
Extended Characteristic Sets: Graph Indexing for SPARQL Query Optimization. 497-508 - Jianye Yang, Wenjie Zhang
, Shiyu Yang, Ying Zhang
, Xuemin Lin
:
TT-Join: Efficient Set Containment Join. 509-520
Session: Systems for New Analytics
- Shangyu Luo, Zekai J. Gao, Michael N. Gubanov, Luis Leopoldo Perez, Christopher M. Jermaine:
Scalable Linear Algebra on a Relational Database System. 523-534 - Evan R. Sparks, Shivaram Venkataraman, Tomer Kaftan, Michael J. Franklin, Benjamin Recht:
KeystoneML: Optimizing Pipelines for Large-Scale Advanced Analytics. 535-546 - Buwen Wu, Yongluan Zhou
, Hai Jin, Amol Deshpande:
Parallel SPARQL Query Optimization. 547-558 - Christos Anagnostopoulos
, Peter Triantafillou:
Efficient Scalable Accurate Regression Queries in In-DBMS Analytics. 559-570 - Hui Miao, Ang Li, Larry S. Davis, Amol Deshpande:
Towards Unified Data and Lifecycle Management for Deep Learning. 571-582
Session: Top-K, KNN and Skyline Querying
- Farhana Murtaza Choudhury
, Zhifeng Bao
, J. Shane Culpepper
, Timos Sellis
:
Monitoring the Top-m Rank Aggregation of Spatial Objects in Streaming Queries. 585-596 - Sheng Wang, Zhifeng Bao
, J. Shane Culpepper
, Timos Sellis
, Mark Sanderson
, Xiaolin Qin:
Answering Top-k Exemplar Trajectory Queries. 597-608 - Bilong Shen, Ying Zhao, Guoliang Li, Weimin Zheng, Yue Qin, Bo Yuan, Yongming Rao:
V-Tree: Efficient kNN Search on Moving Objects with Road-Network Constraints. 609-620 - Guoyang Chen, Yufei Ding, Xipeng Shen
:
Sweet KNN: An Efficient KNN on GPU through Reconciliation between Redundancy Removal and Regularity. 621-632 - Jinfei Liu, Juncheng Yang, Li Xiong
, Jian Pei
:
Secure Skyline Queries on Cloud Platform. 633-644
Session: New Hardware
- David Broneske, Veit Köppen
, Gunter Saake, Martin Schäler:
Accelerating Multi-Column Selection Predicates in Main-Memory - The Elf Approach. 647-658 - Shuhao Zhang
, Bingsheng He
, Daniel Dahlmeier, Amelie Chi Zhou, Thomas Heinze:
Revisiting the Design of Data Stream Processing Systems on Multi-Core Processors. 659-670 - Kai Zhang, Jiayu Hu, Bingsheng He
, Bei Hua:
DIDO: Dynamic Pipelines for In-Memory Key-Value Stores on Coupled CPU-GPU Architectures. 671-682 - Risi Thonangi, Jun Yang:
On Log-Structured Merge for Solid-State Drives. 683-694
Session: Security and Encryption
- Rui Li, Alex X. Liu:
Adaptively Secure Conjunctive Query Processing over Encrypted Data for Cloud Computing. 697-708 - Pietro Colombo
, Elena Ferrari
:
Towards a Unifying Attribute Based Access Control Approach for NoSQL Datastores. 709-720 - Boxiang Dong
, Wendy Hui Wang:
Frequency-Hiding Dependency-Preserving Encryption for Outsourced Databases. 721-732 - Kerim Yasin Oktay, Murat Kantarcioglu, Sharad Mehrotra:
Secure and Efficient Query Processing over Hybrid Clouds. 733-744
Session: Similarity Search
- Georgios Damaskinos, Rachid Guerraoui
, Rhicheek Patra:
Capturing the Moment: Lightweight Similarity Computations. 747-758 - Yong Zhang, Xiuxing Li, Jin Wang, Ying Zhang, Chunxiao Xing
, Xiaojie Yuan:
An Efficient Framework for Exact Set Similarity Search Using Tree Structure Indexes. 759-770 - Pratik Vinay Gupte, Balaraman Ravindran
, Srinivasan Parthasarathy
:
Role Discovery in Graphs Using Global Features: Algorithms, Applications and a Novel Evaluation Strategy. 771-782 - Yongjiang Liang, Peixiang Zhao:
Similarity Search in Graph Databases: A Multi-Layered Indexing Approach. 783-794
Session: Potpourri
- Xuan Zhou, Xin Zhou, Zhengtai Yu, Kian-Lee Tan
:
Posterior Snapshot Isolation. 797-808 - Ning Wang, Xiaokui Xiao
, Yin Yang
, Zhenjie Zhang, Yu Gu, Ge Yu:
PrivSuper: A Superset-First Approach to Frequent Itemset Mining under Differential Privacy. 809-820 - Yang Cao
, Masatoshi Yoshikawa, Yonghui Xiao, Li Xiong
:
Quantifying Differential Privacy under Temporal Correlations. 821-832 - Haida Zhang, Zengfeng Huang
, Zhewei Wei, Wenjie Zhang
, Xuemin Lin
:
Tracking Matrix Approximation over Distributed Sliding Windows. 833-844
Session: Social Networks
- Chonggang Song, Wynne Hsu
, Mong-Li Lee
:
Temporal Influence Blocking: Minimizing the Effect of Misinformation in Social Networks. 847-858 - Yurong Cheng, Ye Yuan, Lei Chen, Christophe G. Giraud-Carrier
, Guoren Wang:
Complex Event-Participant Planning and Its Incremental Variant. 859-870 - Jianxin Li
, Xinjue Wang, Ke Deng, Xiaochun Yang, Timos Sellis
, Jeffrey Xu Yu:
Most Influential Community Search over Large Social Networks. 871-882 - Yishi Lin, Wei Chen
, John C. S. Lui:
Boosting Information Spread: An Algorithmic Approach. 883-894
Session: Data Cleaning
- Joeri Rammelaere, Floris Geerts
, Bart Goethals
:
Cleaning Data with Forbidden Itemsets. 897-908 - Yasser Altowim, Sharad Mehrotra:
Parallel Progressive Approach to Entity Resolution Using MapReduce. 909-920 - Angelika Kimmig, Alex Memory, Renée J. Miller, Lise Getoor:
A Collective, Probabilistic Approach to Schema Mapping. 921-932 - Shuang Hao, Nan Tang, Guoliang Li, Jian Li:
Cleaning Relations Using Knowledge Bases. 933-944
Session: Learning and Outlier Detection
- Thach Le Nguyen, Severin Gsponer, Georgiana Ifrim
:
Time Series Classification by Sequence Learning in All-Subsequence Space. 947-958 - Lei Cao
, Yizhou Yan, Caitlin Kuhlman, Qingyang Wang, Elke A. Rundensteiner, Mohamed Y. Eltabakh:
Multi-Tactic Distance-Based Outlier Detection. 959-970 - Jiawei Zhang, Jianhui Chen, Shi Zhi, Yi Chang
, Philip S. Yu, Jiawei Han:
Link Prediction across Aligned Networks with Sparse and Low Rank Matrix Estimation. 971-982 - Xuyun Zhang
, Wan-Chun Dou, Qiang He, Rui Zhou
, Christopher Leckie
, Kotagiri Ramamohanarao, Zoran A. Salcic
:
LSHiForest: A Generic Framework for Fast Tree Isolation Based Ensemble Anomaly Analysis. 983-994
Session: Crowdsourcing and Recommender Systems
- Peng Cheng
, Xiang Lian
, Lei Chen, Cyrus Shahabi:
Prediction-Based Task Assignment in Spatial Crowdsourcing. 997-1008 - Tianshu Song, Yongxin Tong
, Libin Wang, Jieying She, Bin Yao, Lei Chen, Ke Xu:
Trichromatic Online Matching in Real-Time Spatial Crowdsourcing. 1009-1020 - Caleb Chen Cao, Jiayang Tu, Zheng Liu
, Lei Chen, H. V. Jagadish:
Tuning Crowdsourced Human Computation. 1021-1032 - Reinhard Heckel, Michail Vlachos
, Thomas P. Parnell, Celestine Dünner:
Scalable and Interpretable Product Recommendations via Overlapping Co-Clustering. 1033-1044
Session: Distributed Processing
- Yongyang Yu, MingJie Tang, Walid G. Aref, Qutaibah M. Malluhi, Mostafa M. Abbas, Mourad Ouzzani:
In-Memory Distributed Matrix Computation Processing and Optimization. 1047-1058 - Chuitian Rong
, Chunbin Lin, Yasin N. Silva, Jianguo Wang, Wei Lu, Xiaoyong Du:
Fast and Scalable Distributed Set Similarity Joins for Big Data Analytics. 1059-1070 - Namyong Park, Sejoon Oh
, U Kang:
Fast and Scalable Distributed Boolean Tensor Factorization. 1071-1082 - Claudio Martella, Dionysios Logothetis, Andreas Loukas, Georgos Siganos:
Spinner: Scalable Graph Partitioning in the Cloud. 1083-1094 - Zhida Chen, Gao Cong
, Zhenjie Zhang, Tom Z. J. Fu, Lisi Chen:
Distributed Publish/Subscribe Query Processing on the Spatio-Textual Data Stream. 1095-1106
Industry Track
Session: Predictive Analytics
- Lalitha Viswanathan, Bikash Chandra, Willis Lang, Karthik Ramachandra, Jignesh M. Patel, Ajay Kalhan, David J. DeWitt, Alan Halverson:
Predictive Provisioning: Efficiently Anticipating Usage in Azure SQL Database. 1111-1116 - Raja Subramaniam Thangaraj, Koyel Mukherjee, Gurulingesh Raravi, Asmita Metrewar, Narendra Annamaneni, Koushik Chattopadhyay:
Xhare-a-Ride: A Search Optimized Dynamic Ride Sharing System with Approximation Guarantee. 1117-1128 - Quanzhi Li, Armineh Nourbakhsh, Sameena Shah, Xiaomo Liu:
Real-Time Novel Event Detection from Social Media. 1129-1139 - Hanna Mazzawi, Gal Dalal, David Rozenblat, Liat Ein-Dor, Matan Ninio, Ofer Lavi, Allon Adir, Ehud Aharoni, Einat Kermany:
Anomaly Detection in Large Databases Using Behavioral Patterning. 1140-1149 - Abdeltawab M. Hendawi, Jayant Gupta, Youying Shi, Hossam Fattah, Mohamed H. Ali:
The Microsoft Reactive Framework Meets the Internet of Moving Things. 1150-1161
Session: New Systems
- Maosong Fu, Ashvin Agrawal, Avrilia Floratou, Bill Graham, Andrew Jorgensen, Mark Li, Neng Lu, Karthik Ramasamy, Sriram Rao, Cong Wang:
Twitter Heron: Towards Extensible Streaming Engines. 1165-1172 - An Qin, Yuan Yuan, Dai Tan, Pengyu Sun, Xiang Zhang, Hao Cao, Rubao Lee, Xiaodong Zhang:
Feisu: Fast Query Execution over Heterogeneous Data Sources on Large-Scale Clusters. 1173-1182 - Sijie Guo, Robin Dhamankar, Leigh Stewart:
DistributedLog: A High Performance Replicated Log Service. 1183-1194 - Sam Lightstone, Russ Ohanian, Michael Haide, James Cho, Michael Springgay, Torsten Steinbach:
Making Big Data Simple with dashDB Local. 1195-1205
Session: Optimization and Benchmarks
- Mohammed Al-Kateb, Paul Sinclair, Alain Crolotte, Lu Ma, Grace Au, Sanjay Nair:
Optimizing UNION ALL Join Queries in Teradata. 1209-1212 - Shuhao Zhang
, Hoang Tam Vo, Daniel Dahlmeier, Bingsheng He
:
Multi-Query Optimization for Complex Event Processing in SAP ESP. 1213-1224 - Ahmad Ghazal, Todor Ivanov, Pekka Kostamaa, Alain Crolotte, Ryan Voong, Mohammed Al-Kateb, Waleed Ghazal, Roberto V. Zicari:
BigBench V2: The New and Improved BigBench. 1225-1236 - Arun Iyengar:
Providing Enhanced Functionality for Data Store Clients. 1237-1248 - Alexander Ulanov, Andrey Simanovsky, Manish Marwah:
Modeling Scalability of Distributed Machine Learning. 1249-1254
Applications Track
Session 1
- Victor J. Marin, Tobin Pereira, Srinivas Sridharan, Carlos R. Rivero
:
Automated Personalized Feedback in Introductory Java Programming MOOCs. 1259-1270 - Deokwoo Jung, Zhenjie Zhang, Marianne Winslett:
Vibration Analysis for IoT Enabled Predictive Maintenance. 1271-1282 - Guojun Wu, Yichen Ding, Yanhua Li, Jie Bao, Yu Zheng, Jun Luo:
Mining Spatio-Temporal Reachable Regions over Massive Trajectory Data. 1283-1294 - Abdeltawab M. Hendawi, Aqeel Rustum, Amr A. Ahmadain, David Hazel, Ankur Teredesai, Dev Oliver, Mohamed H. Ali, John A. Stankovic:
Smart Personalized Routing for Smart Cities. 1295-1306
Session 2
- Jose Cordova-Garcia, Xin Wang
:
Robust Power Line Outage Detection with Unreliable Phasor Measurements. 1309-1319 - Mohamed Sarwat, Raha Moraffah, Mohamed F. Mokbel, James L. Avery:
Database System Support for Personalized Recommendation Applications. 1320-1331 - Constantinos Costa
, Georgios Chatzimilioudis, Demetrios Zeinalipour-Yazti
, Mohamed F. Mokbel:
Efficient Exploration of Telco Big Data with Compression and Decaying. 1332-1343 - Jie-Teng Wang, Wen-Yang Lin:
Privacy Preserving Anonymity for Periodical SRS Data Publishing. 1344-1355
Demo Track
Session 1: Cloud, Stream, Query Processing, Provenance
- Ryan Marcus
, Sofiya Semenova, Olga Papaemmanouil:
A Learning-Based Service for Cost and Performance Management of Cloud Databases. 1361-1362 - Zujian Weng, Qi Guo, Chunkai Wang, Xiaofeng Meng, Bingsheng He
:
AdaStorm: Resource Efficient Storm with Adaptive Configuration. 1363-1364 - Patrick Tan, Yichao Zhou, Xinxin Huang, Giuseppe M. Mazzeo, Chelsea Ju:
AZTEC: A Cloud-based Computational Platform to Integrate Biomedical Resources. 1365-1366 - Laurynas Siksnys, Torben Bach Pedersen:
Demonstrating SolveDB: An SQL-Based DBMS for Optimization Applications. 1367-1368 - Saad Bin Suhaim, Nan Zhang, Gautam Das
, Ali Jaoua
:
HDBExpDetector: Aggregate Sudden-Change Detector over Dynamic Web Databases. 1369-1370 - Qing Liu, Yunjun Gao, Linlin Zhou, Gang Chen:
IS2R: A System for Refining Reverse Top-k Queries. 1371-1372 - Pierre Bourhis, Daniel Deutch, Yuval Moskovitch:
POLYTICS: Provenance-Based Analytics of Data-Centric Applications. 1373-1374 - Sergey Hardock, Ilia Petrov, Robert Gottstein, Alejandro P. Buchmann:
Selective In-Place Appends for Real: Reducing Erases on Wear-prone DBMS Storage. 1375-1376 - Cetin Sahin, Aaron Magat, Victor Zakhary, Amr El Abbadi, Huijia (Rachel) Lin, Stefano Tessaro:
Understanding the Security Challenges of Oblivious Cloud Storage with Asynchronous Accesses. 1377-1378 - Tania K. Roblot, Sebastian Link
:
Urd: A Data Summarization Tool for the Acquisition of Meaningful Cardinality Constraints with Probabilistic Intervals. 1379-1380
Session 2: Graph Analytics, Social Networks, Machine Learning
- Amr Magdy
, Mohamed F. Mokbel:
Demonstration of Kite: A Scalable System for Microblogs Data Management. 1383-1384 - Abdulrahman Alsaudi, Mehdi Sadri, Yasser Altowim, Sharad Mehrotra:
Adaptive Topic Follow-Up on Twitter. 1385-1386 - Mohammad Hossein Namaki, Keyvan Sasani, Yinghui Wu, Tingjian Ge:
BEAMS: Bounded Event Detection in Graph Streams. 1387-1388 - Chunbin Lin, Jianguo Wang, Yannis Papakonstantinou:
GQFast: Fast Graph Exploration with Context-Aware Autocompletion. 1389-1390 - J. W. Zhang, Anwesha Mal, Y. C. Tay:
GscalerCloud: A Web-Based Graph Scaling Service. 1391-1392 - Hui Miao, Ang Li, Larry S. Davis, Amol Deshpande:
ModelHub: Deep Learning Lifecycle Management. 1393-1394 - Theodore Georgiou, Amr El Abbadi, Xifeng Yan:
Privacy Cyborg: Towards Protecting the Privacy of Social Media Users. 1395-1396 - Hongyun Cai, Vincent Wenchen Zheng, Penghe Chen, Fanwei Zhu, Kevin Chen-Chuan Chang, Zi Huang
:
SocialLens: Searching and Browsing Communities by Content and Interaction. 1397-1398 - Jan Neerbek, Ira Assent
, Peter Dolog:
TABOO: Detecting Unstructured Sensitive Information Using Recursive Neural Networks. 1399-1400
Session 3: Applications, Data Visualization, Text Analysis, Data Integration
- Zuozhi Wang, Flavio Bayer, Seungjin Lee, Kishore Narendran, Xuxi Pan, Qing Tang, Jimmy Wang, Chen Li:
A Demonstration of TextDB: Declarative and Scalable Text Analytics on Large Data Sets. 1403-1404 - Zhaoqiang Chen, Qun Chen, Zhanhuai Li:
A Human-and-Machine Cooperative Framework for Entity Resolution with Quality Guarantees. 1405-1406 - Dimitris Stripelis, José Luis Ambite
, Yao-Yi Chiang, Sandrah P. Eckel, Rima Habre
:
A Scalable Data Integration and Analysis Architecture for Sensor Data of Pediatric Asthma. 1407-1408 - Ahmad Assadi, Tova Milo, Slava Novgorodov:
DANCE: Data Cleaning with Constraints and Experts. 1409-1410 - Behrooz Omidvar-Tehrani, Arnab Nandi
, Nicholas Meyer, Dalton Flanagan, Seth Young:
DV8: Interactive Analysis of Aviation Data. 1411-1412 - Jia Yu
, Raha Moraffah, Mohamed Sarwat:
Hippo in Action: Scalable Indexing of a Billion New York City Taxi Trips and Beyond. 1413-1414 - Aibek Musaev, Calton Pu:
Landslide Information Service Based on Composition of Physical and Social Sensors. 1415-1416 - Furong Li, Mong-Li Lee
, Wynne Hsu
:
MAROON+: A System for Profiling Entities over Time. 1417-1418 - Constantinos Costa
, Georgios Chatzimilioudis, Demetrios Zeinalipour-Yazti
, Mohamed F. Mokbel:
SPATE: Compacting and Exploring Telco Big Data. 1419-1420 - Xin Mou, Hasan M. Jamil
, Xiaogang Ma
:
VisFlow: A Visual Database Integration and Workflow Querying System. 1421-1422
PhD Symposium
Session 1
- Magda Balazinska:
Keynote: Research with Real Users. 1425 - Zahid Abul-Basher:
Multiple-Query Optimization of Regular Path Queries. 1426-1430 - Bella Martínez-Seis
:
RELNA: Ranking Attributes in Social Networks to Detect Overlapping Communities Efficiently. 1431-1435
Session 2
- Sujoy Chatterjee
, Anirban Mukhopadhyay
, Malay Bhattacharyya
:
Judgment Analysis Based on Crowdsourced Opinions. 1439-1443 - Anahita Davoudi:
Effects of User Interactions on Online Social Recommender Systems. 1444-1448
Tutorials
- Xin Huang
, Laks V. S. Lakshmanan, Jianliang Xu
:
Community Search over Big Graphs: Models, Algorithms, and Opportunities. 1451-1454 - Chao Zhang
, Quan Yuan, Jiawei Han:
Bringing Semantics to Spatiotemporal Data Mining: Challenges, Methods, and Applications. 1455-1458 - Kostas Stefanidis
, Vassilis Christophides, Vasilis Efthymiou
:
Web-Scale Blocking, Iterative and Progressive Entity Resolution. 1459-1462 - Faisal Nawab, Divyakant Agrawal, Amr El Abbadi:
The Challenges of Global-Scale Data Management. 1463-1466 - Andreas Züfle, Goce Trajcevski, Dieter Pfoser, Matthias Renz, Matthew T. Rice, Timothy Leslie, Paul L. Delamater
, Tobias Emrich:
Handling Uncertainty in Geo-Spatial Data. 1467-1470
Panels
- Bill Howe, Michael J. Franklin, Laura M. Haas, Tim Kraska, Jeffrey D. Ullman:
Data Science Education: We're Missing the Boat, Again. 1473-1474 - Oliver Kennedy, D. Richard Hipp, Stratos Idreos, Amélie Marian, Arnab Nandi
, Carmela Troncoso, Eugene Wu:
Small Data. 1475-1476
HDMM Workshop
Session 1
- Maria Stratigi
, Haridimos Kondylakis, Kostas Stefanidis
:
Fairness in Group Recommendations in the Health Domain. 1481-1488 - Yuta Suzuki, Makito Sato, Hiroaki Shiokawa, Masashi Yanagisawa, Hiroyuki Kitagawa
:
MASC: Automatic Sleep Stage Classification Based on Brain and Myoelectric Signals. 1489-1496 - Yuanyang Zhang, Richard M. Jiang, Linda R. Petzold:
Survival Topic Models for Predicting Outcomes for Trauma Patients. 1497-1504
Session 2
- Vinicius Oliverio, Omero Bendicto Poli-Neto
:
Case Study: Classification Algorithms Comparison for the Multi-Label Problem of Chronic Pelvic Pain Diagnosing. 1507-1509 - NhatHai Phan, Soon Ae Chun, Manasi Bhole, James Geller:
Enabling Real-Time Drug Abuse Detection in Tweets. 1510-1514 - Jing (Melody) Yao:
Mother Smoking During Pregnancy and ADHD in Children. 1515-1522
Session 3
- Vineet K. Raghu, Xiaoyu Ge, Panos K. Chrysanthis
, Panayiotis V. Benos
:
Integrated Theory-and Data-Driven Feature Selection in Gene Expression Data Analysis. 1525-1532 - Luca Bonomi, Xiaoqian Jiang:
A Mortality Study for ICU Patients Using Bursty Medical Events. 1533-1540 - Diogo Ferreira Pacheco, Diego Pinheiro
, Martin Cadeiras, Ronaldo Menezes
:
Characterizing Organ Donation Awareness from Social Media. 1541-1548
DesWeb Workshop
Session 1
- Thu-Le Pham, Alessandra Mileo
, Muhammad Intizar Ali
:
Towards Scalable Non-Monotonic Stream Reasoning via Input Dependency Analysis. 1553-1558 - Marios Meimaris, George Papastefanatos
:
Distance-Based Triple Reordering for SPARQL Query Optimization. 1559-1562 - Sutanay Choudhury, Khushbu Agarwal, Sumit Purohit, Baichuan Zhang, Meg Pirrung, William P. Smith, Mathew Thomas:
NOUS: Construction and Querying of Dynamic Knowledge Graphs. 1563-1565
Session 2
- Siti Aminah
, Iis Afriyanti, Adila Krisnadhi:
Ontology-Based Approach for Academic Evaluation System. 1569-1574 - Michael N. Gubanov:
PolyFuse: A Large-Scale Hybrid Data Fusion System. 1575-1578 - Kostas Stefanidis
, Haridimos Kondylakis, Georgia Troullinou:
On Recommending Evolution Measures: A Human-Aware Approach. 1579-1581
Active and HardDB Workshops
HardDB Keynote
- Andre Putnam:
The Configurable Cloud - Accelerating Hyperscale Datacenter Services with FPGA. 1587
HardDB Papers
- Jianting Zhang, Simin You, Le Gruenwald:
Parallel Selectivity Estimation for Optimizing Multidimensional Spatial Join Processing on GPUs. 1591-1598 - Marcus Pinnecke, David Broneske, Gabriel Campero Durand, Gunter Saake:
Are Databases Fit for Hybrid Workloads on GPUs? A Storage Engine's Perspective. 1599-1606
Active Invited Talks (Academic)
- Bingsheng He
:
Data Management Systems on Future Hardware: Challenges and Opportunities. 1609 - Stratis D. Viglas:
Processing Declarative Queries through Generating Imperative Code in Managed Runtimes. 1610-1611 - Xiaodong Zhang:
Enabling Effective Utilization of GPUs for Data Management Systems. 1612
Active Invited Talks (Industry)
- Roger Moussalli:
Tradeoffs and Considerations in the Design of Accelerators for Database Applications. 1615 - Evangelia A. Sitaridi:
Hardware Acceleration of Database Analytics. 1616
![](https://dblp.uni-trier.de/img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.