Stop the war!
Остановите войну!
for scientists:
default search action
BigData Conference 2017: Boston, MA, USA
- Jian-Yun Nie, Zoran Obradovic, Toyotaro Suzumura, Rumi Ghosh, Raghunath Nambiar, Chonggang Wang, Hui Zang, Ricardo Baeza-Yates, Xiaohua Hu, Jeremy Kepner, Alfredo Cuzzocrea, Jian Tang, Masashi Toyoda:
2017 IEEE International Conference on Big Data (IEEE BigData 2017), Boston, MA, USA, December 11-14, 2017. IEEE Computer Society 2017, ISBN 978-1-5386-2715-0 - Carla E. Brodley:
Human-in-the-loop applied machine learning. 1 - Alan Edelman:
A more open efficient future for AI development and data science with an introduction to Julia. 2 - John Langford:
Contextual reinforcement learning. 3 - Jure Leskovec:
Large-scale graph representation learning. 4 - Satoshi Matsuoka:
Being "BYTES-oriented" in HPC leads to an open big data/AI ecosystem and further advances into the post-moore era. 5 - ChengXiang Zhai:
TextScope: Enhance human perception via text mining. 6 - Feng Chen, Chunpai Wang, Jin-Hee Cho:
Collective subjective logic: Scalable uncertainty-based opinion inference. 7-16 - Natascha Harth, Christos Anagnostopoulos:
Quality-aware aggregation & predictive analytics at the edge. 17-26 - Sheng Li, Yun Fu:
Robust multi-label semi-supervised classification. 27-36 - Xiaoli Li, Sai Nivedita Chandrasekaran, Jun Huan:
Lifelong multi-task multi-view learning using latent spaces. 37-46 - Natalia Ponomareva, Thomas Colthurst, Gilbert Hendry, Salem Haykal, Soroush Radpour:
Compact multi-class boosted trees. 47-56 - Daniel Yue Zhang, Dong Wang, Yang Zhang:
Constraint-aware dynamic truth discovery in big data social media sensing. 57-66 - Peter Baumann:
Standardizing big earth datacubes. 67-73 - Salima Benbernou, Mourad Ouziri:
Enhancing data quality by cleaning inconsistent big RDF data. 74-79 - Byron J. Gao, Robert Tung, Yong Yang:
Iterative matrix correlation for bisection clustering. 80-87 - Diego Granziol, Stephen J. Roberts:
Entropic determinants of massive matrices. 88-93 - Er-Chen Huang, Hsing-Kuo Pao, Yuh-Jye Lee:
Big active learning. 94-101 - Hasan Kurban, Mehmet M. Dalkilic:
A novel approach to optimization of iterative machine learning algorithms: Over heap structure. 102-109 - Sheng Li, Hongfu Liu, Zhiqiang Tao, Yun Fu:
Multi-view graph learning with adaptive label propagation. 110-115 - Christian S. Schmid, Bruce A. Desmarais:
Exponential random graph models with big networks: Maximum pseudolikelihood estimation and the parametric bootstrap. 116-121 - Sam Wood, Rohit Muthyala, Yi Jin, Yixing Qin, Nilaj Rukadikar, Amit Rai, Hua Gao:
Automated industry classification with deep learning. 122-129 - Jonghyun Bae, Hakbeom Jang, Wenjing Jin, Jun Heo, Jaeyoung Jang, Joo Young Hwang, Sangyeun Cho, Jae W. Lee:
Jointly optimizing task granularity and concurrency for in-memory mapreduce frameworks. 130-140 - Nathanael Cheriere, Gabriel Antoniu:
How fast can one scale down a distributed file system? 141-150 - Thomas Swearingen, Will Drevo, Bennett Cyphers, Alfredo Cuesta-Infante, Arun Ross, Kalyan Veeramachaneni:
ATM: A distributed, collaborative, scalable system for automated machine learning. 151-162 - Ioannis Giannakopoulos, Dimitrios Tsoumakos, Nectarios Koziris:
A decision tree based approach towards adaptive modeling of big data applications. 163-172 - Shashank Gugnani, Xiaoyi Lu, Houliang Qi, Li Zha, Dhabaleswar K. Panda:
Characterizing and accelerating indexing techniques on distributed ordered tables. 173-182 - Yuki Ito, Ryo Matsumiya, Toshio Endo:
ooc_cuDNN: Accommodating convolutional neural networks over GPU memory capacity. 183-192 - HyeongSik Kim, Padmashree Ravindra, Kemafor Anyanwu:
A semantics-aware storage framework for scalable processing of knowledge graphs on Hadoop. 193-202 - Konstantinos Lolos, Ioannis Konstantinou, Verena Kantere, Nectarios Koziris:
Elastic management of cloud applications using adaptive reinforcement learning. 203-212 - Xiaoyi Lu, Haiyang Shi, Dipti Shankar, Dhabaleswar K. Panda:
Performance characterization and acceleration of big data workloads on OpenPOWER system. 213-222 - Diego Marron, Eduard Ayguadé, José R. Herrero, Jesse Read, Albert Bifet:
Low-latency multi-threaded ensemble learning for dynamic big data streams. 223-232 - Arnab Kumar Paul, Arpit Goyal, Feiyi Wang, Sarp Oral, Ali Raza Butt, Michael J. Brim, Sangeetha B. Srinivasa:
I/O load balancing for big data HPC applications. 233-242 - Bo Peng, Bingjing Zhang, Langshi Chen, Mihai Avram, Robert Henschel, Craig A. Stewart, Shaojuan Zhu, Emily McCallum, Lisa Smith, Tom Zahniser, Jon Omer, Judy Qiu:
HarpLDA+: Optimizing latent dirichlet allocation for parallel efficiency. 243-252 - Jim Pivarski, Peter Elmer, Brian Bockelman, Zhe Zhang:
Fast access to columnar, hierarchically nested data via code transformation. 253-262 - Alex Watson, Deepigha Shree Vittal Babu, Suprio Ray:
Sanzu: A data science benchmark. 263-272 - Luna Xu, Seung-Hwan Lim, Min Li, Ali Raza Butt, Ramakrishnan Kannan:
Scaling up data-parallel analytics platforms: Linear algebraic operation cases. 273-282 - Xiaodong Yu, Kaixi Hou, Hao Wang, Wu-chun Feng:
Robotomata: A framework for approximate pattern matching of big data on an automata processor. 283-292 - Yunming Zhang, Vladimir Kiriansky, Charith Mendis, Saman P. Amarasinghe, Matei Zaharia:
Making caches work for graph analytics. 293-302 - Bilal Akil, Ying Zhou, Uwe Röhm:
On the usability of Hadoop MapReduce, Apache Spark & Apache flink for data science. 303-310 - Mohammed M. Alawad, Hong-Jun Yoon, Georgia D. Tourassi:
Energy efficient stochastic-based deep spiking neural networks for sparse datasets. 311-318 - Lars Arge, Mathias Rav, Svend C. Svendsen, Jakob Truelsen:
External memory pipelining made easy with TPIE. 319-324 - Dapeng Dong, John Herbert:
Compressed domain-specific data processing and analysis. 325-330 - Celestine Dünner, Thomas P. Parnell, Kubilay Atasu, Manolis Sifalakis, Haralampos Pozidis:
Understanding and optimizing the performance of distributed machine learning applications on apache spark. 331-338 - Xiao Meng, Lukasz Golab:
Optimal reducer placement to minimize data transfer in MapReduce-style processing. 339-346 - Michael Mercier, David Glesser, Yiannis Georgiou, Olivier Richard:
Big data and HPC collocation: Using HPC idle resources for Big Data analytics. 347-352 - Axel Oehmichen, Florian Guitton, Kai Sun, Jean Grizet, Thomas Heinis, Yike Guo:
eTRIKS analytical environment: A modular high performance framework for medical data analysis. 353-360 - Ilia Pietri, Yannis Chronis, Yannis E. Ioannidis:
Multi-objective optimization of scheduling dataflows on heterogeneous cloud resources. 361-368 - Md. Wasi-ur-Rahman, Nusrat Sharmin Islam, Xiaoyi Lu, Dhabaleswar K. Panda:
NVMD: Non-volatile memory assisted design for accelerating MapReduce and DAG execution frameworks on HPC systems. 369-374 - Xinhui Tian, Yuanqing Guo, Jianfeng Zhan, Lei Wang:
Towards memory and computation efficient graph processing on spark. 375-382 - Alexander Ulanov, Manish Marwah, Mijung Kim, Roshan Dathathri, Carlos Zubieta, Jun Li:
Sandpiper: Scaling probabilistic inferencing to large scale graphical models. 383-388 - Nikos Zacheilas, Stathis Maroulis, Vana Kalogeraki:
Dione: Profiling spark applications exploiting graph similarity. 389-394 - Mohammad Asghari, Cyrus Shahabi:
On on-line task assignment in spatial crowdsourcing. 395-404 - Ilir Fetai, Alexander Stiemer, Heiko Schuldt:
QuAD: A quorum protocol for adaptive data management in the cloud. 405-414 - Valérie Hayot-Sasson, Yongping Gao, Yuhong Yan, Tristan Glatard:
Sequential algorithms to split and merge ultra-high resolution 3D images. 415-424 - Shahab Helmi, Farnoush Banaei Kashani:
Spatiotemporal range pattern queries on large-scale co-movement pattern datasets. 425-434 - Srinivasan Venkatramanan, Sichao Wu, Bowen Shi, Achla Marathe, Madhav V. Marathe, Stephen G. Eubank, Lalit P. Sah, A. P. Giri, Luke A. Colavito, K. S. Nitin, V. Sridhar, R. Asokan, Rangaswamy Muniappan, G. Norton, Abhijin Adiga:
Towards robust models of food flows and their role in invasive species spread. 435-444 - Juan A. Colmenares, Reza Dorrigiv, Daniel G. Waddington:
A single-node datastore for high-velocity multidimensional sensor data. 445-452 - Isabelle Comyn-Wattiau, Jacky Akoka:
Model driven reverse engineering of NoSQL property graph databases: The case of Neo4j. 453-458 - Helge Holzmann, Vinay Goel, Emily Novak Gustainis:
Universal distant reading through metadata proxies with archivespark. 459-464 - Md. S. Q. Zulkar Nine, Kemal Guner, Ziyun Huang, Xiangyu Wang, Jinhui Xu, Tevfik Kosar:
Big data transfer optimization based on offline knowledge discovery and adaptive sampling. 465-472 - Ramyar Saeedi, Skyler Norgaard, Assefaw Hadish Gebremedhin:
A closed-loop deep learning architecture for robust activity recognition using wearable sensors. 473-479 - Haiying Shen, Heng Zhou:
CStorage: An efficient classification-based image storage system in cloud datacenters. 480-485 - Dingwen Tao, Sheng Di, Zizhong Chen, Franck Cappello:
In-depth exploration of single-snapshot lossy compression techniques for N-body simulations. 486-493 - Xian Wu, Yuxiao Dong, Jun Tao, Chao Huang, Nitesh V. Chawla:
Reliable fake review detection via modeling temporal and behavioral patterns. 494-499 - Masahiro Yokoyama, Takahiro Hara, Sanjay Kumar Madria:
Efficient diversified set monitoring for mobile sensor stream environments. 500-507 - Yangwen Yu, James Jian Qiao Yu, Victor O. K. Li, Jacqueline C. K. Lam:
Low-rank singular value thresholding for recovering missing air quality data. 508-513 - Lina Yu, Michael L. Rilee, Yu Pan, Feiyu Zhu, Kwo-Sen Kuo, Hongfeng Yu:
Visual analytics with unparalleled variety scaling for big earth data. 514-521 - Ming Zeng, Tong Yu, Xiao Wang, Le T. Nguyen, Ole J. Mengshoel, Ian R. Lane:
Semi-supervised convolutional neural networks for human activity recognition. 522-529 - Xibo Zhou, Ye Ding, Fengchao Peng, Qiong Luo, Lionel M. Ni:
Detecting unmetered taxi rides from trajectory data. 530-535 - Giambattista Amati, Simone Angelini, Giorgio Gambosi, Gianluca Rossi, Paola Vocca:
Estimation of distance-based metrics for very large graphs with MinHash Signatures. 536-545 - Philipp Baumann, Dorit S. Hochbaum, Quico Spaen:
High-performance geometric algorithms for sparse computation in big data analytics. 546-555 - Sreyasee Das Bhattacharjee, Ashit Talukder, Bala Venkatram Balantrapu:
Active learning based news veracity detection with feature weighting and deep-shallow fusion. 556-565 - Chandramani Chaudhary, Poonam Goyal, Yi-Ping Phoebe Chen:
Exploiting visual and textual neighborhood information to improve image-tag relevance. 566-575 - Limeng Cui, Jiawei Zhang, Zhensong Chen, Yong Shi, Philip S. Yu:
Inverse extreme learning machine for learning with label proportions. 576-585 - Vachik S. Dave, Nesreen K. Ahmed, Mohammad Al Hasan:
E-CLoG: Counting edge-centric local graphlets. 586-595 - Bo Dong, Yifan Li, Yang Gao, Ahsanul Haque, Latifur Khan, Mohammad M. Masud:
Multistream regression with asynchronous concept drift detection. 596-605 - Roohollah Etemadi, Jianguo Lu:
Bias correction in clustering coefficient estimation. 606-615 - Guyue Han, Harish Sethu:
Closed walk sampler: An efficient method for estimating the spectral radius of large graphs. 616-625 - Jun Hu, Yuxin Wang, Ping Li:
Online city-scale hyper-local event detection via analysis of social media and human mobility. 626-635 - Jianfeng Jia, Chen Li, Michael J. Carey:
Drum: A rhythmic approach to interactive analytics on large data. 636-645 - Ryoya Kaneko, Kohei Miyaguchi, Kenji Yamanishi:
Detecting changes in streaming data with information-theoretic windowing. 646-655 - Foteini Katsarou, Nikos Ntarmos, Peter Triantafillou:
Hybrid algorithms for subgraph pattern queries in graph databases. 656-665 - Sarasi Lalithsena, Sujan Perera, Pavan Kapanipathi, Amit P. Sheth:
Domain-specific hierarchical subgraph extraction: A recommendation use case. 666-675 - Panagiotis Liakos, Alexandros Ntoulas, Alex Delis:
COEUS: Community detection via seed-set expansion on graph streams. 676-685 - Panagiotis Liakos, Alexandros Ntoulas, Alex Delis:
Rhea: Adaptively sampling authoritative content from social activity streams. 686-695 - Ismini Lourentzou, Alex Morales, ChengXiang Zhai:
Text-based geolocation prediction of social media users with neural networks. 696-705 - Alessandro Lulli, Luca Oneto, Davide Anguita:
Crack random forest for arbitrary large datasets. 706-715 - Suchismit Mahapatra, Varun Chandola:
S-Isomap++: Multi manifold learning from streaming data. 716-725 - Sheikh Motahar Naim, Arnold P. Boedihardjo, Mahmud Shahriar Hossain:
A scalable model for tracking topical evolution in large document collections. 726-735 - Mehrnaz Najafi, Lifang He, Philip S. Yu:
Error-robust multi-view clustering. 736-745 - Axel-Cyrille Ngonga Ngomo, Michael Hoffmann, Ricardo Usbeck, Kunal Jha:
Holistic and scalable ranking of RDF data. 746-755 - Haekyu Park, Jinhong Jung, U Kang:
A comparative study of matrix factorization and random walk with restart in recommender systems. 756-765 - Chao Shang, Aaron Palmer, Jiangwen Sun, Ko-Shin Chen, Jin Lu, Jinbo Bi:
VIGAN: Missing view imputation with generative adversarial networks. 766-775 - Lorenzo De Stefani, Erisa Terolli, Eli Upfal:
Tiered sampling: An efficient method for approximate counting sparse motifs in massive graph streams. 776-786 - Cheng-Chin Tu, Mi-Yen Yeh, Tei-Wei Kuo:
A fast non-volatile memory aware algorithm for generating random scale-free networks. 787-796 - Nguyen Vo, Kyumin Lee, Thanh Tran:
MRAttractor: Detecting communities from large-scale graphs. 797-806 - Yueyao Wang, Qinmin Hu, Yang Song, Liang He:
Potentiality of healthcare big data: Improving search by automatic query reformulation. 807-816 - Ichitaro Yamazaki, Stanimire Tomov, Jack J. Dongarra:
Sampling algorithms to update truncated SVD. 817-826 - Yizhou Yan, Lei Cao, Elke A. Rundensteiner:
Distributed Top-N local outlier detection in big data. 827-836 - Tong Yang, Binchao Yin, Hang Li, Muhammad Shahzad, Steve Uhlig, Bin Cm, Xiaoming Li:
Rectangular hash table: Bloom filter and bitmap assisted hash table with high speed. 837-846 - Xinli Yu, Zheng Chen, Wei-Shih Yang, Xiaohua Hu, Erjia Yan, Guangrong Li:
Large-scale joint topic, sentiment & user preference analysis for online reviews. 847-856 - Chuxu Zhang, Lu Yu, Xiangliang Zhang, Nitesh V. Chawla:
ImWalkMF: Joint matrix factorization and implicit walk integrative learning for recommendation. 857-866 - Lei Zheng, Bokai Cao, Vahid Noroozi, Philip S. Yu, Nianzu Ma:
Hierarchical collaborative embedding for context-aware recommendations. 867-876 - Ebad Ahmadzadeh, Philip K. Chan:
Mining pros and cons of actions from social media for decision support. 877-882 - Masato Asahara, Ryohei Fujimaki:
Distributed Bayesian piecewise sparse linear models. 883-888 - Kubilay Atasu, Thomas P. Parnell, Celestine Dünner, Manolis Sifalakis, Haralampos Pozidis, Vasileios Vasileiadis, Michail Vlachos, Cesar Berrospi, Abdel Labbi:
Linear-complexity relaxed word Mover's distance with GPU acceleration. 889-896 - Ricardo Baeza-Yates, Zeinab Liaghat:
Quality-efficiency trade-offs in machine learning for text processing. 897-904 - Jose Cadena, Saliya Ekanayake, Anil Vullikanti:
Fast graph scan statistics optimization using algebraic fingerprints. 905-910 - Zaineb Chelly Dagdia, Christine Zarges, Gaël Beck, Mustapha Lebbah:
A distributed rough set theory based algorithm for an efficient big data pre-processing under the spark framework. 911-916 - Hoang Anh Dau, Diego Furtado Silva, François Petitjean, Germain Forestier, Anthony J. Bagnall, Eamonn J. Keogh:
Judicious setting of Dynamic Time Warping's window width allows more accurate classification of time series. 917-922 - Alexander Denzler, Michael Kaufmann:
Toward granular knowledge analytics for data intelligence: Extracting granular entity-relationship graphs for knowledge profiling. 923-928 - Ankit Desai, Sanjay Chaudhary:
Distributed decision tree v.2.0. 929-934 - Mohammad M. Ghassemi, Willow Jarvis, Tuka Alhanai, Emery N. Brown, Roger G. Mark, M. Brandon Westover:
An open-source tool for the transcription of paper-spreadsheet data: Code and supplemental materials available online: Https: //github.com/deskool/images to spreadsheets. 935-941 - Poonam Goyal, Jagat Sesh Challa, Shivin Shrivastava, Navneet Goyal:
AnyFI: An anytime frequent itemset mining algorithm for data streams. 942-947 - Tatsuru Kobayashi, Shin Matsushima, Taito Lee, Kenji Yamanishi:
Discovering potential traffic risks in Japan using a supervised learning approach. 948-955 - Martin Koehler, Alex Bogatu, Cristina Civili, Nikolaos Konstantinou, Edward Abel, Alvaro A. A. Fernandes, John A. Keane, Leonid Libkin, Norman W. Paton:
Data context informed data wrangling. 956-963 - Naama Kraus, David Carmel, Idit Keidar:
Fishing in the stream: Similarity search over endless data. 964-969 - Liang Ma, Guohong Cao, Lance M. Kaplan:
Graphical approach for influence maximization in social networks under generic threshold-based non-submodular model. 970-975 - Aritra Mandal, Mohammad Al Hasan:
A distributed k-core decomposition algorithm on spark. 976-981 - Mohammad Hossein Namaki, Peng Lin, Yinghui Wu:
Event pattern discovery by keywords in graph streams. 982-987 - Michael Nelson, Sridhar Radhakrishnan, Amlan Chatterjee, Chandra N. Sekharan:
Queryable compression on streaming social networks. 988-993 - Fengchao Peng, Yudian Ji, Qiong Luo, Lionel M. Ni:
Event-based non-parametric clustering of team sport trajectories. 994-999 - Sumit Purohit, Sutanay Choudhury, Lawrence B. Holder:
Application-specific graph sampling for frequent subgraph mining and community detection. 1000-1005 - Hung Tran-The, Koji Zettsu:
Discovering co-occurrence patterns of heterogeneous events from unevenly-distributed spatiotemporal data. 1006-1011 - Takeaki Uno, Hiroki Maegawa, Takanobu Nakahara, Yukinobu Hamuro, Ryo Yoshinaka, Makoto Tatsuta:
Micro-clustering by data polishing. 1012-1018 - Chenwei Zhang, Nan Du, Wei Fan, Yaliang Li, Chun-Ta Lu, Philip S. Yu:
Bringing semantic structures to user intent detection in online medical queries. 1019-1026 - Daniel Yue Zhang, Dong Wang, Hao Zheng, Xin Mu, Qi Li, Yang Zhang:
Large-scale point-of-interest category prediction using natural language processing models. 1027-1032 - Alexander Heifetz, Vaikkunth Mugunthan, Lalana Kagal:
Shade: A differentially-private wrapper for enterprise big data. 1033-1042 - Balaji Palanisamy, Chao Li, Prashant Krishnamurthy:
Group privacy-aware disclosure of association graph data. 1043-1052 - Lichao Sun, Xiaokai Wei, Jiawei Zhang, Lifang He, Philip S. Yu, Witawas Srisa-an:
Contaminant removal for Android malware detection systems. 1053-1062 - Xi Zhang, Yu Zeng, Xiao-Bo Jin, Zhiwei Yan, Guang-Gang Geng:
Boosting the phishing detection performance by semantic analysis. 1063-1070 - Robert A. Bridges, Jessie D. Jamieson, Joel W. Reed:
Setting the threshold for high throughput detectors: A mathematical approach for ensembles of dynamic, heterogeneous, probabilistic anomaly detectors. 1071-1078 - Dong Chen, David E. Irwin:
Weatherman: Exposing weather-based privacy threats in big energy data. 1079-1086 - Jiuyong Li, Jixue Liu, Lin Liu, Thuc Duy Le, Saisai Ma, Yizhao Han:
Discrimination detection by causal effect estimation. 1087-1094 - Amit Pande, Vishal Ahuja:
WEAC: Word embeddings for anomaly classification from event logs. 1095-1100 - Shuo Wang, Richard O. Sinnott, Surya Nepal:
Privacy-protected place of activity mining on big location data. 1101-1108 - Shuo Wang, Richard O. Sinnott, Surya Nepal:
Sensitive gazetteer discovery and protection for mobile social media users. 1109-1116 - Tianqing Zhu, Ping Xiong, Gang Li, Wanlei Zhou, Philip S. Yu:
Differentially private query learning: From data publishing to model publishing. 1117-1122 - Eric Breck, Shanqing Cai, Eric Nielsen, Michael Salib, D. Sculley:
The ML test score: A rubric for ML production readiness and technical debt reduction. 1123-1132 - Meng-Fen Chiang, Ee-Peng Lim, Wang-Chien Lee, Agus Trisnajaya Kwee:
BTCI: A new framework for identifying congestion cascades using bus trajectory data. 1133-1142 - Pankaj Goel, Aniruddha Datta, M. Sam Mannan:
Application of big data analytics in process safety and risk management. 1143-1152 - Lei Huang, Weijia Xu, Si Liu, Venktesh Pandey, Natalia Ruiz-Juri:
Enabling versatile analysis of large scale traffic video data with deep learning and HiveQL. 1153-1162 - Hiroshi Inoue:
Fast interpolation of grid data at a non-grid point. 1163-1172 - Xiaowei Jia, Yifan Hu, Ankush Khandelwal, Anuj Karpatne, Vipin Kumar:
Joint sparse auto-encoder: A semi-supervised spatio-temporal approach in mapping large-scale croplands. 1173-1182 - Pasan Karunaratne, Masud Moshtaghi, Shanika Karunasekera, Aaron Harwood, Trevor Cohn:
Multi-step prediction with missing smart sensor data using multi-task Gaussian processes. 1183-1192 - Abhinav Maurya, Rahul Telang:
Bayesian multi-view models for member-job matching and personalized skill recommendations. 1193-1202 - Mai H. Nguyen, Daniel Crawl, Jiaxin Li, Dylan Uys, Ilkay Altintas:
Automated scalable detection of location-specific Santa Ana conditions from weather data using unsupervised learning. 1203-1212 - Haoyu Wang, Jiaqi Gong, Yan Zhuang, Haiying Shen, John C. Lach:
HealthEdge: Task scheduling for edge computing with health emergency and human behavior consideration in smart homes. 1213-1222 - Jingyuan Zhang, Chun-Ta Lu, Bokai Cao, Yi Chang, Philip S. Yu:
Connecting emerging relationships from news via tensor factorization. 1223-1232 - Yuan Zhang, Chen Lin, Min Chi, Julie S. Ivy, Muge Capan, Jeanne M. Huddleston:
LSTM for septic shock: Adding unreliable labels to reliable predictions. 1233-1242 - Baoxin Zhao, Chengzhong Xu, Siyuan Liu:
A data-driven congestion diffusion model for characterizing traffic in metrocity scales. 1243-1252 - Allard J. van Altena, Perry D. Moerland, Aeilko H. Zwinderman, Sílvia D. Olabarriaga:
Analysis of the term 'big data': Usage in biomedical publications. 1253-1258 - Marzieh Bakhshandeh, Dennis M. M. Schunselaar, Henrik Leopold, Hajo A. Reijers:
Predicting treatment repetitions in the implant denture therapy process. 1259-1264 - Jian Cao, Fangzhou Yang, Yuchang Xu, Yudong Tan, Quan-Wu Xiao:
Personalized flight recommendations via paired choice modeling. 1265-1270 - Zhitang Chen, Ke He, Jian Li, Yanhui Geng:
Seq2Img: A sequence-to-image based approach towards IP traffic classification using convolutional neural networks. 1271-1276 - Chung Ming Cheung, Palash Goyal, Viktor K. Prasanna, Arash Saber Tehrani:
OReONet: Deep convolutional network for oil reservoir optimization. 1277-1282 - Giuseppe Cuccu, Somayeh Danafar, Philippe Cudré-Mauroux, Martin Gassner, Stefano Bernero, Krzysztof Kryszczuk:
A data-driven approach to predict NOx-emissions of gas turbines. 1283-1288 - Angelo Furno, Nour-Eddin El Faouzi, Rajesh Sharma, Eugenio Zimeo:
Two-level clustering fast betweenness centrality computation for requirement-driven approximation. 1289-1294 - Xueying Guo, George Trimponias, Xiaoxiao Wang, Zhitang Chen, Yanhui Geng, Xin Liu:
Cellular network configuration via online learning and joint optimization. 1295-1300 - Jiankun Huang, Wenjun Wu:
T-BMIRT: Estimating representations of student knowledge and educational components in online education. 1301-1306 - Xinjiang Lu, Zhiwen Yu, Chuanren Liu, Yanchi Liu, Hui Xiong, Bin Guo:
Forecasting the rise and fall of volatile point-of-interests. 1307-1312 - Stanislav Sobolevsky, Emanuele Massaro, Iva Bojic, Juan Murillo Arias, Carlo Ratti:
Predicting regional economic indices using big data of individual bank card transactions. 1313-1318 - Chuishi Meng, Yu Cui, Qing He, Lu Su, Jing Gao:
Travel purpose inference with GPS trajectories, POIs, and geo-tagged social media data. 1319-1324 - Jennifer Sleeman, Milton Halem, Tim Finin, Mark Cane:
Discovering scientific influence using cross-domain dynamic topic modeling. 1325-1332 - Mohiuddin Solaimani, Sayeed Salam, Latifur Khan, Patrick T. Brandt, Vito D'Orazio:
RePAIR: Recommend political actors in real-time from news websites. 1333-1340 - Xing Su, Yuan Yao, Qing He, Jie Lu, Hanghang Tong:
Personalized travel mode detection with smartphone sensors. 1341-1348 - Ashish Tapdiya, Daniel Fabbri:
A comparative analysis of state-of-the-art SQL-on-Hadoop systems for interactive analytics. 1349-1356 - Tingyang Xu, Tan Yan, Dongjin Song, Wei Cheng, Haifeng Chen, Geoff Jiang, Jinbo Bi:
Identifying and quantifying nonlinear structured relationships in complex manufactural systems. 1357-1362 - Yuchang Xu, Jian Cao:
OTPS: A decision support service for optimal airfare Ticket Purchase. 1363-1368 - Hu Xu, Sihong Xie, Lei Shu, Philip S. Yu:
Product function need recognition via semi-supervised attention network. 1369-1374 - Wenbo Zhang, Dheeraj Kumar, Satish V. Ukkusuri:
Exploring the dynamics of surge pricing in mobility-on-demand taxi services. 1375-1380 - Yihua Shi Astle, Xuning Tang, Craig Freeman:
Application of dynamic logistic regression with unscented Kalman filter in predictive coding. 1381-1389 - Mansurul Alam Bhuiyan, Mohammad Al Hasan:
RAVEN: Web-based smart home exploration system through interactive pattern discovery. 1390-1399 - Simon Bin, Patrick Westphal, Jens Lehmann, Axel Ngonga:
Implementing scalable structured machine learning for big data in the SAKE project. 1400-1407 - Zheng Chen, Xinli Yu, Chi Zhang, Jin Zhang, Cui Lin, Bo Song, Jianliang Gao, Xiaohua Hu, Wei-Shih Yang, Erjia Yan:
Fast botnet detection from streaming logs using online lanczos method. 1408-1417 - Yuheng Du, Alexander Herzog, André Luckow, Ramu Nerella, Christopher Gropp, Amy W. Apon:
Representativeness of latent dirichlet allocation topics estimated from data samples with application to common crawl. 1418-1427 - Rishi Chhatwal, Nathaniel Huber-Fliflet, Robert Keeling, Jianping Zhang, Haozhen Zhao:
Empirical evaluations of active learning strategies in legal document review. 1428-1437 - T. F. Kennedy, Robert S. Provence, James L. Broyan, Patrick W. Fink, Phong H. Ngo, Lazaro D. Rodriguez:
Topic models for RFID data modeling and localization. 1438-1446 - Ishita K. Khan, Prathyusha Senthil Kumar, Daniel Miranda, David Goldberg:
What is skipped: Finding desirable items in e-commerce search by discovering the worst title tokens. 1447-1456 - Youngho Kim, Petros Zerfos, Vadim Sheinin, Nancy Greco:
Ranking the importance of ontology concepts using document summarization techniques. 1457-1466 - Lay Wai Kong:
Performance optimization in scale-out storage using design of experiment as heuristic. 1467-1474 - Hyunjong Lee, Youngin Jo, Sanghyuk Chun, Kwangseob Kim:
A study on intelligent personalized push notification with user history. 1475-1482 - Xiaomo Liu, Armineh Nourbakhsh, Quanzhi Li, Sameena Shah, Robert Martin, John Duprey:
Reuters tracer: Toward automated news production using large scale social media data. 1483-1493 - Justin McHugh, Paul E. Cuddihy, Jenny Weisenberg Williams, Kareem S. Aggour, Vijay S. Kumar, Varish Mulwad:
Integrated access to big data polystores through a knowledge-driven framework. 1494-1503 - Jacob Montiel, Albert Bifet, Talel Abdessalem:
Predicting over-indebtedness on batch and streaming data. 1504-1513 - Ye Ouyang, Zhongyuan Li, Le Su, Wenyuan Lu, Zhenyi Lin:
APP-SON: Application characteristics-driven SON to optimize 4G/5G network performance and quality of experience. 1514-1523 - Karthikeyan Natesan Ramamurthy, Dennis Wei, Emily Ray, Moninder Singh, Vijay S. Iyengar, Dmitriy A. Katz-Rogozhnikov, Jingwei Yang, Kevin N. Tran, Gigi Y. Yuen-Reed:
A configurable, big data system for on-demand healthcare cost prediction. 1524-1533 - Syed Yousaf Shah, Zengwen Yuan, Songwu Lu, Petros Zerfos:
Dependency analysis of cloud applications for performance monitoring using recurrent neural networks. 1534-1543 - Walid Shalaby, BahaaEddin AlAila, Mohammed Korayem, Layla Pournajaf, Khalifeh AlJadda, Shannon Quinn, Wlodek Zadrozny:
Help me find a job: A graph-based approach for job recommendation at scale. 1544-1553 - Derrick C. Spell, Xiao-Han T. Zeng, Jae Young Chung, Bahador Nooraei, Richard T. Shomer, Ling-Yong Wang, James C. Gibson, Daniel Kirsche:
Flux: Groupon's automated, scalable, extensible machine learning platform. 1554-1559 - Nenad Stojanovic, Marko Dinic, Ljiljana Stojanovic:
A data-driven approach for multivariate contextualized anomaly detection: Industry use case. 1560-1569 - Dharmashankar Subramanian, Debarun Bhattacharjya, Ruben Rodriguez Torrado, Jeffrey O. Kephart, Vijil Chenthamarakshan, Jesus Rios:
A cognitive assistant for risk identification and modeling. 1570-1579 - Warut D. Vijitbenjaronk, Jinho Lee, Toyotaro Suzumura, Gabriel Tanase:
Scalable time-versioning support for property graph databases. 1580-1589 - Xuchao Zhang, Liang Zhao, Zhiqian Chen, Arnold P. Boedihardjo, Jing Dai, Chang-Tien Lu:
Trendi: Tracking stories in news and microblogs via emerging, evolving and fading topics. 1590-1599 - Zhiwei Zhang, Ning Chen, Jun Wang, Luo Si:
SMART: Sponsored mobile app recommendation by balancing app downloads and appstore profit. 1600-1609 - Wen-Yuan Zhu, Wen-Yueh Shih, Ying-Hsuan Lee, Wen-Chih Peng, Jiun-Long Huang:
A gamma-based regression for winning price estimation in real-time bidding advertising. 1610-1619 - Nirupama Appiktala, Miao Chen, Michael Natkovich, Joshua J. Walters:
Demystifying dark matter for online experimentation. 1620-1626 - Neela Avudaiappan, Alexander Herzog, Sneha Kadam, Yuheng Du, Jason Thatcher, Ilya Safro:
Detecting and summarizing emergent events in microblogs and social media streams by dynamic centralities. 1627-1634 - Russell Chen, Miao Chen, Mahendrasinh Ramsinh Jadav, Joonsuk Bae, Don Matheson:
Faster online experimentation by eliminating traditional A/A validation. 1635-1641 - Ferosh Jacob, Ilamgumaran Karunanithi, Pramod Salian, Ravi Sambhu:
BBC: A DSL for designing cloud-based heterogeneous bigdata pipelines. 1642-1645 - George Mathew:
Architectural considerations for highly scalable computing to support on-demand video analytics. 1646-1649 - Leonardo Maria Millefiori, Paolo Braca, Gianfranco Arcieri:
Scalable distributed change detection and its application to maritime traffic. 1650-1657 - Ankita R. Nambiar, Nikitha Reddy, Debojyoti Dutta:
Connected health: Opportunities and challenges. 1658-1662 - Emmanuel Oyekanlu:
Predictive edge computing for time series of industrial IoT and large scale critical infrastructure based on open-source software analytic of big data. 1663-1669 - Kevin B. Pratt:
Linking many unusual co-incidences. 1670-1675 - Martin Ringsquandl, Evgeny Kharlamov, Daria Stepanova, Steffen Lamparter, Raffaello Lepratti, Ian Horrocks, Peer Kröger:
On event-driven knowledge graph completion in digital factories. 1676-1681 - Giannis Spiliopoulos, Konstantinos Chatzikokolakis, Dimitrios Zissis, Evmorfia Biliri, Dimitrios Papaspyros, Giannis Tsapelas, Spyros Mouzakitis:
Knowledge extraction from maritime spatiotemporal data: An evaluation of clustering algorithms on Big Data. 1682-1687 - Xuchao Zhang, Zhiqian Chen, Liang Zhao, Arnold P. Boedihardjo, Chang-Tien Lu:
TRACES: Generating Twitter stories via shared subspace and temporal smoothness. 1688-1693 - Christine Balili, Aviv Segev, Uichin Lee:
Tracking and predicting the evolution of research topics in scientific literature. 1694-1697 - Gong Cheng, Evgeny Kharlamov:
Towards a semantic keyword search over industrial knowledge graphs (extended abstract). 1698-1700 - Ajay Dholakia, Prasad Venkatachar, Kshitij A. Doshi, Ravikanth Durgavajhala, Stewart Tate, Berni Schiefer, Matthew Sheard, Ramnath Sai Sagar:
Designing a high performance cluster for large-scale SQL-on-hadoop analytics. 1701-1703 - Maurizio Montagnuolo, Alberto Messina, Nicolo Bidotti, Paolo Platter, Alessio Bosca:
Real time semantic enrichment of broadcast content in the big data age. 1704-1708 - Yiran Zhao, Shuochao Yao, Shaohan Hu, Shiyu Chang, Raghu K. Ganti, Mudhakar Srivatsa, Shen Li, Tarek F. Abdelzaher:
On the improvement of classifying EEG recordings using neural networks. 1709-1711 - Zhou Fa, Guang-Gang Geng, Zhiwei Yan, Xiao-Dong Lee:
A robust internet abuse detection method. 1712-1715 - Alexander Brodsky, Mohan Krishnamoorthy, M. Omar Nachawati, William Z. Bernstein, Daniel A. Menascé:
Manufacturing and contract service networks: Composition, optimization and tradeoff analysis based on a reusable repository of performance models. 1716-1725 - Max Ferguson, Ronay Ak, Yung-Tsun Tina Lee, Kincho H. Law:
Automatic localization of casting defects with convolutional neural networks. 1726-1735 - Yunpeng Li, Heng Zhang, Utpal Roy, Y. Tina Lee:
A data-driven approach for improving sustainability assessment in advanced manufacturing. 1736-1745 - Don Libes, David Lechevalier, Sanjay Jain:
Issues in synthetic data generation for advanced manufacturing. 1746-1754 - Srinivasan Radhakrishnan, Yung-Tsun Tina Lee, Sagar V. Kamarthi:
Estimation of online tool wear in turning processes using recurrence quantification analysis (RQA). 1755-1759 - Heather M. Reed, Richard P. Vinci, Corbin Robeck, Trevor Verdonik, Michael Pires, Maria Castro, Wojciech Z. Misiolek, Christina Viau Haden:
Statistically-substantiated density characterizations of additively manufactured steel alloys through verification, validation, and uncertainty quantification. 1760-1768 - Thurston Sexton, Michael P. Brundage, Michael Hoffman, K. C. Morris:
Hybrid datafication of maintenance logs from AI-assisted human tags. 1769-1777 - Akinori Abe, Yuki Hayashi:
Data treatment from the viewpoint of granular computing. 1778-1785 - Fatemeh Cheraghchi, Ibrahim Y. Abualhaol, Rafael Falcon, Rami S. Abielmona, Bijan Raahemi, Emil M. Petriu:
Big-data-enabled modelling and optimization of granular speed-based vessel schedule recovery problem. 1786-1794 - Lihao Ge, Teng-Sheng Moh:
Improving text classification with word embedding. 1796-1805 - Marek Grzegorowski, Andrzej Janusz, Dominik Slezak, Marcin S. Szczuka:
On the role of feature space granulation in feature selection processes. 1806-1815 - Tzung-Pei Hong, Lu-Hung Chen, Shyue-Liang Wang, Chun-Wei Lin, Bay Vo:
Quasi-erasable itemset mining. 1816-1820 - Tsau Young Lin, Pierre Vachon:
Secure information flow and file movements: A topological theory of discretionary access controls. 1821-1829 - Ahmad M. Mustafa, Gbadebo Ayoade, Khaled Al-Naami, Latifur Khan, Kevin W. Hamlen, Bhavani Thuraisingham, Frederico Araujo:
Unsupervised deep embedding for novel class detection over data stream. 1830-1839 - Dominik Slezak, Agnieszka Chadzynska-Krasowska, Joel Holland, Piotr Synak, Rick Glick, Marcin Perkowski:
Scalable cyber-security analytics with a new summary-based approximate query engine. 1840-1849 - Shusaku Tsumoto, Tomohiro Kimura, Haruko Iwata, Shoji Hirano:
Mining text for disease diagnosis in hospital information system. 1850-1859 - Shuyin Xia, Guoyin Wang, Yunsheng Liu, Qun Liu, Hong Yu:
Noise self-filtering K-nearest neighbors algorithms. 1860-1965 - Josh Jia-Ching Ying, Po-Yu Huang, Chih-Kai Chang, Don-Lin Yang:
A preliminary study on deep learning for predicting social insurance payment behavior. 1866-1875 - Hayri Volkan Agun, Sibel Yilmazel, Özgür Yilmazel:
Effects of language processing in Turkish authorship attribution. 1876-1881 - Nora Alkhamees, Maria Fasli:
Event detection from time-series streams using directional change and dynamic thresholds. 1882-1891 - Yusuf Arslan, Aysenur Birturk, Bekjan Djumabaev, Dilek Küçük:
Real-time Lexicon-based sentiment analysis experiments on Twitter with a mild (more information, less data) approach. 1892-1897 - Inci Batmaz, Pinar Karagoz, Gulsah Serdar:
A comparative study on learning to rank with computational methods. 1898-1906 - Belainine Billal, Alexsandro Fonseca, Fatiha Sadat, Hakim Lounis:
Semi-supervised learning and social media text analysis towards multi-labeling categorization. 1907-1916 - Tugce Dongel, Yasemin Timar:
B3SafirBiyo: Genomic variant analysis with big data technologies. 1917-1925 - Vasco Furtado, Elizabeth Furtado, Carlos Caminha, André Lopes, Victor Dantas, Caio Ponte, Sofia Cavalcante:
A data-driven approach to help understanding the preferences of public transport users. 1926-1935 - Lovedeep Gondara, Ke Wang:
Recovering loss to followup information using denoising autoencoders. 1936-1945 - Muhittin Isik, Hasan Dag:
A recommender model based on trust value and time decay: Improve the quality of product rating score in E-commerce platforms. 1946-1955 - Maryam Bahojb Imani, Swarup Chandra, Samuel Ma, Latifur Khan, Bhavani Thuraisingham:
Focus location extraction from political news reports with bias correction. 1956-1964 - Kishlay Jha, Guangxu Xun, Vishrawas Gopalakrishnan, Aidong Zhang:
Augmenting word embeddings through external knowledge-base for biomedical application. 1965-1974 - Shady S. Refaat, Amira Mohamed, Haitham Abu-Rub:
Big data impact on stability and reliability improvement of smart grid. 1975-1982 - Ibrahim Kok, Mehmet Ulvi Simsek, Suat Özdemir:
A deep learning model for air quality prediction in smart cities. 1983-1990 - Giannis V. Koumoutsos, Maria Fasli, Ian Lewin, David Milward:
Graph-based information exploration over structured and unstructured data. 1991-2000 - Paula Lauren, Guangzhi Qu, Paul Watta:
Convolutional neural network for clinical narrative categorization. 2001-2008 - Kwan Hui Lim, Shanika Karunasekera, Aaron Harwood:
ClusTop: A clustering-based topic modelling algorithm for twitter using word networks. 2009-2018 - Long Hoang Nguyen, Andrew Salopek, Liang Zhao, Fang Jin:
A natural language normalization approach to enhance social media text reasoning. 2019-2026 - Mustafa V. Nural, Hao Peng, John A. Miller:
Using meta-learning for model type selection in predictive big data analytics. 2027-2036 - Aras Can Onal, Omer Berat Sezer, A. Murat Ozbayoglu, Erdogan Dogdu:
Weather data analysis and sensor fault detection using an extended IoT framework with semantics, big data, and machine learning. 2037-2046 - Yiming Pan, Xuefeng Peng, Tianran Hu, Jiebo Luo:
Understanding what affects career progression using linkedin and twitter data. 2047-2055 - Thomas Papastergiou, Vasileios Megalooikonomou:
A distributed proximal gradient descent method for tensor completion. 2056-2065 - Xuefeng Peng, Yiming Pan, Jiebo Luo:
Predicting high taxi demand regions using social media check-ins. 2066-2075 - Xuefeng Peng, Jiebo Luo, Catherine Glenn, Li-Kai Chi, Jingyao Zhan:
Sleep-deprived fatigue pattern analysis using large-scale selfies from social media. 2076-2084 - Harun Pirim:
Mathematical programming for social network analysis. 2085-2088 - Ali Sekmen, Ahmet Bugra Koku, Mustafa Parlaktuna, Ayad Abdul-Malek, Nagendrababu Vanamala:
Unsupervised deep learning for subspace clustering. 2089-2094 - Ali Sekmen, Akram Aldroubi, Ahmet Bugra Koku, Keaton Hamm:
Principal coordinate clustering. 2095-2101 - Gokberk Serin, M. Ugur Gudelek, A. Murat Ozbayoglu, Hakki Özgür Ünver:
Estimation of parameters for the free-form machining with deep neural network. 2102-2111 - M. Omair Shafiq, Eric Torunski:
Towards MapReduce based Bayesian deep learning network for monitoring big data applications. 2112-2121 - Walid Shalaby, Wlodek Zadrozny:
Mined semantic analysis: A new concept space model for semantic representation of textual data. 2122-2131 - Adisak Sukul, Baskar Gopalakrishnan, Wallapak Tavanapong, David A. M. Peterson:
Online video ad measurement for political science research. 2132-2140 - Fangzhou Sun, Abhishek Dubey, Jules White:
DxNAT - Deep neural networks for explaining non-recurring traffic congestion. 2141-2150 - Imtiaz Ullah, Qusay H. Mahmoud:
A filter-based feature selection model for anomaly-based intrusion detection systems. 2151-2159 - Imtiaz Ullah, Qusay H. Mahmoud:
A hybrid model for anomaly-based intrusion detection in SCADA networks. 2160-2167 - Daniel Xie, Jiejun Xu, Tsai-Ching Lu:
What's trending tomorrow, today: Using early adopters to discover popular posts on Tumblr. 2168-2176 - Zhou Yang, Long Hoang Nguyen, Joshua Stuve, Guofeng Cao, Fang Jin:
Harvey flooding rescue in social media. 2177-2185 - Ozlem Yavanoglu, Murat Aydos:
A review on cyber security datasets for machine learning algorithms. 2186-2193 - Jianbo Yuan, Han Guo, Zhiwei Jin, Hongxia Jin, Xianchao Zhang, Jiebo Luo:
One-shot learning for fine-grained relation extraction via convolutional siamese neural network. 2194-2199 - Semih Yumusak, Riza Emre Aras, Elif Uysal, Erdogan Dogdu, Halife Kodaz, Kasim Oztoprak:
SpEnD portal: Linked data discovery using SPARQL endpoints. 2200-2202 - Philipp Zehnder, Dominik Riemer:
Modeling self-service machine-learning agents for distributed stream processing. 2203-2212 - Bethany G. Anderson, Christopher J. Prom, Kevin Hamilton, James A. Hutchinson, Mark Sammons, Alex Dolski:
The cybernetics thought collective project: Using computational methods to reveal intellectual context in archival material. 2213-2218 - Tobias Blanke, Jon Wilson:
Identifying epochs in text archives. 2219-2224 - Mike Bryant:
GraphQL for archival metadata: An overview of the EHRI GraphQL API. 2225-2230 - Pascal Dugenie, Nuno Freire, Daan Broeder:
Building new knowledge from distributed scientific corpus: HERBADROP & EUROPEANA: Two concrete case studies for exploring big archival data. 2231-2239 - Todd Richard Goodall, Maria Esteva, Sandra Sweat, Alan C. Bovik:
Towards automated quality curation of video collections from a realistic perspective. 2240-2245 - Nicola Horsley:
What can a knowledge complexity approach reveal about big data and archival practice? 2246-2250 - Tim Hutchinson:
Protecting privacy in the archives: Preliminary explorations of topic modeling for born-digital collections. 2251-2255 - Benjamin Charles Germain Lee:
Line detection in binary document scans: A case study with the international tracing service archives. 2256-2261 - Myeong Lee, Yuheng Zhang, Shiyun Chen, Edel Spencer, Jhon Dela Cruz, Hyeonggi Hong, Richard Marciano:
Heuristics for assessing Computational Archival Science (CAS) research: The case of the human face of big data project. 2262-2270 - Victoria L. Lemieux:
A typology of blockchain recordkeeping solutions and some reflections on their implications for the future of archival preservation. 2271-2278 - Ji-Ping Lin:
An infrastructure and application of computational archival science to enrich and integrate big digital archival data: Using Taiwan Indigenous Peoples Open Research Data (TIPD) as an example. 2279-2287 - Nathaniel Payne, Jason R. Baron:
Auto-categorization methods for digital archives. 2288-2298 - T. D. Smith:
The blockchain litmus test. 2299-2308 - William Underwood, Richard Marciano, Sandra Laib, Carl Apgar, Luis Beteta, Waleed Falak, Marisa Gilman, Riss Hardcastle, Keona Holden, Yun Huang, David Baasch, Brittni Ballard, Tricia Glaser, Adam Gray, Leigh Plummer, Zeynep Diker, Mayanka Jha, Aakanksha Singh, Namrata Walanj:
Computational curation of a digitized record series of WWII Japanese-American Internment. 2309-2313 - Darlan Arruda, Nazim H. Madhavji:
Towards a requirements engineering artefact model in the context of big data software development projects: Research in progress. 2314-2319 - David K. Becker:
Predicting outcomes for big data projects: Big Data Project Dynamics (BDPD): Research in progress. 2320-2330 - Nancy W. Grady, Jason A. Payne, Huntley Parker:
Agile big data analytics: AnalyticsOps for data science. 2331-2339 - Mike Lakoju, Alan Serrano:
Saving costs with a big data strategy framework. 2340-2347 - Jeffrey S. Saltz, Ivan Shamshurin:
Does pair programming work in a data science context? An initial case study. 2348-2354 - Jeffrey S. Saltz, Nancy W. Grady:
The ambiguity of data science team roles and the need for a data science workforce framework. 2355-2361 - Toshiyuki Shimono:
Make accumulated data in companies eloquent by SQL statement constructors. 2362-2369 - Shaaban Abbady, Cheng-Yuan Ke, Jennifer Lavergne, Jian Chen, Vijay V. Raghavan, Ryan Benton:
Online mining for association rules and collective anomalies in data streams. 2370-2379 - Junzhi Gong, Tong Yang, Yang Zhou, Dongsheng Yang, Shigang Chen, Bin Cui, Xiaoming Li:
ABC: A practicable sketch framework for non-uniform multisets. 2380-2389 - Vibhuti Gupta, Rattikorn Hewett:
Harnessing the power of hashtags in tweet analytics. 2390-2395 - Ayae Ichinose, Atsuko Takefusa, Hidemoto Nakada, Masato Oguchi:
A study of a video analysis framework using Kafka and spark streaming. 2396-2401 - Ovidiu-Cristian Marcu, Alexandru Costan, Gabriel Antoniu, María S. Pérez-Hernández, Radu Tudoran, Stefano Bortoli, Bogdan Nicolae:
Towards a unified storage and ingestion architecture for stream processing. 2402-2407 - Salman Ahmed Shaikh, Hiroyuki Kitagawa:
Smart distributed query execution over data streams. 2408-2413 - Georgios Touloupas, Ioannis Konstantinou, Nectarios Koziris:
RASP: Real-time network analytics with distributed NoSQL stream processing. 2414-2419 - Qian Zhao, Christian Klaue, Chih Lai:
Predicting concept drift via dynamic Naïve Bayes. 2420-2425 - Hadeel Alghamdi, Farhana H. Zulkernine, Patrick Martin:
Leveraging distributed big data storage support in CLAaaS for WINGS workflow management system. 2426-2432 - Hanieh Alipour, Yan Liu:
Online machine learning for cloud resource provisioning of microservice backend systems. 2433-2441 - Chin-Jung Hsu, Vincent W. Freeh, Flavio Villanustre:
Trilogy: Data placement to improve performance and robustness of cloud computing. 2442-2451 - Bipin Karunakaran, Debdipto Misra, Kyle Marshall, Dhruv Mathrawala, Shravan Kethireddy:
Closing the loop - Finding lung cancer patients using NLP. 2452-2461 - Meike Klettke, Hannes Awolin, Uta Störl, Daniel Müller, Stefanie Scherzinger:
Uncovering the evolution history of data lakes. 2462-2471 - Joichiro Kon, Naoki Mizusawa, Ayaka Umezawa, Saneyasu Yamaguchi, Jian Tao:
Highly consolidated servers with container-based virtualization. 2472-2479 - Leandro Ordoñez-Ante, Thomas Vanhove, Gregory van Seghbroeck, Tim Wauters, Bruno Volckaert, Filip De Turck:
Dynamic data transformation for low latency querying in big data systems. 2480-2489 - Marco Vogt, Alexander Stiemer, Heiko Schuldt:
Icarus: Towards a multistore database system. 2490-2499 - Chenxiao Wang, Jason Arenson, Florian Helff, Le Gruenwald, Laurent d'Orazio:
Improving user interaction in mobile-cloud database query processing. 2500-2507 - Kaihui Zhang, Yusuke Tanimura, Hidemoto Nakada, Hirotaka Ogawa:
Understanding and improving disk-based intermediate data caching in Spark. 2508-2517 - Azim Ahmadzadeh, Dustin J. Kempton, Michael A. Schuh, Rafal A. Angryk:
Improving the functionality of tamura directionality on solar images. 2518-2526 - Sunitha Basodi, Berkay Aydin, Rafal A. Angryk:
Parallel computation of magnetic field parameters from HMI active region patches. 2527-2532 - Soukaina Filali Boubrahimi, Berkay Aydin, Petrus C. Martens, Rafal A. Angryk:
On the prediction of >100 MeV solar energetic particle events using GOES satellite data. 2533-2542 - Shah Muhammad Hamdi, Dustin Kempton, Ruizhe Ma, Soukaina Filali Boubrahimi, Rafal A. Angryk:
A time series classification-based approach for solar flare prediction. 2543-2551 - Ahmet Küçük, Berkay Aydin, Rafal A. Angryk:
Multi-wavelength solar event detection using faster R-CNN. 2552-2558 - Hasan Kurban, Can Kockan, Mark Jenne, Mehmet M. Dalkilic:
Improving expectation maximization algorithm over stellar data. 2559-2568 - Ruizhe Ma, Soukaina Filali Boubrahimi, Shah Muhammad Hamdi, Rafal A. Angryk:
Solar flare prediction using multivariate time series decision trees. 2569-2578 - Simon Marcin, André Csillaghy:
Accelerating scientific algorithms in array databases with GPUs. 2579-2587 - Adrienne Colborne, Michael Smit:
Identifying and mitigating risks to the quality of open data in the post-truth era. 2588-2594 - Matthew L. Dering, Conrad S. Tucker:
Generative adversarial networks for increasing the veracity of big data. 2595-2602 - Junhua Ding, XinChuan Li, Venkat N. Gudivada:
Augmentation and evaluation of training data for deep learning. 2603-2611 - Kim Hee:
Is data quality enough for a clinical decision?: Apply machine learning and avoid bias. 2612-2619 - Alina Lazar, Ling Jin, C. Anna Spurlock, Kesheng Wu, Alex Sim:
Data quality challenges with missing values and mixed types in joint sequence analysis. 2620-2627 - Daniel Muller, Yiea-Funk Te, Pratiksha Jain:
Improving data quality through high precision gender categorization. 2628-2636 - Tim Marple, Bruce A. Desmarais, Kevin L. Young:
Collapsing corporate confusion: Leveraging network structures for effective entity resolution in relational corporate data. 2637-2643 - Shahab Tayeb, Matin Pirouz, Brittany Cozzens, Richard Huang, Maxwell Jay, Kyle Khembunjong, Sahan Paliskara, Felix Zhan, Mark Zhang, Justin Zhan, Shahram Latifi:
Toward data quality analytics in signature verification using a convolutional neural network. 2644-2651 - Yongle Chen, Hui Li, Kejiao Li, Jiyang Zhang:
An improved P2P file system scheme based on IPFS and Blockchain. 2652-2657 - Hui Li, Jiawei Hu, Huajun Ma, Ting Huang:
The architecture of distributed storage system under mimic defense theory. 2658-2663 - Haopeng Li, Hui Li:
A scheduling strategy based on multi-queues of Cassandra. 2664-2669 - Zhili Lin, Kedan Li, Hanxu Hou, Xin Yang, Hui Li:
MDFS: A mimic defense theory based architecture for distributed file system. 2670-2675 - Jiyang Zhang, Hanxu Hou, Kedan Li, Hui Li:
On the implementation of BRS codes in Ceph. 2676-2681 - Mahsa Badami, Olfa Nasraoui, Wenlong Sun, Patrick Shafto:
Detecting polarization in ratings: An automated pipeline and a preliminary quantification on several benchmark data sets. 2682-2690 - Stephen Bonner, John Brennan, Ibad Kureshi, Georgios Theodoropoulos, Andrew Stephen McGough, Boguslaw Obara:
Evaluating the quality of graph embeddings via topological feature reconstruction. 2691-2700 - Wei-Lun Chang:
Using sentiment analysis to explore the degree of risk in sharing economy. 2701-2709 - Hsin-Yu Chen, Cheng-Te Li:
PSEISMIC: A personalized self-exciting point process model for predicting tweet popularity. 2710-2713 - Anahita Davoudi, Mainak Chatterjee:
Detection of profile injection attacks in social recommender systems using outlier analysis. 2714-2719 - Benjamin Flesch, Ravi Vatrapu, Raghava Rao Mukkamala:
A big social media data study of the 2017 german federal election based on social set analysis of political party Facebook pages with SoSeVi. 2720-2729 - K. M. George:
Using an asset price bubble model in tweet analytics. 2730-2739 - Takako Hashimoto, Hiroshi Okamoto, Tetsuji Kuboyama, Kilho Shin:
Topic life cycle extraction from big Twitter data based on community detection in bipartite networks. 2740-2745 - Hsiao-Wei Hu, Ching-Han Cheng, Yun-Chu Chung, Chia-Yu Lee:
Ticket-purchase behavior under the effects of marketing campaigns on facebook fan pages. 2746-2751 - Dijana Kosmajac, Vlado Keselj:
Language identification in multilingual, short and noisy texts using common N-grams. 2752-2759 - Thomas-Joseph Loiseau, Sonia Djebali, Thomas Raimbault, Bérengère Branchet, Gaël Chareyron:
Characterization of daily tourism behaviors based on place sequence analysis from photo sharing websites. 2760-2765 - Gang Wu, Viswanathan Swaminathan, Saayan Mitra, Ratnesh Kumar:
Digital content recommendation system using implicit feedback data. 2766-2771 - Nadiya Straton, Raghava Rao Mukkamala, Ravi Vatrapu:
Big social data analytics for public health: Comparative methods study and performance indicators of health care content on Facebook. 2772-2777 - Tianqi Xia, Xuan Song, Dou Huang, Satoshi Miyazawa, Zipei Fan, Renhe Jiang, Ryosuke Shibasaki:
Outbound behavior analysis through social network data: A case study of Chinese people in Japan. 2778-2786 - Tariq Abughofa, Farhana H. Zulkernine:
Towards online graph processing with spark streaming. 2787-2794 - Maaike de Boer, Barry Nouwt, Michael van Bekkum:
SUDS: System for uncertainty decision support. 2795-2803 - Giuseppe Bruno, Demetrio Condello, Alberto Falzone, Andrea Luciani:
Big data processing: Is there a framework suitable for economists and statisticians? 2804-2811 - Keren Ouaknine, Michael J. Carey:
A performance study of AsterixDB. 2812-2820 - Sheriffo Ceesay, Adam Barker, Blesson Varghese:
Plug and play bench: Simplifying big data benchmarking using containers. 2821-2828 - Wanghu Chen, Xintian Li, Jing Li, Jianwu Wang:
Enhancing the MapReduce training of BP neural networks based on local weight matrix evolution. 2829-2835 - Wei-Chun Chung, Jan-Ming Ho, Chung-Yen Lin, D. T. Lee:
CloudEC: A MapReduce-based algorithm for correcting errors in next-generation sequencing big data. 2836-2842 - Rustem Dautov, Salvatore Distefano:
Quantifying volume, velocity, and variety to support (Big) data-intensive application development. 2843-2852 - Janakiram Dharanipragada, Srikant Padala, Balaji Kammili, Vikram Kumar:
Tula: A disk latency aware balancing and block placement strategy for Hadoop. 2853-2858 - Sina Gholamian, Wojciech M. Golab, Paul A. S. Ward:
Efficient incremental data analytics with apache spark. 2859-2868 - Pei Guo, Jianwu Wang, Zhiyuan Chen:
A comparison of big data application programming approaches: A travel companion case study. 2869-2878 - Andrew Halterman, Jill Irvine, Manar Landis, Phanindra Jalla, Yan Liang, Christan Grant, Mohiuddin Solaimani:
Adaptive scalable pipelines for political event data generation. 2879-2883 - Chengzhi Lu, Kejiang Ye, Guoyao Xu, Cheng-Zhong Xu, Tongxin Bai:
Imbalance in the cloud: An analysis on Alibaba cluster trace. 2884-2892 - Piotr Luszczek, Jakub Kurzak, Ichitaro Yamazaki, David J. Keffer, Jack J. Dongarra:
Scaling point set registration in 3D across thread counts on multicore and hardware accelerator platforms through autotuning for large scale analysis of scientific point clouds. 2893-2902 - Yuri Nishikawa, Hitoshi Sato, Jun Ozawa:
Performance evaluation of multiple sports player tracking system based on graph optimization. 2903-2910 - Pouria Pirzadeh, Michael J. Carey, Till Westmann:
A performance study of big data analytics platforms. 2911-2920 - Vincent Reniers, Dimitri Van Landuyt, Ansar Rafique, Wouter Joosen:
Schema design support for semi-structured data: Finding the sweet spot between NF and De-NF. 2921-2930 - Shanshan Huang, Jungang Xu, Renfeng Liu, Husheng Liao:
A novel compression algorithm decision method for spark shuffle process. 2931-2940 - Lili Xu, Edin Muharemagic, Amy W. Apon:
ECL-watch: A big data application performance tuning tool in the HPCC systems platform. 2941-2950 - Huayi Fang, Baijian Yang, Tonglin Zhang:
Finding the best box-cox transformation from massive datasets on spark. 2951-2960 - Elisa Bertino, Geeth de Mel, Alessandra Russo, Seraphin B. Calo, Dinesh C. Verma:
Community-based self generation of policies and processes for assets: Concepts and research directions. 2961-2969 - Seraphin B. Calo, Emil Lupu, Elisa Bertino, Saritha Arunkumar, Gregory H. Cirincione, Brian Rivera, Alan Cullen:
Research challenges in dynamic policy-based autonomous security. 2970-2973 - Tiziana Catarci, Monica Scannapieco, Marco Console, Camil Demetrescu:
My (fair) big data. 2974-2979 - Supriyo Chakraborty, Wentao Robin Ouyang, Mani B. Srivastava:
LightSpy: Optical eavesdropping on displays using light sensors on mobile devices. 2980-2989 - Emre Göynügür, Murat Sensoy, Geeth de Mel:
Combining semantic web and IoT to reason with health and safety policies. 2990-2997 - Erisa Karafili, Emil C. Lupu, Alan Cullen, Bill Williams, Saritha Arunkumar, Seraphin B. Calo:
Improving data sharing in data rich environments. 2998-3005 - Antara Palit, Mudhakar Srivatsa, Raghu K. Ganti, Christopher Simpkin:
Identifying sensor accesses from service descriptions. 3006-3011 - Seraphin B. Calo, Maroun Touma, Dinesh C. Verma, Alan Cullen:
Edge computing architecture for applying AI to IoT. 3012-3016 - Dinesh C. Verma, Graham A. Bent:
Policy enabled caching for distributed AI. 3017-3023 - Hussain Z. Al-Ajmi:
Case: Big geosciences data validation challenges and achievements. 3024-3030 - Priyaa Thavasimani, Jacek Cala, Paolo Missier:
Why-Diff: Explaining differences amongst similar workflow runs by exploiting scientific metadata. 3031-3041 - Benjamin E. Bagozzi, Ore Koren:
Using machine learning methods to identify atrocity perpetrators. 3042-3051 - Shouji Fujimoto, Atushi Ishikawa, Takayuki Mizuno:
Comparison between spatial distributions of tweet base and population in Japan. 3052-3057 - Masanori Fujita, Hiroto Inoue, Takao Terano:
Evaluating funding programs through network centrality measures of co-author networks of technical papers. 3058-3063 - Kouki Hayashi, Eiichi Umehara, Yuuki Ogawa:
Analysis of twitter messages about the osaka metropolis plan in Japan. 3064-3070 - Ayae Ide, Kazuya Yamashita, Yoichi Motomura, Takao Terano:
Analyzing regional characteristics of living activities of elderly people from large survey data with probabilistic latent spatial semantic structure modeling. 3071-3077 - Akira Ishii, Takayuki Mizuno, Yasuko Kawahata:
Position-sensitive propagation of information on social media using social physics approach. 3078-3085 - Shotaro Ito, Koji Eguchi:
Time dependent analysis of financial networks using supervised latent feature relational models. 3086-3090 - Mitsuki Murase, Masanori Takano, Reiji Suzuki, Takaya Arita:
A statistical analysis of behavioral bursts occurring in a social networking game. 3091-3097 - Daniel Rajchwald, Natasha Markuzon, Edoardo M. Airoldi:
Bias reduction of peer influence effects with latent coordinates and community membership. 3098-3103 - Takuto Sakamoto, Hiroki Takikawa:
Cross-national measurement of polarization in political discourse: Analyzing floor debate in the U.S. the Japanese legislatures. 3104-3110 - Yuya Shibuya:
Mining social media for disaster management: Leveraging social media data for community recovery. 3111-3118 - Jinsei Shima, Mitsuo Yoshida, Kyoji Umemura:
When do users change their profile information on twitter? 3119-3122 - Nadiya Straton, Ravi Vatrapu, Raghava Rao Mukkamala:
Facebook and public health: A study to understand facebook post performance with organizations' strategy. 3123-3132 - Hirohiko Suwa, Yuki Ogawa, Eiichi Umehara, Kento Kakigi, Keiichi Yasumoto, Tatsuo Yamashita, Kota Tsubouchi:
Develop method to predict the increase in the Nikkei VI index. 3133-3138 - Masanori Takano, Hiroki Mizukami, Fujio Toriumi, Makoto Takeuchi, Kazuya Wada, Masahiro Yasuda, Ichiro Fukiida:
Analysis of the changes in listening trends of a music streaming service. 3139-3142 - Hiroki Takikawa, Kikuko Nagayoshi:
Political polarization in social media: Analysis of the "Twitter political field" in Japan. 3143-3150 - Toshimichi Wakabayashi, Yasuko Kawahata, Akira Ishii:
Analysis of EXILE TRIBE in the music scene using mathematical model of hit phenomenon. 3151-3155 - Kenta Yamada, Takayuki Mizuno:
Relationships between market impact characteristics and order book properties. 3156-3161 - Kenta Yamada:
Detecting two types of seasonal words using simple autocorrelation analysis. 3162-3167 - Take Yo, Kazutoshi Sasahara:
Inference of personal attributes from tweets using machine learning. 3168-3174 - Jacob Bolewski, Stavros Papadopoulos:
Managing massive multi-dimensional array data with TileDB: - Invited demo paper. 3175-3176 - Subhasis Dasgupta, Charles McKay, Amarnath Gupta:
Generating polystore ingestion plans - A demonstration with the AWESOME system. 3177-3179 - Hayden Jananthan, Ziqi Zhou, Vijay Gadepally, Dylan Hutchison, Suna Kim, Jeremy Kepner:
Polystore mathematics of relational algebra. 3180-3189 - Yasar Khan, Antoine Zimmermann, Alokkumar Jha, Dietrich Rebholz-Schuhmann, Ratnesh Sahay:
Querying web polystores. 3190-3195 - Antonios Makris, Konstantinos Tserpes, Dimosthenis Anagnostopoulos:
A novel object placement protocol for minimizing the average response time of get operations in distributed key-value stores. 3196-3205 - Jonathan Rivers:
SciDB: An array-native computational database for heterogeneous, multi-dimensional data sets. 3206-3210 - Ran Tan, Rada Chirkova, Vijay Gadepally, Timothy G. Mattson:
Enabling query processing across heterogeneous data models: A survey. 3211-3220 - Ashwin Kumar Vajantri, Kunwar Deep Singh Toor, Edmon Begoli, Jack Bates:
An apache calcite-based polystore variation for federated querying of heterogeneous healthcare sources. 3221-3227 - Jose Luis, Guerrero Cusumano:
A detection mechanism with text mining cross correlation approach. 3228-3232 - Gürdal Ertek, Xu Chi, Allan N. Zhang, Sobhan Asian:
Text mining analysis of wind turbine accidents: An ontology-based framework. 3233-3241 - Aloysious J. L. Lee, D. Paul, W. J. Yan, Allan N. Zhang, Mark Goh:
A model for analysing a disrupted supply chain's time-to-recovery under uncertainty. 3242-3247 - Yong Oh Lee, Jun Jo, Jongwoon Hwang:
Application of deep neural network and generative adversarial network to industrial maintenance: A case study of induction motor fault detection. 3248-3253 - Haoye Lu, Anand Srinivasan, Amiya Nayak:
Learning automata based method for solving demand and supply problem with periodic behaviors. 3254-3260 - Nigel Pugh, Lauren B. Davis:
Forecast and analysis of food donations using support vector regression. 3261-3267 - Murat Mustafa Tunç, Alexandru Valcov, Allan N. Zhang, Wenjing Yan, Rong Wen:
Association analysis of supply chain risk and company sales. 3268-3277 - Rong Wen, Wenjing Yan, Allan N. Zhang:
Adaptive spatio-temporal mining for route planning and travel time estimation. 3278-3284 - Yi-Hsin Wu, Sheng-De Wang, Li-Jung Chen, Cheng-Juei Yu:
Streaming analytics processing in manufacturing performance monitoring and prediction. 3285-3289 - Dazhi Yang, Allan N. Zhang, Wenjing Yan:
Performing literature review using text mining, Part I: Retrieving technology infrastructure using Google Scholar and APIs. 3290-3296 - Dazhi Yang, Jihoon Hong:
Performing literature review using text mining, Part II: Expanding domain knowledge with abbreviation identification. 3297-3301 - Md. Maksudul Alam, Kalyan S. Perumalla:
GPU-based parallel algorithm for generating massive scale-free networks using the preferential attachment model. 3302-3311 - Md Hasanuzzaman Bhuiyan, Maleq Khan, Madhav V. Marathe:
A parallel algorithm for generating a random graph with a prescribed degree sequence. 3312-3321 - Florian Demesmaeker, Amine Ghrab, Siegfried Nijssen, Sabri Skhiri:
Discovering interesting patterns in large graph cubes. 3322-3331 - Colleen Heinemann, Talita Perciano, Daniela Ushizima, E. Wes Bethel:
Distributed memory parallel Markov random fields using graph partitioning. 3332-3341 - Weiyi Liu, Toyotaro Suzumura, Lingli Chen, Guangmin Hu:
A generalized incremental bottom-up community detection framework for highly dynamic graphs. 3342-3351 - Hannu Reittu, Ilkka Norros:
Regular decomposition of large graphs and other structures: Scalability and robustness towards missing data. 3352-3357 - Xiangnan Ren, Olivier Curé, Hubert Naacke, Jérémy Lhez, Ke Li:
StriderR: Massive and distributed RDF graph stream reasoning. 3358-3367 - Akira Tanaka, Nozomi Hata, Nariaki Tateiwa, Katsuki Fujisawa:
Practical approach to evacuation planning via network flow and deep learning. 3368-3377 - Adil Alim, Aparna Joshi, Feng Chen, Catherine T. Lawson:
Techniques for efficient detection of rapid weather changes and analysis of their impacts on a highway network. 3378-3387 - Elena Baralis, Andrea Dalla Valle, Paolo Garza, Claudio Rossi, Francesco Scullino:
SQL versus NoSQL databases for geospatial applications. 3388-3397 - Savitha Baskaran, Shiaofen Fang, Shenhui Jiang:
Spatiotemporal visualization of traffic paths using color space time curve. 3398-3405 - Peter Baumann, Eric Hirschorn, Joan Masó, Vlad Merticariu, Dimitar Misev:
All in One: Encoding spatio-temporal big data in XML, JSON, and RDF without information loss. 3406-3415 - Thaleia Dimitra Doudali, Ioannis Konstantinou, Nectarios Koziris:
Spaten: A spatio-temporal and textual big data generator. 3416-3421 - Ronald D. Hagan, Charles A. Phillips, Michael A. Langston, Bradley J. Rhodes:
Multiscale graph theoretical tools reveal subtle patterns in big geospatial data. 3422-3425 - Masahiko Itoh, Daisaku Yokoyama, Masashi Toyoda, Masaru Kitsuregawa:
Optimal viewpoint finding for 3D visualization of spatio-temporal vehicle trajectories on caution crossroads detected from vehicle recorder big data. 3426-3434 - Kulsawasd Jitkajornwanich, Peerapon Vateekul, Teerapong Panboonyuen, Siam Lawawirojwong, Siwapon Srisonphan:
Road map extraction from satellite imagery using connected component analysis and landscape metrics. 3435-3442 - Sangchul Kim, Junhee Lee, Taehoon Kim, Bongki Moon:
Scalable parallel data loading in SciDB. 3443-3446 - Zhicheng Liu, Jun Cao, Junyan Yang, Qiao Wang:
Discovering dynamic patterns of urban space via semi-nonnegative matrix factorization. 3447-3453 - Adway Mitra:
Identifying coherent anomalies in multi-scale spatio-temporal data using Markov random fields. 3454-3460 - Rene Richard, Suprio Ray:
A tale of two cities: Analyzing road accidents with big spatial data. 3461-3470 - Victor Saquicela, Luis Manuel Vilches Blázquez, Andres Tello:
Challenges and trends about smart big geospatial data: A position paper. 3471-3475 - Purnima Shah, Deepak B. Hiremath, Sanjay Chaudhary:
Towards development of spark based agricultural information system including geo-spatial data. 3476-3481 - Dongbo Zhou, Hao Li, Sannyuya Liu, Bo Song, Xiaohua Tony Hu:
A map-based visual analysis method for patterns discovery of mobile learning in education with big data. 3482-3491 - Mehdi Assefi, Ehsun Behravesh, Guangchi Liu, Ahmad Pahlavan Tafti:
Big data machine learning using apache spark MLlib. 3492-3498 - Christophe Cérin, Jean-Luc Gaudiot, Mustapha Lebbah, Foutse Yuehgoh:
Return of experience on the mean-shift clustering for heterogeneous architecture use case. 3499-3507 - Alex Kaplunovich, Yelena Yesha:
Cloud big data decision support system for machine learning on AWS: Analytics of analytics. 3508-3516 - Hui Zhang, Yiwen Zhong, Juan Lin:
Divide-and-conquer strategies for large-scale simulations in R. 3517-3523 - Mihaela Malita, Gheorghe M. Stefan:
Map-scan node accelerator for big-data. 3524-3529 - Cuong Nguyen, Charles Lovering, Rodica Neamtu:
Ranked time series matching by interleaving similarity distances. 3530-3539 - Sergiy Peredriy, Deovrat Kakde, Arin Chaudhuri:
Kernel bandwidth selection for SVDD: The sampling peak criterion method for large data. 3540-3549 - Hong Yan, Zhongqiang Zhang, Jian Zou:
An online spatio-temporal model for inference and predictions of taxi demand. 3550-3557 - Halim Abbas, Ford Garberson, Eric Glover, Dennis P. Wall:
Machine learning for early detection of autism (and other conditions) using a parental questionnaire and home video screening. 3558-3561 - Ravi Santosh Arvapally, Hasan Hicsasmaz, Wally Lo Faro:
Artificial intelligence applied to challenges in the fields of operations and customer support. 3562-3569 - Ricardo Baeza-Yates:
Semantic search (invited talk). 3570 - Richard Boire:
Artificial intelligence(AI), automation, and its impact on data science. 3571-3574 - Yong Cai, Shaorong Liu, Jinlong Hu, Guihong Bai, Shoubin Dong:
A hybrid bipartite graph based recommendation algorithm for mobile games. 3575-3582 - Brian Johnston, Benjamin Zweig, Michael Peran, Charlie Wang, Rachel Rosenfeld:
Estimating skill fungibility and forecasting services labor demand. 3583-3585 - Eva K. Lee:
Innovation in big data analytics: Applications of mathematical programming in medicine and healthcare. 3586-3595 - Srishty Saha, Karuna P. Joshi, Renee Frank, Michael Aebig, Jiayong Lin:
Automated knowledge extraction from the federal acquisition regulations system (FARS). 3596-3603 - Paul Squires, Harold G. Kaufman, Julian Togelius, Catalina M. Jaramillo:
A comparative sequence analysis of career paths among knowledge workers in a multinational bank. 3604-3612 - Xin Xu Lei, Tang Venkat Rangan:
Hitting your number or not? A robust & intelligent sales forecast system. 3613-3622 - Atsushi Yamada, Michael Peran:
Governance framework for enterprise analytics and data. 3623-3631 - Anja Evelyn Amundsen, Kenneth M. Ovens:
Forensics analysis of Wi-Fi communication traces in mobile devices. 3632-3637 - Sreyasee Das Bhattacharjee, Bala Venkatram Balantrapu, William J. Tolone, Ashit Talukder:
Identifying extremism in social media with multi-view context-aware subset optimization. 3638-3647 - Isuf Deliu, Carl Leichter, Katrin Franke:
Extracting cyber threat intelligence from hacker forums: Support vector machines versus convolutional neural networks. 3648-3656 - Asif Iqbal, Mathias Ekstedt, Hanan Alobaidli:
Exploratory studies into forensic logs for criminal investigation using case studies in industrial control systems in the power sector. 3657-3661 - Pierre Lison, Vasileios Mavroeidis:
Neural reputation models learned from passive DNS data. 3662-3671 - Andrii Shalaginov, Jan William Johnsen, Katrin Franke:
Cyber crime investigations in the era of big data. 3672-3676 - Shih-Chieh Su:
Topical behavior prediction from massive logs. 3677-3683 - Peter Xenopoulos:
Introducing DeepBalance: Random deep belief network ensembles to address class imbalance. 3684-3689 - Haohua Sun Yin, Ravi Vatrapu:
A first estimation of the proportion of cybercriminal entities in the bitcoin ecosystem using supervised machine learning. 3690-3699 - Joshua Sablatura, Bing Zhou:
Forensic database reconstruction. 3700-3704 - Conrad Bielski, V. O'Brien, C. Whitmore, Kaisa Riikka Ylinen, I. Juga, P. Nurmi, Juha Pekka Kilpinen, I. Porras, J. M. Sole, P. Gamez, M. Navarro, Azra Alikadic, Andrea Gobbi, Cesare Furlanello, Gunter Zeug, M. Weirathe, J. Martinez, R. Yuste, S. Castro, V. Moreno, T. Velin, Claudio Rossi:
Coupling early warning services, crowdsourcing, and modelling for improved decision support and wildfire emergency management. 3705-3712 - Luca Cagliero:
Summarization of emergency news articles driven by relevance feedback. 3713-3721 - Evelina Di Corso, Francesco Ventura, Tania Cerquitelli:
All in a twitter: Self-tuning strategies for a deeper understanding of a crisis tweet collection. 3722-3726 - Antonella Frisiello, Quynh Nhu Nguyen, Claudio Rossi:
Gamified crowdsourcing for disaster risk management. 3727-3733 - Andrea Gobbi, Azra Alikadic, Kaisa Riikka Ylinen, Federico Angaramo, Cesare Furlanello:
A heat wave forecast system for Europe. 3734-3738 - Jacopo Longhini, Claudio Rossi, Claudio Casetti, Federico Angaramo:
A language-agnostic approach to exact informative tweets during emergency situations. 3739-3475 - Laura Lopez-Fuentes, Claudio Rossi, Harald Skinnemoen:
River segmentation for flood monitoring. 3746-3749 - Timothy Nugent, Fabio Petroni, Natraj Raman, Lucas Carstens, Jochen L. Leidner:
A comparison of classification models for natural disaster and critical event detection from news. 3750-3759 - Jasmin Pielorz, Matthias Prandtstetter, Markus Straub, Christoph H. Lampert:
Optimal geospatial volunteer allocation needs realistic distances. 3760-3763 - Tomoichi Takahashi, Katsuki Ichinose:
Crowd control and evacuation guidance based on simulations. 3764-3768 - Francesco Tarasconi, Michela Farina, Antonio Mazzei, Alessio Bosca:
The role of unstructured data in real-time disaster-related social media monitoring. 3769-3778 - Luca Venturini, Evelina Di Corso:
Analyzing spatial data from twitter during a disaster. 3779-3783 - Marco Brambilla, Paolo Mascetti, Andrea Mauri:
Comparison of different driving style analysis approaches based on trip segmentation over GPS information. 3784-3791 - Qian Fu, John M. Easton:
Understanding data quality: Ensuring data quality by design in the rail industry. 3792-3799 - Emmanuel Nii Martey, Ahmed Lasisi, Nii O. Attoh-Okine:
Track geometry big data analysis: A machine learning approach. 3800-3809 - Federico Perrotta, Tony Parry, Luís C. Neves:
Application of machine learning for fuel consumption modelling of trucks. 3810-3815 - Gene P. K. Wu, Keith C. C. Chan:
Privacy-preserving trajectory classification of driving trip data based on pattern discovery techniques. 3816-3825 - Jerzy Bala, Michael Kellar, Fred Ramberg:
Predictive analytics for litigation case management. 3826-3830 - Han Qin, Kit Riehle, Haozhen Zhao:
Using google analytics to support cybersecurity forensics. 3831-3834 - Thanasis Schoinas, Ghulam Qadir:
A feasibility experiment on the application of predictive coding to instant messaging corpora. 3835-3840 - Alexander Acker, Florian Schmidt, Anton Gulenko, Reinhard Kietzmann, Odej Kao:
Patient-individual morphological anomaly detection in multi-lead electrocardiography data streams. 3841-3846 - Fahima Amin Bhuyan, Shiyong Lu, Ishtiaq Ahmed, Jia Zhang:
Predicting efficacy of therapeutic services for autism spectrum disorder using scientific workflows. 3847-3856 - Elham Hassanain:
A multimedia big data retrieval framework to detect dyslexia among children. 3857-3860 - Wei Hong Lee, En Tzu Wang, Arbee L. P. Chen:
Mining accompanying relationships between diseases from patient records. 3861-3868 - Ning Liu, Soundar R. T. Kumara, Eric Reich:
Explainable data-driven modeling of patient satisfaction survey data. 3869-3876 - Goutam Mylavarapu, Johnson P. Thomas:
A multi-task machine learning approach for comorbid patient prioritization. 3877-3881 - Xianjun Shen, Xianchao Zhu, Xingpeng Jiang, Li Gao, Tingting He, Xiaohua Hu:
Visualization of non-metric relationships by adaptive learning multiple maps t-SNE regularization. 3882-3887 - Ahmad Pahlavan Tafti, Ehsun Behravesh, Mehdi Assefi, Eric LaRose, Jonathan C. Badger, John Mayer, AnHai Doan, David Page, Peggy L. Peissig:
bigNN: An open-source big data toolkit focused on biomedical sentence classification. 3888-3896 - Shahab Tayeb, Matin Pirouz, Johann Sun, Kaylee Hall, Andrew Chang, Jessica Li, Connor Song, Apoorva Chauhan, Michael Ferra, Theresa Sager, Justin Zhan, Shahram Latifi:
Toward predicting medical conditions using k-nearest neighbors. 3897-3903 - Anuja Tike, Sanket Tavarageri:
A medical price prediction system using hierarchical decision trees. 3904-3913 - Iulian Voicu, Denis Kouame:
High dimensional data processing for fetal activity evaluation. 3914-3915 - Lina Yu, Hengle Jiang, Hongfeng Yu, Chi Zhang, Josiah Mcallister, Dandan Zheng:
iVAR: Interactive visual analytics of radiomics features from large-scale medical images. 3916-3923 - Xin Deng:
Big data technology and ethics considerations in customer behavior and customer feedback mining. 3924-3927 - Duyen Do, Phuc Huynh, Phuong Vo, Tu Vu:
Customer churn prediction in an internet service provider. 3928-3933 - Michael Kranzlein, Dan Chia-Tien Lo:
Training on the poles for review sentiment polarity classification. 3934-3937 - Pegah Nokhiz, Fengjun Li:
Understanding rating behavior based on moral foundations: The case of Yelp reviews. 3938-3945 - Yixuan Qiu, Wutao Wei:
A scalable sequential principal component analysis algorithm (SeqPCA) with application to user access control analysis. 3946-3754 - Ross Smith:
Towards an ethical application of customer feedback data. 3955-3957 - Wutao Wei, Le Zhang, Qi Ding, Bingrou Zhou:
Dynamic Bayesian predictive model for box office forecasting. 3958-3964 - Donghui Wu:
A big data analytics framework for forecasting rare customer complaints: A use case of predicting MA members' complaints to CMS. 3965-3967 - Yizhou Zang, Xiaohua Hu:
Heterogeneous knowledge transfer via domain regularization for improving cross-domain collaborative filtering. 3968-3974 - Paulo S. C. Alencar, Donald D. Cowan, Douglas W. Mulholland, Bruce MacVicar, Simon Courtenay, Stephen Murphy, Fred McGarry:
iEnvironment: A software platform for integrated environmental monitoring and modeling of surface water. 3975-3978 - Rumi Chunara:
New data paradigms: From the crowd and back. 3979-3980 - Holden Karau:
Unifying the open big data world: The possibilities∗ of apache BEAM. 3981 - Georgia D. Tourassi:
Deep learning enabled national cancer surveillance. 3982-3983 - Lee Wilson, Adrienne Colborne, Michael Smit:
Preparing data managers to support open ocean science: Required competencies, assessed gaps, and the role of experiential learning. 3984-3993 - Xuan Zhou, Wenjun Wu, Yong Han:
Modeling multiple subskills by extending knowledge tracing model using logistic regression. 3994-4003 - Tsumugi Tairaku, Akihiro Nakao, Saneyasu Yamaguchi, Masato Oguchi:
Application specific traffic control using network virtualization node in large-scale disasters. 4004-4009 - Martino Trevisan, Idilio Drago, Marco Mellia, Maurizio M. Munafò:
Automatic detection of DNS manipulations. 4010-4015 - Luca Vassio, Marco Mellia, Flavio V. D. de Figueiredo, Ana Paula Couto da Silva, Jussara M. Almeida:
Mining and modeling web trajectories from passive traces. 4016-4021 - Richard de Groof, Haiping Xu:
Automatic topic discovery of online hospital reviews using an improved LDA with Variational Gibbs Sampling. 4022-4029 - Noriaki Koide, Yu Ichifuji:
Fragrance to vector as scent technology. 4030-4034 - Deepak Kumar, Chetan Kumar, Ming Shao:
Cross-database mammographic image analysis through unsupervised domain adaptation. 4035-4042 - Christine Bassem, Azer Bestavros:
GuideMe: Routes coordination of participating agents in mobile crowd sensing platforms. 4043-4049 - Yimin Chen, Jin Wen:
A whole building fault detection using weather based pattern matching and feature based PCA method. 4050-4057 - Donald D. Cowan, Paulo S. C. Alencar, Kyle Young, Bryan Smale, Ryan Erb, Fred McGarry:
A model for the socially smart city practical uses of city-level socio-economic indicators. 4058-4067 - Mickael Figueredo, Nélio Cacho, Antonio Thome, Andréa Cacho, Frederico Lopes, Maria Valeria Araujo:
Using social media photos to identify tourism preferences in smart tourism destination. 4068-4073 - Paul G. Flikkema, Morgan Vigil-Hayes:
Self-adaptive and resilient urban networking infrastructure for disasters and smart city services. 4074-4079 - Kyoichi Ito, Masaki Ito, Kosuke Miyazaki, Keishi Tanimoto, Kaoru Sezaki:
Data analysis on train transportation data with nonnegative matrix factorization. 4080-4085 - Anderson Araujo, Rubem Kalebe, Gustavo Girão, Itamir Filho, Kayo Goncalves, Bianor Neto:
Reliability analysis of an IoT-based smart parking application for smart cities. 4086-4091 - Makoto Kawano, Kazuhiro Mikami, Satoshi Yokoyama, Takuro Yonezawa, Jin Nakazawa:
Road marking blur detection with drive recorder. 4092-4097 - Yasue Kishino, Koh Takeuchi, Yoshinari Shirai, Futoshi Naya, Naonori Ueda:
Datafying city: Detecting and accumulating spatio-temporal events by vehicle-mounted sensors. 4098-4104 - Takahiro Komamizu, Jin Nakazawa, Toshiyuki Amagasa, Hiroyuki Kitagawa, Hideyuki Tokuda:
Analytical toolbox for smart city applications: Garbage collection log use case. 4105-4110 - Shuhua Liu, Patrick Jansson:
City event detection from social media with neural embeddings and topic model visualization. 4111-4116 - Zohreh Pourzolfaghar, Markus Helfert, Viviana Angely Bastidas Melo, Ahmad Khalilijafarabad:
Proposing an access gate to facilitate knowledge exchange for smart city services. 4117-4122 - Naoya Shibahara, Ryoma Kondo, Masayuki Iwai:
MM360: A GPS-assisted 360-degree video sharing system for participatory events. 4123-4127 - Jonathan Creighton, Farhana H. Zulkernine:
Towards building a hybrid model for predicting stock indexes. 4128-4133 - Dongmei Guo, Jialong Zheng, Xiaolan Yang:
Agglomeration, network and urban development - - A study on newspaper connection network index of cities. 4134-4141 - Lin Huo, Xiaoli Sun:
An augmented fama and french three-factor model using social interaction. 4142-4147 - Quan Jin, Kun Guo, Yi Sun:
Stock price forecasting using support vector regression: Based on network behavior data. 4148-4153 - Daniel Muller, Yiea-Funk Te:
Insurance premium optimization using motor insurance policies - A business growth classification approach. 4154-4158 - Daniel Muller, Yiea-Funk Te, Pratiksha Jain:
Predicting business performance through patent applications. 4159-4164 - Shaolong Sun, Shouyang Wang, Yunjie Wei, Xianduan Yang, Kwok-Leung Tsui:
Forecasting tourist arrivals with machine learning and internet search index. 4165-4169 - Minggang Wang, André L. M. Vilela, Lixin Tian, Hua Xu, Ruijin Du:
A new time series prediction method based on complex network theory. 4170-4175 - Jinxin Wang, Wei Shang, Zhengyang Liu, Shouyang Wang:
An enhanced LGSA-SVM for S&P 500 index forecast. 4176-4183 - Yunjie Wei, Xun Zhang, Shouyang Wang:
Can search data help forecast inflation? Evidence from a 13-country panel. 4184-4188 - Qingqing Zhang, Darren Jian, Rui Xu, Wei Dai, Ying Liu:
Integrating heterogeneous data sources for traffic flow prediction through extreme learning machine. 4189-4194 - Guihuan Zheng, Qikun Yao, Xingfen Wang, Zhou Yang:
The construction and application of expectations index on monetary policy. 4199-4203 - Giuseppe Bruno, Demetrio Condello, Alberto Falzone, Andrea Luciani:
Big data processing: Is there a framework suitable for economists and statisticians? 4204-4211 - Anne M. Denton, Arighna Roy:
Cluster-overlap algorithm for assessing preprocessing choices in environmental sustainability. 4212-4220 - Chu-hua Kuei, Christian N. Madu, Picheng Lee:
Critical enablers of sustainable water management (SWM): Text evidences from 10 countries. 4221-4227 - Aki-Hiro Sato:
Characterization of cities based on world grid square statistics about specific properties. 4228-4237 - Aki-Hiro Sato, Shoki Nishimura, Hiroe Tsubaki:
World grid square codes: Definition and an example of world grid square data. 4238-4247 - Hiroshi Tsuda, Masakazu Ando, Yu Ichifuji:
Statistical analysis of hotel plan popularity in regional tourist areas. 4248-4254 - Craig S. Wright, Antoaneta Serguieva:
Sustainable blockchain-enabled services: Smart contracts. 4255-4264 - Ailun Ye, Venkata L. Raju Chinthalapati, Antoaneta Serguieva, Edward P. K. Tsang:
Developing sustainable trading strategies using directional changes with high frequency data. 4265-4271 - Arunkumar Bagavathi, Pranava Mummoju, Katarzyna A. Tarnowska, Angelina A. Tzacheva, Zbigniew W. Ras:
SARGS method for distributed actionable pattern mining using spark. 4272-4281 - I-Cheng Chang, Yudi Pratama Halim, Chun-Man Lin:
Vehicle path estimation using dual-level clustering and multi-source prediction. 4282-4286 - Helena F. Deus, Corey A. Harper, Darin McBeath, Ron Daniel Jr.:
Combining pattern matching with word embeddings for the extraction of experimental variables from scientific literature. 4287-4292 - Kulsawasd Jitkajornwanich, Peerapon Vateekul, Upa Gupta, Teeranai Kormongkolkul, Arnon Jirakittayakorn, Siam Lawawirojwong, Siwapon Srisonphan:
Ocean surface current prediction based on HF radar observations using trajectory-oriented association rule mining. 4293-4300 - Liling Li, Tyler Danner, Jesse Eickholt, Erin McCann, Kevin Pangle, Nicholas Johnson:
A distributed pipeline for DIDSON data processing. 4301-4306 - Tse-Yu Pan, Yi-Zhu Dai, Wan-Lun Tsai, Min-Chun Hu:
Deep model style: Cross-class style compatibility for 3D furniture within a scene. 4307-4313 - A. Aziz Altowayan, Ashraf Elnagar:
Improving Arabic sentiment analysis with sentiment-specific embeddings. 4314-4320 - Jose Berengueres, Dani Castro:
Differences in emoji sentiment perception between readers and writers. 4321-4328 - Patrick Jansson, Shuhua Liu:
Topic modelling enriched LSTM models for the detection of novel and emerging named entities from social media. 4329-4336 - Bingjing Jia, Bin Wu, Jinna Lv, Pengpeng Zhou, Yao Bu, Ying Xing:
An entity disambiguation method based on LeaderRank. 4337-4342 - Nicolai Pogrebnyakov, Edgar A. Maldonado:
Identifying emergency stages in facebook posts of police departments with convolutional and recurrent neural networks and support vector machines. 4343-4352 - Ian Stewart, Stevie Chancellor, Munmun De Choudhury, Jacob Eisenstein:
#Anorexia, #anarexia, #anarexyia: Characterizing online community practices with orthographic variation. 4353-4361 - Joseph A. Cottam, Leslie M. Blaha, Dimitri Zarzhitsky, Mathew Thomas, Elliott Skomski:
Crossing the Streams: Fuzz testing with user input. 4362-4371 - Xiaoni Duan, Keishi Tajima:
Improving classification accuracy in crowdsourcing through hierarchical reorganization. 4372-4374 - Yuzuki Furuhashi, Masaki Matsubara, Atsuyuki Morishima:
Crowd-based best-effort number estimation. 4375-4377 - Austin Graham, Yan Liang, Le Gruenwald, Christan Grant:
[Research paper] formalizing interruptible algorithms for human over-the-loop analytics. 4378-4383 - Munenari Inoguchi, Keiko Tamura, Kei Horie, Haruo Hayashi:
Clarifying the transition of workload for victims life reconstruction support programs in affected local governments using the victims master database - Comparison between the 2007 Chuetsu-oki earthquake and the 2016 Kumamoto Earthquake-. 4384-4388 - Masahiro Kazama, Viviane Takahashi:
Active preference learning for generative adversarial networks. 4389-4393 - Naoki Kobayashi, Masaki Matsubara, Keishi Tajima, Atsuyuki Morishima:
A crowd-in-the-loop approach for generating conference programs with microtasks. 4394-4396 - Koyo Kobayashi, Hidehiko Shishido, Yoshinari Kameda, Itaru Kitahara:
Method to generate disaster-damage map using 3D photometry and crowd sourcing. 4397-4399 - Takahiro Komamizu, Toshiyuki Amagasa, Hiroyuki Kitagawa:
Implicit order join: Joining log data with property data by discovering implicit order-oriented keys with human assistance. 4400-4406 - Mamiko Matsubayashi, Keiko Kurata:
Conceptual design for comprehensive research support platform: Successful research data management generating big data from little data. 4407-4409 - Yoshitaka Matsuda, Yu Suzuki, Satoshi Nakamura:
A trade-off between estimation accuracy of worker quality and task complexity. 4410-4416 - Hiroki Morise, Satoshi Oyama, Masahito Kurihara:
Collaborative filtering and rating aggregation based on multicriteria rating. 4417-4422 - Michalis Papakostas, Konstantinos Tsiakas, Theodoros Giannakopoulos, Fillia Makedon:
Towards predicting task performance from EEG signals. 4423-4425 - Hidehiko Shishido, Yutaka Ito, Youhei Kawamura, Toshiya Matsui, Atsuyuki Morishima, Itaru Kitahara:
Proactive preservation of world heritage by crowdsourcing and 3D reconstruction technology. 4426-4428 - Panote Siriaraya, Yuriko Yamaguchi, Mimpei Morishita, Yoichi Inagaki, Reyn Y. Nakamoto, Jianwei Zhang, Junichi Aoi, Shinsuke Nakajima:
Using categorized web browsing history to estimate the user's latent interests for web advertisement recommendation. 4429-4434 - Keiko Tamura, Naoshi Hirata:
"DEKATSU" activity of data and service collaboration among private companies and academic institutions for Tokyo metropolitan resilience project. 4435-4437 - Agniva Banerjee, Karuna Pande Joshi:
Link before you share: Managing privacy policies through blockchain. 4438-4447 - Ruth Bearden, Dan Chia-Tien Lo:
Automated microsoft office macro malware detection using machine learning. 4448-4452 - Alina Campan, Alfredo Cuzzocrea, Traian Marius Truta:
Fighting fake news spread in online social networks: Actual trends and future research directions. 4453-4457 - Anthony Carella, Murat Kotsoev, Traian Marius Truta:
Impact of security awareness training on phishing click-through rates. 4458-4466 - Alfredo Cuzzocrea, Hossain Shahriar:
Data masking techniques for NoSQL database security: A systematic review. 4467-4473 - Alfredo Cuzzocrea, Fabio Martinelli, Francesco Mercaldo, Gianni Viardo Vercelli:
Tor traffic analysis and detection via machine learning techniques. 4474-4480 - Anirban Das, Min-Yi Shen, Jisheng Wang:
Modeling user communities for identifying security risks in an organization. 4481-4486 - Philip Derbeko, Shlomi Dolev, Ehud Gudes, Jeffrey D. Ullman:
Efficient and private approximations of distributed databases calculations. 4487-4496 - Kangsoo Jung, Seog Park:
Collaborative caching techniques for privacy-preserving location-based services in peer-to-peer environments. 4497-4506 - Haya Shajaiah, Ahmed Abdelhadi, Charles Clancy:
Secure power scheduling auction for smart grids using homomorphic encryption. 4507-4512 - Ugur Sopaoglu, Osman Abul:
A top-down k-anonymization implementation for apache spark. 4513-4521 - Shahab Tayeb, Matin Pirouz, Gabriel Esguerra, Kimiya Ghobadi, Jimson Huang, Robin Hill, Derwin Lawson, Stone Li, Tiffany Zhan, Justin Zhan, Shahram Latifi:
Securing the positioning signals of autonomous vehicles. 4522-4528 - Trishita Tiwari, Ata Turk, Alina Oprea, Katzalin Olcoz, Ayse K. Coskun:
User-profile-based analytics for detecting cloud security breaches. 4529-4535 - Conrad M. Albrecht, Marcus Freitag, Theodore G. van Kessel, Siyuan Lu, Hendrik F. Hamann:
Event clustering & event series characterization on expected frequency. 4536-4541 - Roger N. Anderson:
'Petroleum Analytics Learning Machine' for optimizing the Internet of Things of today's digital oil field-to-refinery petroleum system. 4542-4545 - Hung Cao, Monica Wachowicz, Sangwhan Cha:
Developing an edge computing platform for real-time descriptive analytics. 4546-4554 - Domitille Couloumb, Charbel El Kaed, Ayush Garg, Chris Healey, Jonathan Healey, Stuart Sheehan:
Energy efficiency driven by a storage model and analytics on a multi-system semantic integration. 4555-4561 - Aurora González-Vidal, Alfonso P. Ramallo-González, Fernando Terroso-Saenz, Antonio F. Skarmeta:
Data driven modeling for energy consumption prediction in smart buildings. 4562-4569 - Christoph A. Keller, Mathew J. Evans, J. Nathan Kutz, Steven Pawson:
Machine learning and air quality modeling. 4570-4576 - Theodore G. van Kessel, Ramachandran Muralidhar, Josephine B. Chang, Jun-Song Wang, Michael A. Schappert, Hendrik F. Hamann:
A low maintenance particle pollution sensing system using the Minimum Airflow Particle Counter (MAPC). 4577-4582 - Levente J. Klein, Theodore G. van Kessel, Dhruv Nair, Ramachandran Muralidhar, Nigel Hinds, Hendrik F. Hamann, Norma E. Sosa:
Distributed wireless sensing for fugitive methane leak detection. 4583-4591 - Joshua Lieberman, Alan Leidner, George Percivall, Carsten Rönsdorf:
Using big data analytics and IoT principles to keep an eye on underground infrastructure. 4592-4601 - Aekyeung Moon, Jaeyoung Kim, Jialing Zhang, Hang Liu, Seung Woo Son:
Understanding the impact of lossy compressions on IoT smart farm analytics. 4602-4611 - Dinesh C. Verma, Geeth de Mel:
Measures of network centricity for edge deployment of IoT applications. 4612-4620 - Xiaochi Zhou, Vinícius Amaral, John D. Albertson:
Source characterization of airborne emissions using a sensor network: Examining the impact of sensor quality, quantity, and wind climatology. 4621-4629 - Dabiah Ahmed Alboaneen, Huaglory Tianfield, Yan Zhang:
Sentiment analysis via multi-layer perceptron trained by meta-heuristic optimisation. 4630-4635 - Olga Babko-Malaya, Rebecca Cathey, Steve Hinton, David Maimon, Taissa Gladkova:
Detection of hacking behaviors and communication patterns on social media. 4636-4641 - Adam Dalton, Bonnie J. Dorr, Leon Liang, Kristy Hollingshead:
Improving cyber-attack predictions through information foraging. 4642-4647 - Jordan DeLoach, Doina Caragea:
Twitter-enhanced Android malware detection. 4648-4657 - Mohammed Eslami, George Zheng, Hamed Eramian, Georgiy Levchuk:
Deriving cyber use cases from graph projections of cyber data represented as bipartite graphs. 4658-4663 - Jhu-Sin Luo, Dan Chia-Tien Lo:
Binary malware image classification using machine learning with local binary pattern. 4664-4667 - David Maimon, Andrew Fukuda, Steve Hinton, Olga Babko-Malaya, Rebecca Cathey:
On the relevance of social media platforms in predicting the volume and patterns of web defacement attacks. 4668-4673 - Fernando Maymi, Robert Bixler, Randolph M. Jones, Scott D. Lathrop:
Towards a definition of cyberspace tactics, techniques and procedures. 4674-4679 - Hau Tran, An Nguyen, Phuong Vo, Tu Vu:
DNS graph mining for malicious domain detection. 4680-4685 - Xiaoyan Zhuo, Jialing Zhang, Seung Woo Son:
Network intrusion detection using word embeddings. 4686-4695 - Sung Whan Jeon, Hye Jin Lee, Sungzoon Cho:
Building industry network based on business text: Corporate disclosures and news. 4696-4704 - Yang Jiao, Jérémie Jakubowicz:
Predicting stock movement direction with machine learning: An extensive study on S&P 500 stocks. 4705-4713 - Naomi Simumba, Suguru Okami, Naohiko Kohtake:
Credit decision tool using mobile application data for microfinance in agriculture. 4714-4721 - Masanori Ajito, Yasuko Kawahata, Akira Ishii:
Analysis of national election using mathematical model of hit phenomenon. 4722-4724 - Darlan Arruda, Nazim H. Madhavji:
Towards a big data requirements engineering artefact model in the context of big data software development projects: Poster extended abstract. 4725-4726 - Shilpa Balan, Nishant Shristiraj, Vrunda Shah, Anusha Manjappa:
Big data analysis of youth tobacco smoking trends in the United States. 4727-4729 - Shaunak D. Bopardikar, George S. Eskander Ekladious:
Towards scalable kernel machines for streaming data analytics. 4730-4732 - Chaochao Chen, Xinxing Yang, Li Wang, Jun Zhou, Xiaolong Li:
Large scale app recommendation in Ant Financial. 4733-4735 - Ranjeet Devarakonda, Michael Giansiracusa, Jitendra Kumar, Harold Shanafield:
Social media based NPL system to find and retrieve ARM data: Concept paper. 4736-4737 - Mohammed Elshambakey, Mohamed Khalefa, William J. Tolone, Sreyasee Das Bhattacharjee, Huikyo Lee, Luca Cinquini, Shannon Schlueter, Isaac Cho, Wenwen Dou, Daniel J. Crichton:
Towards a distributed infrastructure for data-driven discoveries & analysis. 4738-4740 - Mohammed Eslami, George Zheng, Hamed Eramian, Georgiy Levchuk:
Anomaly detection on bipartite graphs for cyber situational awareness and threat detection. 4741-4743 - Iwao Fujino, Christophe Claramunt, Abdel-Ouahab Boudraa:
Extracting route patterns of vessels from AIS data by using topic model. 4744-4746 - Michel Généreux, Bryor Snejfella, Marta Maslej:
Big data in psychology: Using word embeddings to study theory-of-mind. 4747-4749 - Frank R. Greguska, Thomas Huang, Brian Wilson, Nga Quach, Joe Jacob:
Analyzing big ocean science data with NEXUS. 4750 - Abdeltawab M. Hendawi, Aqeel Rustum, Mohamed H. Ali, John A. Stankovic:
Turning big spatial data into smart routing. 4751-4753 - Mauri Kaipainen, Olli Pitkänen, Perspicamus Ab:
Human-controlled iterative subclustering analysis. 4754-4756 - Kasumi Kato, Atsuko Takefusa, Hidemoto Nakada, Masato Oguchi:
Consideration of parallel data processing over an apache spark cluster. 4757-4759 - Yasuko Kawahata, Yukari Moriyama, Shinichirou Yamada, Mingyi Sun, Taketo Kawamura:
Analytical the large-scale collection of data on the results of the guides for foreigners visiting Japan. 4760-4764 - Saleena Khanna, Yuvraj S. Sethi, Akash R. Nambiar:
iSkin specialist - A big data based expert system for dermatology. 4765-4767 - Thomas Kitson, Paula Olaya, Elizabeth Racca, Michael R. Wyatt II, Mario Guevara, Rodrigo Vargas, Michela Taufer:
Data analytics for modeling soil moisture patterns across united states ecoclimatic domains. 4768-4770 - Anusha Kola, Harshal More, Sean Soderman, Michael N. Gubanov:
Generating Unified Famous Objects (UFOs) from the classified object tables. 4771-4773 - Tai-Yeon Ku, Wan-Ki Park, Hoon Choi:
Energy information collection mechanism using big data correlation map. 4774-4776 - Hyun-Chul Lee, Tong-Il Jang, Kwangsu Moon:
Anticipating human errors from periodic big survey data in nuclear power plants. 4777-4778 - Chen Li, Annisa, Asif Zaman, Yasuhiko Morimoto:
MapReduce-based computation of area skyline query for selecting good locations in a map. 4779-4782 - PrathyushaRani Merla, Yiheng Liang:
Data analysis using hadoop MapReduce environment. 4783-4785 - Kwan Hui Lim, Shanika Karunasekera, Aaron Harwood, Lucia Falzon:
Spatial-based topic modelling using wikidata knowledge base. 4786-4788 - Lixin Liu, Jun Chen:
The influences of deep-sea vision data quality on observational analysis. 4789-4791 - Amin Majd, Elena Troubitsyna:
Data-driven approach to ensuring fault tolerance and efficiency of swarm systems. 4792-4794 - Javier Mata, Ignacio de Miguel, Ramón J. Durán, Juan Carlos Aguado, Noemí Merayo, Lidia Ruiz-Perez, Patricia Fernández, Rubén M. Lorenzo, Evaristo J. Abril:
A SVM approach for lightpath QoT estimation in optical transport networks. 4795-4797 - Kenji Nakashima, Joichiro Kon, Saneyasu Yamaguchi, Gil Jae Lee, José A. B. Fortes:
1A study on big data I/O performance with modern storage systems. 4798-4799 - Monika Nawrocka, Marcin Lukowski:
Biofeedback EEG data integration and visualization analytics for endurance exercise practices: Data integration and visualization analytics of biofeedback EEG. 4800-4802 - Paul Le Noac'h, Alexandru Costan, Luc Bougé:
A performance evaluation of Apache Kafka in support of big data streaming applications. 4803-4806 - Steven Ortiz, Caner Enbatan, Maksim Podkorytov, Dylan Soderman, Michael N. Gubanov:
Hybrid.JSON: High-velocity parallel in-memory polystore JSON ingest. 4807-4809 - Kaine Black, Monica Wachowicz, Alec Parise:
Using Bi-partite graphs to cluster complex networks. 4810-4812 - Nat Pavasant, Hiroshi Furutani, Masayuki Numao, Ken-ichi Fukui:
ART-2b: Adapted ART-2a for large scale data clustering on PM2.5 mass spectra. 4813-4815 - Tayfun Pay, Stephen Lucci:
Automatic keyword extraction: An ensemble method. 4816-4818 - Iulia Popescu, Kurt Portelli, Christos Anagnostopoulos, Nikos Ntarmos:
The case for graph-based recommendations. 4819-4821 - Jason Radford, Luke Horgan, David Lazer:
Baselines for demographic inference on a new gold standard twitter corpus. 4822-4823 - Jason Radford:
Piloting a theory-based approach to inferring gender in big data. 4824-4826 - Bharath K. Samanthula:
Privacy-preserving outsourced collaborative frequent itemset mining in the cloud. 4827-4829 - Shohei Shirataki, Saneyasu Yamaguchi:
A study on interpretability of decision of machine learning. 4830-4831 - Mark Simmons, Daniel Armstrong, Dylan Soderman, Michael N. Gubanov:
Hybrid.media: High velocity video ingestion in an in-memory scalable analytical polystore. 4832-4834 - Lisa Singh, Raghu Pemmaraju:
EOS: A multilingual text archive of international newspaper & blog articles. 4835-4837 - Tsumugi Tairaku, Akihiro Nakao, Saneyasu Yamaguchi, Masato Oguchi:
Application specific traffic control in large-scale disasters. 4838-4840 - Masashi Toyoda, Daisaku Yokoyama, Junpei Komiyama, Masahiko Itoh:
Road safety estimation utilizing big and heterogeneous vehicle recorder data. 4841-4842 - Sebastian Trinks, Carsten Felden:
Real time analytics - State of the art: Potentials and limitations in the smart factory. 4843-4845 - Akira Umayabara, Hayato Yamana:
MCMalloc: A scalable memory allocator for multithreaded applications on a many-core shared-memory machine. 4846-4848 - Santiago Villasenor, Tom Nguyen, Anusha Kola, Sean Soderman, Michael N. Gubanov:
Scalable spam classifier for web tables. 4849-4851 - Jonathan Wang, Kesheng Wu, Alex Sim, Seongwook Hwangbo:
Accurate signal timing from high frequency streaming data. 4852-4854 - Yifang Wei, Lisa Singh:
Understanding the impact of sampling and noise on detecting events using twitter. 4855-4857 - Yoshiko Yasumura, Hiroki Imabayashi, Hayato Yamana:
Attribute-based proxy re-encryption method for revocation in cloud data storage. 4858-4860 - Daisaku Yokoyama, Masashi Toyoda:
Towards constructing a driver management system based on large-scale driving operation records. 4861-4862 - Takuya Yonezawa, Ismail Arai, Toyokazu Akiyama, Kazutoshi Fujikawa:
Proposal of classification method of bus operation states using sensor data. 4863-4865 - Haiyan Yu, Kun Xiang, Jiang Yu:
Understanding a moderating effect of physicians' endorsement to online workload: An empirical study in online health-care communities. 4866-4868 - Philipp Zehnder, Dominik Riemer:
Towards automatic infrastructure provisioning for highly dynamic streaming applications. 4869-4871 - Binyam A. Zemede, Byron J. Gao:
Personalized search with editable profiles. 4872-4874 - Yin Zhang, Jiming Hu:
Discovering the interdisciplinary nature of big data research. 4875-4877 - Ziwei Zhu, Weijia Xu, Wei He:
Big data system for information aggregation and model comparison for precison medicine. 4878-4880
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.