


default search action
BigData Conference 2014: Washington, DC, USA
- Jimmy Lin, Jian Pei, Xiaohua Hu, Wo Chang, Raghunath Nambiar, Charu C. Aggarwal, Nick Cercone, Vasant G. Honavar, Jun Huan, Bamshad Mobasher, Saumyadipta Pyne:
2014 IEEE International Conference on Big Data (IEEE BigData 2014), Washington, DC, USA, October 27-30, 2014. IEEE Computer Society 2014, ISBN 978-1-4799-5665-4 - Lengdong Wu, Li-Yan Yuan, Jia-Huai You:
BASIC: An alternative to BASE for large-scale data management system. 5-14 - Sushovan De, Yuheng Hu, Yi Chen, Subbarao Kambhampati:
BayesWipe: A multimodal system for data cleaning and consistent query answering on structured bigdata. 15-24 - Stéphan Clémençon, Patrice Bertail, Emilie Chautru:
Scaling up M-estimation via sampling designs: The Horvitz-Thompson stochastic gradient descent. 25-30 - Jane Greenberg, Adrian Ogletree
, Angela P. Murillo
, Thomas P. Caruso, Herbie Huang:
Metadata capital: Simulating the predictive value of Self-Generated Health Information (SGHI). 31-36 - Raghvendra Mall
, Vilen Jumutc
, Rocco Langone
, Johan A. K. Suykens
:
Representative subsets for big data learning using k-NN graphs. 37-42 - Rubing Duan, Rick Siow Mong Goh, Feng Yang
, Yong Kiam Tan, Jesus F. B. Valenzuela:
Towards building and evaluating a personalized location-based recommender system. 43-48 - Sarker Tanzir Ahmed, Dmitri Loguinov:
On the performance of MapReduce: A stochastic approach. 49-54 - Khalifeh AlJadda, Mohammed Korayem, Camilo Ortiz, Trey Grainger, John A. Miller
, William S. York:
PGMHD: A scalable probabilistic graphical model for massive hierarchical data problems. 55-60 - Dongfang Zhao, Zhao Zhang, Xiaobing Zhou, Tonglin Li, Ke Wang, Dries Kimpe, Philip H. Carns, Robert B. Ross, Ioan Raicu:
FusionFS: Toward supporting data-intensive scientific applications on extreme-scale high-performance computing systems. 61-70 - Teng Wang, Sarp Oral
, Yandong Wang, Bradley W. Settlemyer, Scott Atchley, Weikuan Yu
:
BurstMem: A high-performance burst buffer system for scientific applications. 71-79 - Junwhan Kim:
Partial rollback-based scheduling on in-memory transactional data grids. 80-89 - Hao Chen, Sastry S. Duri, Vasanth Bala, Nilton T. Bila, Canturk Isci, Ayse K. Coskun:
Detecting and identifying system changes in the cloud via discovery by example. 90-99 - Kyungho Jeon, Sharath Chandrashekhara, Feng Shen, Shikhar Mehra, Oliver Kennedy, Steven Y. Ko:
PigOut: Making multiple Hadoop clusters work together. 100-109 - Zhisong Fu, Harish Kumar Dasari, Bradley R. Bebee, Martin Berzins, Bryan B. Thompson:
Parallel Breadth First Search on GPU clusters. 110-118 - Ke Wang, Xiaobing Zhou, Tonglin Li, Dongfang Zhao, Michael Lang, Ioan Raicu:
Optimizing load balancing and data-locality with data-aware scheduling. 119-128 - Zhang Fu, Magnus Almgren
, Olaf Landsiedel, Marina Papatriantafilou
:
Online temporal-spatial analysis for detection of critical events in Cyber-Physical Systems. 129-134 - Xuejie Xiao, Jian Tang, Zhenhua Chen, Jielong Xu, Chonggang Wang:
A cross-job framework for MapReduce scheduling. 135-140 - Jia-Chun Lin, Ming-Chang Lee, Ramin Yahyapour
:
Scheduling MapReduce tasks on virtual MapReduce clusters from a tenant's perspective. 141-146 - Takatsugu Ono, Yotaro Konishi, Teruo Tanimoto
, Noboru Iwamatsu, Takashi Miyoshi, Jun Tanaka:
FlexDAS: A flexible direct attached storage for I/O intensive applications. 147-152 - Lena Mashayekhy, Mahyar Movahed Nejad, Daniel Grosu
:
A two-sided market mechanism for trading big data computing commodities. 153-158 - Zhiyuan Lin
, Minsuk Kahng, Kaeser Md. Sabrin, Duen Horng (Polo) Chau
, Ho Lee, U Kang:
MMap: Fast billion-scale graph computation on a PC via memory mapping. 159-164 - Arian Bär, Alessandro Finamore, Pedro Casas, Lukasz Golab, Marco Mellia
:
Large-scale network traffic monitoring with DBStream, a system for rolling big data analysis. 165-170 - Jason W. Anderson, Ken E. Kennedy, Linh Bao Ngo, André Luckow, Amy W. Apon:
Synthetic data generation for the internet of things. 171-176 - Diana Gudu
, Marcus Hardt, Achim Streit
:
Evaluating the performance and scalability of the Ceph distributed storage system. 177-182 - Li Jiang, Hideyuki Kawashima, Osamu Tatebe:
Incremental window aggregates over array database. 183-188 - Michel Angelo Roger, Yiqi Xu, Ming Zhao:
BigCache for big-data systems. 189-194 - Evie Kassela, Christina Boumpouka, Ioannis Konstantinou
, Nectarios Koziris:
Automated workload-aware elasticity of NoSQL clusters in the cloud. 195-200 - Khoa Luu
, Chenchen Zhu, Marios Savvides:
Distributed class dependent feature analysis - A big data approach. 201-206 - Krish K. R., M. Safdar Iqbal, Ali Raza Butt
:
VENU: Orchestrating SSDs in hadoop storage. 207-212 - Nusrat Sharmin Islam, Xiaoyi Lu, Md. Wasi-ur-Rahman, Raghunath Rajachandrasekar, Dhabaleswar K. Panda:
In-memory I/O and replication for HDFS with Memcached: Early experiences. 213-218 - Douglas Otstott, Noah Evans, Latchesar Ionkov, Ming Zhao, Michael Lang
:
Enabling composite applications through an asynchronous shared memory interface. 219-224 - Silu Huang, Ada Wai-Chee Fu:
k-Balanced sorting and skew join in MPI and MapReduce. 225-230 - Dongfang Zhao, Jian Yin, Kan Qiao, Ioan Raicu:
Virtual chunks: On supporting random accesses to scientific data in compressible storage systems. 231-240 - Mohammed Nazim Feroz, Susan A. Mengel:
Examination of data, rule generation and detection of phishing URLs using online logistic regression. 241-250 - Mohan Yang, Carlo Zaniolo:
Main memory evaluation of recursive queries on multicore machines. 251-260 - Shigeru Maya, Kai Morino, Kenji Yamanishi
:
Predicting glaucoma progression using multi-task learning with heterogeneous features. 261-270 - Dong Dai, Yong Chen
, Dries Kimpe, Robert B. Ross:
Provenance-based object storage prediction scheme for scientific big data applications. 271-280 - Songchang Jin, Jiawei Zhang, Philip S. Yu, Shuqiang Yang, Aiping Li:
Synergistic partitioning in multiple large scale social networks. 281-290 - Alice Marascu, Pascal Pompey, Eric Bouillet, Michael Wurst, Olivier Verscheure, Martin Grund, Philippe Cudré-Mauroux
:
TRISTAN: Real-time analytics on massive time series using sparse dictionary compression. 291-300 - Hao Li, Di Yu, Anand Kumar, Yi-Cheng Tu:
Performance modeling in CUDA streams - A means for high-throughput data processing. 301-310 - Patrick Leyshock, David Maier, Kristin Tufte:
Minimizing data movement through query transformation. 311-316 - Oyindamola O. Akande, Philip J. Rhodes:
Multilevel partitioning of large unstructured grids. 317-322 - Dongeun Lee, Jaesik Choi
:
Low complexity sensing for big spatio-temporal data. 323-328 - Jialin Liu, Yin Lu, Yong Chen
:
In-advance data analytics for reducing time to discovery. 329-334 - Maria Christoforaki, Torsten Suel:
Estimating pairwise distances in large graphs. 335-344 - Anh Thu Vu, Gianmarco De Francisci Morales
, João Gama
, Albert Bifet
:
Distributed Adaptive Model Rules for mining big data streams. 345-353 - Dorit S. Hochbaum, Philipp Baumann:
Sparse computation for large-scale data mining. 354-363 - Arun S. Maiya, Robert M. Rolfe:
Topic similarity networks: Visual analytics for large document sets. 364-372 - Mayank Daga, Mark Nutter, Mitesh R. Meswani:
Efficient breadth-first search on a heterogeneous processor. 373-382 - Chad A. Steed
, Katherine J. Evans
, John F. Harney, Brian C. Jewell, Galen M. Shipman, Brian E. Smith, Peter E. Thornton
, Dean N. Williams:
Web-based visual analytics for extreme scale climate science. 383-392 - Ryan Compton, David Jurgens, David Allen:
Geotagging one hundred million Twitter accounts with total variation minimization. 393-401 - Ruben Mayer, Boris Koldehofe
, Kurt Rothermel:
Meeting predictable buffer limits in the parallel execution of event processing operators. 402-411 - Xiaomeng Zhao, Huadong Ma, Haitao Zhang, Yi Tang, Guangping Fu:
Metadata extraction and correction for large-scale traffic surveillance videos. 412-420 - Ke Tao, Claudia Hauff, Geert-Jan Houben, Fabian Abel, Guido Wachsmuth:
Facilitating Twitter data analytics: Platform, language and functionality. 421-430 - Masahiko Itoh, Daisaku Yokoyama, Masashi Toyoda, Yoshimitsu Tomita, Satoshi Kawamura, Masaru Kitsuregawa:
Visual fusion of mega-city big data: An application to traffic and tweets data analysis of Metro passengers. 431-440 - Alekh Jindal, Samuel Madden:
GRAPHiQL: A graph intuitive query language for relational databases. 441-450 - Ronak Etemadpour, Paul Murray, Angus Graeme Forbes:
Evaluating density-based motion for big data visual analytics. 451-460 - Ulf Johansson, Cecilia Sönströd, Henrik Linusson, Henrik Boström:
Regression trees for streaming data with local performance guarantees. 461-470 - Pei-Ling Chen, Chung-Kuang Chou
, Ming-Syan Chen
:
Distributed algorithms for k-truss decomposition. 471-480 - George M. Slota, Kamesh Madduri
, Sivasankaran Rajamanickam:
PuLP: Scalable multi-objective multi-constraint partitioning for small-world networks. 481-490 - Arash Fard, Satya Manda, Lakshmish Ramaswamy, John A. Miller
:
Effective caching techniques for accelerating pattern matching queries. 491-499 - Diana Palsetia, Md. Mostofa Ali Patwary, William Hendrix, Ankit Agrawal
, Alok N. Choudhary:
Clique guided community detection. 500-509 - Hideyuki Shamoto, Koichi Shirahata, Aleksandr Drozd, Hitoshi Sato, Satoshi Matsuoka:
Large-scale distributed sorting for GPU-based heterogeneous supercomputers. 510-518 - Chieh-Yen Lin, Cheng-Hao Tsai, Ching-Pei Lee
, Chih-Jen Lin:
Large-scale logistic regression and linear support vector machines using spark. 519-528 - Keita Iwabuchi, Hitoshi Sato, Yuichiro Yasui, Katsuki Fujisawa
, Satoshi Matsuoka:
NVM-based Hybrid BFS with memory efficient data structure. 529-538 - Muzaffer Can Altinigneli, Bettina Konte, Dan Rujescir, Christian Böhm, Claudia Plant
:
Identification of SNP interactions using data-parallel primitives on GPUs. 539-548 - Shan Jiang, ChengXiang Zhai:
Random walks on adjacency graphs for mining lexical relations from big text data. 549-554 - Jonathan Mugan, Ranga Chari, Laura Hitt, Eric McDermid, Marsha Sowell, Yuan Qu, Thayne Coffman:
Entity resolution using inferred relationships and behavior. 555-560 - Rong Gu, Wei Hu, Yihua Huang:
Rainbow: A distributed and hierarchical RDF triple store with dynamic scalability. 561-566 - Hong Yi
, Michel E. Rasquin
, Jun Fang
, Igor A. Bolotnov:
In-situ visualization and computational steering for large-scale simulation of turbulent flows in complex geometries. 567-572 - Thibault Debatty, Pietro Michiardi, Olivier Thonnard, Wim Mees:
Building k-nn graphs from large text data. 573-578 - Raju Balakrishnan, Rajesh Parekh:
Learning to predict subject-line opens for large-scale email marketing. 579-584 - Robert S. Pienta, Acar Tamersoy, Hanghang Tong
, Duen Horng Chau
:
MAGE: Matching approximate patterns in richly-attributed graphs. 585-590 - Jungkyu Han, Min Luo:
Bootstrapping K-means for big data analysis. 591-596 - Ahsanul Haque, Swarup Chandra, Latifur Khan
, Charu C. Aggarwal:
Distributed Adaptive Importance Sampling on graphical models using MapReduce. 597-602 - Bo Liu, Erico N. de Souza, Stan Matwin
, Marcin Sydow:
Knowledge-based clustering of ship trajectories using density-based approach. 603-608 - Ciro Donalek, S. George Djorgovski, Alex Cioc, Anwell Wang, Jerry Zhang, Elizabeth Lawler, Stacy Yeh, Ashish Mahabal, Matthew J. Graham, Andrew J. Drake, Scott Davidoff
, Jeffrey S. Norris, Giuseppe Longo:
Immersive and collaborative data visualization using virtual reality platforms. 609-614 - Lee Parnell Thompson, Weijia Xu, Daniel P. Miranker:
The Adaptive Projection Forest: Using adjustable exclusion and parallelism in metric space indexes. 615-620 - Tony Worm, Kenneth Chiu:
Scaling up Prioritized Grammar Enumeration for scientific discovery in the cloud. 621-626 - Yun Shen, Olivier Thonnard:
MR-TRIAGE: Scalable multi-criteria clustering for big data security intelligence applications. 627-635 - Todd J. Bodnar, Conrad S. Tucker, Kenneth M. Hopkinson, Sven G. Bilén:
Increasing the veracity of event detection on social media networks through user trust modeling. 636-643 - Vladimir Estivill-Castro
, Peter Hough, Md Zahidul Islam
:
Empowering users of social networks to assess their privacy risks. 644-649 - Tahereh Babaie, Sanjay Chawla, Sebastien Ardon, Yue Yu:
A unified approach to network anomaly detection. 650-655 - Zhichuan Huang, Hongyao Luo, David Skoda, Ting Zhu, Yu Gu:
E-Sketch: Gathering large-scale energy consumption data based on consumption patterns. 656-665 - Lee Kellogg, Brian E. Ruttenberg, Alison O'Connor, Michael Howard, Avi Pfeffer:
Hierarchical management of large-scale malware data. 666-674 - Sotiris K. Tasoulis, Lu Cheng
, Niko Välimäki, Nicholas J. Croucher, Simon R. Harris, William P. Hanage, Teemu Roos
, Jukka Corander:
Random projection based clustering for population genomics. 675-682 - Daniela Ushizima, Talita Perciano
, Harinarayan Krishnan
, Burlen Loring
, Hrishikesh Bale, Dilworth Parkinson, James A. Sethian:
Structure recognition from high resolution images of ceramic composites. 683-691 - Sufeng Niu, Guangyu Yang, Nilim Sarma, Pengfei Xuan, Melissa C. Smith, Pradip K. Srimani, Feng Luo:
Combining Hadoop and GPU to preprocess large Affymetrix microarray data. 692-700 - Wenrong Zeng, Yuhao Yang, Bo Luo
:
Content-Based Access Control: Use data content to assist access control for large-scale content-centric databases. 701-710 - Yu Zhang, Stephen Wistar, Jose A. Piedra-Fernández
, Jia Li, Michael A. Steinberg, James Z. Wang
:
Locating visual storm signatures from satellite images. 711-720 - Marc Frîncu, Charalampos Chelmis
, Muhammad Usman Noor, Viktor K. Prasanna:
Accurate and efficient selection of the best consumption prediction method in smart grids. 721-729 - Andrew Todd, William T. Scherer, Peter A. Beling, Mark E. Paddrik, Richard Haynes:
Visualizations for sense-making in financial market regulation. 730-735 - Mathias Johanson, Stanislav Belenki, Jonas Jalminger, Magnus Fant, Mats Gjertz:
Big Automotive Data: Leveraging large volumes of data for knowledge-driven product development. 736-741 - Yufei Han, Xiaolan Sha, Etta Grover-Silva, Pietro Michiardi:
On the impact of socio-economic factors on power load forecasting. 742-747 - Jong Hoon Ahnn:
Toward personalized and scalable voice-enabled services powered by big data. 748-753 - Guo-Qiang Zhang
, Wei Zhu, Mengmeng Sun, Shiqiang Tao, Olivier Bodenreider, Licong Cui
:
MaPLE: A MapReduce Pipeline for Lattice-based Evaluation and its application to SNOMED CT. 754-759 - Bun Theang Ong, Komei Sugiura, Koji Zettsu:
Dynamic pre-training of Deep Recurrent Neural Networks for predicting environmental monitoring data. 760-765 - José Manuel Abuín
, Juan Carlos Pichel
, Tomás F. Pena
, Pablo Gamallo Otero
, Marcos García
:
Perldoop: Efficient execution of Perl scripts on Hadoop clusters. 766-771 - Dean N. Williams, Giri Palanisamy, Galen M. Shipman, Thomas A. Boden, Jimmy W. Voyles:
Department of energy strategic roadmap for Earth system science data integration. 772-777 - Daniel Fried, Mihai Surdeanu, Stephen G. Kobourov
, Melanie Hingle, Dane Bell:
Analyzing the language of food on social media. 778-783 - Wei-Chun Chung, Yu-Jung Chang, D. T. Lee, Jan-Ming Ho:
Using geometric structures to improve the error correction algorithm of high-throughput sequencing data on MapReduce framework. 784-789 - Maryam Panahiazar, Vahid Taslimi, Ashutosh Jadhav, Jyotishman Pathak:
Empowering personalized medicine with big data and semantic web technology: Promises, challenges, and use cases. 790-795 - Amit Gupta, Weijia Xu, Kenneth Perrine, Dennis Bell, Natalia Ruiz-Juri:
On scaling time dependent shortest path computations for Dynamic Traffic Assignment. 796-801 - Tao Zhong, Kshitij A. Doshi, Gang Deng, Xiaoming Yang, Hegao Zhang:
High volume geospatial mapping for internet-of-vehicle solutions with in-memory map-reduce processing. 802-807 - Khalifeh AlJadda, Mohammed Korayem, Trey Grainger, Chris Russell:
Crowdsourced query augmentation through semantic discovery of domain-specific jargon. 808-815 - Peter Bajcsy
, Phuong Nguyen, Antoine Vandecreme, Mary Brady:
Spatial computations over terabyte-sized images on hadoop platforms. 816-824 - Vinay Deolalikar, Kave Eshghi:
Lightweight approximate top-k for distributed settings. 835-844 - Vinay Deolalikar:
Query revision during cluster based search on large unstructured corpora. 845-853 - Chaitali Gupta, Mayank Bansal, Tzu-Cheng Chuang, Ranjan Sinha, Sami Ben-Romdhane:
Astro: A predictive model for anomaly detection and feedback-based scheduling on Hadoop. 854-862 - Eric Huang, Andres Quiroz, Luca Ceriani:
Automating data integration with HiperFuse. 863-867 - Jayasimha Katukuri, Tolga Könik, Rajyashree Mukherjee, Santanu Kolay:
Recommending similar items in large-scale online marketplaces. 868-876 - Dhaval C. Lunagariya, Durvasula V. L. N. Somayajulu, P. Radha Krishna:
SE-CDA: A scalable and efficient community detection algorithm. 877-882 - Rohan Malcolm, Cherrelle Morrison, Tyrone Grandison
, Sean S. E. Thorpe, Kimron Christie, Akim Wallace, Damian Green, Julian Jarrett, Arnett Campbell:
Increasing the accessibility to Big Data systems via a common services API. 883-892 - Sathyan Munirathinam, Balakrishnan Ramadoss
:
Big data predictive analtyics for proactive semiconductor equipment maintenance. 893-902 - Celeste Lyn Paul, Chris Argenta, William C. Elm, Alex Endert:
Future directions of humans in Big Data Research: Summary of the 1st workshop on Human-Centered Big Data Research. 903-904 - Nicolás Poggi, David Carrera
, Aaron Call
, Sergio Mendoza
, Yolanda Becerra
, Jordi Torres, Eduard Ayguadé
, Fabrizio Gagliardi, Jesús Labarta
, Rob Reinauer, Nikola Vujic, Daron Green, José A. Blakeley:
ALOJA: A systematic study of Hadoop deployment variables to enable automated characterization of cost-effectiveness. 905-913