default search action
BigData Congress 2014: Anchorage, AK, USA
- 2014 IEEE International Congress on Big Data, Anchorage, AK, USA, June 27 - July 2, 2014. IEEE Computer Society 2014, ISBN 978-1-4799-5057-7
BigData Research Session 1 - BigData Analytics Techniques
- Yang Zhou, Sangeetha Seshadri, Lawrence Chiu, Ling Liu:
GraphLens: Mining Enterprise Storage Workloads Using Graph Analytics. 1-8 - Mansurul Bhuiyan, Mohammad Al Hasan:
FSM-H: Frequent Subgraph Mining Algorithm in Hadoop. 9-16 - Jia Wang, Ada Wai-Chee Fu, James Cheng:
Rectangle Counting in Large Bipartite Graphs. 17-24
BigData Research Session 2 - MapReduce Model
- Jin Soung Yoo, Douglas Boulware, David Kimmey:
A Parallel Spatial Co-location Mining Algorithm Based on MapReduce. 25-31 - Lena Mashayekhy, Mahyar Movahed Nejad, Daniel Grosu, Dajun Lu, Weisong Shi:
Energy-Aware Scheduling of MapReduce Jobs. 32-39 - Huseyin Ulusoy, Murat Kantarcioglu, Erman Pattuk, Kevin W. Hamlen:
Vigiles: Fine-Grained Access Control for MapReduce Systems. 40-47
BigData Research Session 3 - BigData Security
- Jingwei Huang, David M. Nicol, Roy H. Campbell:
Denial-of-Service Threat to Hadoop/YARN Clusters with Multi-tenancy. 48-55 - Samuel Marchal, Xiuyan Jiang, Radu State, Thomas Engel:
A Big Data Architecture for Large Scale Security Monitoring. 56-63 - Michael A. Hayes, Miriam A. M. Capretz:
Contextual Anomaly Detection in Big Sensor Data. 64-71
BigData Research Session 4 - BigData Performance Issues
- Jianting Zhang, Simin You, Le Gruenwald:
High-Performance Spatial Query Processing on Big Taxi Trip Data Using GPGPUs. 72-79 - Liping Zhang, Qi Chen, Kai Miao:
A Compatible LZMA ORC-Based Optimization for High Performance Big Data Load. 80-87
BigData Research Session 5 - BigData Analytics
- Angelos Molfetas, Anthony Wirth, Justin Zobel:
Storing a Collection of Differentially Compressed Files Recursively. 88-95 - Carsten Binnig, Abdallah Salama, Erfan Zamanian, Harald Kornmayer, Sven Listing, Alexander C. Müller:
XDB - A Novel Database Architecture for Data Analytics as a Service. 96-103 - Peter Ivie, Douglas Thain:
DeltaDB: A Scalable Database Design for Time-Varying Schema-Free Data. 104-111
BigData Research Session 6 - MapReduce Framework
- Yifan Chen, Xiang Zhao, Bin Ge, Chuan Xiao, Chi-Hung Chi:
Practising Scalable Graph Similarity Joins in MapReduce. 112-119 - Jessica Hartog, Renan Delvalle, Madhusudhan Govindaraju, Michael J. Lewis:
Configuring a MapReduce Framework for Performance-Heterogeneous Clusters. 120-127 - Johannes Schildgen, Thomas Jörg, Manuel Hoffmann, Stefan Dessloch:
Marimba: A Framework for Making MapReduce Jobs Incremental. 128-135
BigData Research Session 7 - BigData Services
- Stanislav Sobolevsky, Izabela Sitko, Remi Tachet des Combes, Bartosz Hawelka, Juan Murillo Arias, Carlo Ratti:
Money on the Move: Big Data of Bank Card Transactions as the New Proxy for Human Mobility Patterns and Regional Delineation. The Case of Residents and Foreign Visitors in Spain. 136-143 - Xu Tan, Yuanchao Shu, Xie Lu, Peng Cheng, Jiming Chen:
Characterizing and Modeling Package Dynamics in Express Shipping Service Network. 144-151 - Desheng Zhang, Tian He, Shan Lin, Sirajum Munir, John A. Stankovic:
Dmodel: Online Taxicab Demand Model from Big Sensor Data in a Roving Sensor Network. 152-159
BigData Research Session 8 - Distributed BigData Services
- Yanyan Xu, James Cheng, Ada Wai-Chee Fu, Yingyi Bu:
Distributed Maximal Clique Computation. 160-167 - Elif Dede, Bedri Sendir, Pinar Kuzlu, J. Weachock, Madhusudhan Govindaraju, Lavanya Ramakrishnan:
A Processing Pipeline for Cassandra Datasets Based on Hadoop Streaming. 168-175 - Pedro Martins, Maryam Abbasi, Pedro Furtado:
AuDy: Automatic Dynamic Least-Weight Balancing for Stream Workloads Scalability. 176-183
BigData Research Session 9 - BigData Analytics Network
- Mark Thomas, Leigh Metcalf, Jonathan M. Spring, Paul Krystosek, Katherine Prevost:
SiLK: A Tool Suite for Unsampled Network Flow Analysis at Scale. 184-191 - Angelos Molfetas, Anthony Wirth, Justin Zobel:
Using Inter-file Similarity to Improve Intra-file Compression. 192-199
BigData Research Session 10 - BigData Management Model
- Chung-Chih Cheng, Fan-Chieh Cheng, Po-Hsiung Lin, Shih-Chia Huang:
A Cloud-Computing Local Histogram Construction Algorithm for Big Image Data. 200-203 - Tonglin Li, Ioan Raicu, Lavanya Ramakrishnan:
Scalable State Management for Scientific Applications in the Cloud. 204-211 - Eleanna Kafeza, Andreas Kanavos, Christos Makris, Pantelis Vikatos:
T-PICE: Twitter Personality Based Influential Communities Extraction System. 212-219 - Verena Kantere:
A Holistic Framework for Big Scientific Data Management. 220-226
BigData Research Session 11 - Mexico City Satellite Session
- Ángel Fernando Kuri Morales:
Data Base Analysis Using a Compact Data Set. 227-233 - Moisés Quezada Naquid, Ricardo Marcelín-Jiménez, José Luis González Compeán:
The Babel File System. 234-241
BigData Research Session 12 - Coimbra Satellite Session
- Antonio M. Rinaldi:
Using Multimedia Ontologies for Automatic Image Annotation and Classification. 242-249 - Diogo Anjos, Paulo Carreira, Alexandre P. Francisco:
Real-Time Integration of Building Energy Data. 250-257
BigData Research Session 13 - Taipei Satellite Session
- Rafat Hammad, Ching-Seh Wu:
Provenance as a Service: A Data-centric Approach for Real-Time Monitoring. 258-265 - Ching-Han Chen, Ching-Yi Chen, Chih-Hsien Hsia, Guan-Xin Wu:
Big Data Collection Gateway for Vision-Based Smart Meter Reading Network. 266-269 - Wei-Ho Tsai, Cin-Hao Ma:
Triangulation-Based Singer Identification for Duet Music Data Indexing. 270-275 - Wei-Ho Tsai, Cin-Hao Ma:
Speech and Singing Discrimination for Audio Data Indexing. 276-280 - Charles Chin-Ho Lin, Liang-Cheng Huang, Seng-cho Timothy Chou, Chih-Ho Liu, Han-Fang Cheng, I-Jen Chiang:
Temporal Event Tracing on Big Healthcare Data Analytics. 281-287 - Zhu Wang, Tiejian Luo, Guandong Xu, Xiang Wang:
The Application of Cartesian-Join of Bloom Filters to Supporting Membership Query of Multidimensional Data. 288-295 - Chun-Yu Wang, Tzu-Li Tai, Jui-Shing Shu, Jyh-Biau Chang, Ce-Kuen Shieh:
Federated MapReduce to Transparently Run Applications on Multicluster Environment. 296-303 - Xiao Fu, Zhijian Wang, Hao Wu, Jia-qi Yang, Zizhao Wang:
How to Send a Self-Destructing Email: A Method of Self-Destructing Email System. 304-309 - Xiaoqing Yu, Huanhuan Liu, Jianhua Shi, Jenq-Neng Hwang, Wanggen Wan, Jing Lu:
Association Rule Mining of Personal Hobbies in Social Networks. 310-314
BigData Research Session 14 - Data Processing
- Carson Kai-Sang Leung, Richard Kyle MacKinnon, Fan Jiang:
Reducing the Search Space for Big Data Mining for Interesting Patterns from Uncertain Data. 315-322 - Eirini C. Micheli, Giorgos Margaritis, Stergios V. Anastasiadis:
Lethe: Cluster-Based Indexing for Secure Multi-user Search. 323-330 - Yifeng Geng, Xiaomeng Huang, Guangwen Yang:
Adaptive Indexing for Distributed Array Processing. 331-338
BigData Research Session 15 - Shenzhen Satellite Session
- Dingsheng Wan, Yan Xiao, Pengcheng Zhang, Jun Feng, Yuelong Zhu, Qian Liu:
Hydrological Time Series Anomaly Mining Based on Symbolization and Distance Measure. 339-346 - Liqiang Wang, Shijun Liu, Li Pan, Lei Wu, Xiangxu Meng:
Enterprise Relationship Network: Build Foundation for Social Business. 347-354 - Yan Tang, Yu Wang, Kendra M. L. Cooper, Ling Li:
Towards Big Data Bayesian Network Learning - An Ensemble Learning Based Approach. 355-357 - Xue Bai, Fu Chen, Shaobin Zhan:
A Study on Sentiment Computing and Classification of Sina Weibo with Word2vec. 358-363 - Fu Chen, Shaobin Zhan, Guangjun Shi:
A Study on Trend Prediction in Sina Weibo Community. 364-365 - Changjian Wang, Yuxing Peng, Mingxing Tang, Dongsheng Li, Shanshan Li, Pengfei You:
MapCheckReduce: An Improved MapReduce Computing Model for Imprecise Applications. 366-373 - Saixia Lyu, Jianxun Liu, Mingdong Tang, Guosheng Kang, Buqing Cao, Yucong Duan:
Three-Level Views of the Web Service Network: An Empirical Study Based on ProgrammableWeb. 374-381 - Haisu Zhang, Sheng Zhang, Zhaolin Wu, Liwei Huang, Yutao Ma:
Predicting Wikipedia Editor's Editing Interest Based on Factor Graph Model. 382-389 - Junming Zhang, Jinglin Li, Shangguang Wang, Zhihan Liu, Quan Yuan, Fangchun Yang:
On Retrieving Moving Objects Gathering Patterns from Trajectory Data via Spatio-temporal Graph. 390-397
BigData Industry and Application Session 1 - Distributed
- Zhiyun Zheng, Zhimeng Du, Lun Li, Yike Guo:
BigData Oriented Open Scalable Relational Data Model. 398-405 - Ivan Giangreco, Ihab Al Kabary, Heiko Schuldt:
ADAM - A Database and Information Retrieval System for Big Multimedia Collections. 406-413 - Tianjian Chen, Zhengrui Man, Hao Li, Xin Sun, Raymond K. Wong, Zhiwei Yu:
Building a Massive Stream Computing Platform for Flexible Applications. 414-421 - Alan Yu Shyang Tan, Ryan Kok Leong Ko, Grace P. Y. Ng:
OpenStack Café: A Novel Time-Based User-centric Resource Management Framework in the Cloud. 422-429 - Arpit Baheti, Durga Toshniwal:
Trend Analysis of Time Series Data Using Data Mining Techniques. 430-437 - Chongke Bi, Kenji Ono, Lu Yang:
Parallel POD Compression of Time-Varying Big Datasets Using m-Swap on the K Computer. 438-445
BigData Industry and Application Session 3 - BigData Management System
- Bo Hu, Yutao Ma, Liang-Jie Zhang, Jiake Shi, Jiayan Zhong:
A Key-Value Based Application Platform for Enterprise Big Data. 446-453 - Ahmed Abdeen Hamed, Xindong Wu:
Does Social Media Big Data Make the World Smaller? An Exploratory Analysis of Keyword-Hashtag Networks. 454-461 - Aniruddha Desai, Muaz Mian, David Hazel, Ankur Teredesai, Gregory Benner:
Data Visualization in Educational Datasets Using a Rule-Based Inference System. 462-469
BigData Industry and Application Session 4 - BigData Analytics Method
- Anirudh Thommandram, J. Mikael Eklund, Carolyn McGregor, James Edward Pugh, Andrew G. James:
A Rule-Based Temporal Analysis Method for Online Health Analytics and Its Application for Real-Time Detection of Neonatal Spells. 470-477 - Shanqing Li, Lirong Song, Hui Zhao:
A Discriminant Framework for Detecting Similar Scientific Research Projects Based on Big Data Mining. 478-481
BigData Industry and Application Session 5 - BigData Analytics Algorithm
- Anil Kumar, Vikas Kapur, Apangshu Saha, Rajeev Kumar Gupta, Arun Singh, Santanu Chaudhury, Sumeet Agarwal:
Distributed Implementation of Latent Rating Pattern Sharing Based Cross-domain Recommender System Approach. 482-489 - Apostolos Papageorgiou, Manuel Zahn, Ernö Kovacs:
Auto-configuration System and Algorithms for Big Data-Enabled Internet-of-Things Platforms. 490-497 - Matthew Saltz, Ayushi Jain, Abhishek Kothari, Arash Fard, John A. Miller, Lakshmish Ramaswamy:
DualIso: An Algorithm for Subgraph Pattern Matching on Very Large Labeled Graphs. 498-505
BigData Industry and Application Session 6 - BigData Mining
- Philippe Lalanda, Catherine Hamon:
An Autonomic Mediation Framework for Complex Physical Environments. 506-513 - Leila Ismail, Mohammad M. Masud, Latifur Khan:
FSBD: A Framework for Scheduling of Big Data Mining in Cloud Computing. 514-521 - Srividya K. Bansal:
Towards a Semantic Extract-Transform-Load (ETL) Framework for Big Data Integration. 522-529
BigData Industry and Application Session 7 - NoSQL
- Julian Krumeich, Sven Jacobi, Dirk Werth, Peter Loos:
Big Data Analytics for Predictive Manufacturing Control - A Case Study from Process Industry. 530-537 - Yi-Cheng Huang, Wenwey Hseush, Yu-Chun Lai, Michael Fong:
BigObject Store: In-Place Computing for Interactive Analytics. 538-545 - Phani Rohit Mullangi, Gowtham Penematsa, Lakshmish Ramaswamy:
Scalable XPath Evaluation on Large-Scale Continuously Evolving XML Repositories. 546-553
BigData Industry and Application Session 8 - Big Spatial
- Wei Zhou, Chi-Hung Chi, Can Wang, Raymond K. Wong, Chen Ding:
Bridging the Gap between Spatial Data Sources and Mashup Applications. 554-561 - Jorge Alencar, Tibérius O. Bonates, Carlile Lavor:
A Combinatorial Approach to Multidimensional Scaling. 562-569
BigData Industry and Application Session 9 - Hadoop
- Oscar D. Lara Yejas, Weiqiang Zhuang, Adarsh Pannu:
Big R: Large-Scale Analytics on Hadoop Using R. 570-577 - Jun Fan, Xinhui Li, Chi Harold Liu, Jeffrey Buell, Gavin Lu, Luke Lu:
Diagnosing Virtualized Hadoop Performance from Benchmark Results: An Exploratory Study. 578-585 - Sungyong Ahn, Sangkyu Park, Jae-Ki Hong, Wooseok Chang:
Performance Implications of SSDs in Virtualized Hadoop Clusters. 586-593
BigData Industry and Application Session 10 - BigData Privacy
- Abdulkareem Alsudais, Gondy Leroy, Anthony Corso:
We Know Where You Are Tweeting From: Assigning a Type of Place to Tweets Using Natural Language Processing and Random Forests. 594-600 - Jeff Sedayao, Rahul Bhardwaj, Nakul Gorade:
Making Big Data, Privacy, and Anonymization Work Together in the Enterprise: Experiences and Issues. 601-607
BigData Industry and Application Session 11 - Cloud-Based BigData Storage
- Frank Zhigang Wang, Theo Dimitrakos, Na Helian, Sining Wu, Ling Li, Rodric Yates:
CloudJet4BigData: Streamlining Big Data via an Accelerated Socket Interface. 608-615 - Wei-Chih Huang, Chuan-Ming Liu, Chuan-Chi Lai:
Resource Provisioning with QoS in Cloud Storage. 616-620 - Marwan Sabbouh, Kenneth McCracken, Geoff Cooney:
Data Sharing for Cloud Computing Platforms. 621-628
BigData Industry and Application Session 12 - BigData Analytics Architecture
- Raghava Rao Mukkamala, Abid Hussain, Ravi K. Vatrapu:
Towards a Set Theoretical Approach to Big Data Analytics. 629-636 - Aleksandr Drozd, Miquel Pericàs, Satoshi Matsuoka:
Efficient String Sorting on Multi - and Many-Core Architectures. 637-644 - Shantenu Jha, Judy Qiu, André Luckow, Pradeep Kumar Mantha, Geoffrey C. Fox:
A Tale of Two Data-Intensive Paradigms: Applications, Abstractions, and Architectures. 645-652
BigData Industry and Application Session 13 - NoSQL
- Rami Sellami, Sami Bhiri, Bruno Defude:
ODBAPI: A Unified REST API for Relational and NoSQL Data Stores. 653-660 - Richard K. Lomotey, Ralph Deters:
Terms Mining in Document-Based NoSQL: Response to Unstructured Data. 661-668 - Shin'ichi Takeuchi, Yuhei Akahoshi, Bun Theang Ong, Komei Sugiura, Koji Zettsu:
Spatio-temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous Scientific Repositories. 669-676 - Hang Yang, Huajun Chen, Cai Yuan, Fang Lianhang:
An Intelligent System for Forecasting the Trend of Consumed Electricity. 677-682
BigData Industry and Application Session 14 - NoSQL
- Heather Champion, Nick J. Pizzi, Raja Krishnamoorthy:
Tactical Clinical Text Mining for Improved Patient Characterization. 683-690
BigData Industry and Application Session 15 - Big Social
- Sanat Kumar Bista, Surya Nepal, Cécile Paris:
Multifaceted Visualisation of Annotated Social Media Data. 699-706
BigData Industry and Application Session 16 - Graph Analytics
- Arko Provo Mukherjee, Srikanta Tirthapura:
Enumerating Maximal Bicliques from a Large Graph Using MapReduce. 707-716 - Yue Zhao, Kenji Yoshigoe, Mengjun Xie, Suijian Zhou, Remzi Seker, Jiang Bian:
LightGraph: Lighten Communication in Distributed Graph-Parallel Processing. 717-724 - Hao Lin, Shuo Yang, Samuel P. Midkiff:
RABID: A Distributed Parallel R for Large Datasets. 725-732
BigData Industry and Application Session 17 - Cloud-Based BigData Service
- Shuang Chen, Mahboobeh Ghorbani, Yanzhi Wang, Paul Bogdan, Massoud Pedram:
Trace-Based Analysis and Prediction of Cloud Computing User Behavior Using the Fractal Modeling Technique. 733-739 - Xiangdong Huang, Jianmin Wang, Jian Bai, Guiguang Ding, Mingsheng Long:
Inherent Replica Inconsistency in Cassandra. 740-747 - Miyuru Dayarathna, Toyotaro Suzumura:
Towards Emulation of Large Scale Complex Network Workloads on Graph Databases with XGDBench. 748-755
BigData Work-in-Progress Session 1 - BigData Processing
- Hyejung Moon, Hyun Suk Cho, Seo Hwa Jeong, Jangho Park:
Policy Design Based on Risk at Big Data Era: Case Study of Privacy Invasion in South Korea. 756-759 - Jingwei Huang, Zbigniew T. Kalbarczyk, David M. Nicol:
Knowledge Discovery from Big Data for Intrusion Detection Using LDA. 760-761 - Harsh Kupwade Patil, Ravi Seshadri:
Big Data Security and Privacy Issues in Healthcare. 762-765 - Xingcan Cui, Zhen Dong, Liwei Lin, Renyong Song, Xiaohui Yu:
GrandLand Traffic Data Processing Platform. 766-767 - Cuiwen Xiong, Peng Zhang, Yan Li, Shipeng Zhang, Qingyun Liu, Jianlong Tan:
A Memory-Based Continuous Query Index for Stream Processing. 768-769
BigData Work-in-Progress Session 2 - BigData Analytics
- Jing Zhou, Xiaohui Yu, Yang Liu, Ziqiang Yu:
Ranking Keyword Search Results with Query Logs. 770-771 - Namgyu Kim, William Wong Xiu Shun, Jieun Kim, Kee-Young Kwahk, Seung Ryul Jeong, Hyunchul Ahn:
Constructing an Issue Network from the Perspective of Common R&D Keywords. 772-773 - Jarek Nabrzyski, Cheng Liu, Charles Vardeman, Sandra Gesing, Milan Budhatoki:
Agriculture Data for All - Integrated Tools for Agriculture Data Integration, Analytics, and Sharing. 774-775 - Zhen Zhao:
Asynchronous Service Analysis of Cloud DVR DataCenter. 776-777 - Muaz Mian, Ankur Teredesai, David Hazel, Sreenivasulu Pokuri, Krishna Uppala:
Work in Progress - In-Memory Analysis for Healthcare Big Data. 778-779 - Bo Liu, Liang Wu, Qiuxiang Dong, Yuanchun Zhou:
Large-Scale Heterogeneous Program Retrieval through Frequent Pattern Discovery and Feature Correlation Analysis. 780-781
BigData Work-in-Progress Session 3 - BigData Management Framework
- Chong Yang, Xiaohui Yu, Yang Liu:
Towards Efficient KNN Joins on Data Streams. 782-783 - Fangzhou Yao, Roy H. Campbell:
CouchFS: A High-Performance File System for Large Data Sets. 784-785 - Daniel Lins da Silva, Pedro Luiz Pizzigatti Corrêa, Silvio Luiz Stanzani, Paulo Andre Filipak, Andreiwid Sheffer Corrêa:
A Computational Framework for Integrating and Retrieving Biodiversity Data on a Large Scale. 786-787 - Matt MacDuff, Benno Lee, Sherman Beus:
Versioning Complex Data. 788-791 - Alexander Ditter, Dietmar Fey, Tobias Schön, Steven Oeckl:
On the Way to Big Data Applications in Industrial Computed Tomography. 792-793
BigData Work-in-Progress Session 4 - BigData Decision Support System
- Carlos R. Rivero, Hasan M. Jamil:
Towards a Novel Model for Distributed Big Data Service Composition Using Functional Graph Matching. 794-795 - Kerrie Holley, Gandhi Sivakumar, Kalapriya Kannan:
Enrichment Patterns for Big Data. 796-799 - Melyssa Barata, Jorge Bernardino, Pedro Furtado:
YCSB and TPC-H: Big Data and Decision Support Benchmarks. 800-801 - Jongbok Byun, Diane Rasmussen Pennington, Jorge Cardenas, Srabasti Dutta, Jeral Kirwan:
Understanding Student Behaviors in Online Classroom: Data Scientific Approach. 802-803 - Katherine G. Herbert, Emily Hill, Jerry Alan Fails, Joseph O. Ajala, Richard T. Boniface, Paul W. Cushman:
Scientific Data Infrastructure for Sustainability Science Mobile Applications. 804-805 - Andreiwid Sheffer Corrêa, Pedro Luiz Pizzigatti Corrêa, Daniel Lins da Silva, Flávio Soares Corrêa da Silva:
Really Opened Government Data: A Collaborative Transparency at Sight. 806-807 - Rebecca Copeland, Noël Crespi:
Classifying and Aggregating Context Attributes for Business Service Requests - No 'One-Size-Fits-All'. 808-815 - Lídice García Ríos, José Alberto Incera Diéguez:
Big Data Infrastructure for analyzing data generated by Wireless Sensor Networks. 816-823 - Dymitr Ruta:
Automated Trading with Machine Learning on Big Data. 824-830
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.