


default search action
BigData Conference 2015: Santa Clara, CA, USA
- 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29 - November 1, 2015. IEEE Computer Society 2015, ISBN 978-1-4799-9926-2
- Léon Bottou
:
How big data changes statistical machine learning. 1 - H. V. Jagadish:
Moving past the "Wild West" era for Big Data. 2 - Ion Stoica:
Conquering Big Data with Spark. 3 - Ioanna Filippidou, Yannis Kotidis:
Online and on-demand partitioning of streaming graphs. 4-13 - Christos Anagnostopoulos
, Peter Triantafillou:
Learning to accurately COUNT with query-driven predictive analytics. 14-23 - Inho Cho, Soya Park, Sejun Park, Dongsu Han
, Jinwoo Shin:
Practical message-passing framework for large-scale combinatorial optimization. 24-31 - Padmashree Ravindra, HyeongSik Kim, Kemafor Anyanwu
:
Rewriting complex SPARQL analytical queries for efficient cloud-based processing. 32-37 - Salvador Aguiñaga, Aditya Nambiar, Zuozhu Liu, Tim Weninger:
Concept hierarchies and human navigation. 38-45 - Enric Junqué de Fortuny, Theodoros Evgeniou, David Martens, Foster J. Provost:
Iteratively refining SVMs using priors. 46-52 - Harish S. Bhat
, Nitesh Kumar, Garnet Jason Vaz:
Towards scalable quantile regression trees. 53-60 - Kilho Shin, Tetsuji Kuboyama
, Takako Hashimoto, Dave Shepard:
Super-CWC and super-LCC: Super fast feature selection algorithms. 61-67 - Don Libes, Seung-Jun Shin, Jungyub Woo:
Considerations and recommendations for data availability for data analytics for manufacturing. 68-75 - Toyotaro Suzumura, Koji Ueno:
ScaleGraph: A high-performance library for billion-scale graph analytics. 76-84 - Maria Malik, Setareh Rafatirah, Avesta Sasan, Houman Homayoun:
System and architecture level characterization of big data applications on big and little core server architectures. 85-94 - Ashwin Lall:
Data streaming algorithms for the Kolmogorov-Smirnov test. 95-104 - Jilong Kuang, Daniel G. Waddington, Changhui Lin:
Techniques for fast and scalable time series traffic generation. 105-114 - Katayoun Neshatpour, Maria Malik, Mohammad Ali Ghodrat, Avesta Sasan, Houman Homayoun:
Energy-efficient acceleration of big data analytics applications using FPGAs. 115-123 - Lorenz Fischer, Abraham Bernstein:
Workload scheduling in distributed stream processors using graph partitioning. 124-133 - Arghya Kusum Das, Seung-Jong Park
, Jae-Ki Hong, Wooseok Chang:
Evaluating different distributed-cyber-infrastructure for data and compute intensive scientific application. 134-143 - Vincenzo Gulisano
, Yiannis Nikolakopoulos, Marina Papatriantafilou
, Philippas Tsigas
:
Scalejoin: A deterministic, disjoint-parallel and skew-resilient stream join. 144-153 - Jilong Xue, Zhi Yang, Shian Hou, Yafei Dai:
When computing meets heterogeneous cluster: Workload assignment in graph computation. 154-163 - E. Preston Carman Jr.
, Till Westmann, Vinayak R. Borkar, Michael J. Carey, Vassilis J. Tsotras
:
A scalable parallel XQuery processor. 164-173 - Guoxin Liu, Haiying Shen, Haoyu Wang:
Computing load aware and long-view load balancing for cluster storage systems. 174-183 - Nam-Luc Tran, Thomas Peel, Sabri Skhiri:
Distributed frank-wolfe under pipelined stale synchronous parallelism. 184-192 - Michele Bertoni, Stefano Ceri, Abdulrahman Kaitoua, Pietro Pinoli
:
Evaluating cloud frameworks on genomic applications. 193-202 - Chenxi Qiu, Haiying Shen, Liuhua Chen:
Towards green cloud computing: Demand allocation and pricing policies for cloud service brokerage. 203-212 - Nikos Zacheilas, Vana Kalogeraki
, Nikolaos Zygouras, Nikolaos Panagiotou, Dimitrios Gunopulos
:
Elastic complex event processing exploiting prediction. 213-222 - Xi Yang, Ning Liu, Bo Feng, Xian-He Sun, Shujia Zhou:
PortHadoop: Support direct HPC data processing in Hadoop. 223-232 - John F. Canny, Huasha Zhao, Bobby Jaros, Ye Chen, Jiangchang Mao:
Machine learning at the limit. 233-242 - Nusrat Sharmin Islam, Md. Wasi-ur-Rahman, Xiaoyi Lu, Dipti Shankar, Dhabaleswar K. Panda:
Performance characterization and acceleration of in-memory file systems for Hadoop and Spark applications on HPC clusters. 243-252 - Serafettin Tasci, Murat Demirbas:
Panopticon: A lock broker architecture for scalable transactions in the datacenter. 253-262 - Dongfang Zhao, NagaPramod Mandagere, Gabriel Alatorre, Mohamed Mohamed, Heiko Ludwig:
Toward locality-aware scheduling for containerized cloud services. 263-270 - Min Du, Feifei Li:
ATOM: Automated tracking, orchestration and monitoring of resource usage in infrastructure as a service systems. 271-278 - Dongyao Wu, Sherif Sakr
, Liming Zhu
, Qinghua Lu:
Composable and efficient functional big data processing framework. 279-286 - Hyunjoo Kim, Sriganesh Madhvanath, Tong Sun:
Hybrid active learning for non-stationary streaming data with asynchronous labeling. 287-292 - Srikant Padala, Dinesh Kumar, Arun Raj, Janakiram Dharanipragada:
Octopus: A multi-job scheduler for Graphlab. 293-298 - Rubén Tous
, Anastasios Gounaris, Carlos Tripiana, Jordi Torres, Sergi Girona
, Eduard Ayguadé, Jesús Labarta
, Yolanda Becerra
, David Carrera
, Mateo Valero
:
Spark deployment and performance evaluation on the MareNostrum supercomputer. 299-306 - Zhenhua Chen, Jielong Xu, Jian Tang, Kevin A. Kwiat, Charles A. Kamhoua:
G-Storm: GPU-enabled high-throughput online data processing in Storm. 307-312 - Orcun Yildiz, Shadi Ibrahim, Tran Anh Phuong, Gabriel Antoniu:
Chronos: Failure-aware scheduling in shared Hadoop clusters. 313-318 - Kousuke Nakabasami, Toshiyuki Amagasa
, Salman Ahmed Shaikh
, Franck Gass, Hiroyuki Kitagawa
:
An architecture for stream OLAP exploiting SPE and OLAP engine. 319-326 - Wei Xie, Jiang Zhou, Mark Reyes, Jason Noble, Yong Chen
:
Two-mode data distribution scheme for heterogeneous storage in data centers. 327-332 - Teng Li, Jian Tang, Jielong Xu:
A predictive scheduling framework for fast and distributed stream data processing. 333-338 - Anthony Kleerekoper
, Michael Pappas, Adam Craig Pocock, Gavin Brown
, Mikel Luján:
A scalable implementation of information theoretic feature selection for high dimensional data. 339-346 - S. M. Faisal, Georgios Tziantzioulis, Ali Murat Gok, Nikolaos Hardavellas
, Seda Ogrenci Memik
, Srinivasan Parthasarathy
:
Edge importance identification for energy efficient graph processing. 347-354 - Keira Zhou, Jack Wadden, Jeffrey J. Fox
, Ke Wang, Donald E. Brown, Kevin Skadron
:
Regular expression acceleration on the micron automata processor: Brill tagging as a case study. 355-360 - Suprio Ray, Angela Demke Brown, Nick Koudas, Rolando Blanco, Anil K. Goel:
Parallel in-memory trajectory-based spatiotemporal topological join. 361-370 - Bin Dong, Surendra Byna
, Kesheng Wu
:
Spatially clustered join on heterogeneous scientific data sets. 371-380 - Chung-Yi Li, Wei-Lun Su, Todd G. McKenzie, Fu-Chun Hsu, Shou-De Lin
, Jane Yung-jen Hsu, Phillip B. Gibbons:
Recommending missing sensor values. 381-390 - Cheng-Te Li, Yu-Jen Lin, Mi-Yen Yeh
:
The roles of network communities in social information diffusion. 391-400 - Vasilis Efthymiou, Kostas Stefanidis
, Vassilis Christophides:
Big data entity resolution: From highly to somehow similar entity descriptions in the Web. 401-410 - Vasilis Efthymiou, George Papadakis
, George Papastefanatos
, Kostas Stefanidis
, Themis Palpanas:
Parallel meta-blocking: Realizing scalable entity resolution over large, heterogeneous data. 411-420 - Bogdan Simion, Daniel N. Ilha, Suprio Ray, Leslie Barron, Angela Demke Brown, Ryan Johnson:
Slingshot: A modular framework for designing data processing systems. 421-430 - Eser Kandogan, Mary Roth, Peter M. Schwarz, Joshua Hui, Ignacio G. Terrizzano, Christina Christodoulakis, Renée J. Miller:
LabBook: Metadata-driven social collaborative data analysis. 431-440 - Huseyin Ulusoy, Murat Kantarcioglu, Erman Pattuk:
TrustMR: Computation integrity assurance system for MapReduce. 441-450 - Huseyin Ulusoy, Murat Kantarcioglu, Erman Pattuk, Lalana Kagal:
AccountableMR: Toward accountable MapReduce systems. 451-460 - Eleazar Leal, Le Gruenwald, Jianting Zhang, Simin You:
TKSimGPU: A parallel top-K trajectory similarity query processing algorithm for GPGPUs. 461-469 - Anand Tripathi, Bhagavathi Dhass Thirunavukarasu:
A transaction model for management of replicated data with multiple consistency levels. 470-477 - Jianting Zhang, Simin You, Le Gruenwald:
Quadtree-based lightweight data compression for large-scale geospatial rasters on multi-core CPUs. 478-484 - Roee Ebenstein
, Gagan Agrawal:
DSDQuery DSI - Querying scientific data repositories with structured operators. 485-492 - Smruti Padhy, Greg Jansen
, Jay Alameda, Edgar F. Black, Liana Diesendruck, Mike Dietze, Praveen Kumar, Rob Kooper, Jong Lee
, Rui Liu, Richard Marciano, Luigi Marini, Dave Mattson, Barbara S. Minsker
, Chris Navarro, Marcus Slavenas, William C. Sullivan
, Jason Votava, Inna Zharnitsky, Kenton McHenry:
Brown Dog: Leveraging everything towards autocuration. 493-500 - Afsin Akdogan, Saratchandra Indrakanti, Ugur Demiryurek, Cyrus Shahabi:
Cost-efficient partitioning of spatial data on cloud. 501-506 - Pouria Pirzadeh, Michael J. Carey, Till Westmann:
BigFUN: A performance study of big data management system functionality. 507-514 - Tonglin Li, Ke Wang, Dongfang Zhao, Kan Qiao, Iman Sadooghi, Xiaobing Zhou, Ioan Raicu:
A flexible QoS fortified distributed key-value storage system for the cloud. 515-522 - Mahdi Ebrahimi, Aravind Mohan, Shiyong Lu, Robert G. Reynolds:
TPS: A task placement strategy for big data workflows. 523-530 - Yuqing Zhu
, Yilei Wang:
Improving transaction processing performance by consensus reduction. 531-538 - Dipti Shankar, Xiaoyi Lu, Md. Wasi-ur-Rahman, Nusrat S. Islam, Dhabaleswar K. Panda:
Benchmarking key-value stores on high-performance storage and interconnects for web-scale workloads. 539-544 - Roberto Tardío, Alejandro Maté
, Juan Trujillo
:
An iterative methodology for big data management, analysis and visualization. 545-550 - Chin-Chi Hsu, Perng-Hwa Kung, Mi-Yen Yeh
, Shou-De Lin
, Phillip B. Gibbons:
Bandwidth-efficient distributed k-nearest-neighbor search with dynamic time warping. 551-560 - Liang Zhao, Feng Chen, Chang-Tien Lu
, Naren Ramakrishnan
:
Dynamic theme tracking in Twitter. 561-570 - Sean Massung, ChengXiang Zhai:
SyntacticDiff: Operator-based transformation for comparative text mining. 571-580 - Yixian Zheng, Wenchao Wu
, Huamin Qu, Chunyan Ma, Lionel M. Ni:
Visual analysis of bi-directional movement behavior. 581-590 - Yuncheng Li, Tao Mei, Yang Cong, Jiebo Luo
:
User-curated image collections: Modeling and recommendation. 591-600 - Ke Wang, Ping Guo
, A-Li Luo
:
Angular quantization based affinity propagation clustering and its application to astronomical big spectra data. 601-608 - Yibo Yao, Lawrence B. Holder:
Scalable classification for large dynamic networks. 609-618 - Ruslan Mavlyutov, Philippe Cudré-Mauroux
:
CINTIA: A distributed, low-latency index for big interval data. 619-628 - Yang Wang, Kwan-Liu Ma:
Revealing the fog-of-war: A visualization-directed, uncertainty-aware approach for exploring high-dimensional data. 629-638 - Bokai Cao, Francine Chen, Dhiraj Joshi, Philip S. Yu:
Inferring crowd-sourced venues for tweets. 639-648 - Huanhuan Wu, James Cheng, Yi Lu, Yiping Ke
, Yuzhen Huang, Da Yan, Hejun Wu:
Core decomposition in large temporal graphs. 649-658 - Jason H. D. Cho, Yanen Li, Roxana Girju, Chengxiang Zhai:
Recommending forum posts to designated experts. 659-666 - Mark Gates
, Hartwig Anzt
, Jakub Kurzak, Jack J. Dongarra:
Accelerating collaborative filtering using concepts from high performance computing. 667-676 - Wei Xie, Feida Zhu
, Siyuan Liu, Ke Wang:
Modelling cascades over time in microblogs. 677-686 - Yasser Salem, Jun Hong, Weiru Liu
:
CSFinder: A cold-start friend finder in large-scale social networks. 687-696 - Hien To, Seon Ho Kim, Cyrus Shahabi:
Effectively crowdsourcing the acquisition and analysis of visual data for disaster response. 697-706 - Zhen Chen, Hanghang Tong
, Lei Ying
:
Full diffusion history reconstruction in networks. 707-716 - Demetris Trihinas, George Pallis
, Marios D. Dikaiakos:
AdaM: An adaptive monitoring framework for sampling and filtering on IoT devices. 717-726 - Suchismit Mahapatra
, Varun Chandola:
Modeling graphs using a mixture of Kronecker models. 727-736 - Stephen Bonner, Andrew Stephen McGough, Ibad Kureshi
, John Brennan
, Georgios Theodoropoulos
, Laura Moss
, David Corsar
, Grigoris Antoniou
:
Data quality assessment and anomaly detection via map/reduce and linked data: A case study in the medical domain. 737-746 - Tian Guo, Jean-Paul Calbimonte
, Hao Zhuang, Karl Aberer:
SigCO: Mining significant correlations via a distributed real-time computation engine. 747-756 - Yen-Kai Wang, Wei-Ming Chen, Cheng-Te Li, Shou-De Lin
:
Identifying smallest unique subgraphs in a heterogeneous social network. 757-766 - Jiejun Xu, Tsai-Ching Lu:
Toward precise user-topic alignment in online social media. 767-775 - Masahiko Itoh, Daisaku Yokoyama, Masashi Toyoda, Masaru Kitsuregawa:
Visual interface for exploring caution spots from vehicle recorder big data. 776-784 - Amir Bahmani, Frank Mueller:
ACURDION: An adaptive clustering-based algorithm for tracing large-scale MPI applications. 785-792 - Max C. Watson:
Time maps: A tool for visualizing many discrete events across multiple timescales. 793-800 - Xugang Ye, Zijie Qi, Dan Massey:
Learning relevance from click data via neural network based similarity models. 801-806 - Chad A. Steed
, Margaret Drouhard
, Justin M. Beaver
, Joshua Pyle, Paul Logasa Bogen:
Matisse: A visual analytics system for exploring emotion trends in social media text streams. 807-814 - Sihong Xie, Qingbo Hu, Jingyuan Zhang, Jing Gao, Wei Fan, Philip S. Yu:
Robust crowd bias correction via dual knowledge transfer from multiple overlapping sources. 815-820 - Deepika Lalwani, Durvasula V. L. N. Somayajulu, P. Radha Krishna:
A community driven social recommendation system. 821-826 - Yongfeng Zhang, Min Zhang, Yiqun Liu, Tat-Seng Chua, Yi Zhang
, Shaoping Ma:
Task-based recommendation on a web-scale. 827-836 - Xiaowei Jia, Aosen Wang, Xiaoyi Li, Guangxu Xun
, Wenyao Xu, Aidong Zhang:
Multi-modal learning for video recommendation based on mobile application usage. 837-842 - Xiaoyi Li, Xiaowei Jia, Guangxu Xun
, Aidong Zhang:
Improving EEG feature learning via synchronized facial video. 843-848 - Muyi Liu, Michael Gribskov
:
MMC-margin: Identification of maximum frequent subgraphs by metropolis Monte Carlo sampling. 849-856 - Yue Wang, Ke Wang, Ada Wai-Chee Fu, Raymond Chi-Wing Wong:
KeyLabel algorithms for keyword search in large graphs. 857-864 - Chung-Hsien Yu, Dong Luo, Wei Ding
, Joseph Paul Cohen, David L. Small, Shafiqul Islam:
Spatio-temporal asynchronous co-occurrence pattern for big climate data towards long-lead flood prediction. 865-870 - Luca Pappalardo
, Dino Pedreschi
, Zbigniew Smoreda
, Fosca Giannotti:
Using big data to study the link between human mobility and socio-economic development. 871-878 - Tri Kurniawan Wijaya, Matteo Vasirani, Samuel Humeau, Karl Aberer:
Cluster-based aggregate forecasting for residential electricity demand using smart meter data. 879-887 - Masayo Ota, Huy T. Vo, Cláudio T. Silva, Juliana Freire
:
A scalable approach for data-driven taxi ride-sharing simulation. 888-897 - Desheng Zhang, Ruobing Jiang
, Shuai Wang, Yanmin Zhu, Bo Yang, Jian Cao, Fan Zhang, Tian He:
EveryoneCounts: Data-driven digital advertising with uncertain demand model in metro networks. 898-907 - Liang Zhao, Wen-Zhan Song
, Xiaojing Ye:
Fast decentralized gradient descent method and applications to in-situ seismic tomography. 908-917 - Zhao Zhang, Kyle Barbary, Frank Austin Nothaft, Evan Randall Sparks, Oliver Zahn, Michael J. Franklin, David A. Patterson, Saul Perlmutter:
Scientific computing meets big data technology: An astronomy use case. 918-927 - Michael Nalisnik, David A. Gutman, Jun Kong, Lee A. D. Cooper:
An interactive learning framework for scalable classification of pathology images. 928-935 - Yu Wang, Jianbo Yuan, Jiebo Luo
:
America Tweets China: A fine-grained analysis of the state and individual characteristics regarding attitudes towards China. 936-943 - Yu Jin, Joseph F. JáJá, Rong Chen, Edward H. Herskovits:
A data-driven approach to extract connectivity structures from diffusion tensor imaging data. 944-951 - Georgios Chatzigeorgakidis, Sophia Karagiorgou, Spiros Athanasiou, Spiros Skiadopoulos
:
A MapReduce based k-NN joins probabilistic classifier. 952-957 - Alessandro Lulli, Thibault Debatty, Matteo Dell'Amico
, Pietro Michiardi, Laura Ricci
:
Scalable k-NN based text clustering. 958-963 - Yuwen Chen, Jian Cao, Shanshan Feng, Yudong Tan:
An ensemble learning based approach for building airfare forecast service. 964-969 - Mack Sweeney, Jaime Lester, Huzefa Rangwala:
Next-term student grade prediction. 970-975 - Sofia Apreleva, Alejandro Cantarero:
Predicting the location of users on Twitter from low density graphs. 976-983 - Elias Alevizos
, Alexander Artikis, Kostas Patroumpas, Marios Vodas
, Yannis Theodoridis
, Nikos Pelekis:
How not to drown in a sea of information: An event recognition approach. 984-990 - Jiaoyan Chen, Huajun Chen, Daning Hu, Jeff Z. Pan, Yalin Zhou:
Smog disaster forecasting using social web data and physical sensor data. 991-998 - Kamalika Das, Kanishka Bhaduri, Bryan L. Matthews, Nikunj C. Oza:
Large scale support vector regression for aviation safety. 999-1006 - Lorenzo Gabrielli, Barbara Furletti, Roberto Trasarti
, Fosca Giannotti, Dino Pedreschi
:
City users' classification with mobile phone data. 1007-1012 - Anas Abu-Doleh, Ümit V. Çatalyürek:
Spaler: Spark and GraphX based de novo genome assembler. 1013-1018 - Florin Schimbinschi, Xuan Vinh Nguyen
, James Bailey, Christopher Leckie
, Hai Le Vu
, Rao Kotagiri
:
Traffic forecasting in complex urban networks: Leveraging big data and machine learning. 1019-1024 - Karla L. Caballero Barajas, Ram Akella:
Prediction of physiological subsystem failure and its impact in the prediction of patient mortality. 1025-1030 - Fei Shao, Li-Yung Ho, Jan-Jan Wu, Pangfeng Liu
:
Efficient distributed maximum matching for solving the container exchange problem in the maritime industry. 1031-1036 - Robert P. Trevino, Steve A. Kawamoto, Thomas J. Lamkin, Huan Liu:
Cell analytics in compound hit selection of bacterial inhibitors. 1037-1042 - Xiuqiang He, Wenyuan Dai, Guoxiang Cao, Ruiming Tang
, Mingxuan Yuan, Qiang Yang:
Mining target users for online marketing based on App Store data. 1043-1052 - Ahmed Metwally, Jia-Yu Pan, Minh Doan, Christos Faloutsos:
Scalable community discovery from multi-faceted graphs. 1053-1062 - Ernesto Diaz-Aviles, Fabio Pinelli
, Karol Lynch, Zubair Nabi, Yiannis Gkoufas, Eric Bouillet, Francesco Calabrese, Eoin Coughlan, Peter Holland, Jason Salzwedel:
Towards real-time customer experience prediction for telecommunication operators. 1063-1072 - I. Stephen Choi, Weiqing Yang, Yang-Suk Kee:
Early experience with optimizing I/O performance using high-performance SSDs for in-memory cluster computing. 1073-1083 - Hyunsik Choi, Jongyoung Park, Yong In Lee, Kangho Roh, Kwanghyun La:
An evaluation of alternative shared-nothing architecture for analytical processing systems. 1084-1093 - Anjan Goswami, Wei Han, Zhenrui Wang, Angela Jiang:
Controlled experiments for decision-making in e-Commerce search. 1094-1102 - Jenny Weisenberg Williams, Paul Cuddihy, Justin McHugh, Kareem S. Aggour
, Arvind Menon, Steven M. Gustafson, Timothy Healy:
Semantics for Big Data access & integration: Improving industrial equipment design through increased data usability. 1103-1112 - Laura Rettig, Mourad Khayati
, Philippe Cudré-Mauroux
, Michal Piórkowski:
Online anomaly detection over Big Data streams. 1113-1122 - Aungon Nag Radon, Ke Wang, Uwe Glässer, Hans Wehn, Andrew Westwell-Roper:
Contextual verification for false alarm reduction in maritime anomaly detection. 1123-1133 - Tanay Kumar Saha
, Mohammad Al Hasan, Chandler Burgess, Md. Ahsan Habib, Jeff Johnson:
Batch-mode active learning for technology-assisted review. 1134-1143 - Mayank Kejriwal, Qiaoling Liu, Ferosh Jacob, Faizan Javed:
A pipeline for extracting and deduplicating domain-specific knowledge bases. 1144-1153 - Fang-Hsiang Su, Manas Somaiya, Shrish Mishra, Rajyashree Mukherjee:
EXOS: Expansion on session for enhancing effectiveness of query auto-completion. 1154-1163 - Gergely Ács, Jagdish Prasad Achara, Claude Castelluccia:
Probabilistic km-anonymity efficient anonymization of large set-valued datasets. 1164-1173 - Sauptik Dhar, Congrui Yi, Naveen Ramakrishnan, Mohak Shah:
ADMM based scalable machine learning on Spark. 1174-1182 - Dapeng Dong
, John Herbert:
Record-aware compression for big textual data analysis acceleration. 1183-1190 - Alekh Jindal, Samuel Madden, Malú Castellanos, Meichun Hsu:
Graph analytics using vertica relational database. 1191-1200 - André Luckow, Ken Kennedy, Fabian Manhardt
, Emil Djerekarov, Bennie Vorster, Amy W. Apon:
Automotive big data: Applications, workloads and infrastructures. 1201-1210 - Goktug T. Cinar
, Jeffrey Thompson, Soundar Srinivasan:
Cost-sensitive optimization of automated inspection. 1211-1219 - Nicolás Poggi
, Josep Lluis Berral
, David Carrera
, Aaron Call
, Fabrizio Gagliardi, Rob Reinauer, Nikola Vujic, Daron Green, José A. Blakeley:
From performance profiling to predictive analytics while evaluating hadoop cost-efficiency in ALOJA. 1220-1229 - Mohammed Korayem, Camilo Ortiz, Khalifeh AlJadda, Trey Grainger:
Query sense disambiguation leveraging large scale user behavioral data. 1230-1237 - Viet Ha-Thuc, Ganesh Venkataraman, Mario Rodriguez, Shakti Sinha, Senthil Sundaram, Lin Guo:
Personalized expertise search at LinkedIn. 1238-1247 - Vinay Deolalikar:
How valuable is your data? A quantitative approach using data mining. 1248-1253 - Kang Li, Vinay Deolalikar, Neeraj Pradhan:
Mining lifestyle personas at scale in e-commerce. 1254-1261 - Petros Zerfos, Hangu Yeo, Brent D. Paulovicks, Vadim Sheinin:
SDFS: Secure distributed file system for data-at-rest security for Hadoop-as-a-service. 1262-1271 - Sreenivas R. Sukumar:
Open research challenges with Big Data - A data-scientist's perspective. 1272-1278 - Hamed Yaghoubi Shahir, Uwe Glässer, Amir Yaghoubi Shahir, Hans Wehn:
Maritime situation analysis framework: Vessel interaction classification and anomaly detection. 1279-1289 - Levente J. Klein, Fernando J. Marianno, Conrad M. Albrecht
, Marcus Freitag, Siyuan Lu, Nigel Hinds, Xiaoyan Shao, Sergio Bermudez Rodriguez, Hendrik F. Hamann:
PAIRS: A scalable geo-spatial data analytics platform. 1290-1298 - Jayasimha Katukuri, Tolga Könik, Rajyashree Mukherjee, Santanu Kolay:
Post-purchase recommendations in large-scale online marketplaces. 1299-1305 - Hong-Han Shuai
, Chih-Ya Shen, Hsiang-Chun Hsu, De-Nian Yang
, Chung-Kuang Chou
, Jihg-Hong Lin, Ming-Syan Chen
:
Revenue maximization for telecommunications company with social viral marketing. 1306-1310 - Stephanie Rosenthal, Scott McMillan, Matthew E. Gaston:
Developer toolchains for large-scale analytics: Two case studies. 1311-1316 - Ramakrishna Vadakattu, Bibek Panda, Swarnim Narayan, Harshal Godhia:
Enterprise subscription churn prediction. 1317-1321 - Joshua Seeger, Aron Culotta, Jason Keller, Patrick van Kessel, Michael Jugovich:
Data deidentification in medical transcriptions using regular expressions and machine learning. 1322-1323 - Qinlong Luo, Meng Zhao, Faizan Javed, Ferosh Jacob:
Macau: Large-scale skill sense disambiguation in the online recruitment domain. 1324-1329 - Wei-Yi Liu, Hui-I Hsiao, Shih-Yao Dai:
Genomic analysis with MapReduce. 1330-1335 - Chaitali Gupta, Ranjan Sinha, Yong Zhang:
Eagle: User profile-based anomaly detection for securing Hadoop clusters. 1336-1343 - Manuel Diaz-Granados, Javier Diaz Montes, Manish Parashar:
Investigating insurance fraud using social media. 1344-1349 - Luca Cazzanti, Leonardo Maria Millefiori
, Gianfranco Arcieri:
A document-based data model for large scale computational maritime situational awareness. 1350-1356 - Jhao-Yin Li, Mi-Yen Yeh, Ming-Syan Chen
, Jihg-Hong Lin:
Modeling social influences from call records and mobile web browsing histories. 1357-1361 - Christian Seebode, Matthias Ort, Peter Hufnagl, Christian R. A. Regenbrecht
:
Next generation biobanks. 1362-1367 - Mikel Nino
, José Miguel Blanco
, Arantza Illarramendi
:
Business understanding, challenges and issues of Big Data Analytics for the servitization of a capital equipment manufacturer. 1368-1377 - Divya Sardana, Raj Bhatnagar, Radu Pavel, Jonathan Iverson:
Data driven predictive analytics for a spindle's health. 1378-1387 - Yunpeng Li, Utpal Roy, Seung-Jun Shin, Y. Tina Lee:
A "smart component" data model in PLM. 1388-1397 - Nenad Stojanovic, Marko Dinic, Ljiljana Stojanovic:
Big data process analytics for continuous process improvement in manufacturing. 1398-1407 - Saideep Nannapaneni, Sankaran Mahadevan, David Lechevalier, Anantha Narayanan, Sudarsan Rachuri
:
Automated uncertainty quantification analysis using a system model and data. 1408-1417 - Alexander Brodsky, Guodong Shao, Mohan Krishnamoorthy, Anantha Narayanan, Daniel A. Menascé, Ronay Ak:
Analysis and optimization in smart manufacturing based on a reusable knowledge base for process performance models. 1418-1427 - David Lechevalier, Steven Hudak, Ronay Ak, Y. Tina Lee, Sebti Foufou:
A neural network meta-model and its application for manufacturing. 1428-1435 - Luca Oneto
, Ilenia Orlandi, Davide Anguita
:
Performance assessment and uncertainty quantification of predictive models for smart manufacturing systems. 1436-1445 - Ashwin K. Thillai Natarajan, Sagar V. Kamarthi:
Time complexity and architecture of a cloud based prognostics system for a multi-client condition monitoring activity. 1446-1450 - Jinkyoo Park, Kincho H. Law, Raunak Bhinge, Mason Chen, David Dornfeld, Sudarsan Rachuri
:
Real-time energy prediction for a milling machine tool using sparse Gaussian process regression. 1451-1460 - Kannan Govindarajan, David Boulanger, Vivekanandan Suresh Kumar
, Kinshuk:
Parallel Particle Swarm Optimization (PPSO) clustering for learning analytics. 1461-1465 - Gökhan Silahtaroglu
, Hale Donertasli:
Analysis and prediction of Ε-customers' behavior by mining clickstream data. 1466-1472 - Jeyhun Karimov, A. Murat Ozbayoglu
:
High quality clustering of big data and solving empty-clustering problem with an evolutionary hybrid algorithm. 1473-1478 - Renaud Richardet, Jean-Cédric Chappelier, Shreejoy J. Tripathy
, Sean L. Hill:
Agile text mining with Sherlok. 1479-1484 - Golnoosh Farnadi
, Zeinab Mahdavifar, Ivan Keller, Jacob Nelson, Ankur Teredesai, Marie-Francine Moens, Martine De Cock
:
Scalable adaptive label propagation in Grappa. 1485-1491 - Kasim Oztoprak:
Profiling subscribers according to their internet usage characteristics and behaviors. 1492-1499 - Magdalini Eirinaki
, Sweta Patel:
QueRIE reloaded: Using matrix factorization to improve database query recommendations. 1500-1508 - Ran Pang, Agustin Baretto, Henry A. Kautz
, Jiebo Luo
:
Monitoring adolescent alcohol use via multimodal analysis in social multimedia. 1509-1518 - Raj Bhatnagar, Lalit Kumar:
An efficient map-reduce algorithm for computing formal concepts from binary data. 1519-1528 - Jagadeesh Patchala, Raj Bhatnagar:
Learning relaxed 3-clusters from pairs of related datasets. 1529-1538 - Jun Meng, Rui Li, Jing Zhang:
Parallel information fusion method for microarray data analysis. 1539-1544 - Xuezhi Ji, Lixiang Liu, Pei Zhao, Dapeng Wang:
A-Star algorithm based on-demand routing protocol for hierarchical LEO/MEO satellite networks. 1545-1549 - Lukasz Sosnowski
, Marcin S. Szczuka
, Dominik Slezak:
Granular modeling with fuzzy comparators. 1550-1555 - I-Jen Chiang:
Agglomerative algorithm to discover semantics from unstructured big data. 1556-1563 - Alexander Denzler, Marcel Wehrle, Andreas Meier:
A granular approach for identifying user knowledge. 1564-1569 - Liang Wu, Teng-Sheng Moh, Natalia Khuri:
Twitter opinion mining for adverse drug reactions. 1570-1574 - Shusaku Tsumoto, Shoji Hirano, Haruko Iwata:
Data decomposition and dual clustering for clinical care management. 1475-1584 - Maria Pershina, Mohamed Yakout, Kaushik Chakrabarti:
Holistic entity matching across knowledge graphs. 1585-1590 - Zehua Chen, He Ma, Yu Zhang:
GrC-based statistic optimization algorithm for big truth table. 1591-1596 - Patrick G. Clark, Jerzy W. Grzymala-Busse:
Mining incomplete data with many attribute-concept values and "do not care" conditions. 1597-1602 - Tsau-Young Lin:
Chinese wall security policies information flows in business cloud. 1603-1607 - Shusaku Tsumoto, Shoji Hirano:
Granular formalization of medical diagnostic process. 1608-1614 - Karan Khare, Teng-Sheng Moh:
Mobile gesture-based iPhone user authentication. 1615-1621 - Chris Tseng, Tien Nguyen, Chetan Sharma:
Cost and data exploration considerations for big data prediction on the cloud. 1622-1628 - Chao-Lin Liu, Chih-Kai Huang, Hongsu Wang, Peter K. Bol:
Mining local gazetteers of literary Chinese with CRF and pattern based methods for biographical information in Chinese history. 1629-1638 - Giles Greenway, Leonard Mack, Tobias Blanke
, Mark Coté
, Tom Heath:
Towards a mobile social data commons. 1639-1642 - Matthew Coole
, Paul Rayson
, John A. Mariani
:
Scaling out for extreme scale corpus data. 1643-1649 - Stefan Pernes:
Metaphor mining in historical german novels: An unsupervised learning approach. 1650-1652 - Mehrdad Yazdani, Lev Manovich:
Predicting social trends from non-photographic images on Twitter. 1653-1660 - Dallas Liddle:
The coding of literary form: Data mining and the information structure of historical texts. 1661-1666 - Benjamin M. Schmidt:
Plot arceology: A vector-space model of narrative structure. 1667-1672 - Ben Miller, Jennifer Olive, Shakthidhar Reddy Gopavaram, Yanjun Zhao, Ayush Shrestha, Cynthia Berger:
A method for cross-document narrative alignment of a two-hundred-sixty-million word corpus. 1673-1677 - Nadya A. Calderón, Brian D. Fisher
, Jeff J. Hemsley
, Billy Ceskavich, Greg Jansen
, Richard Marciano
, Victoria L. Lemieux
:
Mixed-initiative social media analytics at the World Bank: Observations of citizen sentiment in Twitter data to explore "trust" of political actors and state institutions and its relationship to social protest. 1678-1687 - Ilir Fetai, Damian Murezzan, Heiko Schuldt
:
Workload-driven adaptive data partitioning and distribution - The Cumulus approach. 1688-1697 - Gabor Madl, Ramani Routray, Yang Song, Rakesh Jain:
Account clustering in multi-tenant storage management environments. 1698-1707 - Marlon McKenzie, Hua Fan
, Wojciech M. Golab:
Fine-tuning the consistency-latency trade-off in quorum-replicated distributed storage systems. 1708-1717 - Sathiya Prabhu Kumar, Sylvain Lefebvre
, Minyoung Kim, Mark-Oliver Stehr:
Priority register: Application-defined replacement orderings for ad hoc reconciliation. 1718-1727 - Matthew Bihis, Sohini Roychowdhury:
A generalized flow for multi-class and binary classification tasks: An Azure ML approach. 1728-1737 - Alexander Stiemer
, Ilir Fetai, Heiko Schuldt
:
Comparison of eager and quorum-based replication in a cloud environment. 1738-1748 - Alexander Lenk, Leif Bonorden
, Astrid Hellmanns, Nico Rödder, Stefan Jähnichen:
Towards a taxonomy of standards in smart data. 1749-1754 - Nan Zhu, Wenbo He
, Yu Hua, Yixin Chen:
Marlin: Taming the big streaming data in large scale video similarity search. 1755-1764 - Chong Zhang, Xiaoying Chen, Bin Ge, Weidong Xiao:
Indexing historical spatio-temporal data in the cloud. 1765-1774 - Vladimir Grupcev, Yi-Cheng Tu, Joseph C. Fogarty, Sagar Pandit:
Push-based system for molecular simulation data analysis. 1775-1784 - Guan Xu, Jun Yang, Bin Dai:
Challenges and opportunities on network resource management in DCN with SDN. 1785-1790 - Lijia Lu, Hui Li
, Jun Chen, Bing Zhu, Weijuan Yin:
On the implementation of Zigzag codes for distributed storage system. 1791-1796 - Arun Kumar Kalakanti, Vinay Sudhakaran, Varsha Raveendran, Nisha Menon:
A comprehensive evaluation of NoSQL datastores in the context of historians and sensor data analysis. 1797-1806 - Harris T. Lin
, Ngot Bui, Vasant G. Honavar
:
Learning classifiers from remote RDF data stores augmented with RDFS subclass hierarchies. 1807-1813 - Guoyao Feng, Xiao Meng, Khaled Ammar:
DISTINGER: A distributed graph data structure for massive dynamic graph processing. 1814-1822 - Olivier Curé, Hubert Naacke, Tendry Randriamalala
, Bernd Amann
:
LiteMat: A scalable, cost-efficient inference encoding scheme for large RDF graphs. 1823-1830 - Alireza Rezaei Mahdiraji, Peter Baumann
:
MQuery: A query language for scientific meshes. 1831-1838 - Shaikh Arifuzzaman, Maleq Khan, Madhav V. Marathe:
A fast parallel algorithm for counting triangles in graphs using dynamic load balancing. 1839-1847 - Janani Balaji, Rajshekhar Sunderraman:
Scalable storage structure for pattern matching on big graph data. 1848-1855 - Serafettin Tasci, Murat Demirbas:
Employing in-memory data grids for distributed graph processing. 1856-1864 - Ather Sharif, Sarah Cooney, Shengqi Gong, Drew Vitek:
Current security threats and prevention measures relating to cloud services, Hadoop concurrent processing, and big data. 1865-1870 - Jinoh Kim, Bin Dong, Surendra Byna
, Kesheng Wu
:
Security for the scientific data services framework. 1871-1875 - Santosh Aditham, Nagarajan Ranganathan:
A novel framework for mitigating insider attacks in big data systems. 1876-1885 - Katerina Doka, Mingqiang Xue, Dimitrios Tsoumakos, Panagiotis Karras, Alfredo Cuzzocrea, Nectarios Koziris:
Heterogeneous k-anonymization with high utility. 1886-1890 - Lee A. Carraher, Philip A. Wilsey, Anindya Moitra, Sayantan Dey
:
Multi-probe random projection clustering to secure very large distributed datasets. 1891-1900 - Dymitr Ruta, Ling Cen, Ernesto Damiani
:
Fast summarization and anonymization of multivariate big time series. 1901-1904 - Ernesto Damiani
:
Toward big data risk analysis. 1905-1909 - Alfredo Cuzzocrea, Gianluigi Folino
, Pietro Sabatino:
A distributed framework for supporting adaptive ensemble-based intrusion detection. 1910-1916 - Andy Bengel, Amin Shawki, Dippy Aggarwal:
Simplifying web analytics for digital marketing. 1917-1918 - Dawn N. Jutla, Peter Bodorik
:
PAUSE: A privacy architecture for heterogeneous big data environments. 1919-1928 - Xiaoying Chen, Chong Zhang, Bin Ge, Weidong Xiao:
Spatio-temporal queries in HBase. 1929-1937 - V. Gyurjyan, Aron Bartle, Constantine Lukashin, S. Mancilla, R. Oyarzun, A. Vakhnin:
Component based dataflow processing framework. 1938-1942 - Constantine Lukashin, Aron Bartle, E. Callaway, V. Gyijrjyan, S. Mancilla, R. Oyarzun, A. Vakhnin:
Earth science data fusion with event building approach. 1943-1947 - Seungwon Lee, Lei Pan, Chengxing Zhai, Benyang Tang, Terry Kubar, Jia Zhang, Wei Wang:
Climate model diagnostic analyzer. 1948-1952 - David Haynes
, Suprio Ray, Steven M. Manson
, Ankit Soni:
High performance analysis of big spatial data. 1953-1957 - Akinori Asahara, Hideki Hayashi, Nobuhiro Ishimaru
, Ryosuke Shibasaki, Hiroshi Kanasugi:
International standard "OGC® moving features" to address "4Vs" on locational bigdata. 1958-1966 - Luis A. Lopez, Ruth E. Duerr
, Siri Jodha Singh Khalsa
:
Optimizing apache nutch for domain specific crawling at large scale. 1967-1971 - Shujia Zhou, Xi Yang, Xiaowen Li, Toshihisa Matsui, Si Liu, Xian-He Sun, Wei-Kuo Tao:
A Hadoop-based visualization and diagnosis framework for earth science data. 1972-1977 - Saman Biookaghazadeh, Yiqi Xu, Shujia Zhou, Ming Zhao:
Enabling scientific data storage and processing on big-data systems. 1978-1984 - Kevin Paul, Sheri A. Mickelson, John M. Dennis, Haiying Xu, David Brown:
Light-weight parallel Python tools for earth system modeling workflows. 1985-1994 - In Kee Kim, Jacob Steele, Anthony M. Castronova
, Jonathan L. Goodall
, Marty Humphrey:
WDCloud: An end to end system for large-scale watershed delineation on cloud. 1995-2004 - Lesley Wyborn, Benjamin J. K. Evans:
Integrating 'Big' geoscience data into the petascale national environmental research interoperability platform (NERDIP): Successes and unforeseen challenges. 2005-2009 - Fatih Akdag, Christoph F. Eick:
An optimized interestingness hotspot discovery framework for large gridded spatio-temporal datasets. 2010-2019 - Rahul Palamuttam, Renato Javier Marroquín Mogrovejo, Chris Mattmann, Brian Wilson, Kim Whitehall, Rishi Verma, Lewis J. McGibbney, Paul M. Ramirez:
SciSpark: Applying in-memory distributed computing to weather event detection and tracking. 2020-2026 - Amelia Yzaguirre, Robert Warren, Mike Smit:
Detecting environmental disasters in digital news archives. 2027-2035 - Yuzhong Yan, Lei Huang, Liqi Yi:
Is Apache Spark scalable to seismic data analytics and computations? 2036-2045 - Peter Baumann
, Vlad Merticariu:
On the efficient evaluation of array joins. 2046-2055 - Torsten Priebe
, Stefan Markus:
Business information modeling: A methodology for data-intensive projects, data science and big data governance. 2056-2065 - Jeffrey S. Saltz
:
The need for new processes, methodologies and tools to support big data teams and improve big data project effectiveness. 2066-2071 - Manirupa Das, Renhao Cui, David R. Campbell, Gagan Agrawal, Rajiv Ramnath:
Towards methods for systematic research on big data. 2072-2081 - Marco Pospiech, Carsten Felden:
Towards a big data theory model. 2082-2090 - Kerk F. Kee:
Three critical matters in big data projects for e-science: Different user groups, the mutually constitutive perspective, and virtual organizational capacity. 2091-2097 - Jeffrey S. Saltz
, Ivan Shamshurin:
Exploring the process of doing data science via an ethnographic study of a media advertising company. 2098-2105 - Dazhi Yang
, Gary S. W. Goh, Chi Xu, Allan N. Zhang
, Orkan Akcan:
Forecast UPC-level FMCG demand, Part I: Exploratory analysis and visualization. 2106-2112 - Dazhi Yang
, Gary S. W. Goh, Siwei Jiang, Allan N. Zhang
, Orkan Akcan:
Forecast UPC-level FMCG demand, Part II: Hierarchical reconciliation. 2113-2121 - B. Y. Ong, S. W. Goh, Chi Xu:
Sparsity adjusted information gain for feature selection in sentiment analysis. 2122-2128 - Sam Iosevich, Georgiy Arutyunyants, Z. Hou:
Dynamic aggregation for time series forecasting. 2129-2131 - Wenjing Yan, Xianshun Chen, Orkan Akcan, Jasmine J. Lim, Dazhi Yang
:
Big data analytics for empowering milk yield prediction in dairy supply chains. 2132-2137 - Gürdal Ertek
, Xu Chi, Gabriel Yee, Ong Boon Yong, Byung-Geun Choi:
Profit estimation error analysis in recommender systems based on association rules. 2138-2142 - Gürdal Ertek
, Byung-Geun Choi, Xu Chi, Dazhi Yang
, Ong Boon Yong:
Graph-based analysis of resource dependencies in project networks. 2143-2149 - Prapa Rattadilok
, John A. W. McCall
, Trevor Burbridge, Andrea Soppera, Philip Eardley:
A data fusion framework for large-scale measurement platforms. 2150-2158 - Nijat Mehdiyev, Julian Krumeich, Dirk Werth, Peter Loos:
Sensor event mining with hybrid ensemble learning and evolutionary feature subset selection model. 2159-2168 - Huikyo Lee, Luca Cinquini, Daniel J. Crichton, Amy Braverman:
Optimization of system architecture for Big Data analysis in climate science. 2169-2172 - Goutham Kamath
, Wen-Zhan Song
:
In-situ analytics for tomographic imaging in sensor network. 2173-2176 - Beth Huffer, Marc Cotnoir, Jonathan Gleason:
Ontology-drive data access at the NASA earth exchange. 2177-2181 - Dean N. Williams, Michael Lautenschlager, Venkatramani Balaji, Luca Cinquini, Cecelia DeLuca, Sebastien Denvil
, Daniel Duffy, Benjamin J. K. Evans, Robert D. Ferraro
, Martin Juckes, Claire Trenham
:
Strategie roadmap for the earth system grid federation. 2182-2190 - Shin'ichi Takeuchi, Komei Sugiura, Yuhei Akahoshi, Koji Zettsu:
Constrained region selection method based on configuration space for visualization in scientific dataset search. 2191-2200 - Peter Baumann
, Dimitar Misev:
Enhancing science support in SQL. 2201-2204 - Ramezan Paravi Torghabeh, Narayana Prasad Santhanam:
Modeling community detection using slow mixing random walks. 2205-2211 - Jorge David Destephen Lavaire, Anshuman Singh, Mahmoud Yousef, Sumi Singh, Xiaodong Yue:
Dimensional scalability of supervised and unsupervised concept drift detection: An empirical study. 2212-2218 - Spiros V. Georgakopoulos
, Sotiris K. Tasoulis, Vassilis P. Plagianakos:
Efficient change detection for high dimensional data streams. 2219-2222 - Charalampos Chelmis
, Jahanvi Kolte, Viktor K. Prasanna:
Big data analytics for demand response: Clustering over space and time. 2223-2232 - Fatimah Binta Abdullahi
, Frans Coenen
, Russell Martin:
Finding banded patterns in big data using sampling. 2233-2242 - Gheorghi Guzun, Joel E. Tosado, Guadalupe Canahuate:
Scalable preference queries for high-dimensional data using map-reduce. 2243-2252 - Chuan Hu, Huiping Cao
:
Discovering time-evolving influence from dynamic heterogeneous graphs. 2253-2262 - Kanji Matsutani, Masahito Kumano, Masahiro Kimura, Kazumi Saito, Kouzou Ohara, Hiroshi Motoda:
Combining activity-evaluation information with NMF for trust-link prediction in social media. 2263-2272 - Nemanja Spasojevic, Adithya Rao:
Identifying actionable messages on social media. 2273-2281 - Adithya Rao, Nemanja Spasojevic, Zhisheng Li, Trevor DSouza:
Klout score: Measuring influence across multiple social networks. 2282-2289 - Fei Liu, Yan Jia:
Top (k1, k2) Distance-based outliers detection in an uncertain dataset. 2290-2299 - Guirong Chen, Ning Wang, Fengqin Zhang, Hua Jiang:
Understanding the time characteristic of user behavior on online forums. 2300-2306 - Yu Liu, Bin Wu, Bai Wang:
Characterizing super spreading in microblog: An epidemic-based model. 2307-2313 - Yang Wang, Liutong Xu, Bin Wu:
A community detection method based on K-shell. 2314-2319 - Divya Rao, Wee Keong Ng
:
How much is your information worth - A method for revenue generation for your information. 2320-2326 - Rong Gu, Yun Tang, Zhaokang Wang, Shuai Wang, Xusen Yin, Chunfeng Yuan, Yihua Huang:
Efficient large scale distributed matrix computation with spark. 2327-2336 - Bailing Wang, Junheng Huang, Libing Ou, Rui Wang:
A collaborative filtering algorithm fusing user-based, item-based and social networks. 2337-2343 - Man Li, Ruisheng Shi:
Mining the relation between dorm arrangement and student performance. 2344-2347 - Fang Lv, Bailing Wang, Junheng Huang, Yushan Sun, Yuliang Wei:
A proactive discovery and filtering solution on phishing websites. 2348-2355 - Yunlei Zhang, Bin Wu:
Finding community structure via rough K-means in social network. 2356-2361 - Shuang Zhang, Xuefeng Zheng, Changjun Hu:
A survey of semantic similarity and its application to social network analysis. 2362-2367 - Fei Jiang, Jin Xu:
Dynamic community detection based on game theory in social networks. 2368-2373 - Michel de Rougemont, Guillaume Vimont:
The value of analytical queries on Social Networks. 2374-2383 - Rui Wang, Bailing Wang, Junheng Huang:
A collaborative filtering algorithm based on social network information. 2384-2389 - Alina Campan, Traian Marius Truta, Matthew Beckerich:
Efficient approximation algorithms to determine minimum partial dominating sets in social networks. 2390-2397 - Garisha Chowdhary, Sanghamitra Bandyopadhyay:
Ties that matter. 2398-2403 - Hao Wang, Jorge A. Castanon:
Sentiment expression via emoticons on social media. 2404-2408 - Michael L. Nelson
, Sridhar Radhakrishnan, Amlan Chatterjee, Chandra N. Sekharan:
On compressing massive streaming graphs with Quadtrees. 2409-2417 - Benjamin Flesch, Ravi Vatrapu
, Raghava Rao Mukkamala
, Abid Hussain:
Social set visualizer: A set theoretical approach to big social data analytics of real-world events. 2418-2427 - Gavin Smith, James Goulding:
A novel symbolization technique for time-series outlier detection. 2428-2436 - Jian Zou, Yunbo An, Hong Yan:
Volatility matrix inference in high-frequency finance with regularization and efficient computations. 2437-2444 - Oliver Bieh-Zimmert, Carsten Felden:
Shaping data: Visualization under construction. 2445-2452 - Margaret Drouhard
, Chad A. Steed
, Steven E. Hahn
, Thomas Proffen
, Jamison Daniel, Michael A. Matheson:
Immersive visualization for materials science data analysis using the Oculus Rift. 2453-2461 - Hideki Hayashi, Akinori Asahara, Natsuko Sugaya, Yuichi Ogawa, Hitoshi Tomita:
Spatio-temporal similarity search method for disaster estimation. 2462-2469 - Hui Zhang, Riqing Chen, Guangchen Ruan, Masatoshi Ando:
Scalable dental computing on cyberinfrastructure. 2470-2478 - Christopher Jordan, David Walling, Weijia Xu, Stephen A. Mock, Niall Gaffney, Dan Stanzione:
Wrangler's user environment: A software framework for management of data-intensive computing system. 2479-2486 - Wanbo Luo, Hui Zhang:
Visual analysis of large-scale LiDAR point clouds. 2487-2492 - Yin Huang, Yelena Yesha, Shujia Zhou:
A database-based distributed computation architecture with Accumulo and D4M: An application of eigensolver for large sparse matrix. 2493-2500 - Jieting Wu, Lina Yu, Hongfeng Yu:
Texture-based edge bundling: A web-based approach for interactively visualizing large graphs. 2501-2508 - Jianwu Wang, Daniel Crawl, Shweta Purawat, Mai H. Nguyen, Ilkay Altintas:
Big data provenance: Challenges, state of the art and opportunities. 2509-2516 - Ruizhu Huang, Weijia Xu:
Performance evaluation of enabling logistic regression for big data with R. 2517-2524 - Shinichi Yamagiwa, Yoshinobu Kawahara
, Noriyuki Tabuchi, Yoshinobu Watanabe, Takeshi Naruo:
Skill grouping method: Mining and clustering skill differences from body movement BigData. 2525-2534 - Vilen Jumutc
, Rocco Langone
, Johan A. K. Suykens
:
Regularized and sparse stochastic k-means for distributed large-scale clustering. 2535-2540 - Ran Rui, Hao Li, Yi-Cheng Tu:
Join algorithms on GPUs: A revisit after seven years. 2541-2550 - Martha Ganser, Sauptik Dhar, Unmesh Kurup, Carlos Cunha, Aca Gacic:
A data-driven approach towards patient identification for telehealth programs. 2551-2559 - Max Metzger, Michael Howard, Lee Kellogg, Rishi Kundi:
Ensemble prediction of vascular injury in Trauma care: Initial efforts towards data-driven, low-cost screening. 2560-2568 - Jinghe Zhang, Haoyi Xiong
, Yu Huang, Hao Wu, Kevin Leach, Laura E. Barnes:
M-SEQ: Early detection of anxiety and depression via temporal orders of diagnoses in electronic health data. 2569-2577 - Katherine Senter, Sreenivas R. Sukumar, Robert M. Patton, Edward Chaum
:
Using clinical data, hypothesis generation tools and PubMed trends to discover the association between diabetic retinopathy and antihypertensive drugs. 2578-2582 - Rina Singh, Jeffrey A. Graves, Sangkeun Lee, Sreenivas R. Sukumar, Mallikarjun Shankar
:
Enabling graph appliance for genome assembly. 2583-2590 - Daniel Muller, Stefan Mau, Irena Pletikosa Cvijikj:
A framework for consensual and online privacy preserving record linkage in real-time. 2591-2599 - Tao Feng, Zhenyun Zhuang, Yi Pan, Haricharan Ramachandra:
A memory capacity model for high performing data-filtering applications in Samza framework. 2600-2605 - Chieh-Han Wu, Yang Song:
Robust and distributed web-scale near-dup document conflation in microsoft academic service. 2606-2611 - Alicia L. Nobles, Ketki Vilankar, Hao Wu, Laura E. Barnes:
Evaluation of data quality of multisite electronic health record data for secondary analysis. 2612-2620 - Asma Abboura, Soror Sahri, Mourad Ouziri, Salima Benbernou:
CrowdMD: Crowdsourcing-based approach for deduplication. 2621-2627 - Laure Berti-Équille
:
Data veracity estimation with ensembling truth discovery methods. 2628-2636 - Lavanya Sainik:
Distributed life cycle scheduling for cascaded data processing. 2637-2643 - David Becker, Trish Dunn King, Bill McMullen:
Big data, big data quality problem. 2644-2653 - Dhana Rao, Venkat N. Gudivada, Vijay V. Raghavan:
Data quality issues in big data. 2654-2660 - N. Keshan, P. V. Parimi, Isabelle Bichindaritz:
Machine learning for stress detection from ECG signals in automobile drivers. 2661-2669 - Kunal Malhotra, Tanner C. Hobson
, Silvia Valkova, Laura L. Pullum
, Arvind Ramanathan:
Sequential pattern mining of electronic healthcare reimbursement claims: Experiences and challenges in uncovering how patients are treated by physicians. 2670-2679 - Akshay Grover, Jay Gholap, Vandana P. Janeja, Yelena Yesha, Raghu Chintalapati, Harsh Marwaha, Kunal Modi:
SQL-like big data environments: Case study in clinical trial analytics. 2680-2689 - Minh-Son Dao, Koji Zettsu, Siripen Pongpaichet, Laleh Jalali, Ramesh C. Jain:
Exploring spatio-temporal-theme correlation between physical and social streaming data for event detection and pattern interpretation from heterogeneous sensors. 2690-2699 - Aki-Hiro Sato
:
Microdata analysis of the accommodation survey in Japanese tourism statistics. 2700-2708 - Shihan Wang, Takao Terano:
Detecting rumor patterns in streaming social media. 2709-2715 - Hông-Ân Cao, Tri Kurniawan Wijaya, Karl Aberer, Nuno Nunes
:
A collaborative framework for annotating energy datasets. 2716-2725 - Atushi Ishikawa, Shouji Fujimoto
, Takayuki Mizuno, Tsutomu Watanabe:
The relation between firm age distributions and the decay rate of firm activities in the united states and Japan. 2726-2731 - Aki-Hiro Sato
, Isao Ito
, Hidefumi Sawai, Kentaro Iwata:
An epidemic simulation with a delayed stochastic SIR model based on international socioeconomic-technological databases. 2732-2741 - Bilal Sadiq, Faizan Ur Rehman
, Akhlaq Ahmad, Md. Abdur Rahman
, Sohaib Ghani, Abdullah Murad, Saleh M. Basalamah
, Ahmed Lbath:
A spatio-temporal multimedia big data framework for a large crowd. 2742-2751 - Naveen Ramakrishnan, Rumi Ghosh:
Distributed dynamic elastic nets: A scalable approach for regularization in dynamic manufacturing environments. 2752-2761 - Marc Goessling, Shan Kang:
Directional decision lists. 2762-2766 - Ningxuan Kang, Cong Zhao, Jingshan Li, John A. Horst:
Analysis of key operation performance data in manufacturing systems. 2767-2770 - Abhinav Jauhri
, Bradley McDanel, Chris Connor:
Outlier detection for large scale manufacturing processes. 2771-2774 - Daniela Ushizima, Talita Perciano
, Dilworth Parkinson:
Fast detection of material deformation through structural dissimilarity. 2775-2781 - Ronay Ak, Raunak Bhinge:
Data analytics and uncertainty quantification for energy prediction in manufacturing. 2782-2784 - Mariam Kiran, Peter Murphy, Inder Monga
, Jon Dugan, Sartaj Singh Baveja:
Lambda architecture for cost-effective batch and speed big data processing. 2785-2792 - Thomas Renner, Lauritz Thamsen, Odej Kao:
Network-aware resource management for scalable data analytics frameworks. 2793-2800 - Parinaz Ameri, Jörg Meyer
, Achim Streit
:
On a new approach to the index selection problem using mining algorithms. 2801-2810 - Ranjeet Devarakonda
, Yaxing Wei
, Michele Thornton, Ben Mayer, Peter E. Thornton
, Bob Cook:
Preparing, storing, and distributing multi-dimensional scientific data. 2811-2813 - Ranjeet Devarakonda
, Les A. Hook, Terri Killeffer
, Misha Krassovski, Tom Boden, Stan D. Wullschleger
:
Use of a metadata documentation and search tool for large data volumes: The NGEE arctic example. 2814-2816 - Erica Yang, Derek Ross, Srikanth Nagella, Martin J. Turner
, Winfried Kockelmann, Genoveva Burca
, Federico Montesino-Pouzols:
Data optimised computing for heterogeneous big data computing applications. 2817-2819 - Vasilis Efthymiou, Kostas Stefanidis
, Eirini Ntoutsi:
Top-k computations in MapReduce: A case study on recommendations. 2820-2822 - Kai Chen, Yi Zhou, Fangyan Dai:
A LSTM-based method for stock returns prediction: A case study of China stock market. 2823-2824 - Kazuya Uesato, Hiroki Asai, Hayato Yamana
:
Predicting various types of user attributes in Twitter by using personalized pagerank. 2825-2827 - Asmelash Teka Hadgu, Aastha Nigam, Ernesto Diaz-Aviles:
Large-scale learning with AdaGrad on Spark. 2828-2830 - Sudip Mittal, Karuna P. Joshi
, Claudia Pearce, Anupam Joshi
:
Parallelizing natural language techniques for knowledge extraction from cloud service level agreements. 2831-2833 - Christian Beecks, Merih Seran Uysal, Thomas Seidl
:
Gradient-based signatures for big multimedia data. 2834-2835 - Dimitrios Rafailidis
, Stefanos Antaris:
Indexing media storms on Flink. 2836-2838 - Connor Stokes, Anoop Kumar, Frederick Choi, Ralph M. Weischedel:
Scaling NLP algorithms to meet high demand. 2839 - Bonnie J. Dorr
, Craig S. Greenberg, Peter C. Fontana, Mark A. Przybocki, Marion Le Bras, Cathryn A. Ploehn, Oleg Aulov, Wo Chang:
The NIST data science evaluation series: Part of the NIST information access division data science initiative. 2840-2842 - Alexei Samoylov, Jason Schlachter:
Flexible ingest framework: A scalable architecture for dynamic routing through composable pipelines. 2843-2845 - Priya Govindan, Ruobing Chen, Katya Scheinberg
, Soundararajan Srinivasan:
A scalable solution for group feature selection. 2846-2848 - Luna M. Zhang:
Genetic deep neural networks using different activation functions for financial data mining. 2849-2851 - Ryota Takei, Ayahiko Niimi:
Performance of graph reconstruction method for large-scale web graph analysis. 2852-2854 - Altti Ilari Maarala
, Mika Rautiainen, Miikka Salmi, Susanna Pirttikangas
, Jukka Riekki:
Low latency analytics for streaming traffic data with Apache Spark. 2855-2858 - Divya Rao, Wee Keong Ng
:
How to make money from your information and keep your privacy. 2859-2861 - B. Kezia Rani, A. Vinaya Babu:
Scheduling of Big Data application workflows in cloud and inter-cloud environments. 2862-2864 - Peter Li, Simon N. Yates, Jenna K. Lovely, David W. Larson:
Patient-like-mine: A real time, visual analytics tool for clinical decision support. 2865-2867 - Samuel D. Johnson, Kang-Yu Ni:
A pricing mechanism using social media and web data to infer dynamic consumer valuations. 2868-2870 - Yifan Hao, Huiping Cao, Yan Qi, Chuan Hu, Sukumar Brahma, Jingyu Han:
Efficient keyword search on graphs using MapReduce. 2871-2873 - Yuqing Zhu
:
Non-blocking one-phase commit made possible for distributed transactions over replicated data. 2874-2876 - Daisaku Yokoyama, Masashi Toyoda:
A large scale examination of vehicle recorder data to understand relationship between drivers' behaviors and their past driving histories. 2877-2879 - Yoshitaka Yamamoto, Koji Iwanuma:
Online pattern mining for high-dimensional data streams. 2880-2882 - Zhenhui Liu, Jingjing He, Yufei Xue, Zhenzhong Huang, Manli Li, Zhihui Du
:
Modeling the learning behaviors of massive open online courses. 2883-2885 - Jian Yin, Dongfang Zhao:
Data confidentiality challenges in big data applications. 2886-2888 - Anh-Phuong Ta:
Factorization machines with follow-the-regularized-leader for CTR prediction in display advertising. 2889-2891 - Aakash Deep Singh, Wei Wu, Shili Xiang, Shonali Krishnaswamy:
Taxi trip time prediction using similar trips and road network data. 2892-2894 - Long Ma, Yanqing Zhang:
Using Word2Vec to process big text data. 2895-2897 - Longbiao Chen
, Jérémie Jakubowicz:
Inferring bike trip patterns from bike sharing system open data. 2898-2900 - Xiaobing Zhou, Tonglin Li, Ke Wang, Dongfang Zhao, Iman Sadooghi, Ioan Raicu:
MHT: A light-weight scalable zero-hop MPI enabled distributed key-value store. 2901-2903 - Hangu Yeo, Catherine H. Crawford:
Big Data: Cloud computing in genomics applications. 2904-2906 - Maoyuan Zhang, Fang Yuan, Jianping Zhu:
Integrating semantic knowledge into Tag-LDA model through cloud model. 2907-2909 - Yunkai Liu, Christopher Magno:
A case study to apply mobile technology into individual's local community. 2910-2912 - Biying Tan, Sangaralingam Kajanan, Vivek Kumar Singh, Chandra Sekhar Saripaka, Giuseppe Manai:
Clairvoyant-push: A real-time news personalized push notifier using topic modeling and social scoring for enhanced reader engagement. 2913-2915 - Hua Fang, Honggang Wang, Chonggang Wang, Mahmoud Daneshmand:
Using probabilistic approach to joint clustering and statistical inference: Analytics for big investment data. 2916-2918 - Jing Wang, Nikos Ntarmos
, Peter Triantafillou:
Towards a subgraph/supergraph cached query-graph index. 2919-2921 - Ratna Madhuri Maddipatla, Mirsad Hadzikadic, Dipti Patel Misra, Lixia Yao:
30 Day hospital readmission analysis. 2922-2924 - Mehrdad Yazdani, Larry Smarr:
Using pairwise difference features to measure temporal changes in the microbial ecology. 2925-2927 - Ardi Imawan, Joonho Kwon:
A timeline visualization system for road traffic big data. 2928-2929 - Gaël Chareyron
, Bérengère Branchet, Sebastien Jacquot:
A new area tourist ranking method. 2930-2932 - Maoyuan Zhang, Jianping Zhu, Lijun Hua, Fang Yuan:
Text retrieval based on the feature conversion of vector space. 2933-2935 - Kang Li, Vinay Deolalikar, Neeraj Pradhan:
Big data gathering and mining pipelines for CRM using open-source. 2936-2938 - Jay Gholap, Vandana P. Janeja, Yelena Yesha:
Unified framework for clinical data analytics (U-CDA). 2939-2941 - Chanpaul Jin Wang, Hua Fang, Chonggang Wang, Mahmoud Daneshmand, Honggang Wang:
A novel initialization method for particle swarm optimization-based FCM in big biomedical data. 2942-2944 - Chandra Khatri, Suman Voleti, Sathish Veeraraghavan, Nish Parikh, Atiq Islam, Shifa Mahmood, Neeraj Garg, Vivek Singh:
Algorithmic content generation for products. 2945-2947 - Ismini Lourentzou, Graham Dyer, Abhishek Sharma, ChengXiang Zhai:
Hotspots of news articles: Joint mining of news text & social media to discover controversial points in news. 2948-2950 - Khalifeh AlJadda, Mohammed Korayem, Trey Grainger:
Improving the quality of semantic relationships extracted from massive user behavioral data. 2951-2953 - Maruthi Prithivirajan, Vivian Lai, Kyong Jin Shim
, Koo Ping Shung:
Analysis of star ratings in consumer reviews: A case study of Yelp. 2954-2956 - S. George Djorgovski, Ashish Mahabal, Daniel J. Crichton, Basit Chaudhry:
From stars to patients: Lessons from space science and astrophysics for health care informatics. 2957-2959

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.