default search action
BigData Conference 2015: Santa Clara, CA, USA
- 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29 - November 1, 2015. IEEE Computer Society 2015, ISBN 978-1-4799-9926-2
- Léon Bottou:
How big data changes statistical machine learning. 1 - H. V. Jagadish:
Moving past the "Wild West" era for Big Data. 2 - Ion Stoica:
Conquering Big Data with Spark. 3 - Ioanna Filippidou, Yannis Kotidis:
Online and on-demand partitioning of streaming graphs. 4-13 - Christos Anagnostopoulos, Peter Triantafillou:
Learning to accurately COUNT with query-driven predictive analytics. 14-23 - Inho Cho, Soya Park, Sejun Park, Dongsu Han, Jinwoo Shin:
Practical message-passing framework for large-scale combinatorial optimization. 24-31 - Padmashree Ravindra, HyeongSik Kim, Kemafor Anyanwu:
Rewriting complex SPARQL analytical queries for efficient cloud-based processing. 32-37 - Salvador Aguiñaga, Aditya Nambiar, Zuozhu Liu, Tim Weninger:
Concept hierarchies and human navigation. 38-45 - Enric Junqué de Fortuny, Theodoros Evgeniou, David Martens, Foster J. Provost:
Iteratively refining SVMs using priors. 46-52 - Harish S. Bhat, Nitesh Kumar, Garnet Jason Vaz:
Towards scalable quantile regression trees. 53-60 - Kilho Shin, Tetsuji Kuboyama, Takako Hashimoto, Dave Shepard:
Super-CWC and super-LCC: Super fast feature selection algorithms. 61-67 - Don Libes, Seung-Jun Shin, Jungyub Woo:
Considerations and recommendations for data availability for data analytics for manufacturing. 68-75 - Toyotaro Suzumura, Koji Ueno:
ScaleGraph: A high-performance library for billion-scale graph analytics. 76-84 - Maria Malik, Setareh Rafatirah, Avesta Sasan, Houman Homayoun:
System and architecture level characterization of big data applications on big and little core server architectures. 85-94 - Ashwin Lall:
Data streaming algorithms for the Kolmogorov-Smirnov test. 95-104 - Jilong Kuang, Daniel G. Waddington, Changhui Lin:
Techniques for fast and scalable time series traffic generation. 105-114 - Katayoun Neshatpour, Maria Malik, Mohammad Ali Ghodrat, Avesta Sasan, Houman Homayoun:
Energy-efficient acceleration of big data analytics applications using FPGAs. 115-123 - Lorenz Fischer, Abraham Bernstein:
Workload scheduling in distributed stream processors using graph partitioning. 124-133 - Arghya Kusum Das, Seung-Jong Park, Jae-Ki Hong, Wooseok Chang:
Evaluating different distributed-cyber-infrastructure for data and compute intensive scientific application. 134-143 - Vincenzo Gulisano, Yiannis Nikolakopoulos, Marina Papatriantafilou, Philippas Tsigas:
Scalejoin: A deterministic, disjoint-parallel and skew-resilient stream join. 144-153 - Jilong Xue, Zhi Yang, Shian Hou, Yafei Dai:
When computing meets heterogeneous cluster: Workload assignment in graph computation. 154-163 - E. Preston Carman Jr., Till Westmann, Vinayak R. Borkar, Michael J. Carey, Vassilis J. Tsotras:
A scalable parallel XQuery processor. 164-173 - Guoxin Liu, Haiying Shen, Haoyu Wang:
Computing load aware and long-view load balancing for cluster storage systems. 174-183 - Nam-Luc Tran, Thomas Peel, Sabri Skhiri:
Distributed frank-wolfe under pipelined stale synchronous parallelism. 184-192 - Michele Bertoni, Stefano Ceri, Abdulrahman Kaitoua, Pietro Pinoli:
Evaluating cloud frameworks on genomic applications. 193-202 - Chenxi Qiu, Haiying Shen, Liuhua Chen:
Towards green cloud computing: Demand allocation and pricing policies for cloud service brokerage. 203-212 - Nikos Zacheilas, Vana Kalogeraki, Nikolaos Zygouras, Nikolaos Panagiotou, Dimitrios Gunopulos:
Elastic complex event processing exploiting prediction. 213-222 - Xi Yang, Ning Liu, Bo Feng, Xian-He Sun, Shujia Zhou:
PortHadoop: Support direct HPC data processing in Hadoop. 223-232 - John F. Canny, Huasha Zhao, Bobby Jaros, Ye Chen, Jiangchang Mao:
Machine learning at the limit. 233-242 - Nusrat Sharmin Islam, Md. Wasi-ur-Rahman, Xiaoyi Lu, Dipti Shankar, Dhabaleswar K. Panda:
Performance characterization and acceleration of in-memory file systems for Hadoop and Spark applications on HPC clusters. 243-252 - Serafettin Tasci, Murat Demirbas:
Panopticon: A lock broker architecture for scalable transactions in the datacenter. 253-262 - Dongfang Zhao, NagaPramod Mandagere, Gabriel Alatorre, Mohamed Mohamed, Heiko Ludwig:
Toward locality-aware scheduling for containerized cloud services. 263-270 - Min Du, Feifei Li:
ATOM: Automated tracking, orchestration and monitoring of resource usage in infrastructure as a service systems. 271-278 - Dongyao Wu, Sherif Sakr, Liming Zhu, Qinghua Lu:
Composable and efficient functional big data processing framework. 279-286 - Hyunjoo Kim, Sriganesh Madhvanath, Tong Sun:
Hybrid active learning for non-stationary streaming data with asynchronous labeling. 287-292 - Srikant Padala, Dinesh Kumar, Arun Raj, Janakiram Dharanipragada:
Octopus: A multi-job scheduler for Graphlab. 293-298 - Rubén Tous, Anastasios Gounaris, Carlos Tripiana, Jordi Torres, Sergi Girona, Eduard Ayguadé, Jesús Labarta, Yolanda Becerra, David Carrera, Mateo Valero:
Spark deployment and performance evaluation on the MareNostrum supercomputer. 299-306 - Zhenhua Chen, Jielong Xu, Jian Tang, Kevin A. Kwiat, Charles A. Kamhoua:
G-Storm: GPU-enabled high-throughput online data processing in Storm. 307-312 - Orcun Yildiz, Shadi Ibrahim, Tran Anh Phuong, Gabriel Antoniu:
Chronos: Failure-aware scheduling in shared Hadoop clusters. 313-318 - Kousuke Nakabasami, Toshiyuki Amagasa, Salman Ahmed Shaikh, Franck Gass, Hiroyuki Kitagawa:
An architecture for stream OLAP exploiting SPE and OLAP engine. 319-326 - Wei Xie, Jiang Zhou, Mark Reyes, Jason Noble, Yong Chen:
Two-mode data distribution scheme for heterogeneous storage in data centers. 327-332 - Teng Li, Jian Tang, Jielong Xu:
A predictive scheduling framework for fast and distributed stream data processing. 333-338 - Anthony Kleerekoper, Michael Pappas, Adam Craig Pocock, Gavin Brown, Mikel Luján:
A scalable implementation of information theoretic feature selection for high dimensional data. 339-346 - S. M. Faisal, Georgios Tziantzioulis, Ali Murat Gok, Nikolaos Hardavellas, Seda Ogrenci Memik, Srinivasan Parthasarathy:
Edge importance identification for energy efficient graph processing. 347-354 - Keira Zhou, Jack Wadden, Jeffrey J. Fox, Ke Wang, Donald E. Brown, Kevin Skadron:
Regular expression acceleration on the micron automata processor: Brill tagging as a case study. 355-360 - Suprio Ray, Angela Demke Brown, Nick Koudas, Rolando Blanco, Anil K. Goel:
Parallel in-memory trajectory-based spatiotemporal topological join. 361-370 - Bin Dong, Surendra Byna, Kesheng Wu:
Spatially clustered join on heterogeneous scientific data sets. 371-380 - Chung-Yi Li, Wei-Lun Su, Todd G. McKenzie, Fu-Chun Hsu, Shou-De Lin, Jane Yung-jen Hsu, Phillip B. Gibbons:
Recommending missing sensor values. 381-390 - Cheng-Te Li, Yu-Jen Lin, Mi-Yen Yeh:
The roles of network communities in social information diffusion. 391-400 - Vasilis Efthymiou, Kostas Stefanidis, Vassilis Christophides:
Big data entity resolution: From highly to somehow similar entity descriptions in the Web. 401-410 - Vasilis Efthymiou, George Papadakis, George Papastefanatos, Kostas Stefanidis, Themis Palpanas:
Parallel meta-blocking: Realizing scalable entity resolution over large, heterogeneous data. 411-420 - Bogdan Simion, Daniel N. Ilha, Suprio Ray, Leslie Barron, Angela Demke Brown, Ryan Johnson:
Slingshot: A modular framework for designing data processing systems. 421-430 - Eser Kandogan, Mary Roth, Peter M. Schwarz, Joshua Hui, Ignacio G. Terrizzano, Christina Christodoulakis, Renée J. Miller:
LabBook: Metadata-driven social collaborative data analysis. 431-440 - Huseyin Ulusoy, Murat Kantarcioglu, Erman Pattuk:
TrustMR: Computation integrity assurance system for MapReduce. 441-450 - Huseyin Ulusoy, Murat Kantarcioglu, Erman Pattuk, Lalana Kagal:
AccountableMR: Toward accountable MapReduce systems. 451-460 - Eleazar Leal, Le Gruenwald, Jianting Zhang, Simin You:
TKSimGPU: A parallel top-K trajectory similarity query processing algorithm for GPGPUs. 461-469 - Anand Tripathi, Bhagavathi Dhass Thirunavukarasu:
A transaction model for management of replicated data with multiple consistency levels. 470-477 - Jianting Zhang, Simin You, Le Gruenwald:
Quadtree-based lightweight data compression for large-scale geospatial rasters on multi-core CPUs. 478-484 - Roee Ebenstein, Gagan Agrawal:
DSDQuery DSI - Querying scientific data repositories with structured operators. 485-492 - Smruti Padhy, Greg Jansen, Jay Alameda, Edgar F. Black, Liana Diesendruck, Mike Dietze, Praveen Kumar, Rob Kooper, Jong Lee, Rui Liu, Richard Marciano, Luigi Marini, Dave Mattson, Barbara S. Minsker, Chris Navarro, Marcus Slavenas, William C. Sullivan, Jason Votava, Inna Zharnitsky, Kenton McHenry:
Brown Dog: Leveraging everything towards autocuration. 493-500 - Afsin Akdogan, Saratchandra Indrakanti, Ugur Demiryurek, Cyrus Shahabi:
Cost-efficient partitioning of spatial data on cloud. 501-506 - Pouria Pirzadeh, Michael J. Carey, Till Westmann:
BigFUN: A performance study of big data management system functionality. 507-514 - Tonglin Li, Ke Wang, Dongfang Zhao, Kan Qiao, Iman Sadooghi, Xiaobing Zhou, Ioan Raicu:
A flexible QoS fortified distributed key-value storage system for the cloud. 515-522 - Mahdi Ebrahimi, Aravind Mohan, Shiyong Lu, Robert G. Reynolds:
TPS: A task placement strategy for big data workflows. 523-530 - Yuqing Zhu, Yilei Wang:
Improving transaction processing performance by consensus reduction. 531-538 - Dipti Shankar, Xiaoyi Lu, Md. Wasi-ur-Rahman, Nusrat S. Islam, Dhabaleswar K. Panda:
Benchmarking key-value stores on high-performance storage and interconnects for web-scale workloads. 539-544 - Roberto Tardío, Alejandro Maté, Juan Trujillo:
An iterative methodology for big data management, analysis and visualization. 545-550 - Chin-Chi Hsu, Perng-Hwa Kung, Mi-Yen Yeh, Shou-De Lin, Phillip B. Gibbons:
Bandwidth-efficient distributed k-nearest-neighbor search with dynamic time warping. 551-560 - Liang Zhao, Feng Chen, Chang-Tien Lu, Naren Ramakrishnan:
Dynamic theme tracking in Twitter. 561-570 - Sean Massung, ChengXiang Zhai:
SyntacticDiff: Operator-based transformation for comparative text mining. 571-580 - Yixian Zheng, Wenchao Wu, Huamin Qu, Chunyan Ma, Lionel M. Ni:
Visual analysis of bi-directional movement behavior. 581-590 - Yuncheng Li, Tao Mei, Yang Cong, Jiebo Luo:
User-curated image collections: Modeling and recommendation. 591-600 - Ke Wang, Ping Guo, A-Li Luo:
Angular quantization based affinity propagation clustering and its application to astronomical big spectra data. 601-608 - Yibo Yao, Lawrence B. Holder:
Scalable classification for large dynamic networks. 609-618 - Ruslan Mavlyutov, Philippe Cudré-Mauroux:
CINTIA: A distributed, low-latency index for big interval data. 619-628 - Yang Wang, Kwan-Liu Ma:
Revealing the fog-of-war: A visualization-directed, uncertainty-aware approach for exploring high-dimensional data. 629-638 - Bokai Cao, Francine Chen, Dhiraj Joshi, Philip S. Yu:
Inferring crowd-sourced venues for tweets. 639-648 - Huanhuan Wu, James Cheng, Yi Lu, Yiping Ke, Yuzhen Huang, Da Yan, Hejun Wu:
Core decomposition in large temporal graphs. 649-658 - Jason H. D. Cho, Yanen Li, Roxana Girju, Chengxiang Zhai:
Recommending forum posts to designated experts. 659-666 - Mark Gates, Hartwig Anzt, Jakub Kurzak, Jack J. Dongarra:
Accelerating collaborative filtering using concepts from high performance computing. 667-676 - Wei Xie, Feida Zhu, Siyuan Liu, Ke Wang:
Modelling cascades over time in microblogs. 677-686 - Yasser Salem, Jun Hong, Weiru Liu:
CSFinder: A cold-start friend finder in large-scale social networks. 687-696 - Hien To, Seon Ho Kim, Cyrus Shahabi:
Effectively crowdsourcing the acquisition and analysis of visual data for disaster response. 697-706 - Zhen Chen, Hanghang Tong, Lei Ying:
Full diffusion history reconstruction in networks. 707-716 - Demetris Trihinas, George Pallis, Marios D. Dikaiakos:
AdaM: An adaptive monitoring framework for sampling and filtering on IoT devices. 717-726 - Suchismit Mahapatra, Varun Chandola:
Modeling graphs using a mixture of Kronecker models. 727-736 - Stephen Bonner, Andrew Stephen McGough, Ibad Kureshi, John Brennan, Georgios Theodoropoulos, Laura Moss, David Corsar, Grigoris Antoniou:
Data quality assessment and anomaly detection via map/reduce and linked data: A case study in the medical domain. 737-746 - Tian Guo, Jean-Paul Calbimonte, Hao Zhuang, Karl Aberer:
SigCO: Mining significant correlations via a distributed real-time computation engine. 747-756 - Yen-Kai Wang, Wei-Ming Chen, Cheng-Te Li, Shou-De Lin:
Identifying smallest unique subgraphs in a heterogeneous social network. 757-766 - Jiejun Xu, Tsai-Ching Lu:
Toward precise user-topic alignment in online social media. 767-775 - Masahiko Itoh, Daisaku Yokoyama, Masashi Toyoda, Masaru Kitsuregawa:
Visual interface for exploring caution spots from vehicle recorder big data. 776-784 - Amir Bahmani, Frank Mueller:
ACURDION: An adaptive clustering-based algorithm for tracing large-scale MPI applications. 785-792 - Max C. Watson:
Time maps: A tool for visualizing many discrete events across multiple timescales. 793-800 - Xugang Ye, Zijie Qi, Dan Massey:
Learning relevance from click data via neural network based similarity models. 801-806 - Chad A. Steed, Margaret Drouhard, Justin M. Beaver, Joshua Pyle, Paul Logasa Bogen:
Matisse: A visual analytics system for exploring emotion trends in social media text streams. 807-814 - Sihong Xie, Qingbo Hu, Jingyuan Zhang, Jing Gao, Wei Fan, Philip S. Yu:
Robust crowd bias correction via dual knowledge transfer from multiple overlapping sources. 815-820 - Deepika Lalwani, Durvasula V. L. N. Somayajulu, P. Radha Krishna:
A community driven social recommendation system. 821-826 - Yongfeng Zhang, Min Zhang, Yiqun Liu, Tat-Seng Chua, Yi Zhang, Shaoping Ma:
Task-based recommendation on a web-scale. 827-836 - Xiaowei Jia, Aosen Wang, Xiaoyi Li, Guangxu Xun, Wenyao Xu, Aidong Zhang:
Multi-modal learning for video recommendation based on mobile application usage. 837-842 - Xiaoyi Li, Xiaowei Jia, Guangxu Xun, Aidong Zhang:
Improving EEG feature learning via synchronized facial video. 843-848 - Muyi Liu, Michael Gribskov:
MMC-margin: Identification of maximum frequent subgraphs by metropolis Monte Carlo sampling. 849-856 - Yue Wang, Ke Wang, Ada Wai-Chee Fu, Raymond Chi-Wing Wong:
KeyLabel algorithms for keyword search in large graphs. 857-864 - Chung-Hsien Yu, Dong Luo, Wei Ding, Joseph Paul Cohen, David L. Small, Shafiqul Islam:
Spatio-temporal asynchronous co-occurrence pattern for big climate data towards long-lead flood prediction. 865-870 - Luca Pappalardo, Dino Pedreschi, Zbigniew Smoreda, Fosca Giannotti:
Using big data to study the link between human mobility and socio-economic development. 871-878 - Tri Kurniawan Wijaya, Matteo Vasirani, Samuel Humeau, Karl Aberer:
Cluster-based aggregate forecasting for residential electricity demand using smart meter data. 879-887 - Masayo Ota, Huy T. Vo, Cláudio T. Silva, Juliana Freire:
A scalable approach for data-driven taxi ride-sharing simulation. 888-897 - Desheng Zhang, Ruobing Jiang, Shuai Wang, Yanmin Zhu, Bo Yang, Jian Cao, Fan Zhang, Tian He:
EveryoneCounts: Data-driven digital advertising with uncertain demand model in metro networks. 898-907 - Liang Zhao, Wen-Zhan Song, Xiaojing Ye:
Fast decentralized gradient descent method and applications to in-situ seismic tomography. 908-917 - Zhao Zhang, Kyle Barbary, Frank Austin Nothaft, Evan Randall Sparks, Oliver Zahn, Michael J. Franklin, David A. Patterson, Saul Perlmutter:
Scientific computing meets big data technology: An astronomy use case. 918-927 - Michael Nalisnik, David A. Gutman, Jun Kong, Lee A. D. Cooper:
An interactive learning framework for scalable classification of pathology images. 928-935 - Yu Wang, Jianbo Yuan, Jiebo Luo:
America Tweets China: A fine-grained analysis of the state and individual characteristics regarding attitudes towards China. 936-943 - Yu Jin, Joseph F. JáJá, Rong Chen, Edward H. Herskovits:
A data-driven approach to extract connectivity structures from diffusion tensor imaging data. 944-951 - Georgios Chatzigeorgakidis, Sophia Karagiorgou, Spiros Athanasiou, Spiros Skiadopoulos:
A MapReduce based k-NN joins probabilistic classifier. 952-957 - Alessandro Lulli, Thibault Debatty, Matteo Dell'Amico, Pietro Michiardi, Laura Ricci:
Scalable k-NN based text clustering. 958-963 - Yuwen Chen, Jian Cao, Shanshan Feng, Yudong Tan:
An ensemble learning based approach for building airfare forecast service. 964-969 - Mack Sweeney, Jaime Lester, Huzefa Rangwala:
Next-term student grade prediction. 970-975 - Sofia Apreleva, Alejandro Cantarero:
Predicting the location of users on Twitter from low density graphs. 976-983 - Elias Alevizos, Alexander Artikis, Kostas Patroumpas, Marios Vodas, Yannis Theodoridis, Nikos Pelekis:
How not to drown in a sea of information: An event recognition approach. 984-990 - Jiaoyan Chen, Huajun Chen, Daning Hu, Jeff Z. Pan, Yalin Zhou:
Smog disaster forecasting using social web data and physical sensor data. 991-998 - Kamalika Das, Kanishka Bhaduri, Bryan L. Matthews, Nikunj C. Oza:
Large scale support vector regression for aviation safety. 999-1006 - Lorenzo Gabrielli, Barbara Furletti, Roberto Trasarti, Fosca Giannotti, Dino Pedreschi:
City users' classification with mobile phone data. 1007-1012 - Anas Abu-Doleh, Ümit V. Çatalyürek:
Spaler: Spark and GraphX based de novo genome assembler. 1013-1018 - Florin Schimbinschi, Xuan Vinh Nguyen, James Bailey, Christopher Leckie, Hai Le Vu, Rao Kotagiri:
Traffic forecasting in complex urban networks: Leveraging big data and machine learning. 1019-1024 - Karla L. Caballero Barajas, Ram Akella:
Prediction of physiological subsystem failure and its impact in the prediction of patient mortality. 1025-1030 - Fei Shao, Li-Yung Ho, Jan-Jan Wu, Pangfeng Liu:
Efficient distributed maximum matching for solving the container exchange problem in the maritime industry. 1031-1036 - Robert P. Trevino, Steve A. Kawamoto, Thomas J. Lamkin, Huan Liu:
Cell analytics in compound hit selection of bacterial inhibitors. 1037-1042 - Xiuqiang He, Wenyuan Dai, Guoxiang Cao, Ruiming Tang, Mingxuan Yuan, Qiang Yang:
Mining target users for online marketing based on App Store data. 1043-1052 - Ahmed Metwally, Jia-Yu Pan, Minh Doan, Christos Faloutsos:
Scalable community discovery from multi-faceted graphs. 1053-1062 - Ernesto Diaz-Aviles, Fabio Pinelli, Karol Lynch, Zubair Nabi, Yiannis Gkoufas, Eric Bouillet, Francesco Calabrese, Eoin Coughlan, Peter Holland, Jason Salzwedel:
Towards real-time customer experience prediction for telecommunication operators. 1063-1072 - I. Stephen Choi, Weiqing Yang, Yang-Suk Kee:
Early experience with optimizing I/O performance using high-performance SSDs for in-memory cluster computing. 1073-1083 - Hyunsik Choi, Jongyoung Park, Yong In Lee, Kangho Roh, Kwanghyun La:
An evaluation of alternative shared-nothing architecture for analytical processing systems. 1084-1093 - Anjan Goswami, Wei Han, Zhenrui Wang, Angela Jiang:
Controlled experiments for decision-making in e-Commerce search. 1094-1102 - Jenny Weisenberg Williams, Paul Cuddihy, Justin McHugh, Kareem S. Aggour, Arvind Menon, Steven M. Gustafson, Timothy Healy:
Semantics for Big Data access & integration: Improving industrial equipment design through increased data usability. 1103-1112 - Laura Rettig, Mourad Khayati, Philippe Cudré-Mauroux, Michal Piórkowski:
Online anomaly detection over Big Data streams. 1113-1122 - Aungon Nag Radon, Ke Wang, Uwe Glässer, Hans Wehn, Andrew Westwell-Roper:
Contextual verification for false alarm reduction in maritime anomaly detection. 1123-1133 - Tanay Kumar Saha, Mohammad Al Hasan, Chandler Burgess, Md. Ahsan Habib, Jeff Johnson:
Batch-mode active learning for technology-assisted review. 1134-1143 - Mayank Kejriwal, Qiaoling Liu, Ferosh Jacob, Faizan Javed:
A pipeline for extracting and deduplicating domain-specific knowledge bases. 1144-1153 - Fang-Hsiang Su, Manas Somaiya, Shrish Mishra, Rajyashree Mukherjee:
EXOS: Expansion on session for enhancing effectiveness of query auto-completion. 1154-1163 - Gergely Ács, Jagdish Prasad Achara, Claude Castelluccia:
Probabilistic km-anonymity efficient anonymization of large set-valued datasets. 1164-1173 - Sauptik Dhar, Congrui Yi, Naveen Ramakrishnan, Mohak Shah:
ADMM based scalable machine learning on Spark. 1174-1182 - Dapeng Dong, John Herbert:
Record-aware compression for big textual data analysis acceleration. 1183-1190 - Alekh Jindal, Samuel Madden, Malú Castellanos, Meichun Hsu:
Graph analytics using vertica relational database. 1191-1200 - André Luckow, Ken Kennedy, Fabian Manhardt, Emil Djerekarov, Bennie Vorster, Amy W. Apon:
Automotive big data: Applications, workloads and infrastructures. 1201-1210 - Goktug T. Cinar, Jeffrey Thompson, Soundar Srinivasan:
Cost-sensitive optimization of automated inspection. 1211-1219 - Nicolás Poggi, Josep Lluis Berral, David Carrera, Aaron Call, Fabrizio Gagliardi, Rob Reinauer, Nikola Vujic, Daron Green, José A. Blakeley:
From performance profiling to predictive analytics while evaluating hadoop cost-efficiency in ALOJA. 1220-1229 - Mohammed Korayem, Camilo Ortiz, Khalifeh AlJadda, Trey Grainger:
Query sense disambiguation leveraging large scale user behavioral data. 1230-1237 - Viet Ha-Thuc, Ganesh Venkataraman, Mario Rodriguez, Shakti Sinha, Senthil Sundaram, Lin Guo:
Personalized expertise search at LinkedIn. 1238-1247 - Vinay Deolalikar:
How valuable is your data? A quantitative approach using data mining. 1248-1253 - Kang Li, Vinay Deolalikar, Neeraj Pradhan:
Mining lifestyle personas at scale in e-commerce. 1254-1261 - Petros Zerfos, Hangu Yeo, Brent D. Paulovicks, Vadim Sheinin:
SDFS: Secure distributed file system for data-at-rest security for Hadoop-as-a-service. 1262-1271 - Sreenivas R. Sukumar:
Open research challenges with Big Data - A data-scientist's perspective. 1272-1278 - Hamed Yaghoubi Shahir, Uwe Glässer, Amir Yaghoubi Shahir, Hans Wehn:
Maritime situation analysis framework: Vessel interaction classification and anomaly detection. 1279-1289 - Levente J. Klein, Fernando J. Marianno, Conrad M. Albrecht, Marcus Freitag, Siyuan Lu, Nigel Hinds, Xiaoyan Shao, Sergio Bermudez Rodriguez, Hendrik F. Hamann:
PAIRS: A scalable geo-spatial data analytics platform. 1290-1298 - Jayasimha Katukuri, Tolga Könik, Rajyashree Mukherjee, Santanu Kolay:
Post-purchase recommendations in large-scale online marketplaces. 1299-1305 - Hong-Han Shuai, Chih-Ya Shen, Hsiang-Chun Hsu, De-Nian Yang, Chung-Kuang Chou, Jihg-Hong Lin, Ming-Syan Chen:
Revenue maximization for telecommunications company with social viral marketing. 1306-1310 - Stephanie Rosenthal, Scott McMillan, Matthew E. Gaston:
Developer toolchains for large-scale analytics: Two case studies. 1311-1316 - Ramakrishna Vadakattu, Bibek Panda, Swarnim Narayan, Harshal Godhia:
Enterprise subscription churn prediction. 1317-1321 - Joshua Seeger, Aron Culotta, Jason Keller, Patrick van Kessel, Michael Jugovich:
Data deidentification in medical transcriptions using regular expressions and machine learning. 1322-1323 - Qinlong Luo, Meng Zhao, Faizan Javed, Ferosh Jacob:
Macau: Large-scale skill sense disambiguation in the online recruitment domain. 1324-1329 - Wei-Yi Liu, Hui-I Hsiao, Shih-Yao Dai:
Genomic analysis with MapReduce. 1330-1335 - Chaitali Gupta, Ranjan Sinha, Yong Zhang:
Eagle: User profile-based anomaly detection for securing Hadoop clusters. 1336-1343 - Manuel Diaz-Granados, Javier Diaz Montes, Manish Parashar:
Investigating insurance fraud using social media. 1344-1349 - Luca Cazzanti, Leonardo Maria Millefiori, Gianfranco Arcieri:
A document-based data model for large scale computational maritime situational awareness. 1350-1356 - Jhao-Yin Li, Mi-Yen Yeh, Ming-Syan Chen, Jihg-Hong Lin:
Modeling social influences from call records and mobile web browsing histories. 1357-1361 - Christian Seebode, Matthias Ort, Peter Hufnagl, Christian R. A. Regenbrecht:
Next generation biobanks. 1362-1367 - Mikel Nino, José Miguel Blanco, Arantza Illarramendi:
Business understanding, challenges and issues of Big Data Analytics for the servitization of a capital equipment manufacturer. 1368-1377 - Divya Sardana, Raj Bhatnagar, Radu Pavel, Jonathan Iverson:
Data driven predictive analytics for a spindle's health. 1378-1387 - Yunpeng Li, Utpal Roy, Seung-Jun Shin, Y. Tina Lee:
A "smart component" data model in PLM. 1388-1397 - Nenad Stojanovic, Marko Dinic, Ljiljana Stojanovic:
Big data process analytics for continuous process improvement in manufacturing. 1398-1407 - Saideep Nannapaneni, Sankaran Mahadevan, David Lechevalier, Anantha Narayanan, Sudarsan Rachuri:
Automated uncertainty quantification analysis using a system model and data. 1408-1417 - Alexander Brodsky, Guodong Shao, Mohan Krishnamoorthy, Anantha Narayanan, Daniel A. Menascé, Ronay Ak:
Analysis and optimization in smart manufacturing based on a reusable knowledge base for process performance models. 1418-1427 - David Lechevalier, Steven Hudak, Ronay Ak, Y. Tina Lee, Sebti Foufou:
A neural network meta-model and its application for manufacturing. 1428-1435 - Luca Oneto, Ilenia Orlandi, Davide Anguita:
Performance assessment and uncertainty quantification of predictive models for smart manufacturing systems. 1436-1445 - Ashwin K. Thillai Natarajan, Sagar V. Kamarthi:
Time complexity and architecture of a cloud based prognostics system for a multi-client condition monitoring activity. 1446-1450 - Jinkyoo Park, Kincho H. Law, Raunak Bhinge, Mason Chen, David Dornfeld, Sudarsan Rachuri:
Real-time energy prediction for a milling machine tool using sparse Gaussian process regression. 1451-1460 - Kannan Govindarajan, David Boulanger, Vivekanandan Suresh Kumar, Kinshuk:
Parallel Particle Swarm Optimization (PPSO) clustering for learning analytics. 1461-1465 - Gökhan Silahtaroglu, Hale Donertasli:
Analysis and prediction of Ε-customers' behavior by mining clickstream data. 1466-1472 - Jeyhun Karimov, A. Murat Ozbayoglu:
High quality clustering of big data and solving empty-clustering problem with an evolutionary hybrid algorithm. 1473-1478 - Renaud Richardet, Jean-Cédric Chappelier, Shreejoy J. Tripathy, Sean L. Hill:
Agile text mining with Sherlok. 1479-1484 - Golnoosh Farnadi, Zeinab Mahdavifar, Ivan Keller, Jacob Nelson, Ankur Teredesai, Marie-Francine Moens, Martine De Cock:
Scalable adaptive label propagation in Grappa. 1485-1491 - Kasim Oztoprak:
Profiling subscribers according to their internet usage characteristics and behaviors. 1492-1499 - Magdalini Eirinaki, Sweta Patel:
QueRIE reloaded: Using matrix factorization to improve database query recommendations. 1500-1508 - Ran Pang, Agustin Baretto, Henry A. Kautz, Jiebo Luo:
Monitoring adolescent alcohol use via multimodal analysis in social multimedia. 1509-1518 - Raj Bhatnagar, Lalit Kumar:
An efficient map-reduce algorithm for computing formal concepts from binary data. 1519-1528 - Jagadeesh Patchala, Raj Bhatnagar:
Learning relaxed 3-clusters from pairs of related datasets. 1529-1538 - Jun Meng, Rui Li, Jing Zhang:
Parallel information fusion method for microarray data analysis. 1539-1544 - Xuezhi Ji, Lixiang Liu, Pei Zhao, Dapeng Wang:
A-Star algorithm based on-demand routing protocol for hierarchical LEO/MEO satellite networks. 1545-1549 - Lukasz Sosnowski, Marcin S. Szczuka, Dominik Slezak:
Granular modeling with fuzzy comparators. 1550-1555 - I-Jen Chiang:
Agglomerative algorithm to discover semantics from unstructured big data. 1556-1563 - Alexander Denzler, Marcel Wehrle, Andreas Meier:
A granular approach for identifying user knowledge. 1564-1569 - Liang Wu, Teng-Sheng Moh, Natalia Khuri:
Twitter opinion mining for adverse drug reactions. 1570-1574 - Shusaku Tsumoto, Shoji Hirano, Haruko Iwata:
Data decomposition and dual clustering for clinical care management. 1475-1584 - Maria Pershina, Mohamed Yakout, Kaushik Chakrabarti:
Holistic entity matching across knowledge graphs. 1585-1590 - Zehua Chen, He Ma, Yu Zhang:
GrC-based statistic optimization algorithm for big truth table. 1591-1596 - Patrick G. Clark, Jerzy W. Grzymala-Busse:
Mining incomplete data with many attribute-concept values and "do not care" conditions. 1597-1602 - Tsau-Young Lin:
Chinese wall security policies information flows in business cloud. 1603-1607 - Shusaku Tsumoto, Shoji Hirano:
Granular formalization of medical diagnostic process. 1608-1614 - Karan Khare, Teng-Sheng Moh:
Mobile gesture-based iPhone user authentication. 1615-1621 - Chris Tseng, Tien Nguyen, Chetan Sharma:
Cost and data exploration considerations for big data prediction on the cloud. 1622-1628 - Chao-Lin Liu, Chih-Kai Huang, Hongsu Wang, Peter K. Bol:
Mining local gazetteers of literary Chinese with CRF and pattern based methods for biographical information in Chinese history. 1629-1638 - Giles Greenway, Leonard Mack, Tobias Blanke, Mark Coté, Tom Heath:
Towards a mobile social data commons. 1639-1642 - Matthew Coole, Paul Rayson, John A. Mariani:
Scaling out for extreme scale corpus data. 1643-1649 - Stefan Pernes:
Metaphor mining in historical german novels: An unsupervised learning approach. 1650-1652 - Mehrdad Yazdani, Lev Manovich:
Predicting social trends from non-photographic images on Twitter. 1653-1660 - Dallas Liddle:
The coding of literary form: Data mining and the information structure of historical texts. 1661-1666 - Benjamin M. Schmidt:
Plot arceology: A vector-space model of narrative structure. 1667-1672 - Ben Miller, Jennifer Olive, Shakthidhar Reddy Gopavaram, Yanjun Zhao, Ayush Shrestha, Cynthia Berger:
A method for cross-document narrative alignment of a two-hundred-sixty-million word corpus. 1673-1677 - Nadya A. Calderón, Brian D. Fisher, Jeff J. Hemsley, Billy Ceskavich, Greg Jansen, Richard Marciano, Victoria L. Lemieux:
Mixed-initiative social media analytics at the World Bank: Observations of citizen sentiment in Twitter data to explore "trust" of political actors and state institutions and its relationship to social protest. 1678-1687 - Ilir Fetai, Damian Murezzan, Heiko Schuldt:
Workload-driven adaptive data partitioning and distribution - The Cumulus approach. 1688-1697 - Gabor Madl, Ramani Routray, Yang Song, Rakesh Jain:
Account clustering in multi-tenant storage management environments. 1698-1707 - Marlon McKenzie, Hua Fan, Wojciech M. Golab:
Fine-tuning the consistency-latency trade-off in quorum-replicated distributed storage systems. 1708-1717 - Sathiya Prabhu Kumar, Sylvain Lefebvre, Minyoung Kim, Mark-Oliver Stehr:
Priority register: Application-defined replacement orderings for ad hoc reconciliation. 1718-1727 - Matthew Bihis, Sohini Roychowdhury:
A generalized flow for multi-class and binary classification tasks: An Azure ML approach. 1728-1737 - Alexander Stiemer, Ilir Fetai, Heiko Schuldt:
Comparison of eager and quorum-based replication in a cloud environment. 1738-1748 - Alexander Lenk, Leif Bonorden, Astrid Hellmanns, Nico Rödder, Stefan Jähnichen:
Towards a taxonomy of standards in smart data. 1749-1754 - Nan Zhu, Wenbo He, Yu Hua, Yixin Chen:
Marlin: Taming the big streaming data in large scale video similarity search. 1755-1764 - Chong Zhang, Xiaoying Chen, Bin Ge, Weidong Xiao:
Indexing historical spatio-temporal data in the cloud. 1765-1774 - Vladimir Grupcev, Yi-Cheng Tu, Joseph C. Fogarty, Sagar Pandit:
Push-based system for molecular simulation data analysis. 1775-1784 - Guan Xu, Jun Yang, Bin Dai:
Challenges and opportunities on network resource management in DCN with SDN. 1785-1790 - Lijia Lu, Hui Li, Jun Chen, Bing Zhu, Weijuan Yin:
On the implementation of Zigzag codes for distributed storage system. 1791-1796 - Arun Kumar Kalakanti, Vinay Sudhakaran, Varsha Raveendran, Nisha Menon:
A comprehensive evaluation of NoSQL datastores in the context of historians and sensor data analysis. 1797-1806 - Harris T. Lin, Ngot Bui, Vasant G. Honavar:
Learning classifiers from remote RDF data stores augmented with RDFS subclass hierarchies. 1807-1813 - Guoyao Feng, Xiao Meng, Khaled Ammar:
DISTINGER: A distributed graph data structure for massive dynamic graph processing. 1814-1822 - Olivier Curé, Hubert Naacke, Tendry Randriamalala, Bernd Amann:
LiteMat: A scalable, cost-efficient inference encoding scheme for large RDF graphs. 1823-1830 - Alireza Rezaei Mahdiraji, Peter Baumann:
MQuery: A query language for scientific meshes. 1831-1838 - Shaikh Arifuzzaman, Maleq Khan, Madhav V. Marathe:
A fast parallel algorithm for counting triangles in graphs using dynamic load balancing. 1839-1847 - Janani Balaji, Rajshekhar Sunderraman:
Scalable storage structure for pattern matching on big graph data. 1848-1855 - Serafettin Tasci, Murat Demirbas:
Employing in-memory data grids for distributed graph processing. 1856-1864 - Ather Sharif, Sarah Cooney, Shengqi Gong, Drew Vitek:
Current security threats and prevention measures relating to cloud services, Hadoop concurrent processing, and big data. 1865-1870 - Jinoh Kim, Bin Dong, Surendra Byna, Kesheng Wu:
Security for the scientific data services framework. 1871-1875 - Santosh Aditham, Nagarajan Ranganathan:
A novel framework for mitigating insider attacks in big data systems. 1876-1885 - Katerina Doka, Mingqiang Xue, Dimitrios Tsoumakos, Panagiotis Karras, Alfredo Cuzzocrea, Nectarios Koziris:
Heterogeneous k-anonymization with high utility. 1886-1890 - Lee A. Carraher, Philip A. Wilsey, Anindya Moitra, Sayantan Dey:
Multi-probe random projection clustering to secure very large distributed datasets. 1891-1900 - Dymitr Ruta, Ling Cen, Ernesto Damiani:
Fast summarization and anonymization of multivariate big time series. 1901-1904 - Ernesto Damiani:
Toward big data risk analysis. 1905-1909 - Alfredo Cuzzocrea, Gianluigi Folino, Pietro Sabatino:
A distributed framework for supporting adaptive ensemble-based intrusion detection. 1910-1916 - Andy Bengel, Amin Shawki, Dippy Aggarwal:
Simplifying web analytics for digital marketing. 1917-1918 - Dawn N. Jutla, Peter Bodorik:
PAUSE: A privacy architecture for heterogeneous big data environments. 1919-1928 - Xiaoying Chen, Chong Zhang, Bin Ge, Weidong Xiao:
Spatio-temporal queries in HBase. 1929-1937 - V. Gyurjyan, Aron Bartle, Constantine Lukashin, S. Mancilla, R. Oyarzun, A. Vakhnin:
Component based dataflow processing framework. 1938-1942 - Constantine Lukashin, Aron Bartle, E. Callaway, V. Gyijrjyan, S. Mancilla, R. Oyarzun, A. Vakhnin:
Earth science data fusion with event building approach. 1943-1947 - Seungwon Lee, Lei Pan, Chengxing Zhai, Benyang Tang, Terry Kubar, Jia Zhang, Wei Wang:
Climate model diagnostic analyzer. 1948-1952 - David Haynes, Suprio Ray, Steven M. Manson, Ankit Soni:
High performance analysis of big spatial data. 1953-1957 - Akinori Asahara, Hideki Hayashi, Nobuhiro Ishimaru, Ryosuke Shibasaki, Hiroshi Kanasugi:
International standard "OGC® moving features" to address "4Vs" on locational bigdata. 1958-1966 - Luis A. Lopez, Ruth E. Duerr, Siri Jodha Singh Khalsa:
Optimizing apache nutch for domain specific crawling at large scale. 1967-1971 - Shujia Zhou, Xi Yang, Xiaowen Li, Toshihisa Matsui, Si Liu, Xian-He Sun, Wei-Kuo Tao:
A Hadoop-based visualization and diagnosis framework for earth science data. 1972-1977 - Saman Biookaghazadeh, Yiqi Xu, Shujia Zhou, Ming Zhao:
Enabling scientific data storage and processing on big-data systems. 1978-1984 - Kevin Paul, Sheri A. Mickelson, John M. Dennis, Haiying Xu, David Brown:
Light-weight parallel Python tools for earth system modeling workflows. 1985-1994 - In Kee Kim, Jacob Steele, Anthony M. Castronova, Jonathan L. Goodall, Marty Humphrey:
WDCloud: An end to end system for large-scale watershed delineation on cloud. 1995-2004 - Lesley Wyborn, Benjamin J. K. Evans:
Integrating 'Big' geoscience data into the petascale national environmental research interoperability platform (NERDIP): Successes and unforeseen challenges. 2005-2009 - Fatih Akdag, Christoph F. Eick:
An optimized interestingness hotspot discovery framework for large gridded spatio-temporal datasets. 2010-2019 - Rahul Palamuttam, Renato Javier Marroquín Mogrovejo, Chris Mattmann, Brian Wilson, Kim Whitehall, Rishi Verma, Lewis J. McGibbney, Paul M. Ramirez:
SciSpark: Applying in-memory distributed computing to weather event detection and tracking. 2020-2026 - Amelia Yzaguirre, Robert Warren, Mike Smit:
Detecting environmental disasters in digital news archives. 2027-2035 - Yuzhong Yan, Lei Huang, Liqi Yi:
Is Apache Spark scalable to seismic data analytics and computations? 2036-2045 - Peter Baumann, Vlad Merticariu:
On the efficient evaluation of array joins. 2046-2055 - Torsten Priebe, Stefan Markus:
Business information modeling: A methodology for data-intensive projects, data science and big data governance. 2056-2065 - Jeffrey S. Saltz:
The need for new processes, methodologies and tools to support big data teams and improve big data project effectiveness. 2066-2071 - Manirupa Das, Renhao Cui, David R. Campbell, Gagan Agrawal, Rajiv Ramnath:
Towards methods for systematic research on big data. 2072-2081 - Marco Pospiech, Carsten Felden:
Towards a big data theory model. 2082-2090 - Kerk F. Kee:
Three critical matters in big data projects for e-science: Different user groups, the mutually constitutive perspective, and virtual organizational capacity. 2091-2097 - Jeffrey S. Saltz, Ivan Shamshurin:
Exploring the process of doing data science via an ethnographic study of a media advertising company. 2098-2105 - Dazhi Yang, Gary S. W. Goh, Chi Xu, Allan N. Zhang, Orkan Akcan:
Forecast UPC-level FMCG demand, Part I: Exploratory analysis and visualization. 2106-2112 - Dazhi Yang, Gary S. W. Goh, Siwei Jiang, Allan N. Zhang, Orkan Akcan:
Forecast UPC-level FMCG demand, Part II: Hierarchical reconciliation. 2113-2121 - B. Y. Ong, S. W. Goh, Chi Xu:
Sparsity adjusted information gain for feature selection in sentiment analysis. 2122-2128 - Sam Iosevich, Georgiy Arutyunyants, Z. Hou:
Dynamic aggregation for time series forecasting. 2129-2131 - Wenjing Yan, Xianshun Chen, Orkan Akcan, Jasmine J. Lim, Dazhi Yang:
Big data analytics for empowering milk yield prediction in dairy supply chains. 2132-2137 - Gürdal Ertek, Xu Chi, Gabriel Yee, Ong Boon Yong, Byung-Geun Choi:
Profit estimation error analysis in recommender systems based on association rules. 2138-2142 - Gürdal Ertek, Byung-Geun Choi, Xu Chi, Dazhi Yang, Ong Boon Yong:
Graph-based analysis of resource dependencies in project networks. 2143-2149 - Prapa Rattadilok, John A. W. McCall, Trevor Burbridge, Andrea Soppera, Philip Eardley:
A data fusion framework for large-scale measurement platforms. 2150-2158 - Nijat Mehdiyev, Julian Krumeich, Dirk Werth, Peter Loos:
Sensor event mining with hybrid ensemble learning and evolutionary feature subset selection model. 2159-2168 - Huikyo Lee, Luca Cinquini, Daniel J. Crichton, Amy Braverman:
Optimization of system architecture for Big Data analysis in climate science. 2169-2172 - Goutham Kamath, Wen-Zhan Song:
In-situ analytics for tomographic imaging in sensor network. 2173-2176 - Beth Huffer, Marc Cotnoir, Jonathan Gleason:
Ontology-drive data access at the NASA earth exchange. 2177-2181 - Dean N. Williams, Michael Lautenschlager, Venkatramani Balaji, Luca Cinquini, Cecelia DeLuca, Sebastien Denvil, Daniel Duffy, Benjamin J. K. Evans, Robert D. Ferraro, Martin Juckes, Claire Trenham:
Strategie roadmap for the earth system grid federation. 2182-2190 - Shin'ichi Takeuchi, Komei Sugiura, Yuhei Akahoshi, Koji Zettsu:
Constrained region selection method based on configuration space for visualization in scientific dataset search. 2191-2200 - Peter Baumann, Dimitar Misev:
Enhancing science support in SQL. 2201-2204 - Ramezan Paravi Torghabeh, Narayana Prasad Santhanam:
Modeling community detection using slow mixing random walks. 2205-2211 - Jorge David Destephen Lavaire, Anshuman Singh, Mahmoud Yousef, Sumi Singh, Xiaodong Yue:
Dimensional scalability of supervised and unsupervised concept drift detection: An empirical study. 2212-2218 - Spiros V. Georgakopoulos, Sotiris K. Tasoulis, Vassilis P. Plagianakos:
Efficient change detection for high dimensional data streams. 2219-2222 - Charalampos Chelmis, Jahanvi Kolte, Viktor K. Prasanna:
Big data analytics for demand response: Clustering over space and time. 2223-2232 - Fatimah Binta Abdullahi, Frans Coenen, Russell Martin:
Finding banded patterns in big data using sampling. 2233-2242 - Gheorghi Guzun, Joel E. Tosado, Guadalupe Canahuate:
Scalable preference queries for high-dimensional data using map-reduce. 2243-2252 - Chuan Hu, Huiping Cao:
Discovering time-evolving influence from dynamic heterogeneous graphs. 2253-2262 - Kanji Matsutani, Masahito Kumano, Masahiro Kimura, Kazumi Saito, Kouzou Ohara, Hiroshi Motoda:
Combining activity-evaluation information with NMF for trust-link prediction in social media. 2263-2272 - Nemanja Spasojevic, Adithya Rao:
Identifying actionable messages on social media. 2273-2281 - Adithya Rao, Nemanja Spasojevic, Zhisheng Li, Trevor DSouza:
Klout score: Measuring influence across multiple social networks. 2282-2289 - Fei Liu, Yan Jia:
Top (k1, k2) Distance-based outliers detection in an uncertain dataset. 2290-2299 - Guirong Chen, Ning Wang, Fengqin Zhang, Hua Jiang:
Understanding the time characteristic of user behavior on online forums. 2300-2306 - Yu Liu, Bin Wu, Bai Wang:
Characterizing super spreading in microblog: An epidemic-based model. 2307-2313 - Yang Wang, Liutong Xu, Bin Wu:
A community detection method based on K-shell. 2314-2319 - Divya Rao, Wee Keong Ng:
How much is your information worth - A method for revenue generation for your information. 2320-2326 - Rong Gu, Yun Tang, Zhaokang Wang, Shuai Wang, Xusen Yin, Chunfeng Yuan, Yihua Huang:
Efficient large scale distributed matrix computation with spark. 2327-2336 - Bailing Wang, Junheng Huang, Libing Ou, Rui Wang:
A collaborative filtering algorithm fusing user-based, item-based and social networks. 2337-2343 - Man Li, Ruisheng Shi:
Mining the relation between dorm arrangement and student performance. 2344-2347 - Fang Lv, Bailing Wang, Junheng Huang, Yushan Sun, Yuliang Wei:
A proactive discovery and filtering solution on phishing websites. 2348-2355 - Yunlei Zhang, Bin Wu:
Finding community structure via rough K-means in social network. 2356-2361 - Shuang Zhang, Xuefeng Zheng, Changjun Hu:
A survey of semantic similarity and its application to social network analysis. 2362-2367 - Fei Jiang, Jin Xu:
Dynamic community detection based on game theory in social networks. 2368-2373 - Michel de Rougemont, Guillaume Vimont:
The value of analytical queries on Social Networks. 2374-2383 - Rui Wang, Bailing Wang, Junheng Huang:
A collaborative filtering algorithm based on social network information. 2384-2389 - Alina Campan, Traian Marius Truta, Matthew Beckerich:
Efficient approximation algorithms to determine minimum partial dominating sets in social networks. 2390-2397 - Garisha Chowdhary, Sanghamitra Bandyopadhyay:
Ties that matter. 2398-2403 - Hao Wang, Jorge A. Castanon:
Sentiment expression via emoticons on social media. 2404-2408 - Michael L. Nelson, Sridhar Radhakrishnan, Amlan Chatterjee, Chandra N. Sekharan:
On compressing massive streaming graphs with Quadtrees. 2409-2417 - Benjamin Flesch, Ravi Vatrapu, Raghava Rao Mukkamala, Abid Hussain:
Social set visualizer: A set theoretical approach to big social data analytics of real-world events. 2418-2427 - Gavin Smith, James Goulding:
A novel symbolization technique for time-series outlier detection. 2428-2436 - Jian Zou, Yunbo An, Hong Yan:
Volatility matrix inference in high-frequency finance with regularization and efficient computations. 2437-2444 - Oliver Bieh-Zimmert, Carsten Felden:
Shaping data: Visualization under construction. 2445-2452 - Margaret Drouhard, Chad A. Steed, Steven E. Hahn, Thomas Proffen, Jamison Daniel, Michael A. Matheson:
Immersive visualization for materials science data analysis using the Oculus Rift. 2453-2461 - Hideki Hayashi, Akinori Asahara, Natsuko Sugaya, Yuichi Ogawa, Hitoshi Tomita:
Spatio-temporal similarity search method for disaster estimation. 2462-2469 - Hui Zhang, Riqing Chen, Guangchen Ruan, Masatoshi Ando:
Scalable dental computing on cyberinfrastructure. 2470-2478 - Christopher Jordan, David Walling, Weijia Xu, Stephen A. Mock, Niall Gaffney, Dan Stanzione:
Wrangler's user environment: A software framework for management of data-intensive computing system. 2479-2486 - Wanbo Luo, Hui Zhang:
Visual analysis of large-scale LiDAR point clouds. 2487-2492 - Yin Huang, Yelena Yesha, Shujia Zhou:
A database-based distributed computation architecture with Accumulo and D4M: An application of eigensolver for large sparse matrix. 2493-2500 - Jieting Wu, Lina Yu, Hongfeng Yu:
Texture-based edge bundling: A web-based approach for interactively visualizing large graphs. 2501-2508 - Jianwu Wang, Daniel Crawl, Shweta Purawat, Mai H. Nguyen, Ilkay Altintas:
Big data provenance: Challenges, state of the art and opportunities. 2509-2516 - Ruizhu Huang, Weijia Xu:
Performance evaluation of enabling logistic regression for big data with R. 2517-2524 - Shinichi Yamagiwa, Yoshinobu Kawahara, Noriyuki Tabuchi, Yoshinobu Watanabe, Takeshi Naruo:
Skill grouping method: Mining and clustering skill differences from body movement BigData. 2525-2534 - Vilen Jumutc, Rocco Langone, Johan A. K. Suykens:
Regularized and sparse stochastic k-means for distributed large-scale clustering. 2535-2540 - Ran Rui, Hao Li, Yi-Cheng Tu:
Join algorithms on GPUs: A revisit after seven years. 2541-2550 - Martha Ganser, Sauptik Dhar, Unmesh Kurup, Carlos Cunha, Aca Gacic:
A data-driven approach towards patient identification for telehealth programs. 2551-2559 - Max Metzger, Michael Howard, Lee Kellogg, Rishi Kundi:
Ensemble prediction of vascular injury in Trauma care: Initial efforts towards data-driven, low-cost screening. 2560-2568 - Jinghe Zhang, Haoyi Xiong, Yu Huang, Hao Wu, Kevin Leach, Laura E. Barnes:
M-SEQ: Early detection of anxiety and depression via temporal orders of diagnoses in electronic health data. 2569-2577 - Katherine Senter, Sreenivas R. Sukumar, Robert M. Patton, Edward Chaum:
Using clinical data, hypothesis generation tools and PubMed trends to discover the association between diabetic retinopathy and antihypertensive drugs. 2578-2582 - Rina Singh, Jeffrey A. Graves, Sangkeun Lee, Sreenivas R. Sukumar, Mallikarjun Shankar:
Enabling graph appliance for genome assembly. 2583-2590 - Daniel Muller, Stefan Mau, Irena Pletikosa Cvijikj:
A framework for consensual and online privacy preserving record linkage in real-time. 2591-2599 - Tao Feng, Zhenyun Zhuang, Yi Pan, Haricharan Ramachandra:
A memory capacity model for high performing data-filtering applications in Samza framework. 2600-2605 - Chieh-Han Wu, Yang Song:
Robust and distributed web-scale near-dup document conflation in microsoft academic service. 2606-2611 - Alicia L. Nobles, Ketki Vilankar, Hao Wu, Laura E. Barnes:
Evaluation of data quality of multisite electronic health record data for secondary analysis. 2612-2620 - Asma Abboura, Soror Sahri, Mourad Ouziri, Salima Benbernou:
CrowdMD: Crowdsourcing-based approach for deduplication. 2621-2627 - Laure Berti-Équille:
Data veracity estimation with ensembling truth discovery methods. 2628-2636 - Lavanya Sainik:
Distributed life cycle scheduling for cascaded data processing. 2637-2643 - David Becker, Trish Dunn King, Bill McMullen:
Big data, big data quality problem. 2644-2653 - Dhana Rao, Venkat N. Gudivada, Vijay V. Raghavan:
Data quality issues in big data. 2654-2660 - N. Keshan, P. V. Parimi, Isabelle Bichindaritz:
Machine learning for stress detection from ECG signals in automobile drivers. 2661-2669 - Kunal Malhotra, Tanner C. Hobson, Silvia Valkova, Laura L. Pullum, Arvind Ramanathan:
Sequential pattern mining of electronic healthcare reimbursement claims: Experiences and challenges in uncovering how patients are treated by physicians. 2670-2679 - Akshay Grover, Jay Gholap, Vandana P. Janeja, Yelena Yesha, Raghu Chintalapati, Harsh Marwaha, Kunal Modi:
SQL-like big data environments: Case study in clinical trial analytics. 2680-2689 - Minh-Son Dao, Koji Zettsu, Siripen Pongpaichet, Laleh Jalali, Ramesh C. Jain:
Exploring spatio-temporal-theme correlation between physical and social streaming data for event detection and pattern interpretation from heterogeneous sensors. 2690-2699 - Aki-Hiro Sato:
Microdata analysis of the accommodation survey in Japanese tourism statistics. 2700-2708 - Shihan Wang, Takao Terano:
Detecting rumor patterns in streaming social media. 2709-2715 - Hông-Ân Cao, Tri Kurniawan Wijaya, Karl Aberer, Nuno Nunes:
A collaborative framework for annotating energy datasets. 2716-2725 - Atushi Ishikawa, Shouji Fujimoto, Takayuki Mizuno, Tsutomu Watanabe:
The relation between firm age distributions and the decay rate of firm activities in the united states and Japan. 2726-2731 - Aki-Hiro Sato, Isao Ito, Hidefumi Sawai, Kentaro Iwata:
An epidemic simulation with a delayed stochastic SIR model based on international socioeconomic-technological databases. 2732-2741 - Bilal Sadiq, Faizan Ur Rehman, Akhlaq Ahmad, Md. Abdur Rahman, Sohaib Ghani, Abdullah Murad, Saleh M. Basalamah, Ahmed Lbath:
A spatio-temporal multimedia big data framework for a large crowd. 2742-2751 - Naveen Ramakrishnan, Rumi Ghosh:
Distributed dynamic elastic nets: A scalable approach for regularization in dynamic manufacturing environments. 2752-2761 - Marc Goessling, Shan Kang:
Directional decision lists. 2762-2766 - Ningxuan Kang, Cong Zhao, Jingshan Li, John A. Horst:
Analysis of key operation performance data in manufacturing systems. 2767-2770 - Abhinav Jauhri, Bradley McDanel, Chris Connor:
Outlier detection for large scale manufacturing processes. 2771-2774 - Daniela Ushizima, Talita Perciano, Dilworth Parkinson:
Fast detection of material deformation through structural dissimilarity. 2775-2781 - Ronay Ak, Raunak Bhinge:
Data analytics and uncertainty quantification for energy prediction in manufacturing. 2782-2784 - Mariam Kiran, Peter Murphy, Inder Monga, Jon Dugan, Sartaj Singh Baveja:
Lambda architecture for cost-effective batch and speed big data processing. 2785-2792 - Thomas Renner, Lauritz Thamsen, Odej Kao:
Network-aware resource management for scalable data analytics frameworks. 2793-2800 - Parinaz Ameri, Jörg Meyer, Achim Streit:
On a new approach to the index selection problem using mining algorithms. 2801-2810 - Ranjeet Devarakonda, Yaxing Wei, Michele Thornton, Ben Mayer, Peter E. Thornton, Bob Cook:
Preparing, storing, and distributing multi-dimensional scientific data. 2811-2813 - Ranjeet Devarakonda, Les A. Hook, Terri Killeffer, Misha Krassovski, Tom Boden, Stan D. Wullschleger:
Use of a metadata documentation and search tool for large data volumes: The NGEE arctic example. 2814-2816 - Erica Yang, Derek Ross, Srikanth Nagella, Martin J. Turner, Winfried Kockelmann, Genoveva Burca, Federico Montesino-Pouzols:
Data optimised computing for heterogeneous big data computing applications. 2817-2819 - Vasilis Efthymiou, Kostas Stefanidis, Eirini Ntoutsi:
Top-k computations in MapReduce: A case study on recommendations. 2820-2822 - Kai Chen, Yi Zhou, Fangyan Dai:
A LSTM-based method for stock returns prediction: A case study of China stock market. 2823-2824 - Kazuya Uesato, Hiroki Asai, Hayato Yamana:
Predicting various types of user attributes in Twitter by using personalized pagerank. 2825-2827 - Asmelash Teka Hadgu, Aastha Nigam, Ernesto Diaz-Aviles:
Large-scale learning with AdaGrad on Spark. 2828-2830 - Sudip Mittal, Karuna P. Joshi, Claudia Pearce, Anupam Joshi:
Parallelizing natural language techniques for knowledge extraction from cloud service level agreements. 2831-2833 - Christian Beecks, Merih Seran Uysal, Thomas Seidl:
Gradient-based signatures for big multimedia data. 2834-2835 - Dimitrios Rafailidis, Stefanos Antaris:
Indexing media storms on Flink. 2836-2838 - Connor Stokes, Anoop Kumar, Frederick Choi, Ralph M. Weischedel:
Scaling NLP algorithms to meet high demand. 2839 - Bonnie J. Dorr, Craig S. Greenberg, Peter C. Fontana, Mark A. Przybocki, Marion Le Bras, Cathryn A. Ploehn, Oleg Aulov, Wo Chang:
The NIST data science evaluation series: Part of the NIST information access division data science initiative. 2840-2842 - Alexei Samoylov, Jason Schlachter:
Flexible ingest framework: A scalable architecture for dynamic routing through composable pipelines. 2843-2845 - Priya Govindan, Ruobing Chen, Katya Scheinberg, Soundararajan Srinivasan:
A scalable solution for group feature selection. 2846-2848 - Luna M. Zhang:
Genetic deep neural networks using different activation functions for financial data mining. 2849-2851 - Ryota Takei, Ayahiko Niimi:
Performance of graph reconstruction method for large-scale web graph analysis. 2852-2854 - Altti Ilari Maarala, Mika Rautiainen, Miikka Salmi, Susanna Pirttikangas, Jukka Riekki:
Low latency analytics for streaming traffic data with Apache Spark. 2855-2858 - Divya Rao, Wee Keong Ng:
How to make money from your information and keep your privacy. 2859-2861 - B. Kezia Rani, A. Vinaya Babu:
Scheduling of Big Data application workflows in cloud and inter-cloud environments. 2862-2864 - Peter Li, Simon N. Yates, Jenna K. Lovely, David W. Larson:
Patient-like-mine: A real time, visual analytics tool for clinical decision support. 2865-2867 - Samuel D. Johnson, Kang-Yu Ni:
A pricing mechanism using social media and web data to infer dynamic consumer valuations. 2868-2870 - Yifan Hao, Huiping Cao, Yan Qi, Chuan Hu, Sukumar Brahma, Jingyu Han:
Efficient keyword search on graphs using MapReduce. 2871-2873 - Yuqing Zhu:
Non-blocking one-phase commit made possible for distributed transactions over replicated data. 2874-2876 - Daisaku Yokoyama, Masashi Toyoda:
A large scale examination of vehicle recorder data to understand relationship between drivers' behaviors and their past driving histories. 2877-2879 - Yoshitaka Yamamoto, Koji Iwanuma:
Online pattern mining for high-dimensional data streams. 2880-2882 - Zhenhui Liu, Jingjing He, Yufei Xue, Zhenzhong Huang, Manli Li, Zhihui Du:
Modeling the learning behaviors of massive open online courses. 2883-2885 - Jian Yin, Dongfang Zhao:
Data confidentiality challenges in big data applications. 2886-2888 - Anh-Phuong Ta:
Factorization machines with follow-the-regularized-leader for CTR prediction in display advertising. 2889-2891 - Aakash Deep Singh, Wei Wu, Shili Xiang, Shonali Krishnaswamy:
Taxi trip time prediction using similar trips and road network data. 2892-2894 - Long Ma, Yanqing Zhang:
Using Word2Vec to process big text data. 2895-2897 - Longbiao Chen, Jérémie Jakubowicz:
Inferring bike trip patterns from bike sharing system open data. 2898-2900 - Xiaobing Zhou, Tonglin Li, Ke Wang, Dongfang Zhao, Iman Sadooghi, Ioan Raicu:
MHT: A light-weight scalable zero-hop MPI enabled distributed key-value store. 2901-2903 - Hangu Yeo, Catherine H. Crawford:
Big Data: Cloud computing in genomics applications. 2904-2906 - Maoyuan Zhang, Fang Yuan, Jianping Zhu:
Integrating semantic knowledge into Tag-LDA model through cloud model. 2907-2909 - Yunkai Liu, Christopher Magno:
A case study to apply mobile technology into individual's local community. 2910-2912 - Biying Tan, Sangaralingam Kajanan, Vivek Kumar Singh, Chandra Sekhar Saripaka, Giuseppe Manai:
Clairvoyant-push: A real-time news personalized push notifier using topic modeling and social scoring for enhanced reader engagement. 2913-2915 - Hua Fang, Honggang Wang, Chonggang Wang, Mahmoud Daneshmand:
Using probabilistic approach to joint clustering and statistical inference: Analytics for big investment data. 2916-2918 - Jing Wang, Nikos Ntarmos, Peter Triantafillou:
Towards a subgraph/supergraph cached query-graph index. 2919-2921 - Ratna Madhuri Maddipatla, Mirsad Hadzikadic, Dipti Patel Misra, Lixia Yao:
30 Day hospital readmission analysis. 2922-2924 - Mehrdad Yazdani, Larry Smarr:
Using pairwise difference features to measure temporal changes in the microbial ecology. 2925-2927 - Ardi Imawan, Joonho Kwon:
A timeline visualization system for road traffic big data. 2928-2929 - Gaël Chareyron, Bérengère Branchet, Sebastien Jacquot:
A new area tourist ranking method. 2930-2932 - Maoyuan Zhang, Jianping Zhu, Lijun Hua, Fang Yuan:
Text retrieval based on the feature conversion of vector space. 2933-2935 - Kang Li, Vinay Deolalikar, Neeraj Pradhan:
Big data gathering and mining pipelines for CRM using open-source. 2936-2938 - Jay Gholap, Vandana P. Janeja, Yelena Yesha:
Unified framework for clinical data analytics (U-CDA). 2939-2941 - Chanpaul Jin Wang, Hua Fang, Chonggang Wang, Mahmoud Daneshmand, Honggang Wang:
A novel initialization method for particle swarm optimization-based FCM in big biomedical data. 2942-2944 - Chandra Khatri, Suman Voleti, Sathish Veeraraghavan, Nish Parikh, Atiq Islam, Shifa Mahmood, Neeraj Garg, Vivek Singh:
Algorithmic content generation for products. 2945-2947 - Ismini Lourentzou, Graham Dyer, Abhishek Sharma, ChengXiang Zhai:
Hotspots of news articles: Joint mining of news text & social media to discover controversial points in news. 2948-2950 - Khalifeh AlJadda, Mohammed Korayem, Trey Grainger:
Improving the quality of semantic relationships extracted from massive user behavioral data. 2951-2953 - Maruthi Prithivirajan, Vivian Lai, Kyong Jin Shim, Koo Ping Shung:
Analysis of star ratings in consumer reviews: A case study of Yelp. 2954-2956 - S. George Djorgovski, Ashish Mahabal, Daniel J. Crichton, Basit Chaudhry:
From stars to patients: Lessons from space science and astrophysics for health care informatics. 2957-2959
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.