


default search action
BigData Conference 2016: Washington DC, USA
- James Joshi, George Karypis, Ling Liu, Xiaohua Hu, Ronay Ak, Yinglong Xia, Weijia Xu, Aki-Hiro Sato, Sudarsan Rachuri, Lyle H. Ungar, Philip S. Yu, Rama Govindaraju, Toyotaro Suzumura:
2016 IEEE International Conference on Big Data (IEEE BigData 2016), Washington DC, USA, December 5-8, 2016. IEEE Computer Society 2016, ISBN 978-1-4673-9005-7 - Chaitanya K. Baru:
Harnessing the data revolution: A perspective from the national science foundation. 2 - Elisa Bertino:
Big data security and privacy. 3 - Jiawei Han:
On the power of big data: Mining structures from massive, unstructured text data. 4 - Mark Johnson:
Leveraging high performance computing to drive advanced manufacturing R&D at the US department of energy. 5-6 - Michael Stonebraker, Dong Deng, Michael L. Brodie:
Database decay and how to avoid it. 7-16 - Christian Böhm, Martin Perdacher
, Claudia Plant
:
Cache-oblivious loops based on a novel space-filling curve. 17-26 - Jagat Sesh Challa, Poonam Goyal
, S. Nikhil
, Aditya Mangla, Sundar Balasubramaniam, Navneet Goyal:
DD-Rtree: A dynamic distributed data structure for efficient data distribution among cluster nodes for spatial data mining algorithms. 27-36 - Ravikant Dindokar, Neel Choudhury, Yogesh Simmhan
:
A meta-graph approach to analyze subgraph-centric distributed programming models. 37-47 - Subhadeep Karan, Jaroslaw Zola
:
Exact structure learning of Bayesian networks by optimal path extension. 48-55 - Walaa Eldin Moustafa, Vicky Papavasileiou, Ken Yocum, Alin Deutsch:
Datalography: Scaling datalog graph analytics on graph processing systems. 56-65 - Yosuke Oyama, Akihiro Nomura
, Ikuro Sato, Hiroki Nishimura, Yukimasa Tamatsu, Satoshi Matsuoka:
Predicting statistics of asynchronous SGD parameters for a large-scale distributed deep learning system on GPU supercomputers. 66-75 - Benjamin Sirb, Xiaojing Ye:
Consensus optimization with delayed and stochastic gradients on decentralized networks. 76-85 - Xiaoli Song, Yan Rui, Xiaohua Hu:
Pairwise topic model and its application to topic transition and evolution. 86-95 - Yuan Yuan
, Sihong Xie, Chun-Ta Lu, Jie Tang, Philip S. Yu:
Interpretable and effective opinion spam detection via temporal patterns mining across websites. 96-105 - Fang Zhou, Mohamed F. Ghalwash
, Zoran Obradovic:
A fast structured regression for large networks. 106-115 - Adiska Fardani Haryadi, Joris Hulstijn, Agung Wahyudi, Haiko Van Der Voort, Marijn Janssen:
Antecedents of big data quality: An empirical examination in financial service organizations. 116-121 - Joseph Jupin, Justin Y. Shi, Eduard C. Dragut:
PSH: A probabilistic signature hash method with hash neighborhood candidate generation for fast edit-distance string comparison on big data. 122-127 - Rocco Langone, Johan A. K. Suykens
:
Efficient multiple scale kernel classifiers. 128-133 - Joaquim F. Silva
, Carlos Gonçalves
, José C. Cunha:
A theoretical model for n-gram distribution in big data corpora. 134-141 - Jonathan Stokes, Steven Weber:
The self-avoiding walk-jump (SAWJ) algorithm for finding maximum degree nodes in large graphs. 142-149 - Xiaoli Song, Xiaotong Wang, Xiaohua Hu:
Semantic pattern mining for text mining. 150-155 - Kenji Yamanishi
, Kohei Miyaguchi:
Detecting gradual changes from data stream using MDL-change statistics. 156-163 - Rongda Zhu, Aston Zhang, Jian Peng, Chengxiang Zhai:
Exploiting temporal divergence of topic distributions for event detection. 164-171 - Timo Bingmann, Michael Axtmann, Emanuel Jöbstl, Sebastian Lamm, Huyen Chau Nguyen, Alexander Noe, Sebastian Schlag, Matthias Stumpp, Tobias Sturm, Peter Sanders:
Thrill: High-performance algorithmic distributed batch data processing with C++. 172-183 - Liuhua Chen, Haiying Shen:
Towards resource-efficient cloud systems: Avoiding over-provisioning in demand-prediction based resource provisioning. 184-193 - Katerina Doka, Nikolaos Papailiou, Victor Giannakouris, Dimitrios Tsoumakos, Nectarios Koziris:
Mix 'n' match multi-engine analytics. 194-203 - Alex Gittens, Aditya Devarakonda
, Evan Racah, Michael F. Ringenburg
, Lisa Gerhardt, Jey Kottalam, Jialin Liu, Kristyn J. Maschhoff, Shane Canon, Jatin Chhugani, Pramod Sharma, Jiyan Yang, James Demmel, Jim Harrell, Venkat Krishnamurthy, Michael W. Mahoney, Prabhat:
Matrix factorizations at scale: A comparison of scientific data analytics in spark and C+MPI using three case studies. 204-213 - Yin Huang, Yelena Yesha, Milton Halem, Yaacov Yesha, Shujia Zhou:
YinMem: A distributed parallel indexed in-memory computation system for large scale data analytics. 214-222 - Nusrat Sharmin Islam, Md. Wasi-ur-Rahman, Xiaoyi Lu, Dhabaleswar K. Panda:
Efficient data access strategies for Hadoop and Spark on HPC cluster with heterogeneous storage. 223-232 - Zhuozhao Li
, Haiying Shen, Jeffrey Denton, Walter B. Ligon III:
Comparing application performance on HPC-based Hadoop platforms with local storage and dedicated storage. 233-242 - Jinwei Liu, Haiying Shen, Husnu S. Narman
:
CCRP: Customized cooperative resource provisioning for high resource utilization in clouds. 243-252 - Xiaoyi Lu, Dipti Shankar, Shashank Gugnani, Dhabaleswar K. Panda:
High-performance design of apache spark with RDMA and its benefits on various workloads. 253-262 - Tomoki Yoshihisa, Takahiro Hara:
A low-load stream processing scheme for IoT environments. 263-272 - Yuan Yuan, Meisam Fathi Salmi, Yin Huai, Kaibo Wang, Rubao Lee, Xiaodong Zhang:
Spark-GPU: An accelerated in-memory data processing engine on clusters. 273-283 - Angen Zheng, Alexandros Labrinidis, Panos K. Chrysanthis
, Jack Lange:
Argo: Architecture-aware graph partitioning. 284-293 - Kareem S. Aggour
, Bülent Yener:
Adapting to data sparsity for efficient parallel PARAFAC tensor decomposition in Hadoop. 294-301 - Yadu N. Babuji, Kyle Chard, Aaron Gerow, Eamon Duede:
Cloud Kotta: Enabling secure and scalable data analytics in the cloud. 302-310 - Chunkun Bo, Ke Wang, Jeffrey J. Fox
, Kevin Skadron
:
Entity resolution acceleration using the automata processor. 311-318 - Kyle Chard, Mike D'Arcy
, Benjamin D. Heavner
, Ian T. Foster, Carl Kesselman
, Ravi K. Madduri
, Alexis A. Rodriguez, Stian Soiland-Reyes
, Carole A. Goble, Kristi Clark, Eric W. Deutsch, Ivo D. Dinov
, Nathan D. Price
, Arthur W. Toga:
I'll take that to go: Big data bags and minimal identifiers for exchange of large, complex datasets. 319-328 - Chun-Chieh Chen, Chih-Ya Shen, Ming-Syan Chen:
Massive parallelism for non-linear and non-stationary data analysis with GPGPU. 329-334 - Stratos Dimopoulos
, Chandra Krintz, Rich Wolski:
Big data framework interference in restricted private cloud settings. 335-340 - Khoa D. Doan
, Amidu O. Oloso, Kwo-Sen Kuo, Thomas L. Clune, Hongfeng Yu, Brian Nelson, Jian Zhang:
Evaluating the impact of data placement to spark and SciDB with an Earth Science use case. 341-346 - Saliya Ekanayake, Supun Kamburugamuve, Pulasthi Wickramasinghe, Geoffrey C. Fox:
Java thread and process performance for parallel machine learning on multicore HPC clusters. 347-354 - Gheorghi Guzun, Josiah C. McClurg, Guadalupe Canahuate, Raghuraman Mudumbai:
Power efficient big data analytics algorithms through low-level operations. 355-361 - Satoshi Imamura
, Keitaro Oka, Yuichiro Yasui, Yuichi Inadomi, Katsuki Fujisawa
, Toshio Endo, Koji Ueno, Keiichiro Fukazawa, Nozomi Hata, Yuta Kakibuka, Koji Inoue, Takatsugu Ono:
Evaluating the impacts of code-level performance tunings on power efficiency. 362-369 - Fan Jiang, Claris Castillo, Charles Schmitt
:
RADU: Bridging the divide between data and infrastructure management to support data-driven collaborations. 370-377 - Jinfeng Li, James Cheng, Yunjian Zhao, Fan Yang, Yuzhen Huang, Haipeng Chen, Ruihao Zhao:
A comparison of general-purpose distributed systems for data processing. 378-383 - Jinwei Liu, Haiying Shen:
A popularity-aware cost-effective replication scheme for high data durability in cloud storage. 384-389 - Luis Pineda-Morales, Ji Liu, Alexandru Costan
, Esther Pacitti, Gabriel Antoniu, Patrick Valduriez, Marta Mattoso
:
Managing hot metadata for scientific workflows on multisite clouds. 390-397 - Hitoshi Sato, Ryo Mizote, Satoshi Matsuoka, Hirotaka Ogawa:
I/O chunking and latency hiding approach for out-of-core sorting acceleration using GPU and flash NVM. 398-403 - Dipti Shankar, Xiaoyi Lu, Dhabaleswar K. Panda:
Boldio: A hybrid and resilient burst-buffer over lustre for accelerating big data I/O. 404-409 - Christoforos Svingos, Theofilos Mailis, Herald Kllapi, Lefteris Stamatogiannakis, Yannis Kotidis, Yannis E. Ioannidis:
Real time processing of streaming and static information. 410-415 - Hans Vandierendonck, Karen L. Murphy, Mahwish Arif, Dimitrios S. Nikolopoulos
:
HPTA: High-performance text analytics. 416-423 - Jorge Veiga
, Roberto R. Expósito
, Xoán C. Pardo, Guillermo L. Taboada
, Juan Touriño
:
Performance evaluation of big data frameworks for large-scale data analytics. 424-431 - Yali Zhao, Rodrigo N. Calheiros
, James Bailey, Richard O. Sinnott
:
SLA-based profit optimization for resource management of big data analytics-as-a-service platforms in cloud computing environments. 432-441 - Kaiji Chen, Yongluan Zhou
:
Materialized view selection in feed following systems. 442-451 - Victor Giannakouris, Nikolaos Papailiou, Dimitrios Tsoumakos, Nectarios Koziris:
MuSQLE: Distributed SQL query execution over multiple engine environments. 452-461 - Ahsanul Haque, Zhuoyi Wang, Swarup Chandra, Yupeng Gao, Latifur Khan
, Charu C. Aggarwal:
Sampling-based distributed Kernel mean matching using spark. 462-471 - Yudian Ji, Yuda Zang, Wuman Luo, Xibo Zhou
, Ye Ding, Lionel M. Ni:
Clockwise compression for trajectory data under road network constraints. 472-481 - Karuna P. Joshi
, Aditi Gupta, Sudip Mittal, Claudia Pearce, Anupam Joshi
, Tim Finin:
Semantic approach to automating management of big data privacy policies. 482-491 - Eleazar Leal, Le Gruenwald, Jianting Zhang:
Handling uncertainty in trajectories of moving objects in unconstrained outdoor spaces. 492-501 - Cuong M. Nguyen, Philip J. Rhodes:
Accelerating range queries for large-scale unstructured meshes. 502-511 - Md. Shiblee Sadik, Le Gruenwald, Eleazar Leal:
In pursuit of outliers in multi-dimensional data streams. 512-521 - Jianpeng Xu, Jiayu Zhou, Pang-Ning Tan
, Xi Liu, Lifeng Luo
:
WISDOM: Weighted incremental spatio-temporal multi-task learning via tensor decomposition. 522-531 - Farrukh Ahmed, Michele Samorani, Colin Bellinger, Osmar R. Zaïane:
Advantage of integration in big data: Feature generation in multi-relational databases for imbalanced learning. 532-539 - Matthew Edwards
, Stephen Wattam, Paul Rayson
, Awais Rashid
:
Sampling labelled profile data for identity resolution. 540-547 - Frank Pallas, Johannes Günther, David Bermbach:
Pick your choice in HBase: Security or performance. 548-554 - Rui Ren, Zhen Jia, Lei Wang, Jianfeng Zhan, Tianxu Yi:
BDTUne: Hierarchical correlation-based performance analysis and rule-based diagnosis for big data systems. 555-562 - Ramyar Saeedi, Hassan Ghasemzadeh, Assefaw Hadish Gebremedhin:
Transfer learning algorithms for autonomous reconfiguration of wearable systems. 563-569 - Mei Saouk, Christos Doulkeridis, Akrivi Vlachou, Kjetil Nørvåg
:
Efficient processing of top-k joins in MapReduce. 570-577 - Ting Wu, Chen Jason Zhang
, Lei Chen, Pan Hui, Siyuan Liu:
Object identification with Pay-As-You-Go crowdsourcing. 578-585 - Nesreen K. Ahmed, Theodore L. Willke, Ryan A. Rossi:
Estimation of local subgraph counts. 586-595 - Christian Beecks, Alexander Graß:
Multi-step threshold algorithm for efficient feature-based query processing in large-scale multimedia databases. 596-605 - Mansurul Alam Bhuiyan, Mohammad Al Hasan:
PRIIME: A generic framework for interactive personalized interesting pattern discovery. 606-615 - Ngot Bui, Thanh Le, Vasant G. Honavar
:
Labeling actors in multi-view social networks by integrating information from within and across multiple views. 616-625 - Hariton Efstathiades, Demetris Antoniades, George Pallis, Marios D. Dikaiakos, Zoltán Szlávik
, Robert-Jan Sips:
Online social network evolution: Revisiting the Twitter graph. 626-635 - Jianliang Gao, Bo Song, Ping Liu, Weimao Ke, Jianxin Wang, Xiaohua Hu:
Parallel top-k subgraph query in massive graphs: Computing from the perspective of single vertex. 636-645 - Xiaoyu Ge, Yanbing Xue, Zhipeng Luo, Mohamed A. Sharaf, Panos K. Chrysanthis
:
REQUEST: A scalable framework for interactive construction of exploratory queries. 646-655 - Chun Guo, Xiaozhong Liu:
Dynamic feature generation and selection on heterogeneous graph for music recommendation. 656-665 - Nguyen Ho, Huy T. Vo, Mai Vu:
An adaptive information-theoretic approach for identifying temporal correlations in big data sets. 666-675 - Chao Huang, Dong Wang, Shenglong Zhu, Daniel Yue Zhang:
Towards unsupervised home location inference from online social media. 676-685 - Wei Jiang, Juan Rodriguez, Torsten Suel:
Improved methods for static index pruning. 686-695 - Wooyeol Kim, Younghoon Kim, Kyuseok Shim:
Parallel computation of k-nearest neighbor joins using MapReduce. 696-705 - Sarasi Lalithsena, Pavan Kapanipathi, Amit P. Sheth:
Harnessing relationships for domain-specific subgraph extraction: A recommendation use case. 706-715 - Panagiotis Liakos, Alexandros Ntoulas, Alex Delis:
Scalable link community detection: A local dispersion-aware approach. 716-725 - Hongfu Liu, Yuchao Zhang, Bo Deng, Yun Fu:
Outlier detection via sampling ensemble. 726-735 - Athanasios N. Nikolakopoulos
, Antonia Korba, John D. Garofalakis:
Random surfing on multipartite graphs. 736-745 - Cheong Hee Park, Youngsoon Kang:
An active learning method for data streams with concept drift. 746-752 - Charles Siegel, Jeff Daily, Abhinav Vishnu:
Adaptive neuron apoptosis for accelerating deep learning on large scale systems. 753-762 - Ata Turk, Hao Chen, Anthony Byrne
, John Knollmeyer, Sastry S. Duri, Canturk Isci, Ayse K. Coskun:
DeltaSherlock: Identifying changes in the cloud. 763-772 - Xiaokai Wei, Bokai Cao, Weixiang Shao, Chun-Ta Lu, Philip S. Yu:
Community detection with partially observable links and node attributes. 773-782 - Yongyi Xian, Yan Liu, Chuanfei Xu:
Parallel gathering discovery over big trajectory data. 783-792 - Hu Xu, Sihong Xie, Lei Shu, Philip S. Yu:
CER: Complementary entity recognition via knowledge expansion on large unlabeled product reviews. 793-802 - Jingyuan Zhang, Chun-Ta Lu, Mianwei Zhou, Sihong Xie, Yi Chang
, Philip S. Yu:
HEER: Heterogeneous graph embedding for emerging relation detection from news. 803-812 - Hao Zhang, Yuanyuan Zhu, Lu Qin
, Hong Cheng, Jeffrey Xu Yu:
Efficient triangle listing for billion-scale graphs. 813-822 - Yating Zhang, Adam Jatowt
, Katsumi Tanaka:
Towards understanding word embeddings: Automatically explaining similarity of terms. 823-832 - Kai Zhao, Denis Khryashchev, Juliana Freire
, Cláudio T. Silva, Huy T. Vo:
Predicting taxi demand at high spatial resolution: Approaching the limit of predictability. 833-842 - Yixian Zheng, Wenchao Wu, Haipeng Zeng
, Nan Cao, Huamin Qu, Mingxuan Yuan, Jia Zeng, Lionel M. Ni:
TelcoFlow: Visual exploration of collective behaviors based on telco data. 843-852 - Morteza Zihayat, Zane Zhenhua Hu, Aijun An, Yonggang Hu:
Distributed and parallel high utility sequential pattern mining. 853-862 - Philip K. Chan, Ebad Ahmadzadeh:
Improving efficiency of maximizing spread in the flow authority model for large sparse networks. 863-868 - Wanying Ding, Yue Zhang, Chaomei Chen
, Xiaohua Hu:
Semi-supervised Dirichlet-Hawkes process with applications of topic detection and tracking in Twitter. 869-874 - Ioanna Filippidou, Yannis Kotidis:
Effective and efficient graph augmentation in large graphs. 875-880 - Ville Hyvönen, Teemu Pitkänen, Sotiris K. Tasoulis, Elias Jaasaari, Risto Tuomainen, Liang Wang, Jukka Corander, Teemu Roos
:
Fast nearest neighbor search through sparse random projections and voting. 881-888 - Saïd Jabbour, Nizar Mhadhbi, Abdesattar Mhadhbi, Badran Raddaoui, Lakhdar Sais:
Summarizing big graphs by means of pseudo-boolean constraints. 889-894 - Uwe Jugel, Zbigniew Jerzak, Volker Markl:
Big data on a few pixels. 895-900 - Mohammad Mahdi Kamani, Farshid Farhat, Stephen Wistar, James Z. Wang
:
Shape matching using skeleton context for automated bow echo detection. 901-908 - Weimao Ke, Javed Mostafa:
Scalability analysis of distributed search in large peer-to-peer networks. 909-914 - Nicolas Kourtellis, Gianmarco De Francisci Morales
, Albert Bifet
, Arinto Murdopo:
VHT: Vertical hoeffding tree. 915-922 - Yuh-Jye Lee
, Hsing-Kuo Pao, Shueh-Han Shih, Jing-Yao Lin, Xin-Rong Chen:
Compressed learning for time series classification. 923-930 - Xiaopeng Li
, Ming Cheung, James She:
Connection discovery using shared images by Gaussian relational topic model. 931-936 - Haofu Liao, Yucheng Li, Tianran Hu, Jiebo Luo
:
Inferring restaurant styles by mining crowd sourced photos from user-review websites. 937-944 - Chang Liu, Bin Wu, Yi Yang, Zhihong Guo:
Multiple submodels parallel support vector machine on spark. 945-950 - Xiang Liu, Torsten Suel:
What makes a group fail: Modeling social group behavior in event-based social networks. 951-956 - Jinna Lv, Bin Wu, Shuai Yang, Bingjing Jia, Peigang Qiu:
Efficient large scale near-duplicate video detection base on spark. 957-962 - Stathis Maroulis, Ioannis Boutsis, Vana Kalogeraki
:
Context-aware point of interest recommendation using tensor factorization. 963-968 - Steven Morse, Marta C. González, Natasha Markuzon:
Persistent cascades: Measuring fundamental communication structure in social networks. 969-975 - Tathagata Mukherjee, Biswas Parajuli, Piyush Kumar, Eduardo L. Pasiliao Jr.:
TruthCore: Non-parametric estimation of truth from a collection of authoritative sources. 976-983 - Sergey Nepomnyachiy, Torsten Suel:
Efficient index updates for mixed update and query loads. 984-991 - Gopi Chand Nutakki, Olfa Nasraoui
:
Compartmentalized adaptive topic mining on social media streams. 992-997 - Aduri Pavan, Paul Quint, Stephen D. Scott, N. V. Vinodchandran, J. Smith:
Computing triangle and open-wedge heavy-hitters in large networks. 998-1005 - Michael L. Rilee, Kwo-Sen Kuo, Thomas L. Clune, Amidu Oloso, Paul G. Brown, Hongfeng Yu:
Addressing the big-earth-data variety challenge with the hierarchical triangular mesh. 1006-1011 - Weixiang Shao, Lifang He
, Chun-Ta Lu, Philip S. Yu:
Online multi-view clustering with incomplete views. 1012-1017 - Chuan Shi, Bowei He, Menghao Zhang, Fuzhen Zhuang, Philip S. Yu, Naiwang Guo:
Expenditure aware rating prediction for recommendation. 1018-1025 - Sreenivas R. Sukumar, Ramakrishnan Kannan, Seung-Hwan Lim, Michael A. Matheson:
Kernels for scalable data analysis in science: Towards an architecture-portable future. 1026-1031 - Ioanna Tsalouchidou, Gianmarco De Francisci Morales
, Francesco Bonchi, Ricardo Baeza-Yates
:
Scalable dynamic graph summarization. 1032-1039 - Koji Ueno, Toyotaro Suzumura, Naoya Maruyama, Katsuki Fujisawa
, Satoshi Matsuoka:
Extreme scale breadth-first search on supercomputers. 1040-1047 - Pascal Welke, Alexander Markowetz, Torsten Suel, Maria Christoforaki:
Three-hop distance estimation in social graphs. 1048-1055 - Tong Yu, Ole J. Mengshoel, Alvin Jude, Eugen Feller, Julien Forgeat, Nimish Radia:
Incremental learning for matrix factorization in recommender systems. 1056-1063 - Abir Zayani, Chiheb-Eddine Ben N'cir
, Nadia Essoussi:
Parallel clustering method for non-disjoint partitioning of large-scale data based on spark framework. 1064-1069 - Da-Chuan Zhang, Mei Li, Chang-Dong Wang:
Point of interest recommendation with social and geographical influence. 1070-1075 - Daniel Yue Zhang, Rungang Han, Dong Wang, Chao Huang:
On robust truth discovery in sparse social media sensing. 1076-1081 - Rajesh Sankaran, Ricardo A. Calix:
On the feasibility of an embedded machine learning processor for intrusion detection. 1082-1089 - Heqing Huang, Cong Zheng, Junyuan Zeng, Wu Zhou, Sencun Zhu, Peng Liu, Suresh Chari, Ce Zhang:
Android malware development on public malware scanning platforms: A large-scale data-driven study. 1090-1099 - Hui Li, Jiangtao Cui, Xiaobin Lin, Jianfeng Ma:
Improving the utility in differential private histogram publishing: Theoretical study and practice. 1100-1109 - Xiao Pan, Jiawei Zhang, Fengjiao Wang, Philip S. Yu:
DistSD: Distance-based social discovery with personalized posterior screening. 1110-1119 - Quan Zhang, Mu Qiao, Ramani R. Routray, Weisong Shi
:
H2O: A hybrid and hierarchical outlier detection method for large scale data protection. 1120-1129 - Ariel Bar, Bracha Shapira
, Lior Rokach, Moshe Unger:
Scalable attack propagation model and algorithms for honeypot systems. 1130-1135 - Bas van Stein
, Matthijs van Leeuwen, Thomas Bäck
:
Local subspace-based outlier detection using global neighbourhoods. 1136-1142 - Shuo Wang, Richard O. Sinnott
, Surya Nepal
:
Protecting the location privacy of mobile social media users. 1143-1150 - Michael J. Anderson, Mihai Capota
, Javier S. Turek
, Xia Zhu, Theodore L. Willke, Yida Wang, Po-Hsuan Chen, Jeremy R. Manning, Peter J. Ramadge, Kenneth A. Norman:
Enabling factor analysis on thousand-subject neuroimaging datasets. 1151-1160 - Yanan Bao, Huasen Wu, Tianxiao Zhang, Albara Ah Ramli
, Xin Liu:
Shooting a moving target: Motion-prediction-based transmission for 360-degree videos. 1161-1170 - Sayan Goswami
, Arghya Kusum Das, Richard Platania, Kisung Lee, Seung-Jong Park
:
Lazer: Distributed memory-efficient assembly of large-scale genomes. 1171-1181 - Zhichuan Huang, Ting Zhu:
Leveraging multi-granularity energy data for accurate energy demand forecast in smart grids. 1182-1191 - Xiaowei Jia, Ankush Khandelwal, James Gerber
, Kimberly Carlson, Paul C. West
, Vipin Kumar:
Learning large-scale plantation mapping from imperfect annotators. 1192-1201 - Darja Krushevskaja, William Simpson, S. Muthukrishnan:
Ad allocation with secondary metrics. 1202-1211 - Azad Naik, Huzefa Rangwala:
Embedding feature selection for large-scale hierarchical classification. 1212-1221 - Naman Shah, Harshil Shah, Matthew Malensek, Sangmi Lee Pallickara, Shrideep Pallickara:
Network analysis for identifying and characterizing disease outbreak influence from voluminous epidemiology data. 1222-1231 - Francesco Versaci
, Luca Pireddu
, Gianluigi Zanetti
:
Scalable genomics: From raw data to aligned reads on Apache YARN. 1232-1241 - Yida Wang, Bryn Keller
, Mihai Capota
, Michael J. Anderson, Narayanan Sundaram, Jonathan D. Cohen, Kai Li, Nicholas B. Turk-Browne, Theodore L. Willke:
Real-time full correlation matrix analysis of fMRI data. 1242-1251 - Yanan Xu, Yanmin Zhu:
When remote sensing data meet ubiquitous urban data: Fine-grained air quality inference. 1252-1261 - Jingyuan Yang, Chuanren Liu
, Mingfei Teng, March Liao, Hui Xiong:
Buyer targeting optimization: A unified customer segmentation perspective. 1262-1271 - Mehrdad Yazdani, Bryn C. Taylor
, Justine W. Debelius
, Weizhong Li, Rob Knight
, Larry Smarr:
Using machine learning to identify major shifts in human gut microbiome protein family abundance in disease. 1272-1280 - Chunqiu Zeng, Qing Wang, Wentao Wang, Tao Li, Larisa Shwartz:
Online inference for time-varying temporal dependency discovery from time series. 1281-1290 - Ke Zhang
, Jianwu Xu, Martin Renqiang Min
, Guofei Jiang, Konstantinos Pelechrinis, Hui Zhang:
Automated IT system failure prediction: A deep learning approach. 1291-1300 - Hông-Ân Cao, Tri Kurniawan Wijaya, Karl Aberer, Nuno Nunes
:
Estimating human interactions with electrical appliances for activity-based energy savings recommendations. 1301-1308 - Zexi Chen, Ranga Raju Vatsavai
, Bharathkumar Ramachandra
, Qiang Zhang, Nagendra Singh, Sreenivas R. Sukumar:
Scalable nearest neighbor based hierarchical change detection framework for crop monitoring. 1309-1314 - Aman Gupta, S. Muthukrishnan, Smita Wadhwa:
Optimizing callout in unified ad markets. 1315-1321 - Zhichuan Huang, Tiantian Xie, Ting Zhu, Jianwu Wang, Qingquan Zhang:
Application-driven sensing data reconstruction and selection based on correlation mining and dynamic feedback. 1322-1327 - Xiaowei Jia, Xi C. Chen, Anuj Karpatne, Vipin Kumar:
Identifying dynamic changes with noisy labels in spatial-temporal data: A study on large-scale water monitoring application. 1328-1333 - Mike Lakoju, Alan Serrano:
A strategic approach for visualizing the value of big data (SAVV-BIGD) framework. 1334-1339 - Mai H. Nguyen, Dylan Uys, Daniel Crawl, Charles Cowart, Ilkay Altintas:
A scalable approach for location-specific detection of Santa Ana conditions. 1340-1345 - Susanna Pirttikangas
, Ekaterina Gilman
, Xiang Su, Teemu Leppänen
, Anja Keskinarkaus, Mika Rautiainen, Mikko Pyykkönen, Jukka Riekki:
Experiences with smart city traffic pilot. 1346-1352 - Elyas Sabeti, Anders Høst-Madsen:
How interesting images are: An atypicality approach for social networks. 1353-1358 - Wenzhao Zhang, Houjun Tang, Stephen Ranshous, Surendra Byna
, Daniel F. Martin, Kesheng Wu
, Bin Dong, Scott Klasky, Nagiza F. Samatova:
Exploring memory hierarchy and network topology for runtime AMR data sharing across scientific applications. 1359-1366 - Pavel A. Dmitriev, Brian Frasca, Somit Gupta, Ron Kohavi, Garnet Jason Vaz:
Pitfalls of long-term online controlled experiments. 1367-1376 - Juergen Heit, Jiayi Liu, Mohak Shah:
An architecture for the deployment of statistical models for the big data era. 1377-1384 - Raya Horesh, Kush R. Varshney, Jinfeng Yi:
Information retrieval, fusion, completion, and clustering for employee expertise estimation. 1385-1393 - Rishi Chhatwal, Nathaniel Huber-Fliflet, Robert Keeling, Jianping Zhang, Haozhen Zhao:
Empirical evaluations of preprocessing parameters' impact on predictive coding's effectiveness. 1394-1401 - Ruoyu Wang, Daniel Sun, Guoqiang Li, Muhammad Atif, Surya Nepal:
LogProv: Logging events as provenance of big data analytics pipelines with trustworthiness. 1402-1411 - Bradford Littooy, Sophie Loire, Michael Georgescu, Igor Mezic
:
Pattern recognition and classification of HVAC rule-based faults in commercial buildings. 1412-1421 - Adetokunbo Makanju, Zahra Farzanyar, Aijun An, Nick Cercone, Zane Zhenhua Hu, Yonggang Hu:
Deep parallelization of parallel FP-growth using parent-child MapReduce. 1422-1431 - Nicolás Poggi
, Josep Lluis Berral
, Thomas Fenech, David Carrera
, José A. Blakeley, Umar Farooq Minhas, Nikola Vujic:
The state of SQL-on-Hadoop in the cloud. 1432-1443 - Emily Grace, Ankit Rai, Elissa M. Redmiles, Rayid Ghani:
Detecting fraud, corruption, and collusion in international development contracts: The design of a proof-of-concept automated system. 1444-1453 - Michele Samorani, Farrukh Ahmed, Osmar R. Zaïane:
Automatic generation of relational attributes: An application to product returns. 1454-1463 - Syed Yousaf Shah, Brent Paulovicks, Petros Zerfos:
Data-at-rest security for spark. 1464-1473 - Mylene Simon, Joe Chalfoun, Mary Brady, Peter Bajcsy:
Do we trust image measurements? Variability, accuracy and traceability of image features. 1474-1482 - Sreenivas R. Sukumar, Michael A. Matheson, Ramakrishnan Kannan, Seung-Hwan Lim:
Mini-apps for high performance data analysis. 1483-1492 - Tomasz Tajmajer, Malwina Splawinska
, Piotr Wasilewski
, Stan Matwin
:
Predicting annual average daily highway traffic from large data and very few measurements. 1493-1501 - Ganesh Venkataraman, Abhimanyu Lad, Lin Guo, Shakti Sinha:
Fast, lenient and accurate: Building personalized instant search experience at LinkedIn. 1502-1511 - Hui Wu, Yi Fang, Huming Wu, Shenhong Zhu:
Diversifying trending topic discovery via Semidefinite Programming. 1512-1521 - Xuchao Zhang, Zhiqian Chen, Weisheng Zhong, Arnold P. Boedihardjo, Chang-Tien Lu
:
Storytelling in heterogeneous Twitter entity network based on hierarchical cluster routing. 1522-1531 - Wenjun Zhou
, Yun Zhu, Faizan Javed, Mahmudur Rahman, Janani Balaji, Matt McNair:
Quantifying skill relevance to job titles. 1532-1541 - Zhenyun Zhuang, Haricharan Ramachandra, Badri Sridharan, Brandon Duncan, Kishore Gopalakrishna, Jean-Francois Im:
SmartCache: Application layer caching to improve performance of large-scale memory mapping. 1542-1550 - Zahra Zohrevand, Uwe Glässer, Hamed Yaghoubi Shahir, Mohammad A. Tayebi, Robert Costanzo:
Hidden Markov based anomaly detection for water supply systems. 1551-1560 - Ilaria Bordino, Andrea Ferretti, Marco Firrincieli, Francesco Gullo
, Marcello Paris, Stefano Pascolutti, Gianluca Sabena:
Advancing NLP via a distributed-messaging approach. 1561-1568 - Luca Cazzanti, Antonio Davoli, Leonardo Maria Millefiori:
Automated port traffic statistics: From raw data to visualisation. 1569-1573 - Hongfeng Chai, Hao Liu, Xibo Zhou
, Yanjun Xu, Shuo He, Jinzhi Hua, Dongjie He, Weihuai Liu:
UStore: An optimized storage system for enterprise data warehouses at UnionPay. 1574-1578 - Vinay Deolalikar, Hernan Laffitte:
Extensive large-scale study of error surfaces in sampling-based distinct value estimators for databases. 1579-1586 - Amita Gajewar, Lizhong Wu, Jignesh Parmar, Ramana Yerneni:
Forecasting squatting of demand in display advertising. 1587-1594 - Archana Ganapathi, Yanpei Chen:
Data quality: Experiences and lessons from operationalizing big data. 1595-1602 - Nancy W. Grady:
KDD meets Big Data. 1603-1608 - Rajaraman Kanagasabai, Anitha Veeramani, Shangfeng Hu, Sangaralingam Kajanan, Giuseppe Manai:
Classification of massive mobile web log URLs for customer profiling & analytics. 1609-1614 - Masahiro Kazama, Issei Sato, Haruaki Yatabe, Tairiku Ogihara, Tetsuro Onishi, Hiroshi Nakagawa:
Company recommendation for new graduates via implicit feedback multiple matrix factorization with Bayesian optimization. 1615-1620 - Yiming Kong, Hui Zang, Xiaoli Ma:
Human network usage patterns revealed by telecom data. 1621-1626 - Leonardo Maria Millefiori, Dimitrios Zissis
, Luca Cazzanti, Gianfranco Arcieri:
A distributed approach to estimating sea port operational regions from lots of AIS data. 1627-1632 - Thibaud Nesztler, Don Kasper, Michael Georgescu, Sophie Loire, Igor Mezic
:
Uniformization, organization, association and use of metadata from multiple content providers and manufacturers: A close look at the Building Automation System (BAS) sector. 1633-1638 - Derrick C. Spell, Ling-Yong Wang, Richard T. Shomer, Bahador Nooraei, Jarrell Waggoner, Xiao-Han T. Zeng, Jae Young Chung, Kai-Chen Cheng, Daniel Kirsche:
QED: Groupon's ETL management and curated feature catalog system for machine learning. 1639-1646 - Ljiljana Stojanovic, Marko Dinic, Nenad Stojanovic, Aleksandar Stojadinovic:
Big-data-driven anomaly detection in industry (4.0): An approach and a case study. 1647-1652 - Jiejun Xu, Samuel D. Johnson, Kang-Yu Ni:
Cross-modal event summarization: A network of networks approach. 1653-1657 - Teruyoshi Zenmyo, Satoshi Iijima, Ichiro Fukuda:
Managing a complicated workflow based on dataflow-based workflow scheduler. 1658-1663 - Li Zhou, Yinglong Xia, Hui Zang, Jian Xu, Mingzhen Xia:
An edge-set based large scale graph processing system. 1664-1669 - Nora Alkhamees, Maria Fasli:
Event detection from social network streams using frequent pattern mining with dynamic support values. 1670-1679 - Victor Perazzolo Barros, Pollyana Notargiacomo
:
Big data analytics in cloud gaming: Players' patterns recognition using artificial neural networks. 1680-1689 - Nada Basit, Yutong Zhang, Hao Wu, Haoran Liu, Jieming Bin, Yijun He, Abdeltawab M. Hendawi:
MapReduce-based deep learning with handwritten digit recognition case study. 1690-1699 - Giuseppe Bruno:
Text mining and sentiment extraction in central bank documents. 1700-1708 - Philip Thruesen, Jaroslav Cechák, Blandine Seznec, Roel Castalio, Nattiya Kanhabua:
To link or not to link: Ranking hyperlinks in Wikipedia using collective attention. 1709-1718 - Ismail Duru, Gulustan Dogan, Banu Diri:
An overview of studies about students' performance analysis and learning analytics in MOOCs. 1719-1723 - Brahim Hnich
, Faisal R. Al-Osaimi, Ata Sasmaz, Ozkan Sayin, Amine Lamine, Majed AlOtaibi:
Smart online vehicle tracking system for security applications. 1724-1733 - Hsiao-Wei Hu, Hao-Chen Chang, Wen-Shiu Lin:
An optimized frequent pattern mining algorithm with multiple minimum supports. 1734-1741 - Ammar Jabakji, Hasan Dag
:
Improving item-based recommendation accuracy with user's preferences on Apache Mahout. 1742-1749 - Sampath Jayarathna, Faryaneh Poursardar:
Change detection and classification of digital collections. 1750-1759 - Yerzhan Kerimbekov, Hasan Sakir Bilge:
A feature selection method based on Lorentzian metric. 1760-1767 - Sercan Külcü, Erdogan Dogdu
, A. Murat Ozbayoglu
:
A survey on semantic Web and big data technologies for social network analysis. 1768-1777 - Quanzhi Li, Sameena Shah, Rui Fang:
Table classification using both structure and content information: A case study of financial documents. 1778-1783 - Xiao Li, Reza Sharifi Sedeh, Liao Wang, Yang Yang:
Patient-record level integration of de-identified healthcare big databases. 1784-1786 - Bingchuan Liu, Yudong Tan, Huimin Zhou:
A Bayesian predictor of airline class seats based on multinomial event model. 1787-1791 - Busra Mutlu, Merve Mutlu, Kasim Oztoprak, Erdogan Dogdu
:
Identifying trolls and determining terror awareness level in social networks using a scalable framework. 1792-1798 - Aparna Oruganti, Fangzhou Sun
, Hiba Baroud
, Abhishek Dubey
:
DelayRadar: A multivariate predictive model for transit systems. 1799-1806 - A. Murat Ozbayoglu
, Yusuf Gökhan Küçükayan
, Erdogan Dogdu
:
A real-time autonomous highway accident detection model based on big data processing and computational intelligence. 1807-1813 - Francisco Padillo, José María Luna
, Sebastián Ventura:
Subgroup discovery on big data: Pruning the search space on exhaustive search algorithms. 1814-1823 - Paul Raff, Ze Jin:
The difference-of-datasets framework: A statistical method to discover insight. 1824-1831 - Yehezkel S. Resheff:
Online trajectory segmentation and summary with applications to visualization and retrieval. 1832-1840 - Ali Sekmen, Akram Aldroubi
, Ahmet Bugra Koku
:
Skeleton decomposition analysis for subspace clustering. 1841-1848 - Omer Berat Sezer, Erdogan Dogdu
, A. Murat Ozbayoglu
, Aras Onal:
An extended IoT framework with semantics, big data, and analytics. 1849-1856 - M. Omair Shafiq:
Event segmentation using MapReduce based big data clustering. 1857-1866 - Madhu Shashanka, Min-Yi Shen, Jisheng Wang:
User and entity behavior analytics for enterprise security. 1867-1874 - Thamarai Selvi Somasundaram, Kannan Govindarajan, Vivekanandan Suresh Kumar:
Swarm Intelligence (SI) based profiling and scheduling of big data applications. 1875-1880 - Jenq-Haur Wang, Jia-Zhi Lin:
Improving clustering efficiency by SimHash-based K-Means algorithm for big data analytics. 1881-1888 - Yuchen Wu, Jianbo Yuan, Quanzeng You, Jiebo Luo
:
The effect of pets on happiness: A data-driven approach via large-scale social media. 1889-1894 - Ozlem Yavanoglu:
Intelligent authorship identification with using Turkish newspapers metadata. 1895-1900 - Jianbo Yuan, Walid Shalaby
, Mohammed Korayem, David Lin, Khalifeh AlJadda, Jiebo Luo
:
Solving cold-start problem in large-scale recommendation engines: A deep learning approach. 1901-1910 - Kai Zhao, Sasu Tarkoma, Siyuan Liu, Huy T. Vo:
Urban human mobility data mining: An overview. 1911-1920 - Yiheng Zhou, Numair Sani, Jiebo Luo
:
Fine-grained mining of illicit drug use patterns using social multimedia data from instagram. 1921-1930 - Zhenwei Du, Haopeng Chen, Jianwei Jiang:
Research on the big data system of massive open online course. 1931-1936 - Srinivasa Rao Kundeti
, J. Vijayananda, Srikanth Mujjiga, M. Kalyan:
Clinical named entity recognition: Challenges and opportunities. 1937-1945 - Tsau-Young Lin:
Very fast frequent itemset mining: Simplicial complex methods (Extended abstract). 1946-1949 - G. S. Smrithy
, Sathyan Munirathinam, Ramadoss Balakrishnan
:
Online anomaly detection using non-parametric technique for big data streams in cloud collaborative environment. 1950-1955 - Shusaku Tsumoto, Michinori Nakata, Hiroshi Sakai
, Chenxi Liu:
A proposal of a privacy-preserving questionnaire by non-deterministic information and its analysis. 1956-1965 - Parul Sharma, Teng-Sheng Moh:
Prediction of Indian election using sentiment analysis on Hindi Twitter. 1966-1971 - Shusaku Tsumoto, Shoji Hirano, Haruko Iwata:
Construction of clinical pathway from histories of clinical actions in hospital information system. 1972-1981 - Shusaku Tsumoto, Shoji Hirano, Haruko Iwata, Norio Yoshimoto, Tomohiro Kimura:
Mining process for improvement of clinical process quality. 1982-1990 - Yan Zhu, Melody Moh, Teng-Sheng Moh:
Multi-layer text classification with voting for consumer reviews. 1991-1999 - Shakti Awaghad:
SCEM: Smart & effective crowd management with a novel scheme of big data analytics. 2000-2003 - Alexander Brodsky, Mohan Krishnamoorthy, William Z. Bernstein, M. Omar Nachawati:
A system and architecture for reusable abstractions of manufacturing processes. 2004-2013 - Max Ferguson, Kincho H. Law, Raunak Bhinge, David Dornfeld, Jinkyoo Park, Yung-Tsun Tina Lee:
Evaluation of a PMML-based GPR scoring engine on a cloud platform and microcomputer board for smart manufacturing. 2014-2023 - Jeff Hebert:
Predicting rare failure events using classification trees on large scale manufacturing data with complex interactions. 2024-2028 - Ankita Mangal, Nishant Kumar:
Using big data to enhance the bosch production line performance: A Kaggle challenge. 2029-2035 - Abhinav Maurya:
Bayesian optimization for predicting rare internal failures in manufacturing processes. 2036-2045 - Bohdan M. Pavlyshenko
:
Machine learning, linear and Bayesian models for logistic regression in failure detection problems. 2046-2050 - Srinivasan Radhakrishnan, Sagar V. Kamarthi:
Convergence and divergence in academic and industrial interests on IOT based manufacturing. 2051-2056 - Srinivasan Radhakrishnan, Sagar V. Kamarthi:
Complexity-entropy feature plane for gear fault detection. 2057-2061 - Dazhong Wu, Connor Jennings
, Janis P. Terpenny, Soundar R. T. Kumara:
Cloud-based machine learning for predictive analytics: Tool wear prediction in milling. 2062-2069 - Darui Zhang, Bin Xu
, Jasmine Wood:
Predict failures in production lines: A two-stage approach with clustering and supervised learning. 2070-2074 - Aharon Abadi, Ashraf Haib, Roie Melamed, Alaa Nassar, Aidan Shribman, Hisham Yasin:
Holistic disaster recovery approach for big data NoSQL workloads. 2075-2080 - Genady Ya. Grabarnik, Mauro Tortonesi, Larisa Shwartz:
Data-driven cloud-based IT services performance forecasting. 2081-2086 - John Harney, Seung-Hwan Lim, Sreenivas R. Sukumar, Dale Stansberry, Peter Xenopoulos:
On-demand data analytics in HPC environments at leadership computing facilities: Challenges and experiences. 2087-2096 - Katsunori Miura, Tazro Ohta
, Courtney Powell, Masaharu Munetomo:
Intercloud brokerages based on PLS method for deploying infrastructures for big data analytics. 2097-2102 - Kayhan Moharreri, Jayashree Ramanathan, Rajiv Ramnath:
Motivating dynamic features for resolution time estimation within IT operations management. 2103-2108 - Alexander C. Shulyak, Lizy K. John:
Identifying performance bottlenecks in Hive: Use of processor counters. 2109-2114 - Alok Singh, Eric G. Stephan, Todd Elsethagen, Matt MacDuff, Bibi Raju, Malachi Schram, Kerstin Kleese van Dam, Darren J. Kerbyson, Ilkay Altintas:
Leveraging large sensor streams for robust cloud control. 2115-2120 - Shuang Song, Xinnian Zheng, Andreas Gerstlauer, Lizy K. John:
Fine-grained power analysis of emerging graph processing workloads for cloud operations management. 2121-2126 - Konstantinos Tsakalozos, Cory Johns, Kevin Monroe, Pete VanderGiessen, Andrew Mcleod, Antonio Rosales:
Open big data infrastructures to everyone. 2127-2129 - Shahbaz Atta, Bilal Sadiq, Akhlaq Ahmad, Sheikh Nasir Saeed, Emad A. Felemban
:
Spatial-crowd: A big data framework for efficient data visualization. 2130-2138 - Anne M. Denton, Mostofa Ahsan, David W. Franzen, John Nowatzki:
Multi-scalar analysis of geospatial agricultural data for sustainability. 2139-2146 - Luciano Gervasoni, Martí Bosch
, Serge Fenet, Peter F. Sturm:
A framework for evaluating urban land use mix from crowd-sourcing data. 2147-2156 - Thong Hoang, Pei Hua Cher, Philips Kokoh Prasetyo, Ee-Peng Lim
:
Crowdsensing and analyzing micro-event tweets for public transportation insights. 2157-2166 - Yu Ichifuji, Yoshihide Matsuo, Noriaki Koide, Nobuhiro Akashi, Yoshitaka Terai, Toru Kobayashi:
A study for understanding of tourist person trip pattern based on log data of Wi-Fi access points. 2167-2174 - Noriaki Koide, Yu Ichifuji, Hideki Yoshii, Noboru Sonehara:
Estimation of national tourism statistics based on Wi-Fi association log data. 2175-2179 - Gaurav Paruthi, Enrique Frías-Martínez, Vanessa Frías-Martínez:
Peer-to-peer microlending platforms: Characterization of online traits. 2180-2189 - Caleb Robinson, Arezoo Shirazi, Mengmeng Liu, Bistra Dilkina
:
Network optimization of food flows in the U.S. 2190-2198 - Aki-Hiro Sato
, Tsutomu Watanabe:
Measuring activities and values of industrial clusters based on job opportunity data collected from an internet Japanese job matching site. 2199-2208 - Xiaoyan Shao, Siyuan Lu, Theodore G. van Kessel, Hendrik F. Hamann, Leda Daehler, Jeffrey Cwagenberg, Alan Li:
Solar irradiance forecasting by machine learning for solar car races. 2209-2216 - Hiroshi Tsuda, Masakazu Ando, Yu Ichifuji:
Hotel plan popularity factor analysis of hotels in the Keihanshin region. 2217-2224 - Laura L. Tupper, David S. Matteson
, John C. Handley
:
Mixed data and classification of transit stops. 2225-2232 - Mahwish Arif, Hans Vandierendonck, Dimitrios S. Nikolopoulos
, Bronis R. de Supinski:
A scalable and composable map-reduce system. 2233-2242 - Amit Gupta, Weijia Xu, Natalia Ruiz-Juri, Kenneth Perrine:
A workload aware model of computational resource selection for big data applications. 2243-2250 - Sunwoo Lee
, Wei-keng Liao
, Ankit Agrawal
, Nikos Hardavellas
, Alok N. Choudhary:
Evaluation of K-means data clustering algorithm on Intel Xeon Phi. 2251-2260 - Ruoqian Liu, Ankit Agrawal
, Wei-keng Liao
, Alok N. Choudhary, Marc De Graef:
Materials discovery: Understanding polycrystals from large-scale electron patterns. 2261-2269 - Fang (Cherry) Liu
, Fu Shen, Duen Horng Chau
, Neil Bright, Mehmet Belgin:
Building a research data science platform from industrial machines. 2270-2275 - Lauritz Thamsen, Thomas Renner, Marvin Byfeld, Markus Paeschke, Daniel Schroder, Felix Bohm:
Visually programming dataflows for distributed data analytics. 2276-2285 - Peter Xenopoulos, Jamison Daniel, Michael A. Matheson, Sreenivas R. Sukumar:
Big data analytics on HPC architectures: Performance and cost. 2286-2295 - Weijia Xu, Natalia Ruiz-Juri, Amit Gupta, Amanda Deering, Chandra R. Bhat
, James Kuhr, Jackson Archer:
Supporting large scale connected vehicle data analysis using HIVE. 2296-2304 - Lina Yu, Hongfeng Yu:
Legion-based scientific data analytics on heterogeneous processors. 2305-2314 - Juan Lin, Di Zhong, Yiwen Zhong, Hui Zhang:
Accelerating mathematical knot simulations with R on the web. 2315-2321 - Yanfu Zhou, Jieting Wu, Lina Yu, Hongfeng Yu, Zhenghong Tang:
A geohydrologie data visualization framework with an extendable user interface design. 2322-2331 - Jian Zou, Chuqin Huang:
Efficient portfolio allocation with sparse volatility estimation for high-frequency financial data. 2332-2341 - James Crist:
Dask & Numba: Simple libraries for optimizing scientific python code. 2342-2343 - Vishnu Gowda Harish, Vinay Kumar Bingi, John A. Miller
:
A big data platform integrating compressed linear algebra with columnar databases. 2344-2352 - Ruoqian Liu, Diana Palsetia, Arindam Paul, Reda Al-Bahrani, Dipendra Jha, Wei-keng Liao
, Ankit Agrawal
, Alok N. Choudhary:
PinterNet: A thematic label curation tool for large image datasets. 2353-2362 - Geoffrey Mon
, Milad Makkie, Xiang Li
, Tianming Liu, Shannon Quinn:
Implementing dictionary learning in Apache Flink, Or: How I learned to relax and love iterations. 2363-2367 - Hatef Monajemi, David L. Donoho, Victoria Stodden:
Making massive computational experiments painless. 2368-2373 - Ella Peltonen
, Eemil Lagerspetz
, Petteri Nurmi
, Sasu Tarkoma:
Too big to mail: On the way to publish large-scale mobile analytics data. 2374-2377 - Zhou Xing, Marzieh Parandehgheibi, Fei Xiao, Nilesh Kulkarni, Chris Pouliot:
Content-based recommendation for podcast audio-items using natural language processing techniques. 2378-2383 - Sylvain Hallé
, Sébastien Gaboury, Raphaël Khoury:
A glue language for event stream processing. 2384-2391 - Christopher Hillman, Karen E. Petrie, Andrew Cobley, Mark Whitehorn:
Real-time processing of proteomics data: The internet of things and the connected laboratory. 2392-2399 - Yaser Keneshloo, Shuguang Wang, Eui-Hong Sam Han, Naren Ramakrishnan
:
Predicting the shape and peak time of news article views. 2400-2409 - Kohei Nakamura, Ami Hayashi, Hiroki Matsutani:
An FPGA-based low-latency network processing for spark streaming. 2410-2415 - Joshua Plasse, Niall M. Adams:
Handling delayed labels in temporally evolving data streams. 2416-2424 - Athena Vakali, Paschalis Korosoglou, Pavlos Daoglou:
A multi-layer software architecture framework for adaptive real-time analytics. 2425-2430 - Yongyi Xian, Chuanfei Xu, Yan Liu:
Implementing trajectory data stream analysis in parallel. 2431-2436 - Jaime Alonso-Lorenzo, Enrique Costa-Montenegro
, Milagros Fernández Gavilanes
:
Language independent big-data system for the prediction of user location on Twitter. 2437-2446 - Linda Camilla Boldt, Vinothan Vinayagamoorthy, Florian Winder, Melanie Schnittger, Mats Ekran, Raghava Rao Mukkamala
, Niels Buus Lassen, Benjamin Flesch, Abid Hussain, Ravi Vatrapu
:
Forecasting Nike's sales using Facebook data. 2447-2456 - Seung-Woo Choi, Aviv Segev:
Finding informative comments for video viewing. 2457-2465 - Anahita Davoudi, Mainak Chatterjee:
Prediction of information diffusion in social networks using dynamic carrying capacity. 2466-2469 - Yang Feng, Jiebo Luo
:
When do luxury cars hit the road? Findings by a big data approach. 2470-2474 - David Watts, K. M. George, Ashwin Kumar T. K, Zenia Arora:
Tweet sentiment as proxy for political campaign momentum. 2475-2484 - Ryohei Hisano:
A new approach to building the interindustry input-output table using block estimation techniques. 2485-2494 - Atushi Ishikawa, Shouji Fujimoto
, Takayuki Mizuno:
Nowcast of firm sales using POS data toward stock market stability. 2495-2499 - Yuka Kamiko, Mitsuo Yoshida
, Hirotada Ohashi, Fujio Toriumi
:
Uncovering information flow among users by time-series retweet data: Who is a friend of whom on Twitter? 2500-2504 - Rishemjit Kaur, Kazutoshi Sasahara
:
Quantifying moral foundations from various topics on Twitter conversations. 2505-2512 - Yasuko Kawahata, Tamio Koyama:
Application of an integer-valued autoregressive model to hit phenomena. 2513-2517 - Hirotaka Kawazu, Fujio Toriumi
, Masanori Takano
, Kazuya Wada, Ichiro Fukuda:
Analytical method of web user behavior using Hidden Markov Model. 2518-2524 - Eyad Makki, Lin-Ching Chang
:
Leveraging social big data for performance evaluation of E-commerce websites. 2525-2534 - Rubén Tous
, Otto Wüst, Mauro Gomez, Jonatan Poveda, Marc Elena, Jordi Torres, Mouna Makni, Eduard Ayguadé:
User-generated content curation with deep convolutional neural networks. 2535-2540 - Yu Wang, Yang Feng, Jiebo Luo
, Xiyang Zhang:
Pricing the woman card: Gender politics between hillary clinton and donald trump. 2541-2544 - Daniel Xie, Jiejun Xu, Tsai-Ching Lu:
Automated classification of extremist Twitter accounts using content-based and network-based features. 2545-2549 - Edmon Begoli
, Derek Kistler, Jack Bates:
Towards a heterogeneous, polystore-like data architecture for the US Department of Veteran Affairs (VA) enterprise analytics. 2550-2554 - Subhasis Dasgupta
, Kevin L. Coakley
, Amarnath Gupta:
Analytics-driven data ingestion and derivation in the AWESOME polystore. 2555-2564 - Evgeny Kharlamov, Theofilos P. Mailis, Konstantina Bereta
, Dimitris Bilidas, Sebastian Brandt, Ernesto Jiménez-Ruiz
, Steffen Lamparter, Christian Neuenstadt, Özgür L. Özçep, Ahmet Soylu
, Christoforos Svingos, Guohui Xiao
, Dmitriy Zheleznyakov, Diego Calvanese, Ian Horrocks, Martin Giese, Yannis E. Ioannidis, Yannis Kotidis, Ralf Möller
, Arild Waaler:
A semantic approach to polystores. 2565-2573 - Boyan Kolev, Raquel Pau, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jiménez-Peris, José Pereira
:
Benchmarking polystores: The CloudMdsQL experience. 2574-2579 - Vasilis Spyropoulos, Christina Vasilakopoulou
, Yannis Kotidis:
Digree: A middleware for a graph databases polystore. 2580-2589 - Abdeltawab M. Hendawi, Fatemah Alali, Xiaoyu Wang, Yunfei Guan, Tianshu Zhou, Xiao Liu, Nada Basit, John A. Stankovic:
Hobbits: Hadoop and Hive based Internet traffic analysis. 2590-2599 - Sangkeun Lee, Liangzhe Chen, Sisi Duan, Supriya Chinthavali, Mallikarjun Shankar
, B. Aditya Prakash:
URBAN-NET: A network-based infrastructure monitoring and analysis system for emergency management and public safety. 2600-2609 - Gandhi Sivakumar, Drew Johnson, Rashida Hodge:
Unravelling the Myth of big data and artificial intelligence in sustainable natural resource development. 2610-2615 - Joya A. Deri, Franz Franchetti, José M. F. Moura:
Big data computation of taxi movement in New York City. 2616-2625 - Holly Ferguson, Charles Vardeman, Jarek Nabrzyski:
Linked data view methodology and application to BIM alignment and interoperability. 2626-2635 - Rafal A. Angryk
, Douglas E. Galarus:
The SMART approach to comprehensive quality assessment of site-based spatial-temporal data. 2636-2645 - Upa Gupta, Kulsawasd Jitkajornwanich
, Ramez Elmasri, Leonidas Fegaras:
Adapting K-means clustering to identify spatial patterns in storms. 2646-2654 - Behnam Hedayatnia, Mehrdad Yazdani, Mai H. Nguyen, Jessica Block, Ilkay Altintas:
Determining feature extractors for unsupervised learning on satellite images. 2655-2663 - Andrew Hulbert, Thomas Kunicki, James N. Hughes, Anthony D. Fox, Christopher N. Eichelberger:
An experimental study of big spatial data systems. 2664-2671 - Siyuan Lu, Xiaoyan Shao, Marcus Freitag, Levente J. Klein, Jason D. Renwick, Fernando J. Marianno, Conrad M. Albrecht
, Hendrik F. Hamann:
IBM PAIRS curated big data service for accelerated geospatial data analytics and discovery. 2672-2675 - Chengcheng Mou, Shaoping Chen, Yi-Cheng Tu:
A comparative study of dual-tree algorithm implementations for computing 2-body statistics in spatial data. 2676-2685 - Ivens Portugal, Paulo S. C. Alencar, Donald D. Cowan:
Towards a provenance-aware spatial-temporal architectural framework for massive data integration and analysis. 2686-2691 - Alan Woodley
, Ling-Xiang Tang, Shlomo Geva
, Richi Nayak
, Timothy Chappell:
Using parallel hierarchical clustering to address spatial big data challenges. 2692-2698 - Chien-Heng Wu, Franco Lin, Wen-Yi Chang, Whey-Fone Tsai, Hsi-Ching Lin, Chao-Tung Yang
:
Big data development platform for engineering applications. 2699-2702 - Jiangye Yuan, Hsiu-Han Lexie Yang
, Olufemi A. Omitaomu, Budhendra L. Bhaduri:
Large-scale solar panel mapping from aerial images using deep convolutional networks. 2703-2708 - Yu Zhuang:
Symmetric repositioning of bisecting K-means centers for increased reduction of distance calculations for big data clustering. 2709-2715 - Anton Gulenko, Marcel Wallschläger, Florian Schmidt, Odej Kao, Feng Liu:
Evaluating machine learning algorithms for anomaly detection in clouds. 2716-2721 - Teemu Kanstrén, Jussi Liikka, Jukka Mäkelä, Markus Luoto, Jarmo Prokkola:
Preliminary big data in a 5G test network. 2722-2727 - Yiming Kong, Hui Zang, Xiaoli Ma:
Quick model fitting using a classifying engine. 2728-2733 - Ruilin Liu, Kai Yang, Yanjia Sun
, Tao Quan, Jin Yang:
Spark-based rare association rule mining for big datasets. 2734-2739 - Martino Trevisan
, Idilio Drago
, Marco Mellia
, Han Hee Song, Mario Baldi:
WHAT: A big data approach for accounting of modern web services. 2740-2745 - Azadeh Eftekhari, Farhana H. Zulkernine, Patrick Martin:
BINARY: A framework for big data integration for ad-hoc querying. 2746-2753 - Ellis R. Giles:
Container-based virtualization for byte-addressable NVM data storage. 2754-2763 - Meike Klettke, Uta Störl
, Manuel Shenavai, Stefanie Scherzinger:
NoSQL schema evolution and big data migration at scale. 2764-2774 - Aravind Mohan, Mahdi Ebrahimi, Shiyong Lu, Alexander Kotov
:
Scheduling big data workflows in the cloud under budget constraints. 2775-2784 - Daniel Playfair, Amitabh Trehan
, Barry McLarnon, Dimitrios S. Nikolopoulos
:
Big data availability: Selective partial checkpointing for in-memory database queries. 2785-2794 - Nico Rödder, David Dauer, Kevin Laubis, Paul Karaenke, Christof Weinhardt
:
The digital transformation and smart data analytics: An overview of enabling developments and application areas. 2795-2802 - Matthieu-P. Schapranow
, Matthias Uflacker, Murat Sariyar, Sebastian C. Semler, Johannes Klaus Fichte, Dietmar Schielke, Kismet Ekinci, Thomas Zahn:
Towards an integrated health research process: A cloud-based approach. 2813-2818 - Merlijn Sebrechts
, Sander Borny, Thomas Vanhove
, Gregory van Seghbroeck
, Tim Wauters, Bruno Volckaert, Filip De Turck:
Model-driven deployment and management of workflows on analytics frameworks. 2819-2826 - Daniel Seybold, Nicolas Wagner, Benjamin Erb, Jörg Domaschka:
Is elasticity of scalable databases a Myth? 2827-2836 - Alexander Stiemer
, Ilir Fetai, Heiko Schuldt
:
Analyzing the performance of data replication and data partitioning in the cloud: The BEOWULF approach. 2837-2846 - Miguel G. Xavier, Kassiano J. Matteussi, Fabian Lorenzo, César A. F. De Rose:
Understanding performance interference in multi-tenant cloud databases and web applications. 2847-2852 - Bonnie J. Dorr
, Peter C. Fontana, Craig S. Greenberg, Marion Le Bras, Mark A. Przybocki:
Evaluation-driven research in data science: Leveraging cross-field methodologies. 2853-2862 - Frank S. Haug:
Bad big data science. 2863-2871 - Jeffrey S. Saltz
, Ivan Shamshurin:
Big data team process methodologies: A literature review and the identification of key factors for a project's success. 2872-2879 - Pankush Kalgotra
, Ramesh Sharda
:
Progression analysis of signals: Extending CRISP-DM to stream analytics. 2880-2885 - Vijay Dipti Kumar, Paulo S. C. Alencar:
Software engineering for big data projects: Domains, methodologies and gaps. 2886-2895 - Sohini Roychowdhury, Johnny Ren:
Non-deep CNN for multi-modal image classification and feature learning: An Azure-based model. 2893-2812 - Jeffrey S. Saltz
, Sibel Yilmazel, Özgür Yilmazel:
Not all software engineers can become good data engineers. 2896-2901 - Toshiyuki Shimono:
A hacking toolset for big tabular files (Codenames: Bin4tsv, Kabutomushi). 2902-2910 - Sandro Fiore
, Marcin Plóciennik
, Charles M. Doutriaux, Cosimo Palazzo
, J. Boutte, Tomasz Zok
, Donatello Elia
, Michal Owsiak, Alessandro D'Anca
, Z. Shaheen, Riccardo Bruno, Marco Fargetta
, Miguel Caballer, Germán Moltó, Ignacio Blanquer, Roberto Barbera
, Mário David
, Giacinto Donvito
, Dean N. Williams, Valentine Anantharaj, Davide Salomoni, Giovanni Aloisio
:
Distributed and cloud-based multi-model analytics experiments on large volumes of climate change data in the earth system grid federation eco-system. 2911-2918 - Jason Laura, Robin L. Fergason:
Modeling martian thermal inertia in a distributed memory high performance computing environment. 2919-2928 - Adam M. Leadbetter, Damian Smyth, Robert Fuller, Eoin O'Grady, Adam Shepherd
:
Where big data meets linked data: Applying standard data models to environmental data streams. 2929-2937 - Ryuya Mitsuhashi, Hideyuki Kawashima, Takahiro Nishimichi
, Osamu Tatebe:
Three-dimensional spatial join count exploiting CPU optimized STR R-tree. 2938-2947 - Amidu Oloso, Kwo-Sen Kuo, Thomas L. Clune, Paul Brown, Alex Poliakov, Hongfeng Yu:
Implementing connected component labeling as a user defined operator for SciDB. 2948-2952 - Kevin Paul, Sheri A. Mickelson, John M. Dennis:
A new parallel python tool for the standardization of earth system model data. 2953-2959 - Michael Requa, Garrison Vaughan, John David
, Ben Cotton:
Using cloud bursting to count trees and shrubs in Sub-Saharan Africa. 2960-2963 - Brian Wilson, Rahul Palamuttam, Kim Whitehall, Chris Mattmann, Alex Goodman, Maziyar Boustani, Sujen Shah, Paul Zimdars, Paul M. Ramirez:
SciSpark: Highly interactive in-memory science data analytics. 2964-2973 - Shujia Zhou, Xiaowen Li, Toshihisa Matsui, Wei-Kuo Tao:
Visualization and diagnosis of earth science data through Hadoop and Spark. 2974-2980 - Ellis Giles, Kshitij A. Doshi, Peter J. Varman:
Persisting in-memory databases using SCM. 2981-2990 - Zhihao Huang, Hui Li, Xin Li, Wei He:
SS-dedup: A high throughput stateful data routing algorithm for cluster deduplication system. 2991-2995 - Xin Li, Hui Li, Zhihao Huang, Bing Zhu, Jiawei Cai:
EStore: An effective optimized data placement structure for Hive. 2996-3001 - Si Liu, Eun-Sung Jung
, Rajkumar Kettimuthu, Xian-He Sun, Michael E. Papka
:
Towards optimizing large-scale data transfers with end-to-end integrity verification. 3002-3007 - Thomas Renner, Lauritz Thamsen, Odej Kao:
CoLoc: Distributed data and container colocation for data-intensive applications. 3008-3015 - Holly Ferguson, Charles Vardeman, Jarek Nabrzyski:
Linked data platform for building cloud-based smart applications and connecting API access points with data discovery techniques. 3016-3025 - Ajinkya Prabhune, Hasebullah Ansari, Anil Keshav, Rainer Stotzka, Michael Gertz, Jürgen Hesser:
MetaStore: A metadata framework for scientific data repositories. 3026-3035 - Ulrich Schwardmann
:
Automated schema extraction for PID information types. 3036-3044 - Priyaa Thavasimani
, Paolo Missier
:
Facilitating reproducible research by investigating computational metadata. 3045-3051 - Sudharshan S. Vazhkudai, John Harney, Raghul Gunasekaran, Dale Stansberry, Seung-Hwan Lim, Tom Barron, Andrew Nash, Arvind Ramanathan:
Constellation: A science graph network for scalable data and knowledge discovery in extreme-scale scientific collaborations. 3052-3061 - Guangxia Xu, Jin Qi, Deling Huang, Mahmoud Daneshmand:
Detecting spammers on social networks based on a hybrid model. 3062-3068 - Liudong Zuo
, Mengxia Michelle Zhu:
Bandwidth provision strategies for reliable data movements in dedicated networks. 3069-3078 - Radhakrishnan Angamuthu Chinnathambi, Prakash Ranganathan:
Investigation of forecasting methods for the hourly spot price of the day-ahead electric power markets. 3079-3086 - Hông-Ân Cao, Felix Rauchenstein, Tri Kurniawan Wijaya, Karl Aberer, Nuno Nunes
:
Leveraging user expertise in collaborative systems for annotating energy datasets. 3087-3096 - Hông-Ân Cao, Tri Kurniawan Wijaya, Karl Aberer, Nuno Nunes
:
Temporal association rules for electrical activity detection in residential homes. 3097-3106 - Saman Mostafavi, Benjamin Futrell, John Troxler, Robert W. Cox:
Leveraging cloud computing to convert the non-intrusive load monitor into a powerful framework for grid-responsive buildings. 3107-3114 - Shady S. Refaat, Haitham Abu-Rub, Amira Mohamed:
Big data, better energy management and control decisions for distribution systems in smart grid. 3115-3120 - Viktor Botev, Magnus Almgren
, Vincenzo Gulisano
, Olaf Landsiedel, Marina Papatriantafilou
, Joris van Rooij:
Detecting non-technical energy losses through structural periodic patterns in AMI data. 3121-3130 - Andreas Unterweger, Dominik Engel
:
Lossless compression of high-frequency voltage and current data in smart grids. 3131-3139 - Berkay Aydin
, Ahmet Küçük, Rafal A. Angryk
:
Indexing spatiotemporal relations in solar event datasets. 3140-3148 - Soukaina Filali Boubrahimi, Berkay Aydin
, Dustin Kempton
, Rafal A. Angryk
:
Spatio-temporal interpolation methods for solar events metadata. 3149-3157 - Jon M. Jenkins:
Processing and managing the Kepler mission's treasure trove of stellar and exoplanet data. 3158-3167 - Dustin J. Kempton
, Michael A. Schuh, Rafal A. Angryk
:
Describing solar images with sparse coding for similarity search. 3168-3176 - Ruizhe Ma, Rafal A. Angryk
, Pete Riley:
A data-driven analysis of interplanetary coronal mass ejecta and magnetic flux ropes. 3177-3186 - Simon Marcin, André Csillaghy
:
Running scientific algorithms as array database operators: Bringing the processing power to the data. 3187-3193 - Andrés Muñoz-Jaramillo
, Z. A. Werginz, J. P. Vargas-Acosta, M. D. DeLuca, J. C. Windmueller, J. Zhang, D. W. Longcope, Derek A. Lamb, C. E. DeForest, S. Vargas-Dominguez, J. W. Harvey, P. C. H. Martens:
The best of both worlds: Using automatic detection and limited human supervision to create a homogenous magnetic catalog spanning four solar cycles. 3194-3203 - Ryan J. Oelkers, Keivan G. Stassun, Joshua A. Pepper, Nathan M. De Lee
, Martin A. Paegert
:
An input catalog and target selection for the transiting exoplanet survey satellite. 3204-3213 - N. Olspert, M. J. Kapyla, J. Pelti:
Method for estimating cycle lengths from multidimensional time series: Test cases and application to a massive "in silico" dataset. 3214-3223 - Bennett B. Borden, Jason R. Baron:
Opening up dark digital archives through the use of analytics to identify sensitive content. 3224-3229 - Marco Büchler, Greta Franzini
, Emily Franzini, Thomas Eckart:
Mining and analysing one billion requests to linguistic services. 3230-3239 - Jenny Bunn:
Mind the explanatory gap: Quality from quantity. 3240-3244 - Simon Hengchen, Mathias Coeckelbergs, Seth van Hooland, Ruben Verborgh, Thomas Steiner:
Exploring archives with probabilistic models: Topic modelling for the valorisation of digitised archives of the European Commission. 3245-3249 - Emily Maemura
, Christoph Becker, Ian Milligan
:
Understanding computational web archives research methods using research objects. 3250-3259 - Sonia Ranade
:
Traces through time: A probabilistic approach to connected archival data. 3260-3265 - Robert J. Sandusky:
Computational provenance: DataONE and implications for cultural heritage institutions. 3266-3271 - Michael Shallcross:
Appraising digital archives with Archivematica. 3272-3276 - Kenneth Thibodeau:
Breaking down the invisible wall to enrich archival science and practice. 3277-3282 - Weijia Xu, Ruizhu Huang, Maria Esteva
, Jawon Song
, Ramona L. Walls:
Content-based comparison for collections identification. 3283-3289 - Stephen Bonner
, John Brennan
, Georgios Theodoropoulos
, Ibad Kureshi
, Andrew Stephen McGough:
Deep topology classification: A new approach for massive graph classification. 3290-3297 - Stephen Bonner
, John Brennan
, Georgios Theodoropoulos
, Ibad Kureshi
, Andrew Stephen McGough:
GFP-X: A parallel approach to massive graph comparison using spark. 3298-3307 - Thibault Debatty, Fabio Pulvirenti, Pietro Michiardi, Wim Mees:
Fast distributed k-nn graph update. 3308-3317 - Hiroki Kanezashi, Toyotaro Suzumura:
An incremental local-first community detection method for dynamic graphs. 3318-3325 - Bryan Rainey, David F. Gleich
:
Massive graph processing on nanocomputers. 3326-3335 - Sara Riazi, Boyana Norris
:
GraphFlow: Workflow-based big graph processing. 3336-3343 - W. Sean Kennedy, Iraj Saniee, Onuttom Narayan:
On the hyperbolicity of large-scale networks and its estimation. 3344-3351 - Nilothpal Talukder, Mohammed J. Zaki:
Parallel graph mining with dynamic load balancing. 3352-3359 - Charith Wickramaarachchi, Rajgopal Kannan, Charalampos Chelmis
, Viktor K. Prasanna:
Distributed exact subgraph matching in small diameter dynamic graphs. 3360-3369 - Duncan Yung, Shi-Kuo Chang:
Fast reachability query computation on big attributed graphs. 3370-3380 - Fang Du, Ting Li, Yingjie Shi, Lijuan Song, Xiaojun Gu:
Drug target path discovery on semantic biomedical big data. 3381-3386 - Muhammad Kamran Lodhi, Rashid Ansari, Yingwei Yao, Gail M. Keenan, Diana J. Wilkie
, Ashfaq Khokhar:
A framework to predict outcome for cancer patients using data from a nursing EHR. 3387-3395 - Milad Makkie, Xiang Li
, Tianming Liu, Shannon Quinn, Binbin Lin, Jieping Ye:
Distributed rank-1 dictionary learning: Towards fast and scalable solutions for fMRI big data analytics. 3396-3403 - Mohammad Mehedy Masud, Abdel Rahman Al Harahsheh:
Mortality prediction of ICU patients using lab test data by feature vector compaction & classification. 3404-3411 - Vasundhara Misal, Vandana P. Janeja, Sai C. Pallaprolu, Yelena Yesha, Raghu Chintalapati:
Iterative unified clustering in big data. 3412-3421 - Maitham D. Naeemi, Johnny Ren, Nathan Hollcroft, Adam M. Alessio, Sohini Roychowdhury:
Application of big data analytics for automated estimation of CT image quality. 3422-3431 - Jianwu Wang, Zhichuan Huang, Wenbin Zhang, Ankita Patil, Ketan Patil, Ting Zhu, Eric J. Shiroma, Mitchell A. Schepps, Tamara B. Harris:
Wearable sensor based human posture recognition. 3432-3438 - Takuya Yoshida, M. Emre Celebi, Gerald Schaefer, Hitoshi Iyatomi:
Simple and effective pre-processing for automated melanoma discrimination based on cytological findings. 3439-3442 - Weider D. Yu, Jaspal Singh Gill, Maulin Dalal, Piyush Jha, Sajan Shah:
Big data approach in healthcare used for intelligent design - Software as a service. 3443-3449 - Mansurul Alam Bhuiyan, Mohammad Al Hasan:
Interactive personalized interesting pattern discovery. 3450-3456 - Jordan DeLoach, Doina Caragea
, Xinming Ou:
Android malware detection with weak ground truth data. 3457-3464 - Chenxiao Dou, Daniel Sun, Yi-Cheng Chen, Guoqiang Li, Jianquan Liu:
Probabilistic parallelisation of blocking non-matched records for big data. 3465-3473 - Anders Høst-Madsen, Elyas Sabeti, Chad Walton, Su Jun Lim:
Universal data discovery using atypicality. 3474-3483 - Elham Sahebkar Khorasani, Zhao Zhenge, John Champaign:
A Markov chain collaborative filtering model for course enrollment recommendations. 3484-3490 - Hsu-Chao Lai, Wen-Yueh Shih, Jiun-Long Huang, Yi-Cheng Chen:
Predicting traffic of online advertising in real-time bidding systems from perspective of demand-side platforms. 3491-3498 - Nicholas A. James, Arun Kejariwal, David S. Matteson
:
Leveraging cloud data to mitigate user experience from 'breaking bad'. 3499-3508 - Max Menenberg, Surya Pathak, Hari P. Udyapuram, Srinagesh Gavirneni
, Sohini Roychowdhury:
Topic modeling for management sciences: A network-based approach. 3509-3518 - Izabela Moise:
The technical hashtag in Twitter data: A hadoop experience. 3519-3528 - Kingsley Okoye
, Abdel-Rahman H. Tawil
, Usman Naeem
, Syed Islam, Elyes Lamine
:
Using semantic-based approach to manage perspectives of process mining: Application on improving learning process domain data. 3529-3538 - Sai C. Pallaprolu, Josephine M. Namayanja, Vandana P. Janeja, C. T. Sai Adithya:
Label propagation in big data to detect remote access Trojans. 3539-3547 - Fuad Rahman
, Marvin J. Slepian, Ari Mitra:
A novel big-data processing framwork for healthcare applications: Big-data-healthcare-in-a-box. 3548-3555 - Yao-Ming Yang, Chang-Dong Wang, Jian-Huang Lai:
An efficient parallel topic-sensitive expert finding algorithm using spark. 3556-3562 - Linlin You
, Bige Tunçer
:
Exploring the utilization of places through a scalable "Activities in Places" analysis mechanism. 3563-3572 - Jun He, Yue Zhang, Jiye Wang, Nan Zeng, Hanyong Hao:
Robust K-subspaces recovery with combinatorial initialization. 3573-3582 - Supun Kamburugamuve, Pulasthi Wickramasinghe, Saliya Ekanayake, Chathuri Wimalasena, Milinda Pathirage, Geoffrey C. Fox:
TSmap3D: Browser visualization of high dimensional time series data. 3583-3592 - Michael A. Schuh, Rafal A. Angryk
:
On the theory and practice of high-dimensional data indexing with iDistance. 3593-3600 - Michael Wojnowicz, Ben Cruz, Xuan Zhao, Brian Wallace, Matt Wolff, Jay Luan, Caleb Crable:
"Influence sketching": Finding influential samples in large-scale regressions. 3601-3612 - Katie R. Yates, Nicos G. Pavlidis
:
Minimum density hyperplanes in the feature space. 3613-3618 - Bo Zhang, Liwei Wang:
Structure preserving dimension reduction with 2D images as predictors. 3619-3624 - Santosh Aditham, Nagarajan Ranganathan, Srinivas Katkoori
:
Memory access pattern based insider threat detection in big data systems. 3625-3628 - Khudran Alzhrani
, Ethan M. Rudd, C. Edward Chow, Terrance E. Boult:
Automated big security text pruning and classification. 3629-3637 - Claudio A. Ardagna
, Paolo Ceravolo
, Ernesto Damiani:
Big data analytics as-a-service: Issues and challenges. 3638-3644 - Elisa Bertino:
Data privacy for IoT systems: Concepts, approaches, and research directions. 3645-3647 - Chia-Tien Dan Lo, Pablo Ordóñez, Carlos Cepeda Mora:
Towards an effective and efficient malware detection system. 3648-3655 - Alfredo Cuzzocrea, Carlo Mastroianni, Giorgio Mario Grasso
:
Private databases on the cloud: Models, issues and research perspectives. 3656-3661 - Philip Derbeko, Shlomi Dolev
, Ehud Gudes, Jeffrey D. Ullman:
Concise essence-preserving big data representation. 3662-3665 - Sushil Jajodia
, Witold Litwin, Thomas J. E. Schwarz:
Trusted cloud SQL DBS with on-the-fly AES decryption/encryption. 3666-3675 - Soo-Hyung Kim, Changwook Jung, Yoon-Joon Lee:
An entropy-based analytic model for the privacy-preserving in open data. 3676-3684 - Xueni Li, Guanggang Geng
, Zhiwei Yan, Yong Chen, Xiaodong Lee:
Phishing detection based on newly registered domains. 3685-3692 - Boel Nelson
, Tomas Olovsson:
Security and privacy for big data: A systematic literature review. 3693-3702 - Mohammad Shafahi
, Leon Kempers, Hamideh Afsarmanesh
:
Phishing through social bots on Twitter. 3703-3712 - Hippolyte Djonon Tsague, Bheki Twala
:
Reverse engineering smart card malware using side channel analysis with machine learning techniques. 3713-3721 - Jason W. Woodworth, Mohsen Amini Salehi, Vijay Raghavan:
S3C: An architecture for space-efficient semantic search over encrypted data in the cloud. 3722-3731 - Tomohiro Fukui:
A systems approach to big data technology applied to supply chain. 3732-3736 - Gary S. W. Goh, Andy J. L. Ang, Allan N. Zhang
:
Optimizing performance of sentiment analysis through design of experiments. 3737-3742 - Vahid Kayvanfar
, S. M. Moattar Husseini, Behrooz Karimi, Mohsen S. Sajadieh
, Tan Wen Jun:
Analysis for supply hub in industrial cluster: Classic vs. new perspective. 3743-3748 - Jasmine J. Lim, Allan N. Zhang
:
A DEA approach for Supplier Selection with AHP and risk consideration. 3749-3758 - André Luckow, Matthew Cook, Nathan Ashcraft, Edwin Weill, Emil Djerekarov, Bennie Vorster:
Deep learning in the automotive industry: Applications and tools. 3759-3768 - Kazumasa Mori, Takuya Ohmori:
The Bayesian estimators of polytomous item response theory models with approximated conditional likelihood and their mathematical optimalities. 3769-3772 - B. Y. Ong, Rong Wen, Allan N. Zhang
:
Data blending in manufacturing and supply chains. 3773-3778 - Wen Jun Tan, Wentong Cai
, Zhengping Li:
Adaptive resilient strategies for supply chain networks. 3779-3784 - Takuya Watanabe, Hiroaki Muroi, Motoki Naruke, Kyoto Yono, Gen Kobayashi, Masanori Yamasaki:
Prediction of regional goods demand incorporating the effect of weather. 3785-3791 - Rong Wen, Wenjing Yan, Allan N. Zhang
:
Weighted clustering of spatial pattern for optimal logistics hub deployment. 3792-3797 - Wenjing Yan, Rong Wen, Allan N. Zhang
, Dazhi Yang
:
Vessel movement analysis and pattern discovery using density-based clustering approach. 3798-3806 - Dazhi Yang
, Gary S. W. Goh, Siwei Jiang, Allan N. Zhang
:
Spatial data dimension reduction using quadtree: A case study on satellite-derived solar radiation. 3807-3812 - Dazhi Yang
, Gary S. W. Goh, Siwei Jiang, Allan N. Zhang
:
Forecast UPC-level FMCG demand, Part III: Grouped reconciliation. 3813-3819 - A. Aziz Altowayan, Lixin Tao:
Word embeddings for Arabic sentiment analysis. 3820-3825 - Michael Bentley, Soumya Batra:
Giving voice to office customers: Best practices in how office handles verbatim text feedback. 3826-3832 - Xiangfeng Dai, Robert Prout:
Unlock big data emotions: Weighted word embeddings for sentiment classification. 3833-3838 - Anna Hennig, Anne-Sofie Amodt, Henrik Hernes, Helene Mejer Nygardsmoen, Peter Arenfeldt Larsen, Raghava Rao Mukkamala
, Benjamin Flesch, Abid Hussain, Ravi Vatrapu
:
Big social data analytics of changes in consumer behaviour and opinion of a TV broadcaster. 3839-3848 - Henrikke Hovda Larsen, Johanna Margareta Forsberg, Sigrid Viken Hemstad, Raghava Rao Mukkamala
, Abid Hussain, Ravi Vatrapu
:
TV ratings vs. social media engagement: Big social data analytics of the Scandinavian TV talk show Skavlan. 3849-3858 - Tayfun Pay
:
Totally automated keyword extraction. 3859-3863 - Belainine Billal, Alexsandro Fonseca, Fatiha Sadat:
Efficient natural language pre-processing for analyzing large data sets. 3864-3871 - Jihun Choi, Jonghem Youn, Sang-goo Lee:
A grapheme-level approach for constructing a Korean morphological analyzer without linguistic knowledge. 3872-3879 - Matthew Coole
, Paul Rayson
, John A. Mariani
:
lexiDB: A scalable corpus database management system. 3880-3884 - Pradipto Das, Yandi Xia, Aaron Levine, Giuseppe Di Fabbrizio, Ankur Datta:
Large-scale taxonomy categorization for noisy product listings. 3885-3894 - Georg Heigold, Josef van Genabith, Günter Neumann:
Scaling character-based morphological tagging to fourteen languages. 3895-3902 - Avinash Kumar, Dhaval Patel, Nikita Jain:
Lightweight system for NE-tagged news headlines corpus creation. 3903-3912 - Yunfei Long, Qin Lu
, Yue Xiao, Minglei Li, Chu-Ren Huang
:
Domain-specific user preference prediction based on multiple user activities. 3913-3921 - Daiki Shimada, Ryunosuke Kotani, Hitoshi Iyatomi:
Document classification through image-based character embedding and wildcard training. 3922-3927 - Alexey Svyatkovskiy, Kosuke Imai, Mary Kroeger, Yuki Shiraito
:
Large-scale text processing pipeline with Apache Spark. 3928-3935 - Hoseong Yang, Hye Jin Lee, Sungzoon Cho, Eugene Cho:
Automatic classification of securities using hierarchical clustering of the 10-Ks. 3936-3943 - Katchaguy Areekijseree, Ricky Laishram, Sucheta Soundarajan:
Max-node sampling: An expansion-densification algorithm for data collection. 3944-3946 - Adel Saad Assiri, Ahmed Z. Emam
, Hmood Al-Dossari:
Real-time sentiment analysis of Saudi dialect tweets using SPARK. 3947-3950 - Peter Bajcsy, Soweon Yoon, Mylene Simon, Mary Brady, Ram D. Sriram, Nathan Hotaling, Nicholas Schaub, Carl G. Simon, Piotr M. Szczypinski
, Stephen J. Florczyk
:
Modeling, validation and verification of cell-scaffold contact measurements over terabyte-sized 3D image collection. 3951-3953 - Raja Sarath Kumar Boddu
:
An integrated assessment approach to different collaborative filtering algorithms. 3954-3956 - Shaunak D. Bopardikar, George S. Eskander Ekladious:
Sequential randomized matrix factorization for Gaussian processes. 3957-3959 - Vy Bui, Lin-Ching Chang
, Dunling Li, Li-yueh Hsu
, Marcus Y. Chen:
Comparison of lossless video and image compression codecs for medical computed tomography datasets. 3960-3962 - Sunghwan Cho, Sunghal Hong, Changsoo Lee:
ORANGE: Spatial big data analysis platform. 3963-3965 - Ranjeet Devarakonda
, Yaxing Wei
, Michele Thornton:
Accessing and distributing large volumes of NetCDF data. 3966-3967 - Ranjeet Devarakonda
, Kyle Dumas, Sheman Beus, Everett Neil Rush
, Bhargavi Krishna, Rob Records, Giri Prakash:
Next-gen tools for big scientific data: ARM data center example. 3968-3970 - Srabasti Dutta, Sumantro Ray, S. Roy:
Correlation between weather and weather-related tweets - A preliminary study. 3971-3973 - Austin Harris, Hanna True, Zhen Hu, Jin Cho, Nancy Fell, Mina Sartipi:
Fall recognition using wearable technologies and machine learning algorithms. 3974-3976 - Ling He, Jiebo Luo
:
"What makes a pro eating disorder hashtag": Using hashtags to identify pro eating disorder tumblr posts and Twitter users. 3977-3979 - Ayae Ichinose, Masato Oguchi, Atsuko Takefusa, Hidemoto Nakada
:
Evaluation of distributed processing of caffe framework using poor performance device. 3980-3982 - Hiroki Imabayashi, Yu Ishimaki, Akira Umayabara, Hayato Yamana
:
Fast and space-efficient secure frequent pattern mining by FHE. 3983-3985 - Akira Ishii, Masanori Ajito, Yasuko Kawahata:
Analysis of Pokémon GO using sociophysics approach. 3986-3988 - Yu Ishimaki, Hiroki Imabayashi, Kana Shimizu
, Hayato Yamana
:
Privacy-preserving string search for genome sequences with FHE bootstrapping optimization. 3989-3991 - Jeffrey Jenkins, Lin-Ching Chang
, Elizabeth B. Hutchinson
, M. Okan Irfanoglu, Carlo Pierpaoli:
Harmonization of methods to facilitate reproducibility in medical data processing: Applications to diffusion tensor magnetic resonance imaging. 3992-3994 - Seungwoo Jeon, Jaegi Hong, Bonghee Hong, Chumsu Kim:
TPR∗-tree Performance improvement for big tactical moving objects. 3995-3997 - Xiaoxia Jia, Peng Cheng, Jiming Chen:
A data analysis and visualization system for large-scale e-bike data. 3998-4000 - Priyanka Kale, Shilpa Balan:
Big data application in job trend analysis. 4001-4003 - David Kimmey, Jin Soung Yoo:
Nowcasting with social media data. 4004 - Vivian Lai, Kyong Jin Shim
, Richard Jayadi Oentaryo, Philips Kokoh Prasetyo, Casey Vu, Ee-Peng Lim
, David Lo
:
CareerMapper: An automated resume evaluation tool. 4005-4007 - Ricky Laishram, Katchaguy Areekijseree, Sucheta Soundarajan:
Predicted max degree sampling: Sampling in directed networks to maximize node coverage through crawling. 4008-4010 - Jiwan Lee, Jaegi Hong, Bonghee Hong, Jinsu Ahn:
A generator of test data set for tactical moving objects based on velocity. 4011-4013 - Quanzhi Li, Sameena Shah, Mohammad Mahdi Ghassemi, Rui Fang, Armineh Nourbakhsh, Xiaomo Liu:
Using paraphrases to improve tweet classification: Comparing WordNet and word embedding approaches. 4014-4016 - Xiaomeng Liang, Lin-Ching Chang
, Arash Massoudieh:
A framework for large-scale bacterial motility behavior analysis. 4017-4019 - Ankur Padia, Konstantinos Kalpakis, Tim Finin:
Inferring relations in knowledge graphs with tensor decompositions. 4020-4022 - Benito O. Perez, Yiwei Ma, Mengran Wang, Xiaomeng Liang, Negin Askarzadeh:
Towards a more meterless parking system: Understanding meter payment behavior and trends in Washington, DC. 4023-4025 - Giri Prakash, Jitendra Kumar
, Everett Neil Rush
, Robert Records, Anthony Clodfelter, Jimmy W. Voyles:
HPC infrastructure to support the next-generation ARM facility data operations. 4026-4028 - Jonathan M. Rogers, Soumya S. Dey, Richard Retting, Rahul Jain, Xiaomeng Liang, Negin Askarzadeh:
Using automated enforcement data to achieve vision zero goals: A case study. 4029-4031 - Antonette Shibani
, Elizabeth Koh, Vivian Lai, Kyong Jin Shim
:
Analysis of teamwork dialogue: A data mining approach. 4032-4034 - Kenneth David Strang
, Zhaohao Sun
:
Meta-analysis of big data security and privacy: Scholarly literature gaps. 4035-4037 - Xingang Wang, Zhigang Gai, Suiping Qi:
An approach for extracting big micro-scale severe weather region trajectories automatically from meteorological radar data. 4038-4039 - Guangxia Xu, Jingteng Zhao, Deling Huang:
An improved social spammer detection based on tri-training. 4040-4042

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.