default search action
ACM SIGMOD Conference 2020: online [Portland, OR, USA]
- David Maier, Rachel Pottinger, AnHai Doan, Wang-Chiew Tan, Abdussalam Alawini, Hung Q. Ngo:
Proceedings of the 2020 International Conference on Management of Data, SIGMOD Conference 2020, online conference [Portland, OR, USA], June 14-19, 2020. ACM 2020, ISBN 978-1-4503-6735-6
SIGMOD Keynote 1
- Ion Stoica:
Systems and ML: When the Sum is Greater than Its Parts. 1
Research 1: Crowdsourcing and Visualization
- Dong Wei, Senjuti Basu Roy, Sihem Amer-Yahia:
Recommending Deployment Strategies for Collaborative Tasks. 3-17 - Chengliang Chai, Lei Cao, Guoliang Li, Jian Li, Yuyu Luo, Samuel Madden:
Human-in-the-loop Outlier Detection. 19-33 - Tsz Nam Chan, Reynold Cheng, Man Lung Yiu:
QUAD: Quadratic-Bound-based Kernel Density Visualization. 35-50 - Tarique Siddiqui, Paul Luh, Zesheng Wang, Karrie Karahalios, Aditya G. Parameswaran:
ShapeSearch: A Flexible and Efficient System for Shape-based Exploration of Trendlines. 51-65 - Liming Dong, Qiushi Bai, Taewoo Kim, Taiji Chen, Weidong Liu, Chen Li:
Marviq: Quality-Aware Geospatial Visualization of Range-Selection Queries Using Materialization. 67-82
Research 2: Serverless and Cloud Data Management
- Chenggang Wu, Vikram Sreekanti, Joseph M. Hellerstein:
Transactional Causal Consistency for Serverless Computing. 83-97 - Tarique Siddiqui, Alekh Jindal, Shi Qiao, Hiren Patel, Wangchao Le:
Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings. 99-113 - Ingo Müller, Renato Marroquín, Gustavo Alonso:
Lambada: Interactive Data Analytics on Cold Data Using Serverless Cloud Infrastructure. 115-130 - Matthew Perron, Raul Castro Fernandez, David J. DeWitt, Samuel Madden:
Starling: A Scalable Query Engine on Cloud Functions. 131-141 - Benjamin Hilprecht, Carsten Binnig, Uwe Röhm:
Learning a Partitioning Advisor for Cloud Databases. 143-157
Research 3: Machine Learning for Databases I
- Matthias Jasny, Tobias Ziegler, Tim Kraska, Uwe Röhm, Carsten Binnig:
DB4ML - An In-Memory Database Kernel with Machine Learning Support. 159-173 - Lin Ma, Bailu Ding, Sudipto Das, Adith Swaminathan:
Active Learning for ML Enhanced Database Systems. 175-191 - Zongheng Yang, Badrish Chandramouli, Chi Wang, Johannes Gehrke, Yinan Li, Umar Farooq Minhas, Per-Åke Larson, Donald Kossmann, Rajeev Acharya:
Qd-tree: Learning Data Layouts for Big Data Analytics. 193-208 - Zainab Zolaktaf, Mostafa Milani, Rachel Pottinger:
Facilitating SQL Query Composition and Analysis. 209-224 - Sourav Sikdar, Chris Jermaine:
MONSOON: Multi-Step Optimization and Execution of Queries with Partially Obscured Predicates. 225-240
Research 4: Uncertain, Probabilistic, and Approximate Data
- Babak Salimi, Harsh Parikh, Moe Kayali, Lise Getoor, Sudeepa Roy, Dan Suciu:
Causal Relational Learning. 241-256 - Laurel J. Orr, Magdalena Balazinska, Dan Suciu:
Sample Debiasing in the Themis Open World Database System. 257-268 - Matteo Brucato, Nishant Yadav, Azza Abouzied, Peter J. Haas, Alexandra Meliou:
Stochastic Package Queries in Probabilistic Databases. 269-283 - Xi Liang, Zechao Shang, Sanjay Krishnan, Aaron J. Elmore, Michael J. Franklin:
Fast and Reliable Missing Data Contingency Analysis with Predicate-Constraints. 285-295 - Batya Kenig, Pranay Mundra, Guna Prasaad, Babak Salimi, Dan Suciu:
Mining Approximate Acyclic Schemes from Relations. 297-312
Industry 1: Graph Databases and Knowledge Bases
- Xusheng Luo, Luxin Liu, Yonghua Yang, Le Bo, Yuanpeng Cao, Jinghang Wu, Qiang Li, Keping Yang, Kenny Q. Zhu:
AliCoCo: Alibaba E-commerce Cognitive Concept Net. 313-327 - Chiranjeeb Buragohain, Knut Magne Risvik, Paul Brett, Miguel Castro, Wonhee Cho, Joshua Cowhig, Nikolas Gloy, Karthik Kalyanaraman, Richendra Khanna, John Pao, Matthew Renzelmann, Alex Shamis, Timothy Tan, Shuheng Zheng:
A1: A Distributed In-Memory Graph Database. 329-344 - Yuanyuan Tian, En Liang Xu, Wei Zhao, Mir Hamid Pirahesh, Suijun Tong, Wen Sun, Thomas Kolanko, Md. Shahidul Haque Apu, Huijuan Peng:
IBM Db2 Graph: Supporting Synergistic and Retrofittable Graph Queries Inside IBM Db2. 345-359 - Abdul Quamar, Chuan Lei, Dorian Miller, Fatma Ozcan, Jeffrey T. Kreulen, Robert J. Moore, Vasilis Efthymiou:
An Ontology-Based Conversation System for Knowledge Bases. 361-376 - Alin Deutsch, Yu Xu, Mingxi Wu, Victor E. Lee:
Aggregation Support for Modern Graph Analytics in TigerGraph. 377-392 - Bang Liu, Weidong Guo, Di Niu, Jinwen Luo, Chaoyue Wang, Zhen Wen, Yu Xu:
GIANT: Scalable Creation of a Web-scale Ontology. 393-409
SIGMOD Panel
- Magda Balazinska, Surajit Chaudhuri, Anastasia Ailamaki, Juliana Freire, Sailesh Krishnamurthy, Michael Stonebraker:
The Next 5 Years: What Opportunities Should the Database Community Seize to Maximize its Impact? 411-414
Research 5: Data Provenance
- Pierre Bourhis, Daniel Deutch, Yuval Moskovitch:
Equivalence-Invariant Algebraic Provenance for Hyperplane Update Queries. 415-429 - Anna Fariha, Suman Nath, Alexandra Meliou:
Causality-Guided Adaptive Interventional Debugging. 431-446 - Yinjun Wu, Val Tannen, Susan B. Davidson:
PrIU: A Provenance-Based Approach for Incrementally Updating Regression Models. 447-462 - Raoni Lourenço, Juliana Freire, Dennis E. Shasha:
BugDoc: Algorithms to Debug Computational Processes. 463-478 - Yuchao Tao, Xi He, Ashwin Machanavajjhala, Sudeepa Roy:
Computing Local Sensitivities of Counting Queries with Joins. 479-494
Research 6: Transaction Processing and Query Optimization
- Jong-Bin Kim, Hyunsoo Cho, Kihwang Kim, Jaeseon Yu, Sooyong Kang, Hyungsoo Jung:
Long-lived Transactions Made Less Harmful. 495-510 - Erfan Zamanian, Julian Shun, Carsten Binnig, Tim Kraska:
Chiller: Contention-centric Transaction Execution and Data Partitioning for Modern Networks. 511-526 - Guna Prasaad, Alvin Cheung, Dan Suciu:
Handling Highly Contended OLTP Workloads Using Fast Dynamic Partitioning. 527-542 - Pingcheng Ruan, Dumitrel Loghin, Quang-Trung Ta, Meihui Zhang, Gang Chen, Beng Chin Ooi:
A Transactional Perspective on Execute-order-validate Blockchains. 543-557 - Surabhi Gupta, Sanket Purandare, Karthik Ramachandra:
Aggify: Lifting the Curse of Cursor Loops using Custom Aggregates. 559-573
Research 7: Security, Privacy, and Blockchain
- Yang Cao, Wenfei Fan, Yanghao Wang, Ke Yi:
Querying Shared Data with Security Heterogeneity. 575-585 - Timon Hackenjos, Florian Hahn, Florian Kerschbaum:
SAGMA: Secure Aggregation Grouped by Multiple Attributes. 587-601 - Amrita Roy Chowdhury, Chenghong Wang, Xi He, Ashwin Machanavajjhala, Somesh Jha:
Crypt?: Crypto-Assisted Differential Privacy on Untrusted Servers. 603-619 - Zitao Li, Tianhao Wang, Milan Lopuhaä-Zwakenberg, Ninghui Li, Boris Skoric:
Estimating Numerical Distributions under Local Differential Privacy. 621-635 - Yanqing Peng, Min Du, Feifei Li, Raymond Cheng, Dawn Song:
FalconDB: Blockchain-based Collaborative Database. 637-652
Research 8: Graph Query Processing
- Hanzhi Wang, Zhewei Wei, Ye Yuan, Xiaoyong Du, Ji-Rong Wen:
Exact Single-Source SimRank Computation on Large Graphs. 653-663 - Ziqiang Yu, Xiaohui Yu, Nick Koudas, Yang Liu, Yifan Li, Yueting Chen, Dingyu Yang:
Distributed Processing of k Shortest Path Queries over Dynamic Road Networks. 665-679 - Louis Jachiet, Pierre Genevès, Nils Gesbert, Nabil Layaïda:
On the Optimization of Recursive Relational Queries: Application to Graph Queries. 681-697 - Tangwei Ying, Hanhua Chen, Hai Jin:
Pensieve: Skewness-Aware Version Switching for Efficient Graph Processing. 699-713 - Grace Fan, Wenfei Fan, Yuanhao Li, Ping Lu, Chao Tian, Jingren Zhou:
Extending Graph Patterns with Conditions. 715-729
Industry 2: Machine Learning and Analytics
- Edo Liberty, Zohar S. Karnin, Bing Xiang, Laurence Rouesnel, Baris Coskun, Ramesh Nallapati, Julio Delgado, Amir Sadoughi, Yury Astashonok, Piali Das, Can Balioglu, Saswata Chakravarty, Madhav Jha, Philip Gautier, David Arpin, Tim Januschowski, Valentin Flunkert, Yuyang Wang, Jan Gasthaus, Lorenzo Stella, Syama Sundar Rangapuram, David Salinas, Sebastian Schelter, Alex Smola:
Elastic Machine Learning Algorithms in Amazon SageMaker. 731-737 - Wei Cao, Yusong Gao, Feifei Li, Sheng Wang, Bingchen Lin, Ke Xu, Xiaojie Feng, Yucong Wang, Zhenjun Liu, Gejin Zhang:
Timon: A Timestamped Event Database for Efficient Telemetry Data Processing and Analytics. 739-753 - Arash Fard, Anh Le, George Larionov, Waqas Dhillon, Chuck Bear:
Vertica-ML: Distributed Machine Learning in Vertica Database. 755-768 - Antony S. Higginson, Mihaela Dediu, Octavian Arsene, Norman W. Paton, Suzanne M. Embury:
Database Workload Capacity Planning using Time Series Analysis and Machine Learning. 769-783 - Micah J. Smith, Carles Sala, James Max Kanter, Kalyan Veeramachaneni:
The Machine Learning Bazaar: Harnessing the ML Ecosystem for Effective System Development. 785-800
SIGMOD Keynote 2
- Natasha F. Noy:
When the Web is your Data Lake: Creating a Search Engine for Datasets on the Web. 801 - Awez Syed:
The Challenge of Building Effective, Enterprise-scale Data Lakes. 803
Research 9: Data Cleaning
- Stella Giannakopoulou, Manos Karpathiotakis, Anastasia Ailamaki:
Cleaning Denial Constraint Violations through Relaxation. 805-815 - Amir Gilad, Daniel Deutch, Sudeepa Roy:
On Multiple Semantics for Declarative Database Repairs. 817-831 - Ziheng Wei, Sven Hartmann, Sebastian Link:
Discovery Algorithms for Embedded Functional Dependencies. 833-843 - Jing Nathan Yan, Oliver Schulte, Mohan Zhang, Jiannan Wang, Reynold Cheng:
SCODED: Statistical Constraint Oriented Data Error Detection. 845-860 - Yunjia Zhang, Zhihan Guo, Theodoros Rekatsinas:
A Statistical Perspective on Discovering Functional Dependencies in Noisy Data. 861-876
Research 10: Storage and Indexing
- Michael Haubenschild, Caetano Sauer, Thomas Neumann, Viktor Leis:
Rethinking Logging, Checkpoints, and Recovery for High-Performance Storage Engines. 877-892 - Subhadeep Sarkar, Tarikul Islam Papon, Dimitris Staratzis, Manos Athanassoulis:
Lethe: A Tunable Delete-Aware LSM Engine. 893-908 - Linwei Li, Kai Zhang, Jiading Guo, Wen He, Zhenying He, Yinan Jing, Weili Han, X. Sean Wang:
BinDex: A Two-Layered Index for Fast and Robust Scans. 909-923 - Cong Yue, Zhongle Xie, Meihui Zhang, Gang Chen, Beng Chin Ooi, Sheng Wang, Xiaokui Xiao:
Analysis of Indexing Structures for Immutable Data. 925-935 - Harald Lang, Alexander Beischl, Viktor Leis, Peter A. Boncz, Thomas Neumann, Alfons Kemper:
Tree-Encoded Bitmaps. 937-967
Research 11: Machine Learning for Databases II
- Jialin Ding, Umar Farooq Minhas, Jia Yu, Chi Wang, Jaeyoung Do, Yinan Li, Hantian Zhang, Badrish Chandramouli, Johannes Gehrke, Donald Kossmann, David B. Lomet, Tim Kraska:
ALEX: An Updatable Adaptive Learned Index. 969-984 - Vikram Nathan, Jialin Ding, Mohammad Alizadeh, Tim Kraska:
Learning Multi-Dimensional Indexes. 985-1000 - Ani Kristo, Kapil Vaidya, Ugur Çetintemel, Sanchit Misra, Tim Kraska:
The Case for a Learned Sorting Algorithm. 1001-1016 - Yongjoo Park, Shucheng Zhong, Barzan Mozafari:
QuickSel: Quick Selectivity Learning with Mixture Models. 1017-1033 - Shohedul Hasan, Saravanan Thirumuruganathan, Jees Augustine, Nick Koudas, Gautam Das:
Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries. 1035-1050
Research 12: Graph Matching and Discovery
- Chenhao Ma, Yixiang Fang, Reynold Cheng, Laks V. S. Lakshmanan, Wenjie Zhang, Xuemin Lin:
Efficient Algorithms for Densest Subgraph Discovery on Large Directed Graphs. 1051-1066 - Wentian Guo, Yuchen Li, Mo Sha, Bingsheng He, Xiaokui Xiao, Kian-Lee Tan:
GPU-Accelerated Subgraph Enumeration on Partitioned Graphs. 1067-1082 - Shixuan Sun, Qiong Luo:
In-Memory Subgraph Matching: An In-depth Study. 1083-1098 - Yeonsu Park, Seongyun Ko, Sourav S. Bhowmick, Kyoungmin Kim, Kijae Hong, Wook-Shin Han:
G-CARE: A Framework for Performance Benchmarking of Cardinality Estimation Techniques for Subgraph Matching. 1099-1114 - Tahsin Reza, Matei Ripeanu, Geoffrey Sanders, Roger Pearce:
Approximate Pattern Matching in Massive Graphs with Precision and Recall Guarantees. 1115-1131
Research 13: Data Matching
- Venkata Vamsikrishna Meduri, Lucian Popa, Prithviraj Sen, Mohamed Sarwat:
A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching. 1133-1147 - Renzhi Wu, Sanya Chaba, Saurabh Sawlani, Xu Chu, Saravanan Thirumuruganathan:
ZeroER: Entity Resolution using Zero Labeled Examples. 1149-1164 - Zhaoqiang Chen, Qun Chen, Boyi Hou, Zhanhuai Li, Guoliang Li:
Towards Interpretable and Learnable Risk Analysis for Entity Resolution. 1165-1180 - Fuat Basik, Hakan Ferhatosmanoglu, Bugra Gedik:
SLIM: Scalable Linkage of Mobility Data. 1181-1196 - Yaoshu Wang, Chuan Xiao, Jianbin Qin, Xin Cao, Yifang Sun, Wei Wang, Makoto Onizuka:
Monotonic Cardinality Estimation of Similarity Selection: A Deep Learning Approach. 1197-1212
Research 14: Query Optimization and Execution
- Shaleen Deep, Xiao Hu, Paraschos Koutris:
Fast Join Project Query Evaluation using Matrix Multiplication. 1213-1223 - Qichen Wang, Ke Yi:
Maintaining Acyclic Foreign-Key Joins under Updates. 1225-1239 - Dixin Tang, Zechao Shang, Aaron J. Elmore, Sanjay Krishnan, Michael J. Franklin:
Thrifty Query Execution via Incrementability. 1241-1256 - Wenjia He, Michael R. Anderson, Maxwell Strome, Michael J. Cafarella:
A Method for Optimizing Opaque Filter Queries. 1257-1272 - Christian Duta, Torsten Grust:
Functional-Style SQL UDFs With a Capital 'F'. 1273-1287
Research 15: Machine Learning for Cleaning, Integration, and Search
- Sebastian Schelter, Tammo Rukat, Felix Bießmann:
Learning to Validate the Predictions of Black Box Classifiers on Unseen Data. 1289-1299 - Jose Picado, John Davis, Arash Termehchy, Ga Young Lee:
Learning Over Dirty Data Without Cleaning. 1301-1316 - Weiyuan Wu, Lampros Flokas, Eugene Wu, Jiannan Wang:
Complaint-driven Training Data Debugging for Query 2.0. 1317-1334 - Riccardo Cappuzzo, Paolo Papotti, Saravanan Thirumuruganathan:
Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks. 1335-1349 - Shay Gershtein, Tova Milo, Gefen Morami, Slava Novgorodov:
Minimization of Classifier Construction Cost for Search Queries. 1351-1365
Research 16: Graph and Stream Processing
- Wentao Li, Miao Qiao, Lu Qin, Ying Zhang, Lijun Chang, Xuemin Lin:
Scaling Up Distance Labeling on Graphs with Core-Periphery Properties. 1367-1381 - Krishna Kumar P., Paul Langton, Wolfgang Gatterbauer:
Factorized Graph Representations for Semi-Supervised Learning from Sparse Data. 1383-1398 - Wentao Zhang, Xupeng Miao, Yingxia Shao, Jiawei Jiang, Lei Chen, Olivier Ruas, Bin Cui:
Reliable Data Distillation on Graph Convolutional Network. 1399-1414 - Anil Pacaci, Angela Bonifati, M. Tamer Özsu:
Regular Path Query Evaluation on Streaming Graphs. 1415-1430 - Prashant Pandey, Shikha Singh, Michael A. Bender, Jonathan W. Berry, Martin Farach-Colton, Rob Johnson, Thomas M. Kroeger, Cynthia A. Phillips:
Timely Reporting of Heavy Hitters using External Memory. 1431-1446
Industry 3: Cloud and Distributed Databases
- Mohamed A. Soliman, Lyublena Antova, Marc Sugiyama, Michael Duller, Amirhossein Aleyasen, Gourab Mitra, Ehab Abdelhamid, Mark Morcos, Michele Gage, Dmitri Korablev, Florian M. Waas:
A Framework for Emulating Database Operations in Cloud Data Warehouses. 1447-1461 - Alex Depoutovitch, Chong Chen, Jin Chen, Paul Larson, Shu Lin, Jack Ng, Wenlin Cui, Qiang Liu, Wei Huang, Yong Xiao, Yongjun He:
Taurus Database: How to be Fast, Available, and Frugal in the Cloud. 1463-1478 - Mathieu B. Demarne, Jim Gramling, Tomer Verona, Miso Cilimdzic:
Reliability Analytics for Cloud Based Distributed Databases. 1479-1492 - Rebecca Taft, Irfan Sharif, Andrei Matei, Nathan VanBenschoten, Jordan Lewis, Tobias Grieger, Kai Niemi, Andy Woods, Anne Birzin, Raphael Poss, Paul Bardea, Amruta Ranade, Ben Darnell, Bram Gruneir, Justin Jaffray, Lucy Zhang, Peter Mattis:
CockroachDB: The Resilient Geo-Distributed SQL Database. 1493-1509 - Panagiotis Antonopoulos, Arvind Arasu, Kunal D. Singh, Ken Eguro, Nitish Gupta, Rajat Jain, Raghav Kaushik, Hanuma Kodavalla, Donald Kossmann, Nikolas Ogg, Ravi Ramamurthy, Jakub Szymaszek, Jeffrey Trimmer, Kapil Vaswani, Ramarathnam Venkatesan, Mike Zwilling:
Azure SQL Database Always Encrypted. 1511-1525
Research 17: Data Exploration and Preparation
- Ori Bar El, Tova Milo, Amit Somech:
Automatically Generating Data Exploration Sessions Using Deep Reinforcement Learning. 1527-1537 - Cong Yan, Yeye He:
Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks. 1539-1554 - Philipp Eichmann, Emanuel Zgraggen, Carsten Binnig, Tim Kraska:
IDEBench: A Benchmark for Interactive Data Exploration. 1555-1569 - Leilani Battle, Philipp Eichmann, Marco Angelini, Tiziana Catarci, Giuseppe Santucci, Yukun Zheng, Carsten Binnig, Jean-Daniel Fekete, Dominik Moritz:
Database Benchmarking for Supporting Real-Time Interactive Querying of Large Data. 1571-1587 - Sajjadur Rahman, Kelly Mack, Mangesh Bendre, Ruilin Zhang, Karrie Karahalios, Aditya G. Parameswaran:
Benchmarking Spreadsheet Systems. 1589-1599
Research 18: Main Memory Databases and Modern Hardware
- Huanchen Zhang, Xiaoxuan Liu, David G. Andersen, Michael Kaminsky, Kimberly Keeton, Andrew Pavlo:
Order-Preserving Key Compression for In-Memory Search Trees. 1601-1615 - Anil Shanbhag, Samuel Madden, Xiangyao Yu:
A Study of the Fundamental Performance Characteristics of GPUs and CPUs for Database Analytics. 1617-1632 - Clemens Lutz, Sebastian Breß, Steffen Zeuch, Tilmann Rabl, Volker Markl:
Pump Up the Volume: Processing Large Data on GPUs with Fast Interconnects. 1633-1649 - Tiemo Bang, Ismail Oukid, Norman May, Ilia Petrov, Carsten Binnig:
Robust Performance of Main Memory Data Structures by Configuration. 1651-1666 - Mayuresh Kunjir, Shivnath Babu:
Black or White? How to Develop an AutoTuner for Memory-based Analytics. 1667-1683
Research 19: Machine Learning Systems and Applications
- Supun Nakandala, Arun Kumar:
Vista: Optimized System for Declarative Feature Transfer from Deep CNNs at Scale. 1685-1700 - Behrouz Derakhshan, Alireza Rezaei Mahdiraji, Ziawasch Abedjan, Tilmann Rabl, Volker Markl:
Optimizing Machine Learning Workloads in Collaborative Environments. 1701-1716 - Nilaksh Das, Sanya Chaba, Renzhi Wu, Sakshi Gandhi, Duen Horng Chau, Xu Chu:
GOGGLES: Automatic Image Labeling with Affinity Coding. 1717-1732 - Amir Ilkhechi, Andrew Crotty, Alex Galakatos, Yicong Mao, Grace Fan, Xiran Shi, Ugur Çetintemel:
DeepSqueeze: Deep Semantic Compression for Tabular Data. 1733-1746 - Kaiping Zheng, Shaofeng Cai, Horng Ruey Chua, Wei Wang, Kee Yuan Ngiam, Beng Chin Ooi:
TRACER: A Framework for Facilitating Accurate and Interpretable Analytics for High Stakes Applications. 1747-1763
Research 20: Graph Data Management and Analysis
- Wenfei Fan, Ruochun Jin, Muyang Liu, Ping Lu, Xiaojian Luo, Ruiqi Xu, Qiang Yin, Wenyuan Yu, Jingren Zhou:
Application Driven Graph Partitioning. 1765-1779 - Dian Ouyang, Dong Wen, Lu Qin, Lijun Chang, Ying Zhang, Xuemin Lin:
Progressive Top-K Nearest Neighbors Search in Large Road Networks. 1781-1795 - Yingxia Shao, Shiyue Huang, Xupeng Miao, Bin Cui, Lei Chen:
Memory-Aware Framework for Efficient Second-Order Random Walk on Large Graphs. 1797-1812 - Yikai Zhang, Jeffrey Xu Yu:
Hub Labeling for Shortest Path Counting. 1813-1828 - Hui Li, Hui Li, Sourav S. Bhowmick:
CHASSIS: Conformity Meets Online Information Diffusion. 1829-1840
Research 21: Spatial, Temporal, and Multimedia Data I
- Victor Junqiu Wei, Raymond Chi-Wing Wong, Cheng Long:
Architecture-Intact Oracle for Fastest Path and Time Queries on Dynamic Spatial Networks. 1841-1856 - Anna Gogolou, Theophanis Tsandilas, Karima Echihabi, Anastasia Bezerianos, Themis Palpanas:
Data Series Progressive Similarity Search with Probabilistic Quality Guarantees. 1857-1873 - Harish Doraiswamy, Juliana Freire:
A GPU-friendly Geometric Data Model and Algebra for Spatial Queries. 1875-1885 - John Paparrizos, Chunwei Liu, Aaron J. Elmore, Michael J. Franklin:
Debunking Four Long-Standing Misconceptions of Time-Series Distance Measures. 1887-1905 - Favyen Bastani, Songtao He, Arjun Balasingam, Karthik Gopalakrishnan, Mohammad Alizadeh, Hari Balakrishnan, Michael J. Cafarella, Tim Kraska, Sam Madden:
MIRIS: Fast Object Track Queries in Video. 1907-1921
Award Talks
- Jose M. Faleiro:
ACM SIGMOD Jim Gray Dissertation Award W Talk. 1923 - Silu Huang:
Effective Data Versioning for Collaborative Data Analytics. 1925-1938
Research 22: Data Lakes, Web, and Knowledge Graph
- Fatemeh Nargesian, Ken Q. Pu, Erkang Zhu, Bahar Ghadiri Bashardoost, Renée J. Miller:
Organizing Data Lakes for Navigation. 1939-1950 - Yi Zhang, Zachary G. Ives:
Finding Related Tables in Data Lakes for Interactive Data Science. 1951-1966 - Mohammad Raza, Sumit Gulwani:
Web Data Extraction using Hybrid Program Synthesis: A Combination of Top-down and Bottom-up Inference. 1967-1978 - Xun Jian, Yue Wang, Xiayu Lei, Libin Zheng, Lei Chen:
SPARQL Rewriting: Towards Desired Results. 1979-1993 - Farahnaz Akrami, Mohammed Samiul Saeef, Qingheng Zhang, Wei Hu, Chengkai Li:
Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study. 1995-2010
Research 23: OLAP, Data Warehouses, and Key-Value Stores
- Bailu Ding, Surajit Chaudhuri, Vivek R. Narasayya:
Bitvector-aware Query Optimization for Decision Support Queries. 2011-2026 - Zhuoyue Zhao, Feifei Li, Yuxi Liu:
Efficient Join Synopsis Maintenance for Data Warehouse. 2027-2042 - Aunn Raza, Periklis Chrysogelos, Angelos-Christos G. Anadiotis, Anastasia Ailamaki:
Adaptive HTAP through Elastic Resource Scheduling. 2043-2054 - Yoon-Min Nam, Donghyoung Han, Min-Soo Kim:
SPRINTER: A Fast n-ary Join Query Processing Method for Complex OLAP Queries. 2055-2070 - Siqiang Luo, Subarna Chatterjee, Rafael Ketsetsidis, Niv Dayan, Wilson Qin, Stratos Idreos:
Rosetta: A Robust Space-Time Optimized Range Filter for Key-Value Stores. 2071-2086
Research 24: Spatial, Temporal, and Multimedia Data II
- Nikos Tsikoudis, Liuba Shrira:
RID: Deduplicating Snapshot Computations. 2087-2101 - Ruby Y. Tahboub, Tiark Rompf:
Architecting a Query Compiler for Spatial Workloads. 2103-2118 - Pengfei Li, Hua Lu, Qian Zheng, Long Yang, Gang Pan:
LISA: A Learned Index Structure for Spatial Data. 2119-2133 - Haitao Yuan, Guoliang Li, Zhifeng Bao, Ling Feng:
Effective Travel Time Estimation: When Historical Trajectories over Road Networks Matter. 2135-2149
Research 25: Social Network Analysis
- Naoto Ohsaka:
The Solution Distribution of Influence Maximization: A High-level Experimental Study on Three Algorithmic Approaches. 2151-2166 - Qintian Guo, Sibo Wang, Zhewei Wei, Ming Chen:
Influence Maximization Revisited: Efficient Reverse Reachable Set Generation with Bound Tightened. 2167-2181 - Qing Liu, Minjun Zhao, Xin Huang, Jianliang Xu, Yunjun Gao:
Truss-based Community Search over Large Directed Graphs. 2183-2197 - Junghoon Kim, Tao Guo, Kaiyu Feng, Gao Cong, Arijit Khan, Farhana Murtaza Choudhury:
Densely Connected User Community and Location Cluster Search in Location-Based Social Networks. 2199-2209 - Qingyuan Linghu, Fan Zhang, Xuemin Lin, Wenjie Zhang, Ying Zhang:
Global Reinforcement of Social Networks: The Anchored Coreness Problem. 2211-2226
Industry 4: Advanced Functionality
- Ying Yan, Changzheng Wei, Xuepeng Guo, Xuming Lu, Xiaofu Zheng, Qi Liu, Chenhui Zhou, Xuyang Song, Boran Zhao, Hui Zhang, Guofei Jiang:
Confidentiality Support over Financial Grade Consortium Blockchain. 2227-2240 - Wen Yang, Tao Li, Gai Fang, Hong Wei:
PASE: PostgreSQL Ultra-High-Dimensional Approximate Nearest Neighbor Search Extension. 2241-2253 - Fabio Maschi, Muhsen Owaida, Gustavo Alonso, Matteo Casalino, Anthony Hock-Koon:
Making Search Engines Faster by Lowering the Cost of Querying Business Rules Through FPGAs. 2255-2270 - Ke Wang, Avrilia Floratou, Ashvin Agrawal, Daniel Musgrave:
Spur: Mitigating Slow Instances in Large-Scale Streaming Pipelines. 2271-2285 - Yan Yan, Stephen Meyles, Aria Haghighi, Dan Suciu:
Entity Matching in the Wild: A Consistent and Versatile Framework to Unify Data in Industrial Applications. 2287-2301
Research 26: Usability and Natural Language User Interfaces
- Aristotelis Leventidis, Jiahui Zhang, Cody Dunne, Wolfgang Gatterbauer, H. V. Jagadish, Mirek Riedewald:
QueryVis: Logic-based Diagrams help Users Understand Complicated SQL Queries Faster. 2303-2318 - Christopher Baik, Zhongjun Jin, Michael J. Cafarella, H. V. Jagadish:
Duoquest: A Dual-Specification System for Expressive SQL Queries. 2319-2329 - Prashanth Dintyala, Arpit Narechania, Joy Arulraj:
SQLCheck: Automated Detection and Diagnosis of SQL Anti-Patterns. 2331-2345 - Nathaniel Weir, Prasetya Ajie Utama, Alex Galakatos, Andrew Crotty, Amir Ilkhechi, Shekar Ramaswamy, Rohin Bhushan, Nadja Geisler, Benjamin Hättasch, Steffen Eger, Ugur Çetintemel, Carsten Binnig:
DBPal: A Fully Pluggable NL2SQL Training Pipeline. 2347-2361 - Vraj Shah, Side Li, Arun Kumar, Lawrence K. Saul:
SpeakQL: Towards Speech-driven Multimodal Querying of Structured Data. 2363-2374
Research 27: Distributed and Parallel Processing
- Rundong Li, Wolfgang Gatterbauer, Mirek Riedewald:
Near-Optimal Distributed Band-Joins through Recursive Partitioning. 2375-2390 - Brad Glasbergen, Kyle Langendoen, Michael Abebe, Khuzaima Daudjee:
ChronoCache: Predictive and Adaptive Mid-Tier Query Result Caching. 2391-2406 - Muhammad Tirmazi, Ran Ben Basat, Jiaqi Gao, Minlan Yu:
Cheetah: Accelerating Database Queries with Switch Pruning. 2407-2422 - Yannis Chronis, Thanh Do, Goetz Graefe, Keith Peters:
External Merge Sort for Top-K Queries: Eager input filtering guided by histograms. 2423-2437 - Qiange Wang, Yanfeng Zhang, Hao Wang, Liang Geng, Rubao Lee, Xiaodong Zhang, Ge Yu:
Automating Incremental and Asynchronous Evaluation for Recursive Aggregate Data Processing. 2439-2454
Research 28: Stream Processing
- Ahmed S. Abdelhamid, Ahmed R. Mahmood, Anas Daghistani, Walid G. Aref:
Prompt: Dynamic Data-Partitioning for Distributed Micro-batch Stream Processing Systems. 2455-2469 - Bonaventura Del Monte, Steffen Zeuch, Tilmann Rabl, Volker Markl:
Rhino: Efficient Management of Very Large Distributed State for Stream Processing Engines. 2471-2486 - Philipp M. Grulich, Sebastian Breß, Steffen Zeuch, Jonas Traub, Janis von Bleichert, Zongxiong Chen, Tilmann Rabl, Volker Markl:
Grizzly: Efficient Stream Processing Through Adaptive Query Compilation. 2487-2503 - Georgios Theodorakis, Alexandros Koliousis, Peter R. Pietzuch, Holger Pirk:
LightSaber: Efficient Window Aggregation on Multi-core Processors. 2505-2521 - Amirhesam Shahvarani, Hans-Arno Jacobsen:
Parallel Index-based Stream Join on a Multicore CPU. 2523-2537
Research 29: Data Mining and Similarity Search
- Conglong Li, Minjia Zhang, David G. Andersen, Yuxiong He:
Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination. 2539-2554 - Yiqiu Wang, Yan Gu, Julian Shun:
Theoretically-Efficient and Practical Parallel DBSCAN. 2555-2571 - Oksana Dolmatova, Nikolaus Augsten, Michael H. Böhlen:
A Relational Matrix Algebra and its Implementation in a Column Store. 2573-2587 - Yifan Lei, Qiang Huang, Mohan S. Kankanhalli, Anthony K. H. Tung:
Locality-Sensitive Hashing Scheme based on Longest Circular Co-Substring. 2589-2599 - Huayi Zhang, Lei Cao, Yizhou Yan, Samuel Madden, Elke A. Rundensteiner:
Continuously Adaptive Similarity Search. 2601-2616
Tutorials
- Tova Milo, Amit Somech:
Automating Exploratory Data Analysis via Machine Learning: An Overview. 2617-2622 - Alexey Drutsa, Valentina Fedorova, Dmitry Ustalov, Olga Megorskaya, Evfrosiniya Zerminova, Daria Baidakova:
Crowdsourcing Practice for Efficient Data Labeling: Aggregation, Incremental Relabeling, and Pricing. 2623-2627 - Fatma Ozcan, Abdul Quamar, Jaydeep Sen, Chuan Lei, Vasilis Efthymiou:
State of the Art and Open Challenges in Natural Language Interfaces to Data. 2629-2636 - Nihar B. Shah, Zachary C. Lipton:
SIGMOD 2020 Tutorial on Fairness and Bias in Peer Review and Other Sociotechnical Intelligent Systems. 2637-2640 - Anurag Khandelwal, Arun Kejariwal, Karthikeyan Ramasamy:
Le Taureau: Deconstructing the Serverless Landscape & A Look Forward. 2641-2650 - Paris Carbone, Marios Fragkoulis, Vasiliki Kalavri, Asterios Katsifodimos:
Beyond Analytics: The Evolution of Stream Processing Systems. 2651-2658 - Nikolaos Tziavelis, Wolfgang Gatterbauer, Mirek Riedewald:
Optimal Join Algorithms Meet Top-k. 2659-2665 - Stratos Idreos, Mark Callaghan:
Key-Value Storage Engines. 2667-2672
Demonstrations
- Jin Wang, Guorui Xiao, Jiaqi Gu, Jiacheng Wu, Carlo Zaniolo:
RASQL: A Powerful Language and its System for Big Data Applications. 2673-2676 - Denis Hirn, Torsten Grust:
PL/SQL Without the PL. 2677-2680 - Theofilos Belmpas, Orest Gkini, Georgia Koutrika:
Analysis of Database Search Systems with THOR. 2681-2684 - Yinglong Song, Huey-Eng Chua, Sourav S. Bhowmick, Byron Choi, Shuigeng Zhou:
BOOMER: A Tool for Blending Visual P-Homomorphic Queries on Large Networks. 2685-2688 - Sourav S. Bhowmick, Kai Huang, Huey-Eng Chua, Zifeng Yuan, Byron Choi, Shuigeng Zhou:
AURORA: Data-driven Construction of Visual Graph Query Interfaces for Graph Databases. 2689-2692 - Haixin Wang, Cheng Xu, Ce Zhang, Jianliang Xu:
vChain: A Blockchain System Ensuring Query Integrity. 2693-2696 - Wentao Hu, Dongxiang Zhang, Dawei Jiang, Sai Wu, Ke Chen, Kian-Lee Tan, Gang Chen:
AUDITOR: A System Designed for Automatic Discovery of Complex Integrity Constraints in Relational Databases. 2697-2700 - Angela Bonifati, Wim Martens, Thomas Timm:
SHARQL: Shape Analysis of Recursive SPARQL Queries. 2701-2704 - Hongzhi Chen, Bowen Wu, Shiyuan Deng, Chenghuan Huang, Changji Li, Yichao Li, James Cheng:
High Performance Distributed OLAP on Property Graphs with Grasper. 2705-2708 - Kisung Park, Taeyoung Jeong, Chanho Jeong, Jaeha Lee, Dong-Hun Lee, Young-Koo Lee:
ProcAnalyzer: Effective Code Analyzer for Tuning Imperative Programs in SAP HANA. 2709-2712 - Sean Tan, Sourav S. Bhowmick, Huey-Eng Chua, Xiaokui Xiao:
LATTE: Visual Construction of Smart Contracts. 2713-2716 - Theodoros Toliopoulos, Christos Bellas, Anastasios Gounaris, Apostolos Papadopoulos:
PROUD: PaRallel OUtlier Detection for Streams. 2717-2720 - Zhongjun Jin, Mengjing Xu, Chenkai Sun, Abolfazl Asudeh, H. V. Jagadish:
MithraCoverage: A System for Investigating Population Bias for Intersectional Fairness. 2721-2724 - Shay Gershtein, Tova Milo, Gefen Morami, Slava Novgorodov:
MC3: A System for Minimization of Classifier Construction Cost. 2725-2728 - Brad Glasbergen, Michael Abebe, Khuzaima Daudjee, Daniel Vogel, Jian Zhao:
Sentinel: Understanding Data Systems. 2729-2732 - Raoni Lourenço, Juliana Freire, Dennis E. Shasha:
BugDoc: A System for Debugging Computational Pipelines. 2733-2736 - Yueting Chen, Xiaohui Yu, Nick Koudas:
TQVS: Temporal Queries over Video Streams in Action. 2737-2740 - Anna Fariha, Ashish Tiwari, Arjun Radhakrishna, Sumit Gulwani:
ExTuNe: Explaining Tuple Non-conformance. 2741-2744 - Xuedi Qin, Chengliang Chai, Yuyu Luo, Nan Tang, Guoliang Li:
Interactively Discovering and Ranking Desired Tuples without Writing SQL Queries. 2745-2748 - Miro Mannino, Azza Abouzied:
Synner: Generating Realistic Synthetic Data. 2749-2752 - Roee Shraga, Coral Scharf, Rakefet Ackerman, Avigdor Gal:
InCognitoMatch: Cognitive-aware Matching via Crowdsourcing. 2753-2756 - Mashaal Musleh, Mourad Ouzzani, Nan Tang, AnHai Doan:
CoClean: Collaborative Data Cleaning. 2757-2760 - Zhida Chen, Gao Cong, Walid G. Aref:
STAR: A Distributed Stream Warehouse System for Spatial Data. 2761-2764 - Daniel Deutch, Nave Frost, Amir Gilad, Oren Sheffer:
T-REx: Table Repair Explanations. 2765-2768 - Daren Chao, Nick Koudas, Ioannis Xarchakos:
SVQ++: Querying for Object Interactions in Video Streams. 2769-2772 - Milos Nikolic, Haozhe Zhang, Ahmet Kara, Dan Olteanu:
F-IVM: Learning over Fast-Evolving Relational Data. 2773-2776 - Ziquan Fang, Yunjun Gao, Lu Pan, Lu Chen, Xiaoye Miao, Christian S. Jensen:
CoMing: A Real-time Co-Movement Mining System for Streaming Trajectories. 2777-2780 - Nemanja Boric, Hinnerk Gildhoff, Menelaos Karavelas, Ippokratis Pandis, Ioanna Tsalouchidou:
Unified Spatial Analytics from Heterogeneous Sources with Amazon Redshift. 2781-2784 - Liang Zhang, Noura Alghamdi, Mohamed Y. Eltabakh, Elke A. Rundensteiner:
Big Data Series Analytics Using TARDIS and its Exploitation in Geospatial Applications. 2785-2788 - Ryan Marcus, Emily Zhang, Tim Kraska:
CDFShop: Exploring and Optimizing Learned Index Structures. 2789-2792 - Emily Caveness, Paul Suganthan G. C., Zhuo Peng, Neoklis Polyzotis, Sudip Roy, Martin Zinkevich:
TensorFlow Data Validation: Data Analysis and Validation in Continuous ML Pipelines. 2793-2796 - Zuozhi Wang, Kai Zeng, Botong Huang, Wei Chen, Xiaozong Cui, Bo Wang, Ji Liu, Liya Fan, Dachuan Qu, Zhenyu Hou, Tao Guan, Chen Li, Jingren Zhou:
Grosbeak: A Data Warehouse Supporting Resource-Aware Incremental Computing. 2797-2800 - Saehan Jo, Immanuel Trummer:
Demonstration of BitGourmet: Data Analysis via Deterministic Approximation. 2801-2804 - Eliana Pastor, Elena Baralis:
Bring Your Own Data to X-PLAIN. 2805-2808 - Lana Ramjit, Zhaoning Kong, Ravi Netravali, Eugene Wu:
Physical Visualization Design. 2809-2812 - Mingwei Samuel, Cong Yan, Alvin Cheung:
Demonstration of Chestnut: An In-memory Data Layout Designer for Database Applications. 2813-2816
Student Abstracts
- Chen Luo:
Breaking Down Memory Walls in LSM-based Storage Systems. 2817-2819 - Yuliya Susanina:
Context-Free Path Querying via Matrix Equations. 2821-2823 - Xiaoshuang Chen:
Simulation-based Approximate Graph Pattern Matching. 2825-2827 - Karima Echihabi:
High-Dimensional Vector Similarity Search: From Time Series to Deep Network Embeddings. 2829-2832 - Hendrik Makait:
Rethinking Message Brokers on RDMA and NVM. 2833-2835 - Yiru Chen:
Monte Carlo Tree Search for Generating Interactive Data Analysis Interfaces. 2837-2839 - Haneen Mohammed:
Continuous Prefetch for Interactive Data Applications. 2841-2843 - Shiva Jahangiri:
Re-evaluating the Performance Trade-offs for Hash-Based Multi-Join Queries. 2845-2847 - Xiaozhong Zhang:
Interactive View Recommendation. 2849-2851 - Meena Jagadeesan, Garrett Tanzer:
From Worst-Case to Average-Case Analysis: Accurate Latency Predictions for Key-Value Storage Engines. 2853-2855 - Kongzhang Hao, Longbin Lai:
Towards the Scheduling of Vertex-constrained Multi Subgraph Matching Query. 2857-2859 - William W. Ma:
Serverless Query Processing on a Budget. 2861-2863 - Noah Slavitch:
Workload-Aware Column Imprints. 2865-2867 - Justus Adam:
Towards Scalable UDTFs in Noria. 2869-2871 - Jia Shi:
Column Partition and Permutation for Run Length Encoding in Columnar Databases. 2873-2874 - Wanxin Li:
Supporting Database Constraints in Synthetic Data Generation based on Generative Adversarial Networks. 2875-2877 - Jacob Spiegel:
An Evaluation of Methods of Compressing Doubles. 2879-2881 - Neil Band:
MemFlow: Memory-Aware Distributed Deep Learning. 2883-2885 - Kunal Waghray:
JSON Schema Matching: Empirical Observations. 2887-2889
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.