default search action
Steven Euijong Whang
Person information
- affiliation: Korea Advanced Institute of Science and Technology (KAIST), Korea
- affiliation (former): Stanford University, CA, USA
Other persons with the same name
- Steven Whang 0002 — Langara College, Vancouver, Canada
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j24]Steven Euijong Whang:
Letter from the Special Issue Editor. IEEE Data Eng. Bull. 47(1): 2 (2024) - [j23]Ki Hyun Tae, Hantian Zhang, Jaeyoung Park, Kexin Rong, Steven Euijong Whang:
Falcon: Fair Active Learning using Multi-armed Bandits. Proc. VLDB Endow. 17(5): 952-965 (2024) - [c29]Minsu Kim, Seonghyeon Hwang, Steven Euijong Whang:
Quilt: Robust Data Segment Selection against Concept Drifts. AAAI 2024: 21249-21257 - [c28]Yuji Roh, Qingyun Liu, Huan Gui, Zhe Yuan, Yujin Tang, Steven Euijong Whang, Liang Liu, Shuchao Bi, Lichan Hong, Ed H. Chi, Zhe Zhao:
LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views. ICML 2024 - [c27]Seonghyeon Hwang, Minsu Kim, Steven Euijong Whang:
RC-Mixup: A Data Augmentation Strategy against Noisy Data for Regression Tasks. KDD 2024: 1155-1165 - [i23]Ki Hyun Tae, Hantian Zhang, Jaeyoung Park, Kexin Rong, Steven Euijong Whang:
Falcon: Fair Active Learning using Multi-armed Bandits. CoRR abs/2401.12722 (2024) - [i22]Yuji Roh, Qingyun Liu, Huan Gui, Zhe Yuan, Yujin Tang, Steven Euijong Whang, Liang Liu, Shuchao Bi, Lichan Hong, Ed H. Chi, Zhe Zhao:
LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views. CoRR abs/2402.04644 (2024) - [i21]Jio Oh, Soyeon Kim, Junseok Seo, Jindong Wang, Ruochen Xu, Xing Xie, Steven Euijong Whang:
ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language Models. CoRR abs/2403.05266 (2024) - [i20]Seonghyeon Hwang, Minsu Kim, Steven Euijong Whang:
RC-Mixup: A Data Augmentation Strategy against Noisy Data for Regression Tasks. CoRR abs/2405.17938 (2024) - [i19]Jaeyoung Park, Minsu Kim, Steven Euijong Whang:
Fair Class-Incremental Learning using Sample Weighting. CoRR abs/2410.01324 (2024) - [i18]Soyeon Kim, Yuji Roh, Geon Heo, Steven Euijong Whang:
PFGuard: A Generative Framework with Privacy and Fairness Safeguards. CoRR abs/2410.02246 (2024) - 2023
- [j22]Hantian Zhang, Ki Hyun Tae, Jaeyoung Park, Xu Chu, Steven Euijong Whang:
iFlipper: Label Flipping for Individual Fairness. Proc. ACM Manag. Data 1(1): 8:1-8:26 (2023) - [j21]Georgia Koutrika, Jun Yang, Manos Athanassoulis, Kostas Stefanidis, Ju Fan, Abdul Quamar, Yuanyan Tian, Alekh Jindal, Carsten Binnig, Jennie Rogers, Senjuti Basu Roy, Steven Euijong Whang, Matthias Boehm, Aaron J. Elmore, Vasilis Efthymiou, Xiao Hu, Xiaofang Zhou, Alan D. Fekete:
Front Matter. Proc. VLDB Endow. 16(12) (2023) - [j20]Yuji Roh, Weili Nie, De-An Huang, Steven Euijong Whang, Arash Vahdat, Anima Anandkumar:
Dr-Fairness: Dynamic Data Ratio Adjustment for Fair Training on Real and Generated Data. Trans. Mach. Learn. Res. 2023 (2023) - [j19]Steven Euijong Whang, Yuji Roh, Hwanjun Song, Jae-Gil Lee:
Data collection and quality challenges in deep learning: a data-centric AI perspective. VLDB J. 32(4): 791-813 (2023) - [c26]Hyunseung Hwang, Steven Euijong Whang:
XClusters: Explainability-First Clustering. AAAI 2023: 7962-7970 - [c25]Geon Heo, Steven Euijong Whang:
Redactor: A Data-Centric and Individualized Defense against Inference Attacks. AAAI 2023: 14874-14882 - [c24]Yuji Roh, Kangwook Lee, Steven Euijong Whang, Changho Suh:
Improving Fair Training under Correlation Shifts. ICML 2023: 29179-29209 - [i17]Yuji Roh, Kangwook Lee, Steven Euijong Whang, Changho Suh:
Improving Fair Training under Correlation Shifts. CoRR abs/2302.02323 (2023) - [i16]Geon Heo, Junseok Seo, Steven Euijong Whang:
Personalized DP-SGD using Sampling Mechanisms. CoRR abs/2305.15165 (2023) - [i15]Minsu Kim, Seonghyeon Hwang, Steven Euijong Whang:
Quilt: Robust Data Segment Selection against Concept Drifts. CoRR abs/2312.09691 (2023) - 2022
- [i14]Geon Heo, Steven Euijong Whang:
Redactor: Targeted Disinformation Generation using Probabilistic Decision Boundaries. CoRR abs/2202.02902 (2022) - [i13]Hantian Zhang, Ki Hyun Tae, Jaeyoung Park, Xu Chu, Steven Euijong Whang:
iFlipper: Label Flipping for Individual Fairness. CoRR abs/2209.07047 (2022) - [i12]Hyunseung Hwang, Steven Euijong Whang:
XClusters: Explainability-first Clustering. CoRR abs/2209.10956 (2022) - 2021
- [j18]Steven Euijong Whang, Ki Hyun Tae, Yuji Roh, Geon Heo:
Responsible AI Challenges in End-to-end Machine Learning. IEEE Data Eng. Bull. 44(1): 79-91 (2021) - [j17]Yuji Roh, Geon Heo, Steven Euijong Whang:
A Survey on Data Collection for Machine Learning: A Big Data - AI Integration Perspective. IEEE Trans. Knowl. Data Eng. 33(4): 1328-1347 (2021) - [c23]Yuji Roh, Kangwook Lee, Steven Euijong Whang, Changho Suh:
FairBatch: Batch Selection for Model Fairness. ICLR 2021 - [c22]Jae-Gil Lee, Yuji Roh, Hwanjun Song, Steven Euijong Whang:
Machine Learning Robustness, Fairness, and their Convergence. KDD 2021: 4046-4047 - [c21]Yuji Roh, Kangwook Lee, Steven Whang, Changho Suh:
Sample Selection for Fair and Robust Training. NeurIPS 2021: 815-827 - [c20]Ki Hyun Tae, Steven Euijong Whang:
Slice Tuner: A Selective Data Acquisition Framework for Accurate and Fair Machine Learning Models. SIGMOD Conference 2021: 1771-1783 - [e6]Matthias Boehm, Julia Stoyanovich, Steven Whang:
Proceedings of the Fifth Workshop on Data Management for End-To-End Machine Learning, In conjunction with the 2021 ACM SIGMOD/PODS Conference, DEEM@SIGMOD 2021, Virtual Event, China, 20 June, 2021. ACM 2021, ISBN 978-1-4503-8486-5 [contents] - [i11]Steven Euijong Whang, Ki Hyun Tae, Yuji Roh, Geon Heo:
Responsible AI Challenges in End-to-end Machine Learning. CoRR abs/2101.05967 (2021) - [i10]Seonghyeon Hwang, Steven Euijong Whang:
MixRL: Data Mixing Augmentation for Regression using Reinforcement Learning. CoRR abs/2106.03374 (2021) - [i9]Yuji Roh, Kangwook Lee, Steven Euijong Whang, Changho Suh:
Sample Selection for Fair and Robust Training. CoRR abs/2110.14222 (2021) - [i8]Steven Euijong Whang, Yuji Roh, Hwanjun Song, Jae-Gil Lee:
Data Collection and Quality Challenges in Deep Learning: A Data-Centric AI Perspective. CoRR abs/2112.06409 (2021) - 2020
- [j16]Steven Whang, Jae-Gil Lee:
Data Collection and Quality Challenges for Deep Learning. Proc. VLDB Endow. 13(12): 3429-3432 (2020) - [j15]Geon Heo, Yuji Roh, Seonghyeon Hwang, Dayun Lee, Steven Whang:
Inspector Gadget: A Data Programming-based Labeling System for Industrial Images. Proc. VLDB Endow. 14(1): 28-36 (2020) - [j14]Yeounoh Chung, Tim Kraska, Neoklis Polyzotis, Ki Hyun Tae, Steven Euijong Whang:
Automated Data Slicing for Model Validation: A Big Data - AI Integration Approach. IEEE Trans. Knowl. Data Eng. 32(12): 2284-2296 (2020) - [c19]Hyunseung Hwang, Steven Euijong Whang:
Open-World COVID-19 Data Visualization [Extended Abstract]. Poly/DMAH@VLDB 2020: 81-84 - [c18]Yuji Roh, Kangwook Lee, Steven Whang, Changho Suh:
FR-Train: A Mutual Information-Based Approach to Fair and Robust Training. ICML 2020: 8147-8157 - [e5]Yunmook Nah, Bin Cui, Sang-Won Lee, Jeffrey Xu Yu, Yang-Sae Moon, Steven Euijong Whang:
Database Systems for Advanced Applications - 25th International Conference, DASFAA 2020, Jeju, South Korea, September 24-27, 2020, Proceedings, Part I. Lecture Notes in Computer Science 12112, Springer 2020, ISBN 978-3-030-59409-1 [contents] - [e4]Yunmook Nah, Bin Cui, Sang-Won Lee, Jeffrey Xu Yu, Yang-Sae Moon, Steven Euijong Whang:
Database Systems for Advanced Applications - 25th International Conference, DASFAA 2020, Jeju, South Korea, September 24-27, 2020, Proceedings, Part II. Lecture Notes in Computer Science 12113, Springer 2020, ISBN 978-3-030-59415-2 [contents] - [e3]Yunmook Nah, Bin Cui, Sang-Won Lee, Jeffrey Xu Yu, Yang-Sae Moon, Steven Euijong Whang:
Database Systems for Advanced Applications - 25th International Conference, DASFAA 2020, Jeju, South Korea, September 24-27, 2020, Proceedings, Part III. Lecture Notes in Computer Science 12114, Springer 2020, ISBN 978-3-030-59418-3 [contents] - [e2]Yunmook Nah, Chulyun Kim, Seon Ho Kim, Yang-Sae Moon, Steven Euijong Whang:
Database Systems for Advanced Applications. DASFAA 2020 International Workshops - BDMS, SeCoP, BDQM, GDMA, and AIDE, Jeju, South Korea, September 24-27, 2020, Proceedings. Lecture Notes in Computer Science 12115, Springer 2020, ISBN 978-3-030-59412-1 [contents] - [e1]Sebastian Schelter, Steven Whang, Julia Stoyanovich:
Proceedings of the Fourth Workshop on Data Management for End-To-End Machine Learning, In conjunction with the 2020 ACM SIGMOD/PODS Conference, DEEM@SIGMOD 2020, Portland, OR, USA, June 14, 2020. ACM 2020, ISBN 978-1-4503-8023-2 [contents] - [i7]Yuji Roh, Kangwook Lee, Steven Euijong Whang, Changho Suh:
FR-Train: A mutual information-based approach to fair and robust training. CoRR abs/2002.10234 (2020) - [i6]Ki Hyun Tae, Steven Euijong Whang:
Slice Tuner: A Selective Data Collection Framework for Accurate and Fair Machine Learning Models. CoRR abs/2003.04549 (2020) - [i5]Geon Heo, Yuji Roh, Seonghyeon Hwang, Dayun Lee, Steven Euijong Whang:
Inspector Gadget: A Data Programming-based Labeling System for Industrial Images. CoRR abs/2004.03264 (2020) - [i4]Yuji Roh, Kangwook Lee, Steven Euijong Whang, Changho Suh:
FairBatch: Batch Selection for Model Fairness. CoRR abs/2012.01696 (2020)
2010 – 2019
- 2019
- [c17]Yeounoh Chung, Tim Kraska, Neoklis Polyzotis, Ki Hyun Tae, Steven Euijong Whang:
Slice Finder: Automated Data Slicing for Model Validation. ICDE 2019: 1550-1553 - [c16]Eric Breck, Neoklis Polyzotis, Sudip Roy, Steven Whang, Martin Zinkevich:
Data Validation for Machine Learning. SysML 2019 - [c15]Ki Hyun Tae, Yuji Roh, Young Hun Oh, Hyunsu Kim, Steven Euijong Whang:
Data Cleaning for Accurate, Fair, and Robust Models: A Big Data - AI Integration Approach. DEEM@SIGMOD 2019: 5:1-5:4 - [i3]Ki Hyun Tae, Yuji Roh, Young Hun Oh, Hyunsu Kim, Steven Euijong Whang:
Data Cleaning for Accurate, Fair, and Robust Models: A Big Data - AI Integration Approach. CoRR abs/1904.10761 (2019) - 2018
- [j13]Neoklis Polyzotis, Sudip Roy, Steven Euijong Whang, Martin Zinkevich:
Data Lifecycle Challenges in Production Machine Learning: A Survey. SIGMOD Rec. 47(2): 17-28 (2018) - [i2]Yeounoh Chung, Tim Kraska, Neoklis Polyzotis, Steven Euijong Whang:
Slice Finder: Automated Data Slicing for Model Validation. CoRR abs/1807.06068 (2018) - [i1]Yuji Roh, Geon Heo, Steven Euijong Whang:
A Survey on Data Collection for Machine Learning: a Big Data - AI Integration Perspective. CoRR abs/1811.03402 (2018) - 2017
- [c14]Denis Baylor, Eric Breck, Heng-Tze Cheng, Noah Fiedel, Chuan Yu Foo, Zakaria Haque, Salem Haykal, Mustafa Ispir, Vihan Jain, Levent Koc, Chiu Yuen Koo, Lukasz Lew, Clemens Mewald, Akshay Naresh Modi, Neoklis Polyzotis, Sukriti Ramesh, Sudip Roy, Steven Euijong Whang, Martin Wicke, Jarek Wilkiewicz, Xin Zhang, Martin Zinkevich:
TFX: A TensorFlow-Based Production-Scale Machine Learning Platform. KDD 2017: 1387-1395 - [c13]Neoklis Polyzotis, Sudip Roy, Steven Euijong Whang, Martin Zinkevich:
Data Management Challenges in Production Machine Learning. SIGMOD Conference 2017: 1723-1726 - 2016
- [j12]Alon Y. Halevy, Flip Korn, Natalya Fridman Noy, Christopher Olston, Neoklis Polyzotis, Sudip Roy, Steven Euijong Whang:
Managing Google's data lake: an overview of the Goods system. IEEE Data Eng. Bull. 39(3): 5-14 (2016) - [c12]Mina H. Farid, Ihab F. Ilyas, Steven Euijong Whang, Cong Yu:
LONLIES: Estimating Property Values for Long Tail Entities. SIGIR 2016: 1125-1128 - [c11]Alon Y. Halevy, Flip Korn, Natalya Fridman Noy, Christopher Olston, Neoklis Polyzotis, Sudip Roy, Steven Euijong Whang:
Goods: Organizing Google's Datasets. SIGMOD Conference 2016: 795-806 - [c10]Alon Y. Halevy, Natalya Fridman Noy, Sunita Sarawagi, Steven Euijong Whang, Xiao Yu:
Discovering Structure in the Universe of Attribute Names. WWW 2016: 939-949 - 2015
- [c9]Dana Movshovitz-Attias, Steven Euijong Whang, Natalya Fridman Noy, Alon Y. Halevy:
Discovering Subsumption Relationships for Web-Based Ontologies. WebDB 2015: 62-69 - 2014
- [j11]Rahul Gupta, Alon Y. Halevy, Xuezhi Wang, Steven Euijong Whang, Fei Wu:
Biperpedia: An Ontology for Search Applications. Proc. VLDB Endow. 7(7): 505-516 (2014) - [j10]Steven Euijong Whang, Hector Garcia-Molina:
Incremental entity resolution on rules and data. VLDB J. 23(1): 77-102 (2014) - [c8]Mohamed Yahya, Steven Whang, Rahul Gupta, Alon Y. Halevy:
ReNoun: Fact Extraction for Nominal Attributes. EMNLP 2014: 325-335 - 2013
- [j9]Steven Euijong Whang, Peter Lofgren, Hector Garcia-Molina:
Question Selection for Crowd Entity Resolution. Proc. VLDB Endow. 6(6): 349-360 (2013) - [j8]Steven Euijong Whang, David Marmaros, Hector Garcia-Molina:
Pay-As-You-Go Entity Resolution. IEEE Trans. Knowl. Data Eng. 25(5): 1111-1124 (2013) - [j7]Steven Euijong Whang, Hector Garcia-Molina:
Joint entity resolution on multiple datasets. VLDB J. 22(6): 773-795 (2013) - [c7]Steven Euijong Whang, Hector Garcia-Molina:
Disinformation techniques for entity resolution. CIKM 2013: 715-720 - 2012
- [b1]Steven Euijong Whang:
Data analytics: integration and privacy. Stanford University, USA, 2012 - [c6]Steven Euijong Whang, Hector Garcia-Molina:
Joint Entity Resolution. ICDE 2012: 294-305 - [c5]Steven Euijong Whang, Hector Garcia-Molina:
A Model for Quantifying Information Leakage. Secure Data Management 2012: 25-44 - 2011
- [j6]Steven Euijong Whang, Hector Garcia-Molina:
Developments in Generic Entity Resolution. IEEE Data Eng. Bull. 34(3): 51-59 (2011) - [c4]Steven Whang, Hector Garcia-Molina:
Managing Information Leakage. CIDR 2011: 79-84 - 2010
- [j5]David Menestrina, Steven Whang, Hector Garcia-Molina:
Evaluating Entity Resolution Results. Proc. VLDB Endow. 3(1): 208-219 (2010) - [j4]Steven Whang, Hector Garcia-Molina:
Entity Resolution with Evolving Rules. Proc. VLDB Endow. 3(1): 1326-1337 (2010)
2000 – 2009
- 2009
- [j3]Steven Whang, Chad Brower, Jayavel Shanmugasundaram, Sergei Vassilvitskii, Erik Vee, Ramana Yerneni, Hector Garcia-Molina:
Indexing Boolean Expressions. Proc. VLDB Endow. 2(1): 37-48 (2009) - [j2]Omar Benjelloun, Hector Garcia-Molina, David Menestrina, Qi Su, Steven Euijong Whang, Jennifer Widom:
Swoosh: a generic approach to entity resolution. VLDB J. 18(1): 255-276 (2009) - [j1]Steven Euijong Whang, Omar Benjelloun, Hector Garcia-Molina:
Generic entity resolution with negative rules. VLDB J. 18(6): 1261-1277 (2009) - [c3]Malú Castellanos, Ivo Jimenez, Neal Coddington, Hans Zeller, Steven Whang, Umeshwar Dayal:
QuickStart: An Upfront Client-Based Design Advisor for Parallel Data Warehouses. ICDE 2009: 1543-1546 - [c2]Steven Euijong Whang, David Menestrina, Georgia Koutrika, Martin Theobald, Hector Garcia-Molina:
Entity resolution with iterative blocking. SIGMOD Conference 2009: 219-232 - 2006
- [c1]Ki-Hoon Lee, Seoyoung Kim, Euijong Whang, Jae-Gil Lee:
A Practitioner's Approach to Normalizing XQuery Expressions. DASFAA 2006: 437-453
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 00:56 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint