![](https://dblp.uni-trier.de./img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
default search action
Zhengxuan Wu
Person information
Refine list
![note](https://dblp.uni-trier.de./img/note-mark.dark.12x12.png)
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c25]Jing Huang, Zhengxuan Wu, Christopher Potts, Mor Geva, Atticus Geiger:
RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations. ACL (1) 2024: 8669-8687 - [c24]Atticus Geiger, Zhengxuan Wu, Christopher Potts, Thomas Icard, Noah D. Goodman:
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations. CLeaR 2024: 160-187 - [c23]Zhengxuan Wu, Yuhao Zhang, Peng Qi, Yumo Xu, Rujun Han, Yian Zhang, Jifan Chen, Bonan Min, Zhiheng Huang:
Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models. EMNLP 2024: 3942-3965 - [c22]Shiqi Chen, Miao Xiong, Junteng Liu, Zhengxuan Wu, Teng Xiao, Siyang Gao, Junxian He:
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation. ICML 2024 - [c21]Zhengxuan Wu, Atticus Geiger, Aryaman Arora, Jing Huang
, Zheng Wang, Noah D. Goodman, Christopher D. Manning, Christopher Potts:
pyvene: A Library for Understanding and Improving PyTorch Models via Interventions. NAACL (Demonstrations) 2024: 158-165 - [i32]Zhengxuan Wu, Atticus Geiger, Jing Huang, Aryaman Arora, Thomas Icard, Christopher Potts, Noah D. Goodman:
A Reply to Makelov et al. (2023)'s "Interpretability Illusion" Arguments. CoRR abs/2401.12631 (2024) - [i31]Jing Huang, Zhengxuan Wu, Christopher Potts, Mor Geva, Atticus Geiger:
RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations. CoRR abs/2402.17700 (2024) - [i30]Shiqi Chen, Miao Xiong, Junteng Liu, Zhengxuan Wu, Teng Xiao, Siyang Gao, Junxian He:
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation. CoRR abs/2403.01548 (2024) - [i29]Zhengxuan Wu, Atticus Geiger, Aryaman Arora, Jing Huang, Zheng Wang, Noah D. Goodman, Christopher D. Manning, Christopher Potts:
pyvene: A Library for Understanding and Improving PyTorch Models via Interventions. CoRR abs/2403.07809 (2024) - [i28]Weixin Liang, Yaohui Zhang, Zhengxuan Wu, Haley Lepp, Wenlong Ji, Xuandong Zhao, Hancheng Cao, Sheng Liu, Siyu He, Zhi Huang, Diyi Yang, Christopher Potts, Christopher D. Manning, James Y. Zou:
Mapping the Increasing Use of LLMs in Scientific Papers. CoRR abs/2404.01268 (2024) - [i27]Zhengxuan Wu, Aryaman Arora, Zheng Wang, Atticus Geiger, Dan Jurafsky, Christopher D. Manning, Christopher Potts:
ReFT: Representation Finetuning for Language Models. CoRR abs/2404.03592 (2024) - [i26]Zhengxuan Wu, Yuhao Zhang, Peng Qi, Yumo Xu, Rujun Han, Yian Zhang, Jifan Chen, Bonan Min, Zhiheng Huang:
Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models. CoRR abs/2407.21417 (2024) - 2023
- [j4]Zhengxuan Wu, Christopher D. Manning
, Christopher Potts:
ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation. Trans. Assoc. Comput. Linguistics 11: 1719-1733 (2023) - [c20]Jing Huang
, Zhengxuan Wu, Kyle Mahowald, Christopher Potts:
Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training. ACL (Findings) 2023: 12163-12180 - [c19]Jing Huang
, Atticus Geiger, Karel D'Oosterlinck, Zhengxuan Wu, Christopher Potts:
Rigorously Assessing Natural Language Explanations of Neurons. BlackboxNLP@EMNLP 2023: 317-331 - [c18]Zhengxuan Wu, Alex Tamkin, Isabel Papadimitriou:
Oolong: Investigating What Makes Transfer Learning Hard with Controlled Studies. EMNLP 2023: 3280-3289 - [c17]Zexuan Zhong, Zhengxuan Wu, Christopher D. Manning, Christopher Potts, Danqi Chen:
MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions. EMNLP 2023: 15686-15702 - [c16]Zhengxuan Wu, Karel D'Oosterlinck, Atticus Geiger, Amir Zur, Christopher Potts:
Causal Proxy Models for Concept-based Model Explanations. ICML 2023: 37313-37334 - [c15]Zhengxuan Wu, Atticus Geiger, Thomas Icard, Christopher Potts, Noah D. Goodman:
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca. NeurIPS 2023 - [i25]Atticus Geiger, Zhengxuan Wu, Christopher Potts, Thomas Icard, Noah D. Goodman:
Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations. CoRR abs/2303.02536 (2023) - [i24]Zhengxuan Wu, Christopher D. Manning, Christopher Potts:
ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation. CoRR abs/2303.13716 (2023) - [i23]Zhengxuan Wu, Atticus Geiger, Christopher Potts, Noah D. Goodman:
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca. CoRR abs/2305.08809 (2023) - [i22]Zexuan Zhong, Zhengxuan Wu, Christopher D. Manning, Christopher Potts, Danqi Chen:
MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions. CoRR abs/2305.14795 (2023) - [i21]Jing Huang, Atticus Geiger, Karel D'Oosterlinck, Zhengxuan Wu, Christopher Potts:
Rigorously Assessing Natural Language Explanations of Neurons. CoRR abs/2309.10312 (2023) - 2022
- [c14]Atticus Geiger, Zhengxuan Wu, Hanson Lu, Josh Rozner, Elisa Kreiss, Thomas Icard, Noah D. Goodman, Christopher Potts:
Inducing Causal Structure for Interpretable Neural Networks. ICML 2022: 7324-7338 - [c13]Zhengxuan Wu, Atticus Geiger, Joshua Rozner, Elisa Kreiss, Hanson Lu, Thomas Icard, Christopher Potts, Noah D. Goodman:
Causal Distillation for Language Models. NAACL-HLT 2022: 4288-4295 - [c12]Eldar David Abraham, Karel D'Oosterlinck, Amir Feder, Yair Ori Gat, Atticus Geiger, Christopher Potts, Roi Reichart, Zhengxuan Wu:
CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior. NeurIPS 2022 - [c11]Tailin Wu, Megan Tjandrasuwita, Zhengxuan Wu, Xuelin Yang, Kevin Liu, Rok Sosic, Jure Leskovec:
ZeroC: A Neuro-Symbolic Model for Zero-shot Concept Recognition and Acquisition at Inference Time. NeurIPS 2022 - [c10]Zhengxuan Wu, Nelson F. Liu, Christopher Potts:
Identifying the Limits of Cross-Domain Knowledge Transfer for Pretrained Models. RepL4NLP@ACL 2022: 100-110 - [i20]Zhengxuan Wu, Isabel Papadimitriou, Alex Tamkin:
Oolong: Investigating What Makes Crosslingual Transfer Hard with Controlled Studies. CoRR abs/2202.12312 (2022) - [i19]Eldar David Abraham, Karel D'Oosterlinck, Amir Feder, Yair Ori Gat, Atticus Geiger, Christopher Potts, Roi Reichart, Zhengxuan Wu:
CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior. CoRR abs/2205.14140 (2022) - [i18]Tailin Wu, Megan Tjandrasuwita
, Zhengxuan Wu, Xuelin Yang, Kevin Liu, Rok Sosic, Jure Leskovec:
ZeroC: A Neuro-Symbolic Model for Zero-shot Concept Recognition and Acquisition at Inference Time. CoRR abs/2206.15049 (2022) - [i17]Zhengxuan Wu, Karel D'Oosterlinck, Atticus Geiger, Amir Zur, Christopher Potts:
Causal Proxy Models for Concept-Based Model Explanations. CoRR abs/2209.14279 (2022) - [i16]Jing Huang, Zhengxuan Wu, Kyle Mahowald, Christopher Potts:
Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training. CoRR abs/2212.09897 (2022) - 2021
- [j3]Thanh-Son Nguyen
, Zhengxuan Wu
, Desmond C. Ong
:
Attention uncovers task-relevant semantics in emotional narrative understanding. Knowl. Based Syst. 226: 107162 (2021) - [j2]Desmond C. Ong
, Zhengxuan Wu, Zhi-Xuan Tan
, Marianne Reddan
, Isabella Kahhale
, Alison Mattek, Jamil Zaki:
Modeling Emotion in Complex Stories: The Stanford Emotional Narratives Dataset. IEEE Trans. Affect. Comput. 12(3): 579-594 (2021) - [c9]Zhengxuan Wu, Desmond C. Ong:
Context-Guided BERT for Targeted Aspect-Based Sentiment Analysis. AAAI 2021: 14094-14102 - [c8]Christopher Potts, Zhengxuan Wu, Atticus Geiger, Douwe Kiela:
DynaSent: A Dynamic Benchmark for Sentiment Analysis. ACL/IJCNLP (1) 2021: 2388-2404 - [c7]Geza Kovacs, Zhengxuan Wu, Michael S. Bernstein:
Not Now, Ask Later: Users Weaken Their Behavior Change Regimen Over Time, But Expect To Re-Strengthen It Imminently. CHI 2021: 229:1-229:14 - [c6]Douwe Kiela, Max Bartolo, Yixin Nie, Divyansh Kaushik, Atticus Geiger, Zhengxuan Wu, Bertie Vidgen, Grusha Prasad, Amanpreet Singh, Pratik Ringshia, Zhiyi Ma, Tristan Thrush, Sebastian Riedel, Zeerak Waseem
, Pontus Stenetorp, Robin Jia
, Mohit Bansal, Christopher Potts, Adina Williams
:
Dynabench: Rethinking Benchmarking in NLP. NAACL-HLT 2021: 4110-4124 - [c5]Zhengxuan Wu, Elisa Kreiss, Desmond C. Ong, Christopher Potts:
ReaSCAN: Compositional Reasoning in Language Grounding. NeurIPS Datasets and Benchmarks 2021 - [i15]Zhengxuan Wu, Desmond C. Ong:
On Explaining Your Explanations of BERT: An Empirical Study with Sequence Classification. CoRR abs/2101.00196 (2021) - [i14]Geza Kovacs, Zhengxuan Wu, Michael S. Bernstein:
Not Now, Ask Later: Users Weaken Their Behavior Change Regimen Over Time, But Expect To Re-Strengthen It Imminently. CoRR abs/2101.11743 (2021) - [i13]Zhengxuan Wu, Nelson F. Liu, Christopher Potts:
Identifying the Limits of Cross-Domain Knowledge Transfer for Pretrained Models. CoRR abs/2104.08410 (2021) - [i12]Douwe Kiela, Max Bartolo, Yixin Nie, Divyansh Kaushik, Atticus Geiger, Zhengxuan Wu, Bertie Vidgen, Grusha Prasad, Amanpreet Singh, Pratik Ringshia, Zhiyi Ma, Tristan Thrush, Sebastian Riedel, Zeerak Waseem, Pontus Stenetorp, Robin Jia, Mohit Bansal, Christopher Potts, Adina Williams:
Dynabench: Rethinking Benchmarking in NLP. CoRR abs/2104.14337 (2021) - [i11]Zhengxuan Wu, Elisa Kreiss, Desmond C. Ong, Christopher Potts:
ReaSCAN: Compositional Reasoning in Language Grounding. CoRR abs/2109.08994 (2021) - [i10]Atticus Geiger, Zhengxuan Wu, Hanson Lu, Josh Rozner, Elisa Kreiss, Thomas Icard, Noah D. Goodman, Christopher Potts:
Inducing Causal Structure for Interpretable Neural Networks. CoRR abs/2112.00826 (2021) - [i9]Zhengxuan Wu, Atticus Geiger, Josh Rozner, Elisa Kreiss, Hanson Lu, Thomas Icard, Christopher Potts, Noah D. Goodman:
Causal Distillation for Language Models. CoRR abs/2112.02505 (2021) - 2020
- [c4]Zhengxuan Wu, Thanh-Son Nguyen, Desmond C. Ong:
Structured Self-AttentionWeights Encode Semantics in Sentiment Analysis. BlackboxNLP@EMNLP 2020: 255-264 - [i8]Zhengxuan Wu, Desmond C. Ong:
Pragmatically Informative Color Generation by Grounding Contextual Modifiers. CoRR abs/2010.04372 (2020) - [i7]Zhengxuan Wu, Thanh-Son Nguyen, Desmond C. Ong:
Structured Self-Attention Weights Encode Semantics in Sentiment Analysis. CoRR abs/2010.04922 (2020) - [i6]Zhengxuan Wu, Desmond C. Ong:
Context-Guided BERT for Targeted Aspect-Based Sentiment Analysis. CoRR abs/2010.07523 (2020) - [i5]Christopher Potts, Zhengxuan Wu, Atticus Geiger, Douwe Kiela:
DynaSent: A Dynamic Benchmark for Sentiment Analysis. CoRR abs/2012.15349 (2020)
2010 – 2019
- 2019
- [c3]Zhengxuan Wu, Xiyu Zhang, Zhi-Xuan Tan, Jamil Zaki, Desmond C. Ong
:
Attending to Emotional Narratives. ACII 2019: 648-654 - [c2]Geza Kovacs, Drew Mylander Gregory, Zilin Ma, Zhengxuan Wu, Golrokh Emami, Jacob Ray, Michael S. Bernstein
:
Conservation of Procrastination: Do Productivity Interventions Save Time Or Just Redistribute It? CHI 2019: 330 - [c1]Zhengxuan Wu
, Yueyi Jiang
:
Disentangling Latent Emotions of Word Embeddings on Complex Emotional Narratives. NLPCC (2) 2019: 587-595 - [i4]Zhengxuan Wu, Jason Luo, Xiyu Zhang:
Uncovering Political Promotion in China: A Network Analysis of Patronage Relationship in Autocracy. CoRR abs/1902.00625 (2019) - [i3]Zhengxuan Wu, Xiyu Zhang, Zhi-Xuan Tan, Jamil Zaki, Desmond C. Ong:
Attending to Emotional Narratives. CoRR abs/1907.04197 (2019) - [i2]Zhengxuan Wu, Yueyi Jiang:
Disentangling Latent Emotions of Word Embeddings on Complex Emotional Narratives. CoRR abs/1908.07817 (2019) - [i1]Desmond C. Ong, Zhengxuan Wu, Zhi-Xuan Tan, Marianne Reddan, Isabella Kahhale, Alison Mattek, Jamil Zaki:
Modeling emotion in complex stories: the Stanford Emotional Narratives Dataset. CoRR abs/1912.05008 (2019) - 2018
- [j1]Geza Kovacs, Zhengxuan Wu, Michael S. Bernstein:
Rotating Online Behavior Change Interventions Increases Effectiveness But Also Increases Attrition. Proc. ACM Hum. Comput. Interact. 2(CSCW): 95:1-95:25 (2018)
Coauthor Index
![](https://dblp.uni-trier.de./img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-21 00:23 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint