


default search action
Samuel R. Bowman
Person information
- affiliation: New York University, Department of Linguistics, USA
- affiliation (former): Stanford University, Department of Linguistics
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [i105]Mrinank Sharma, Meg Tong, Jesse Mu, Jerry Wei, Jorrit Kruthoff, Scott Goodfriend, Euan Ong, Alwin Peng, Raj Agarwal, Cem Anil, Amanda Askell, Nathan Bailey, Joe Benton, Emma Bluemke, Samuel R. Bowman, Eric Christiansen, Hoagy Cunningham, Andy Dau, Anjali Gopal, Rob Gilson, Logan Graham, Logan Howard, Nimit Kalra, Taesung Lee, Kevin Lin, Peter Lofgren, Francesco Mosconi, Clare O'Hara, Catherine Olsson, Linda Petrini, Samir Rajani, Nikhil Saxena, Alex Silverstein, Tanya Singh, Theodore R. Sumers, Leonard Tang, Kevin K. Troy, Constantin Weisser, Ruiqi Zhong, Giulio Zhou, Jan Leike, Jared Kaplan, Ethan Perez:
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming. CoRR abs/2501.18837 (2025) - 2024
- [j9]Angelica Chen, Jason Phang, Alicia Parrish, Vishakh Padmakumar, Chen Zhao, Samuel R. Bowman, Kyunghyun Cho:
Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs. Trans. Mach. Learn. Res. 2024 (2024) - [j8]Angelica Chen, Jérémy Scheurer, Jon Ander Campos, Tomasz Korbak, Jun Shern Chan, Samuel R. Bowman, Kyunghyun Cho, Ethan Perez:
Learning from Natural Language Feedback. Trans. Mach. Learn. Res. 2024 (2024) - [c77]Mrinank Sharma, Meg Tong, Tomasz Korbak, David Duvenaud, Amanda Askell, Samuel R. Bowman, Esin Durmus, Zac Hatfield-Dodds, Scott R. Johnston, Shauna Kravec, Timothy Maxwell, Sam McCandlish, Kamal Ndousse, Oliver Rausch, Nicholas Schiefer, Da Yan, Miranda Zhang, Ethan Perez:
Towards Understanding Sycophancy in Language Models. ICLR 2024 - [c76]Akbir Khan, John Hughes, Dan Valentine, Laura Ruis, Kshitij Sachan, Ansh Radhakrishnan, Edward Grefenstette, Samuel R. Bowman, Tim Rocktäschel, Ethan Perez:
Debating with More Persuasive LLMs Leads to More Truthful Answers. ICML 2024 - [c75]Cem Anil, Esin Durmus, Nina Panickssery, Mrinank Sharma, Joe Benton, Sandipan Kundu, Joshua Batson, Meg Tong, Jesse Mu, Daniel Ford, Francesco Mosconi, Rajashree Agrawal, Rylan Schaeffer, Naomi Bashkansky, Samuel Svenningsen, Mike Lambert, Ansh Radhakrishnan, Carson Denison, Evan Hubinger, Yuntao Bai, Trenton Bricken, Timothy Maxwell, Nicholas Schiefer, James Sully, Alex Tamkin, Tamera Lanham, Karina Nguyen, Tomek Korbak, Jared Kaplan, Deep Ganguli, Samuel R. Bowman, Ethan Perez, Roger B. Grosse, David Kristjanson Duvenaud:
Many-shot Jailbreaking. NeurIPS 2024 - [c74]Arjun Panickssery, Samuel R. Bowman, Shi Feng:
LLM Evaluators Recognize and Favor Their Own Generations. NeurIPS 2024 - [i104]Evan Hubinger, Carson Denison, Jesse Mu, Mike Lambert, Meg Tong, Monte MacDiarmid, Tamera Lanham, Daniel M. Ziegler, Tim Maxwell, Newton Cheng, Adam S. Jermyn, Amanda Askell, Ansh Radhakrishnan, Cem Anil, David Duvenaud, Deep Ganguli, Fazl Barez, Jack Clark, Kamal Ndousse, Kshitij Sachan, Michael Sellitto, Mrinank Sharma, Nova DasSarma, Roger Grosse, Shauna Kravec, Yuntao Bai, Zachary Witten, Marina Favaro, Jan Brauner, Holden Karnofsky, Paul F. Christiano, Samuel R. Bowman, Logan Graham, Jared Kaplan, Sören Mindermann, Ryan Greenblatt, Buck Shlegeris, Nicholas Schiefer, Ethan Perez:
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training. CoRR abs/2401.05566 (2024) - [i103]Akbir Khan, John Hughes, Dan Valentine, Laura Ruis, Kshitij Sachan, Ansh Radhakrishnan, Edward Grefenstette, Samuel R. Bowman, Tim Rocktäschel, Ethan Perez:
Debating with More Persuasive LLMs Leads to More Truthful Answers. CoRR abs/2402.06782 (2024) - [i102]James Chua, Edward Rees, Hunar Batra, Samuel R. Bowman, Julian Michael, Ethan Perez, Miles Turpin:
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought. CoRR abs/2403.05518 (2024) - [i101]Arjun Panickssery, Samuel R. Bowman, Shi Feng:
LLM Evaluators Recognize and Favor Their Own Generations. CoRR abs/2404.13076 (2024) - [i100]Jacob Pfau, William Merrill, Samuel R. Bowman:
Let's Think Dot by Dot: Hidden Computation in Transformer Language Models. CoRR abs/2404.15758 (2024) - [i99]Carson Denison, Monte MacDiarmid, Fazl Barez, David Duvenaud, Shauna Kravec, Samuel Marks, Nicholas Schiefer, Ryan Soklaski, Alex Tamkin, Jared Kaplan, Buck Shlegeris, Samuel R. Bowman, Ethan Perez, Evan Hubinger:
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models. CoRR abs/2406.10162 (2024) - [i98]Asa Cooper Stickland, Alexander Lyzhov, Jacob Pfau, Salsabila Mahdi, Samuel R. Bowman:
Steering Without Side Effects: Improving Post-Deployment Control of Language Models. CoRR abs/2406.15518 (2024) - [i97]Sara Price, Arjun Panickssery, Samuel R. Bowman, Asa Cooper Stickland:
Future Events as Backdoor Triggers: Investigating Temporal Vulnerabilities in LLMs. CoRR abs/2407.04108 (2024) - [i96]Jane Pan, He He, Samuel R. Bowman, Shi Feng:
Spontaneous Reward Hacking in Iterative Self-Refinement. CoRR abs/2407.04549 (2024) - [i95]Jiaxin Wen, Ruiqi Zhong, Akbir Khan, Ethan Perez, Jacob Steinhardt, Minlie Huang, Samuel R. Bowman, He He, Shi Feng:
Language Models Learn to Mislead Humans via RLHF. CoRR abs/2409.12822 (2024) - [i94]Joe Benton, Misha Wagner, Eric Christiansen, Cem Anil, Ethan Perez, Jai Srivastav, Esin Durmus, Deep Ganguli, Shauna Kravec, Buck Shlegeris, Jared Kaplan, Holden Karnofsky, Evan Hubinger, Roger Grosse, Samuel R. Bowman, David Duvenaud:
Sabotage Evaluations for Frontier Models. CoRR abs/2410.21514 (2024) - [i93]Ryan Greenblatt, Carson Denison, Benjamin Wright, Fabien Roger, Monte MacDiarmid, Samuel Marks, Johannes Treutlein, Tim Belonax, Jack Chen, David Duvenaud, Akbir Khan, Julian Michael, Sören Mindermann, Ethan Perez, Linda Petrini, Jonathan Uesato, Jared Kaplan, Buck Shlegeris, Samuel R. Bowman, Evan Hubinger:
Alignment faking in large language models. CoRR abs/2412.14093 (2024) - 2023
- [j7]Ian R. McKenzie, Alexander Lyzhov, Michael Pieler, Alicia Parrish, Aaron Mueller, Ameya Prabhu, Euan McLean, Aaron Kirtland, Alexis Ross, Alisa Liu, Andrew Gritsevskiy, Daniel Wurgaft, Derik Kauffman, Gabriel Recchia, Jiacheng Liu, Joe Cavanagh, Max Weiss, Sicong Huang, The Floating Droid, Tom Tseng, Tomasz Korbak, Xudong Shen, Yuhui Zhang, Zhengping Zhou, Najoung Kim, Samuel R. Bowman, Ethan Perez:
Inverse Scaling: When Bigger Isn't Better. Trans. Mach. Learn. Res. 2023 (2023) - [j6]Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew M. Dai, Andrew La, Andrew K. Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakas, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartlomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, Cèsar Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodolà, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan J. Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, François Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocon, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse H. Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, José Hernández-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory W. Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Senel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, María José Ramírez-Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael I. Ivanitskiy, Michael Starritt, Michael Strube, Michal Swedrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T., Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Milkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima (Shammie) Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay V. Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, Zirui Wang, Ziyi Wu:
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. Trans. Mach. Learn. Res. 2023 (2023) - [c73]Jingyuan Selena She, Christopher Potts, Samuel R. Bowman, Atticus Geiger:
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning. ACL (2) 2023: 1803-1821 - [c72]Or Honovich, Uri Shaham, Samuel R. Bowman, Omer Levy:
Instruction Induction: From Few Examples to Natural Language Task Descriptions. ACL (1) 2023: 1935-1952 - [c71]Najoung Kim, Phu Mon Htut, Samuel R. Bowman, Jackson Petty:
(QA)²: Question Answering with Questionable Assumptions. ACL (1) 2023: 8466-8487 - [c70]Ethan Perez, Sam Ringer, Kamile Lukosiute, Karina Nguyen, Edwin Chen, Scott Heiner, Craig Pettit, Catherine Olsson, Sandipan Kundu, Saurav Kadavath, Andy Jones, Anna Chen, Benjamin Mann, Brian Israel, Bryan Seethor, Cameron McKinnon, Christopher Olah, Da Yan, Daniela Amodei, Dario Amodei, Dawn Drain, Dustin Li, Eli Tran-Johnson, Guro Khundadze, Jackson Kernion, James Landis, Jamie Kerr, Jared Mueller, Jeeyoon Hyun, Joshua Landau, Kamal Ndousse, Landon Goldberg, Liane Lovitt, Martin Lucas, Michael Sellitto, Miranda Zhang, Neerav Kingsland, Nelson Elhage, Nicholas Joseph, Noemí Mercado, Nova DasSarma, Oliver Rausch, Robin Larson, Sam McCandlish, Scott Johnston, Shauna Kravec, Sheer El Showk, Tamera Lanham, Timothy Telleen-Lawton, Tom Brown, Tom Henighan, Tristan Hume, Yuntao Bai, Zac Hatfield-Dodds, Jack Clark, Samuel R. Bowman, Amanda Askell, Roger Grosse, Danny Hernandez, Deep Ganguli, Evan Hubinger, Nicholas Schiefer, Jared Kaplan:
Discovering Language Model Behaviors with Model-Written Evaluations. ACL (Findings) 2023: 13387-13434 - [c69]Julian Michael, Ari Holtzman, Alicia Parrish, Aaron Mueller, Alex Wang, Angelica Chen, Divyam Madaan, Nikita Nangia, Richard Yuanzhe Pang, Jason Phang, Samuel R. Bowman:
What Do NLP Researchers Believe? Results of the NLP Community Metasurvey. ACL (1) 2023: 16334-16368 - [c68]Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Vinayak Bhalerao, Christopher L. Buckley, Jason Phang, Samuel R. Bowman, Ethan Perez:
Pretraining Language Models with Human Preferences. ICML 2023: 17506-17533 - [c67]Miles Turpin, Julian Michael, Ethan Perez, Samuel R. Bowman:
Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting. NeurIPS 2023 - [i92]Deep Ganguli, Amanda Askell, Nicholas Schiefer, Thomas I. Liao, Kamile Lukosiute, Anna Chen, Anna Goldie, Azalia Mirhoseini, Catherine Olsson, Danny Hernandez, Dawn Drain, Dustin Li, Eli Tran-Johnson, Ethan Perez, Jackson Kernion, Jamie Kerr, Jared Mueller, Joshua Landau, Kamal Ndousse, Karina Nguyen, Liane Lovitt, Michael Sellitto, Nelson Elhage, Noemí Mercado, Nova DasSarma, Oliver Rausch, Robert Lasenby, Robin Larson, Sam Ringer, Sandipan Kundu, Saurav Kadavath, Scott Johnston, Shauna Kravec, Sheer El Showk, Tamera Lanham, Timothy Telleen-Lawton, Tom Henighan, Tristan Hume, Yuntao Bai, Zac Hatfield-Dodds, Ben Mann, Dario Amodei, Nicholas Joseph, Sam McCandlish, Tom Brown, Christopher Olah, Jack Clark, Samuel R. Bowman, Jared Kaplan:
The Capacity for Moral Self-Correction in Large Language Models. CoRR abs/2302.07459 (2023) - [i91]Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher L. Buckley, Jason Phang, Samuel R. Bowman, Ethan Perez:
Pretraining Language Models with Human Preferences. CoRR abs/2302.08582 (2023) - [i90]Angelica Chen, Jérémy Scheurer, Tomasz Korbak, Jon Ander Campos, Jun Shern Chan, Samuel R. Bowman, Kyunghyun Cho, Ethan Perez:
Improving Code Generation by Training with Natural Language Feedback. CoRR abs/2303.16749 (2023) - [i89]Samuel R. Bowman:
Eight Things to Know about Large Language Models. CoRR abs/2304.00612 (2023) - [i88]Miles Turpin, Julian Michael, Ethan Perez, Samuel R. Bowman:
Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting. CoRR abs/2305.04388 (2023) - [i87]Angelica Chen, Jason Phang, Alicia Parrish, Vishakh Padmakumar, Chen Zhao, Samuel R. Bowman, Kyunghyun Cho:
Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs. CoRR abs/2305.14279 (2023) - [i86]Jingyuan Selena She, Christopher Potts, Samuel R. Bowman, Atticus Geiger:
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning. CoRR abs/2305.19426 (2023) - [i85]Ian R. McKenzie, Alexander Lyzhov, Michael Pieler, Alicia Parrish, Aaron Mueller, Ameya Prabhu, Euan McLean, Aaron Kirtland, Alexis Ross, Alisa Liu, Andrew Gritsevskiy, Daniel Wurgaft, Derik Kauffman, Gabriel Recchia, Jiacheng Liu, Joe Cavanagh, Max Weiss, Sicong Huang, The Floating Droid, Tom Tseng, Tomasz Korbak, Xudong Shen, Yuhui Zhang, Zhengping Zhou, Najoung Kim, Samuel R. Bowman, Ethan Perez:
Inverse Scaling: When Bigger Isn't Better. CoRR abs/2306.09479 (2023) - [i84]Ansh Radhakrishnan, Karina Nguyen, Anna Chen, Carol Chen, Carson Denison, Danny Hernandez, Esin Durmus, Evan Hubinger, Jackson Kernion, Kamile Lukosiute, Newton Cheng, Nicholas Joseph, Nicholas Schiefer, Oliver Rausch, Sam McCandlish, Sheer El Showk, Tamera Lanham, Tim Maxwell, Venkatesa Chandrasekaran, Zac Hatfield-Dodds, Jared Kaplan, Jan Brauner, Samuel R. Bowman, Ethan Perez:
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning. CoRR abs/2307.11768 (2023) - [i83]Tamera Lanham, Anna Chen, Ansh Radhakrishnan, Benoit Steiner, Carson Denison, Danny Hernandez, Dustin Li, Esin Durmus, Evan Hubinger, Jackson Kernion, Kamile Lukosiute, Karina Nguyen, Newton Cheng, Nicholas Joseph, Nicholas Schiefer, Oliver Rausch, Robin Larson, Sam McCandlish, Sandipan Kundu, Saurav Kadavath, Shannon Yang, Thomas Henighan, Timothy Maxwell, Timothy Telleen-Lawton, Tristan Hume, Zac Hatfield-Dodds, Jared Kaplan, Jan Brauner, Samuel R. Bowman, Ethan Perez:
Measuring Faithfulness in Chain-of-Thought Reasoning. CoRR abs/2307.13702 (2023) - [i82]Roger B. Grosse, Juhan Bae, Cem Anil, Nelson Elhage, Alex Tamkin, Amirhossein Tajdini, Benoit Steiner, Dustin Li, Esin Durmus, Ethan Perez, Evan Hubinger, Kamile Lukosiute, Karina Nguyen, Nicholas Joseph, Sam McCandlish, Jared Kaplan, Samuel R. Bowman:
Studying Large Language Model Generalization with Influence Functions. CoRR abs/2308.03296 (2023) - [i81]Mrinank Sharma, Meg Tong, Tomasz Korbak, David Duvenaud, Amanda Askell, Samuel R. Bowman, Newton Cheng, Esin Durmus, Zac Hatfield-Dodds, Scott R. Johnston, Shauna Kravec, Timothy Maxwell, Sam McCandlish, Kamal Ndousse, Oliver Rausch, Nicholas Schiefer, Da Yan, Miranda Zhang, Ethan Perez:
Towards Understanding Sycophancy in Language Models. CoRR abs/2310.13548 (2023) - [i80]Julian Michael, Salsabila Mahdi, David Rein, Jackson Petty, Julien Dirani, Vishakh Padmakumar, Samuel R. Bowman:
Debate Helps Supervise Unreliable Experts. CoRR abs/2311.08702 (2023) - [i79]David Rein, Betty Li Hou, Asa Cooper Stickland, Jackson Petty, Richard Yuanzhe Pang, Julien Dirani, Julian Michael, Samuel R. Bowman:
GPQA: A Graduate-Level Google-Proof Q&A Benchmark. CoRR abs/2311.12022 (2023) - 2022
- [c66]Alicia Parrish, Angelica Chen, Nikita Nangia, Vishakh Padmakumar, Jason Phang, Jana Thompson, Phu Mon Htut, Samuel R. Bowman:
BBQ: A hand-built bias benchmark for question answering. ACL (Findings) 2022: 2086-2105 - [c65]Saku Sugawara, Nikita Nangia, Alex Warstadt, Samuel R. Bowman:
What Makes Reading Comprehension Questions Difficult? ACL (1) 2022: 6951-6971 - [c64]Samuel R. Bowman:
The Dangers of Underclaiming: Reasons for Caution When Reporting How NLP Systems Fail. ACL (1) 2022: 7484-7499 - [c63]Alex Wang, Richard Yuanzhe Pang, Angelica Chen, Jason Phang, Samuel R. Bowman:
SQuALITY: Building a Long-Document Summarization Dataset the Hard Way. EMNLP 2022: 1139-1156 - [c62]Anne Lauscher
, Federico Bianchi, Samuel R. Bowman, Dirk Hovy:
SocioProbe: What, When, and Where Language Models Learn about Sociodemographics. EMNLP 2022: 7901-7918 - [c61]Richard Yuanzhe Pang, Alicia Parrish, Nitish Joshi, Nikita Nangia, Jason Phang, Angelica Chen, Vishakh Padmakumar, Johnny Ma, Jana Thompson, He He, Samuel R. Bowman:
QuALITY: Question Answering with Long Input Texts, Yes! NAACL-HLT 2022: 5336-5358 - [i78]Saku Sugawara, Nikita Nangia, Alex Warstadt, Samuel R. Bowman:
What Makes Reading Comprehension Questions Difficult? CoRR abs/2203.06342 (2022) - [i77]Alicia Parrish, Harsh Trivedi, Ethan Perez, Angelica Chen, Nikita Nangia, Jason Phang, Samuel R. Bowman:
Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions. CoRR abs/2204.05212 (2022) - [i76]Or Honovich, Uri Shaham, Samuel R. Bowman, Omer Levy:
Instruction Induction: From Few Examples to Natural Language Task Descriptions. CoRR abs/2205.10782 (2022) - [i75]Alex Wang, Richard Yuanzhe Pang, Angelica Chen, Jason Phang, Samuel R. Bowman:
SQuALITY: Building a Long-Document Summarization Dataset the Hard Way. CoRR abs/2205.11465 (2022) - [i74]Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power
, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew M. Dai, Andrew La, Andrew K. Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakas, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartlomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, Cèsar Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo
, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodolà, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan J. Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, François Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocon, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse H. Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden
, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, José Hernández-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz
, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory W. Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan
, Lianhui Qin, Lidia Contreras Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Senel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, María José Ramírez-Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael I. Ivanitskiy, Michael Starritt, Michael Strube, Michal Swedrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T., Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Milkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima (Shammie) Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster
, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay V. Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, Zirui Wang, Ziyi Wu:
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. CoRR abs/2206.04615 (2022) - [i73]Saurav Kadavath, Tom Conerly, Amanda Askell, Tom Henighan, Dawn Drain, Ethan Perez, Nicholas Schiefer, Zac Hatfield-Dodds, Nova DasSarma, Eli Tran-Johnson, Scott Johnston, Sheer El Showk, Andy Jones, Nelson Elhage, Tristan Hume, Anna Chen, Yuntao Bai, Sam Bowman, Stanislav Fort, Deep Ganguli, Danny Hernandez, Josh Jacobson, Jackson Kernion, Shauna Kravec, Liane Lovitt, Kamal Ndousse, Catherine Olsson, Sam Ringer, Dario Amodei, Tom Brown, Jack Clark, Nicholas Joseph, Ben Mann, Sam McCandlish, Chris Olah, Jared Kaplan:
Language Models (Mostly) Know What They Know. CoRR abs/2207.05221 (2022) - [i72]Alex Warstadt, Samuel R. Bowman:
What Artificial Neural Networks Can Tell Us About Human Language Acquisition. CoRR abs/2208.07998 (2022) - [i71]Julian Michael, Ari Holtzman, Alicia Parrish, Aaron Mueller, Alex Wang, Angelica Chen, Divyam Madaan, Nikita Nangia, Richard Yuanzhe Pang, Jason Phang, Samuel R. Bowman:
What Do NLP Researchers Believe? Results of the NLP Community Metasurvey. CoRR abs/2208.12852 (2022) - [i70]Deep Ganguli, Liane Lovitt, Jackson Kernion, Amanda Askell, Yuntao Bai, Saurav Kadavath, Ben Mann, Ethan Perez, Nicholas Schiefer, Kamal Ndousse, Andy Jones, Sam Bowman, Anna Chen, Tom Conerly, Nova DasSarma, Dawn Drain, Nelson Elhage, Sheer El Showk, Stanislav Fort, Zac Hatfield-Dodds, Tom Henighan, Danny Hernandez, Tristan Hume, Josh Jacobson, Scott Johnston, Shauna Kravec, Catherine Olsson, Sam Ringer, Eli Tran-Johnson, Dario Amodei, Tom Brown, Nicholas Joseph, Sam McCandlish, Chris Olah, Jared Kaplan, Jack Clark:
Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned. CoRR abs/2209.07858 (2022) - [i69]Alicia Parrish, Harsh Trivedi, Nikita Nangia, Vishakh Padmakumar, Jason Phang, Amanpreet Singh Saimbhi, Samuel R. Bowman:
Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions. CoRR abs/2210.10860 (2022) - [i68]Samuel R. Bowman, Jeeyoon Hyun, Ethan Perez, Edwin Chen, Craig Pettit, Scott Heiner, Kamile Lukosiute, Amanda Askell, Andy Jones, Anna Chen, Anna Goldie, Azalia Mirhoseini, Cameron McKinnon, Christopher Olah, Daniela Amodei, Dario Amodei, Dawn Drain, Dustin Li, Eli Tran-Johnson, Jackson Kernion, Jamie Kerr, Jared Mueller, Jeffrey Ladish, Joshua Landau, Kamal Ndousse, Liane Lovitt, Nelson Elhage, Nicholas Schiefer, Nicholas Joseph, Noemí Mercado, Nova DasSarma, Robin Larson, Sam McCandlish, Sandipan Kundu, Scott Johnston, Shauna Kravec, Sheer El Showk, Stanislav Fort, Timothy Telleen-Lawton, Tom Brown, Tom Henighan, Tristan Hume, Yuntao Bai, Zac Hatfield-Dodds, Ben Mann, Jared Kaplan:
Measuring Progress on Scalable Oversight for Large Language Models. CoRR abs/2211.03540 (2022) - [i67]Anne Lauscher, Federico Bianchi, Samuel R. Bowman, Dirk Hovy:
SocioProbe: What, When, and Where Language Models Learn about Sociodemographics. CoRR abs/2211.04281 (2022) - [i66]Yuntao Bai, Saurav Kadavath, Sandipan Kundu, Amanda Askell, Jackson Kernion, Andy Jones, Anna Chen, Anna Goldie, Azalia Mirhoseini, Cameron McKinnon, Carol Chen, Catherine Olsson, Christopher Olah, Danny Hernandez, Dawn Drain, Deep Ganguli, Dustin Li, Eli Tran-Johnson, Ethan Perez, Jamie Kerr, Jared Mueller, Jeffrey Ladish, Joshua Landau, Kamal Ndousse, Kamile Lukosiute, Liane Lovitt, Michael Sellitto, Nelson Elhage, Nicholas Schiefer, Noemí Mercado, Nova DasSarma, Robert Lasenby, Robin Larson, Sam Ringer, Scott Johnston, Shauna Kravec, Sheer El Showk, Stanislav Fort, Tamera Lanham, Timothy Telleen-Lawton, Tom Conerly, Tom Henighan, Tristan Hume, Samuel R. Bowman, Zac Hatfield-Dodds, Ben Mann, Dario Amodei, Nicholas Joseph, Sam McCandlish, Tom Brown, Jared Kaplan:
Constitutional AI: Harmlessness from AI Feedback. CoRR abs/2212.08073 (2022) - [i65]Ethan Perez, Sam Ringer, Kamile Lukosiute, Karina Nguyen, Edwin Chen, Scott Heiner, Craig Pettit, Catherine Olsson, Sandipan Kundu, Saurav Kadavath, Andy Jones, Anna Chen, Ben Mann, Brian Israel, Bryan Seethor, Cameron McKinnon, Christopher Olah, Da Yan, Daniela Amodei, Dario Amodei, Dawn Drain, Dustin Li, Eli Tran-Johnson, Guro Khundadze, Jackson Kernion, James Landis, Jamie Kerr, Jared Mueller, Jeeyoon Hyun, Joshua Landau, Kamal Ndousse, Landon Goldberg, Liane Lovitt, Martin Lucas, Michael Sellitto, Miranda Zhang, Neerav Kingsland, Nelson Elhage, Nicholas Joseph, Noemí Mercado, Nova DasSarma, Oliver Rausch, Robin Larson, Sam McCandlish, Scott Johnston, Shauna Kravec, Sheer El Showk, Tamera Lanham, Timothy Telleen-Lawton, Tom Brown, Tom Henighan, Tristan Hume, Yuntao Bai, Zac Hatfield-Dodds, Jack Clark, Samuel R. Bowman, Amanda Askell, Roger Grosse, Danny Hernandez, Deep Ganguli, Evan Hubinger, Nicholas Schiefer, Jared Kaplan:
Discovering Language Model Behaviors with Model-Written Evaluations. CoRR abs/2212.09251 (2022) - [i64]Najoung Kim, Phu Mon Htut, Samuel R. Bowman, Jackson Petty:
(QA)2: Question Answering with Questionable Assumptions. CoRR abs/2212.10003 (2022) - 2021
- [c60]Yian Zhang, Alex Warstadt, Xiaocheng Li, Samuel R. Bowman:
When Do You Need Billions of Words of Pretraining Data? ACL/IJCNLP (1) 2021: 1112-1125 - [c59]Clara Vania, Phu Mon Htut, William Huang, Dhara A. Mungra, Richard Yuanzhe Pang, Jason Phang, Haokun Liu, Kyunghyun Cho, Samuel R. Bowman:
Comparing Test Sets with Item Response Theory. ACL/IJCNLP (1) 2021: 1141-1158 - [c58]Nikita Nangia, Saku Sugawara, Harsh Trivedi, Alex Warstadt, Clara Vania, Samuel R. Bowman:
What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks? ACL/IJCNLP (1) 2021: 1221-1235 - [c57]Jason Phang, Haokun Liu, Samuel R. Bowman:
Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers. BlackboxNLP@EMNLP 2021: 529-538 - [c56]Alicia Parrish, Sebastian Schuster
, Alex Warstadt, Omar Agha, Soo-Hwan Lee, Zhuoye Zhao, Samuel R. Bowman, Tal Linzen:
NOPE: A Corpus of Naturally-Occurring Presuppositions in English. CoNLL 2021: 349-366 - [c55]Alicia Parrish, William Huang, Omar Agha, Soo-Hwan Lee, Nikita Nangia, Alex Warstadt, Karmanya Aggarwal, Emily Allaway, Tal Linzen, Samuel R. Bowman:
Does Putting a Linguist in the Loop Improve NLU Data Collection? EMNLP (Findings) 2021: 4886-4901 - [c54]Samuel R. Bowman, George E. Dahl:
What Will it Take to Fix Benchmarking in Natural Language Understanding? NAACL-HLT 2021: 4843-4855 - [i63]Samuel R. Bowman, George E. Dahl:
What Will it Take to Fix Benchmarking in Natural Language Understanding? CoRR abs/2104.02145 (2021) - [i62]Alicia Parrish, William Huang, Omar Agha, Soo-Hwan Lee, Nikita Nangia, Alex Warstadt, Karmanya Aggarwal, Emily Allaway, Tal Linzen, Samuel R. Bowman:
Does Putting a Linguist in the Loop Improve NLU Data Collection? CoRR abs/2104.07179 (2021) - [i61]Nikita Nangia, Saku Sugawara, Harsh Trivedi, Alex Warstadt, Clara Vania, Samuel R. Bowman:
What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks? CoRR abs/2106.00794 (2021) - [i60]Clara Vania, Phu Mon Htut, William Huang, Dhara A. Mungra, Richard Yuanzhe Pang, Jason Phang, Haokun Liu, Kyunghyun Cho, Samuel R. Bowman:
Comparing Test Sets with Item Response Theory. CoRR abs/2106.00840 (2021) - [i59]Alicia Parrish, Sebastian Schuster, Alex Warstadt, Omar Agha, Soo-Hwan Lee, Zhuoye Zhao, Samuel R. Bowman, Tal Linzen:
NOPE: A Corpus of Naturally-Occurring Presuppositions in English. CoRR abs/2109.06987 (2021) - [i58]Jason Phang, Haokun Liu, Samuel R. Bowman:
Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers. CoRR abs/2109.08406 (2021) - [i57]Alicia Parrish, Angelica Chen, Nikita Nangia, Vishakh Padmakumar, Jason Phang, Jana Thompson, Phu Mon Htut, Samuel R. Bowman:
BBQ: A Hand-Built Bias Benchmark for Question Answering. CoRR abs/2110.08193 (2021) - [i56]Samuel R. Bowman:
When Combating Hype, Proceed with Caution. CoRR abs/2110.08300 (2021) - [i55]Derek Chen, Zhou Yu, Samuel R. Bowman:
Learning with Noisy Labels by Targeted Relabeling. CoRR abs/2110.08355 (2021) - [i54]Jason Phang, Angelica Chen, William Huang, Samuel R. Bowman:
Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair. CoRR abs/2111.08181 (2021) - [i53]Richard Yuanzhe Pang, Alicia Parrish, Nitish Joshi, Nikita Nangia, Jason Phang, Angelica Chen, Vishakh Padmakumar, Johnny Ma, Jana Thompson, He He, Samuel R. Bowman:
QuALITY: Question Answering with Long Input Texts, Yes! CoRR abs/2112.08608 (2021) - 2020
- [j5]Alex Warstadt, Alicia Parrish
, Haokun Liu, Anhad Mohananey, Wei Peng, Sheng-Fu Wang, Samuel R. Bowman:
BLiMP: The Benchmark of Linguistic Minimal Pairs for English. Trans. Assoc. Comput. Linguistics 8: 377-392 (2020) - [j4]Alex Warstadt, Alicia Parrish, Haokun Liu, Anhad Mohananey, Wei Peng, Sheng-Fu Wang, Samuel R. Bowman:
Erratum: "BLiMP: The Benchmark of Linguistic Minimal Pairs for English". Trans. Assoc. Comput. Linguistics 8: 867-868 (2020) - [c53]Katharina Kann, Samuel R. Bowman, Kyunghyun Cho:
Learning to Learn Morphological Inflection for Resource-Poor Languages. AAAI 2020: 8058-8065 - [c52]Yada Pruksachatkun, Philip Yeres, Haokun Liu, Jason Phang, Phu Mon Htut, Alex Wang, Ian Tenney, Samuel R. Bowman:
jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models. ACL (demo) 2020: 109-117 - [c51]Yada Pruksachatkun, Jason Phang, Haokun Liu, Phu Mon Htut, Xiaoyi Zhang, Richard Yuanzhe Pang, Clara Vania, Katharina Kann, Samuel R. Bowman:
Intermediate-Task Transfer Learning with Pretrained Language Models: When and Why Does It Work? ACL 2020: 5231-5247 - [c50]William Huang, Haokun Liu, Samuel R. Bowman:
Counterfactually-Augmented SNLI Training Data Does Not Yield Better Generalization Than Unaugmented Data. Insights 2020: 82-87 - [c49]Alex Warstadt, Samuel R. Bowman:
Can neural networks acquire a structural bias from raw linguistic data? CogSci 2020 - [c48]Alex Warstadt, Yian Zhang, Xiaocheng Li, Haokun Liu, Samuel R. Bowman:
Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually). EMNLP (1) 2020: 217-235 - [c47]Nikita Nangia, Clara Vania, Rasika Bhalerao, Samuel R. Bowman:
CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models. EMNLP (1) 2020: 1953-1967 - [c46]Samuel R. Bowman, Jennimaria Palomaki, Livio Baldini Soares, Emily Pitler:
New Protocols and Negative Results for Textual Entailment Data Collection. EMNLP (1) 2020: 8203-8214 - [c45]Haokun Liu, William Huang, Dhara A. Mungra, Samuel R. Bowman:
Precise Task Formalization Matters in Winograd Schema Evaluations. EMNLP (1) 2020: 8275-8280 - [c44]Jason Phang, Iacer Calixto, Phu Mon Htut, Yada Pruksachatkun, Haokun Liu, Clara Vania, Katharina Kann, Samuel R. Bowman:
English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too. AACL/IJCNLP 2020: 557-575 - [c43]Clara Vania, Ruijie Chen, Samuel R. Bowman:
Asking Crowdworkers to Write Entailment Examples: The Best of Bad Options. AACL/IJCNLP 2020: 672-686 - [c42]Anhad Mohananey, Katharina Kann, Samuel R. Bowman:
Self-Training for Unsupervised Parsing with PRPN. IWPT 2020 2020: 105-110 - [i52]Yada Pruksachatkun, Philip Yeres, Haokun Liu, Jason Phang, Phu Mon Htut, Alex Wang, Ian Tenney, Samuel R. Bowman:
jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models. CoRR abs/2003.02249 (2020) - [i51]Samuel R. Bowman, Jennimaria Palomaki, Livio Baldini Soares, Emily Pitler:
Collecting Entailment Data for Pretraining: New Protocols and Negative Results. CoRR abs/2004.11997 (2020) - [i50]Katharina Kann, Samuel R. Bowman, Kyunghyun Cho:
Learning to Learn Morphological Inflection for Resource-Poor Languages. CoRR abs/2004.13304 (2020) - [i49]Yada Pruksachatkun, Jason Phang, Haokun Liu, Phu Mon Htut, Xiaoyi Zhang, Richard Yuanzhe Pang, Clara Vania, Katharina Kann, Samuel R. Bowman:
Intermediate-Task Transfer Learning with Pretrained Models for Natural Language Understanding: When and Why Does It Work? CoRR abs/2005.00628 (2020) - [i48]Jason Phang, Phu Mon Htut, Yada Pruksachatkun, Haokun Liu, Clara Vania, Katharina Kann, Iacer Calixto, Samuel R. Bowman:
English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too. CoRR abs/2005.13013 (2020) - [i47]Anhad Mohananey, Katharina Kann, Samuel R. Bowman:
Self-Training for Unsupervised Parsing with PRPN. CoRR abs/2005.13455 (2020) - [i46]Alex Warstadt, Samuel R. Bowman:
Can neural networks acquire a structural bias from raw linguistic data? CoRR abs/2007.06761 (2020) - [i45]Nikita Nangia, Clara Vania, Rasika Bhalerao, Samuel R. Bowman:
CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models. CoRR abs/2010.00133 (2020) - [i44]Haokun Liu, William Huang, Dhara A. Mungra, Samuel R. Bowman:
Precise Task Formalization Matters in Winograd Schema Evaluations. CoRR abs/2010.04043 (2020) - [i43]William Huang, Haokun Liu, Samuel R. Bowman:
Counterfactually-Augmented SNLI Training Data Does Not Yield Better Generalization Than Unaugmented Data. CoRR abs/2010.04762 (2020) - [i42]Alex Warstadt, Yian Zhang, Haau-Sing Li, Haokun Liu, Samuel R. Bowman:
Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually). CoRR abs/2010.05358 (2020) - [i41]Clara Vania, Ruijie Chen, Samuel R. Bowman:
Asking Crowdworkers to Write Entailment Examples: The Best of Bad Options. CoRR abs/2010.06122 (2020) - [i40]Yian Zhang, Alex Warstadt, Haau-Sing Li, Samuel R. Bowman:
When Do You Need Billions of Words of Pretraining Data? CoRR abs/2011.04946 (2020)
2010 – 2019
- 2019
- [j3]Alex Warstadt, Amanpreet Singh, Samuel R. Bowman:
Neural Network Acceptability Judgments. Trans. Assoc. Comput. Linguistics 7: 625-641 (2019) - [c41]Alex Wang, Jan Hula, Patrick Xia, Raghavendra Pappagari, R. Thomas McCoy, Roma Patel, Najoung Kim, Ian Tenney, Yinghui Huang, Katherin Yu, Shuning Jin, Berlin Chen, Benjamin Van Durme, Edouard Grave, Ellie Pavlick, Samuel R. Bowman:
Can You Tell Me How to Get Past Sesame Street? Sentence-Level Pretraining Beyond Language Modeling. ACL (1) 2019: 4465-4476 - [c40]Nikita Nangia, Samuel R. Bowman:
Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark. ACL (1) 2019: 4566-4575 - [c39]Katharina Kann, Anhad Mohananey, Samuel R. Bowman, Kyunghyun Cho:
Neural Unsupervised Parsing Beyond English. DeepLo@EMNLP-IJCNLP 2019: 209-218 - [c38]Alex Warstadt, Yu Cao, Ioana Grosu, Wei Peng, Hagen Blix, Yining Nie
, Anna Alsop, Shikha Bordia, Haokun Liu, Alicia Parrish
, Sheng-Fu Wang
, Jason Phang, Anhad Mohananey, Phu Mon Htut, Paloma Jeretic, Samuel R. Bowman:
Investigating BERT's Knowledge of Language: Five Analysis Methods with NPIs. EMNLP/IJCNLP (1) 2019: 2877-2887 - [c37]Katharina Kann, Kyunghyun Cho, Samuel R. Bowman:
Towards Realistic Practices In Low-Resource Natural Language Processing: The Development Set. EMNLP/IJCNLP (1) 2019: 3340-3347 - [c36]Ian Tenney, Patrick Xia, Berlin Chen, Alex Wang, Adam Poliak, R. Thomas McCoy, Najoung Kim, Benjamin Van Durme, Samuel R. Bowman, Dipanjan Das, Ellie Pavlick:
What do you learn from context? Probing for sentence structure in contextualized word representations. ICLR (Poster) 2019 - [c35]Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, Samuel R. Bowman:
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding. ICLR (Poster) 2019 - [c34]Samuel R. Bowman, Xiaodan Zhu:
Deep Learning for Natural Language Inference. NAACL-HLT (Tutorial Abstracts) 2019: 6-8 - [c33]Shikha Bordia, Samuel R. Bowman:
Identifying and Reducing Gender Bias in Word-Level Language Models. NAACL-HLT (Student Research Workshop) 2019: 7-15 - [c32]Chandler May
, Alex Wang, Shikha Bordia, Samuel R. Bowman, Rachel Rudinger
:
On Measuring Social Biases in Sentence Encoders. NAACL-HLT (1) 2019: 622-628 - [c31]Alex Wang, Yada Pruksachatkun, Nikita Nangia, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, Samuel R. Bowman:
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems. NeurIPS 2019: 3261-3275 - [c30]Nishant Subramani, Samuel R. Bowman, Kyunghyun Cho:
Can Unconditional Language Models Recover Arbitrary Sentences? NeurIPS 2019: 15232-15242 - [c29]Najoung Kim, Roma Patel, Adam Poliak, Patrick Xia, Alex Wang, Tom McCoy, Ian Tenney, Alexis Ross, Tal Linzen, Benjamin Van Durme, Samuel R. Bowman, Ellie Pavlick:
Probing What Different NLP Tasks Teach Machines about Function Word Comprehension. *SEM@NAACL-HLT 2019: 235-249 - [i39]Alex Warstadt, Samuel R. Bowman:
Grammatical Analysis of Pretrained Sentence Encoders with Acceptability Judgments. CoRR abs/1901.03438 (2019) - [i38]Chandler May, Alex Wang, Shikha Bordia, Samuel R. Bowman, Rachel Rudinger:
On Measuring Social Biases in Sentence Encoders. CoRR abs/1903.10561 (2019) - [i37]Shikha Bordia, Samuel R. Bowman:
Identifying and Reducing Gender Bias in Word-Level Language Models. CoRR abs/1904.03035 (2019) - [i36]Najoung Kim, Roma Patel, Adam Poliak, Alex Wang, Patrick Xia, R. Thomas McCoy, Ian Tenney, Alexis Ross, Tal Linzen, Benjamin Van Durme, Samuel R. Bowman, Ellie Pavlick:
Probing What Different NLP Tasks Teach Machines about Function Word Comprehension. CoRR abs/1904.11544 (2019) - [i35]Alex Wang, Yada Pruksachatkun, Nikita Nangia, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, Samuel R. Bowman:
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems. CoRR abs/1905.00537 (2019) - [i34]Ian Tenney, Patrick Xia, Berlin Chen, Alex Wang, Adam Poliak, R. Thomas McCoy, Najoung Kim, Benjamin Van Durme, Samuel R. Bowman, Dipanjan Das, Ellie Pavlick:
What do you learn from context? Probing for sentence structure in contextualized word representations. CoRR abs/1905.06316 (2019) - [i33]Nikita Nangia, Samuel R. Bowman:
Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark. CoRR abs/1905.10425 (2019) - [i32]Nishant Subramani, Samuel R. Bowman, Kyunghyun Cho:
Can Unconditional Language Models Recover Arbitrary Sentences? CoRR abs/1907.04944 (2019) - [i31]Katharina Kann, Kyunghyun Cho, Samuel R. Bowman:
Towards Realistic Practices In Low-Resource Natural Language Processing: The Development Set. CoRR abs/1909.01522 (2019) - [i30]Alex Warstadt, Yu Cao, Ioana Grosu, Wei Peng, Hagen Blix
, Yining Nie, Anna Alsop, Shikha Bordia, Haokun Liu, Alicia Parrish, Sheng-Fu Wang, Jason Phang, Anhad Mohananey, Phu Mon Htut, Paloma Jeretic, Samuel R. Bowman:
Investigating BERT's Knowledge of Language: Five Analysis Methods with NPIs. CoRR abs/1909.02597 (2019) - [i29]Phu Mon Htut, Kyunghyun Cho, Samuel R. Bowman:
Inducing Constituency Trees through Neural Machine Translation. CoRR abs/1909.10056 (2019) - [i28]Phu Mon Htut, Jason Phang, Shikha Bordia, Samuel R. Bowman:
Do Attention Heads in BERT Track Syntactic Dependencies? CoRR abs/1911.12246 (2019) - [i27]Alex Warstadt, Alicia Parrish, Haokun Liu, Anhad Mohananey, Wei Peng, Sheng-Fu Wang, Samuel R. Bowman:
BLiMP: A Benchmark of Linguistic Minimal Pairs for English. CoRR abs/1912.00582 (2019) - 2018
- [j2]Adina Williams, Andrew Drozdov, Samuel R. Bowman:
Do latent tree learning models identify meaningful structure in sentences? Trans. Assoc. Comput. Linguistics 6: 253-267 (2018) - [c28]Yichen Gong, Samuel R. Bowman:
Ruminating Reader: Reasoning with Gated Multi-hop Attention. QA@ACL 2018: 1-11 - [c27]Woojin Chung, Sheng-Fu Wang, Samuel R. Bowman:
The Lifted Matrix-Space Model for Semantic Composition. CoNLL 2018: 508-518 - [c26]Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, Samuel R. Bowman:
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding. BlackboxNLP@EMNLP 2018: 353-355 - [c25]Kelly W. Zhang
, Samuel R. Bowman:
Language Modeling Teaches You More than Translation Does: Lessons Learned Through Auxiliary Syntactic Task Analysis. BlackboxNLP@EMNLP 2018: 359-361 - [c24]Phu Mon Htut, Kyunghyun Cho, Samuel R. Bowman:
Grammar Induction with Neural Language Models: An Unusual Replication. BlackboxNLP@EMNLP 2018: 371-373 - [c23]Yun Chen, Victor O. K. Li, Kyunghyun Cho, Samuel R. Bowman:
A Stable and Effective Learning Strategy for Trainable Greedy Decoding. EMNLP 2018: 380-390 - [c22]Alexis Conneau, Ruty Rinott, Guillaume Lample, Adina Williams, Samuel R. Bowman, Holger Schwenk, Veselin Stoyanov:
XNLI: Evaluating Cross-lingual Sentence Representations. EMNLP 2018: 2475-2485 - [c21]Phu Mon Htut, Kyunghyun Cho, Samuel R. Bowman:
Grammar Induction with Neural Language Models: An Unusual Replication. EMNLP 2018: 4998-5003 - [c20]Yun Chen, Kyunghyun Cho, Samuel R. Bowman, Victor O. K. Li:
Stable and Effective Trainable Greedy Decoding for Sequence to Sequence Learning. ICLR (Workshop) 2018 - [c19]Nikita Nangia, Samuel R. Bowman:
ListOps: A Diagnostic Dataset for Latent Tree Learning. NAACL-HLT (Student Research Workshop) 2018: 92-99 - [c18]Suchin Gururangan, Swabha Swayamdipta, Omer Levy, Roy Schwartz, Samuel R. Bowman, Noah A. Smith:
Annotation Artifacts in Natural Language Inference Data. NAACL-HLT (2) 2018: 107-112 - [c17]Phu Mon Htut, Samuel R. Bowman, Kyunghyun Cho:
Training a Ranking Function for Open-Domain Question Answering. NAACL-HLT (Student Research Workshop) 2018: 120-127 - [c16]Adina Williams, Nikita Nangia, Samuel R. Bowman:
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference. NAACL-HLT 2018: 1112-1122 - [i26]Suchin Gururangan, Swabha Swayamdipta, Omer Levy, Roy Schwartz, Samuel R. Bowman, Noah A. Smith:
Annotation Artifacts in Natural Language Inference Data. CoRR abs/1803.02324 (2018) - [i25]Phu Mon Htut, Samuel R. Bowman, Kyunghyun Cho:
Training a Ranking Function for Open-Domain Question Answering. CoRR abs/1804.04264 (2018) - [i24]Nikita Nangia, Samuel R. Bowman:
ListOps: A Diagnostic Dataset for Latent Tree Learning. CoRR abs/1804.06028 (2018) - [i23]Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, Samuel R. Bowman:
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding. CoRR abs/1804.07461 (2018) - [i22]Yun Chen, Victor O. K. Li, Kyunghyun Cho, Samuel R. Bowman:
A Stable and Effective Learning Strategy for Trainable Greedy Decoding. CoRR abs/1804.07915 (2018) - [i21]Alex Warstadt, Amanpreet Singh, Samuel R. Bowman:
Neural Network Acceptability Judgments. CoRR abs/1805.12471 (2018) - [i20]Phu Mon Htut, Kyunghyun Cho, Samuel R. Bowman:
Grammar Induction with Neural Language Models: An Unusual Replication. CoRR abs/1808.10000 (2018) - [i19]Alexis Conneau, Guillaume Lample, Ruty Rinott, Adina Williams, Samuel R. Bowman, Holger Schwenk, Veselin Stoyanov:
XNLI: Evaluating Cross-lingual Sentence Representations. CoRR abs/1809.05053 (2018) - [i18]Kelly W. Zhang, Samuel R. Bowman:
Language Modeling Teaches You More Syntax than Translation Does: Lessons Learned Through Auxiliary Task Analysis. CoRR abs/1809.10040 (2018) - [i17]Jason Phang, Thibault Févry, Samuel R. Bowman:
Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks. CoRR abs/1811.01088 (2018) - [i16]Katharina Kann, Alex Warstadt, Adina Williams, Samuel R. Bowman:
Verb Argument Structure Alternations in Word and Sentence Embeddings. CoRR abs/1811.10773 (2018) - [i15]Samuel R. Bowman, Ellie Pavlick, Edouard Grave, Benjamin Van Durme, Alex Wang, Jan Hula, Patrick Xia, Raghavendra Pappagari, R. Thomas McCoy, Roma Patel, Najoung Kim, Ian Tenney, Yinghui Huang, Katherin Yu, Shuning Jin, Berlin Chen:
Looking for ELMo's friends: Sentence-Level Pretraining Beyond Language Modeling. CoRR abs/1812.10860 (2018) - 2017
- [j1]Vasant Dhar, Sam Bowman:
A Perspective on Natural Language Understanding Capability: An Interview with Sam Bowman. Big Data 5(1): 5-11 (2017) - [c15]Rohan Kshirsagar, Robert R. Morris, Sam Bowman:
Detecting and Explaining Crisis. CLPsych@ACL 2017: 66-73 - [c14]Sebastian Brarda, Philip Yeres, Samuel R. Bowman:
Sequential Attention: A Context-Aware Alignment Function for Machine Reading. Rep4NLP@ACL 2017: 75-80 - [c13]Nikita Nangia, Adina Williams, Angeliki Lazaridou, Samuel R. Bowman:
The RepEval 2017 Shared Task: Multi-Genre Natural Language Inference with Sentence Representations. RepEval@EMNLP 2017: 1-10 - [e1]Samuel R. Bowman, Yoav Goldberg, Felix Hill, Angeliki Lazaridou, Omer Levy, Roi Reichart, Anders Søgaard:
Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, RepEval@EMNLP 2017, Copenhagen, Denmark, September 8, 2017. Association for Computational Linguistics 2017, ISBN 978-1-945626-90-6 [contents] - [i14]Adina Williams, Nikita Nangia, Samuel R. Bowman:
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference. CoRR abs/1704.05426 (2017) - [i13]Yichen Gong, Samuel R. Bowman:
Ruminating Reader: Reasoning with Gated Multi-Hop Attention. CoRR abs/1704.07415 (2017) - [i12]Yacine Jernite, Samuel R. Bowman, David A. Sontag:
Discourse-Based Objectives for Fast Unsupervised Sentence Representation Learning. CoRR abs/1705.00557 (2017) - [i11]Sebastian Brarda, Philip Yeres, Samuel R. Bowman:
Sequential Attention. CoRR abs/1705.02269 (2017) - [i10]Rohan Kshirsagar, Robert R. Morris, Sam Bowman:
Detecting and Explaining Crisis. CoRR abs/1705.09585 (2017) - [i9]Nikita Nangia, Adina Williams, Angeliki Lazaridou, Samuel R. Bowman:
The RepEval 2017 Shared Task: Multi-Genre Natural Language Inference with Sentence Representations. CoRR abs/1707.08172 (2017) - [i8]Adina Williams, Andrew Drozdov, Samuel R. Bowman:
Learning to parse from a semantic objective: It works. Is it syntax? CoRR abs/1709.01121 (2017) - [i7]Woojin Chung, Samuel R. Bowman:
The Lifted Matrix-Space Model for Semantic Composition. CoRR abs/1711.03602 (2017) - 2016
- [c12]Samuel R. Bowman, Jon Gauthier
, Abhinav Rastogi, Raghav Gupta, Christopher D. Manning, Christopher Potts:
A Fast Unified Model for Parsing and Sentence Understanding. ACL (1) 2016 - [c11]Samuel R. Bowman, Luke Vilnis, Oriol Vinyals, Andrew M. Dai, Rafal Józefowicz, Samy Bengio:
Generating Sentences from a Continuous Space. CoNLL 2016: 10-21 - [i6]Samuel R. Bowman, Jon Gauthier, Abhinav Rastogi, Raghav Gupta, Christopher D. Manning, Christopher Potts:
A Fast Unified Model for Parsing and Sentence Understanding. CoRR abs/1603.06021 (2016) - 2015
- [c10]Samuel R. Bowman, Christopher Potts, Christopher D. Manning:
Learning Distributed Word Representations for Natural Logic Reasoning. AAAI Spring Symposia 2015 - [c9]Samuel R. Bowman, Christopher Potts, Christopher D. Manning:
Recursive Neural Networks Can Learn Logical Semantics. CVSC 2015: 12-21 - [c8]Samuel R. Bowman, Gabor Angeli, Christopher Potts, Christopher D. Manning:
A large annotated corpus for learning natural language inference. EMNLP 2015: 632-642 - [c7]Samuel R. Bowman, Christopher D. Manning, Christopher Potts:
Tree-Structured Composition in Neural Networks without Tree-Structured Architectures. CoCo@NIPS 2015 - [i5]Samuel R. Bowman, Christopher D. Manning, Christopher Potts:
Tree-structured composition in neural networks without tree-structured architectures. CoRR abs/1506.04834 (2015) - [i4]Samuel R. Bowman, Gabor Angeli, Christopher Potts, Christopher D. Manning:
A large annotated corpus for learning natural language inference. CoRR abs/1508.05326 (2015) - [i3]Samuel R. Bowman, Luke Vilnis, Oriol Vinyals, Andrew M. Dai, Rafal Józefowicz, Samy Bengio:
Generating Sentences from a Continuous Space. CoRR abs/1511.06349 (2015) - 2014
- [c6]Natalia Silveira, Timothy Dozat, Marie-Catherine de Marneffe, Samuel R. Bowman, Miriam Connor, John Bauer, Christopher D. Manning:
A Gold Standard Dependency Corpus for English. LREC 2014: 2897-2904 - [c5]Samuel R. Bowman:
Can recursive neural tensor networks learn logical reasoning? ICLR (Workshop Poster) 2014 - [i2]Samuel R. Bowman, Christopher Potts, Christopher D. Manning:
Recursive Neural Networks for Learning Logical Semantics. CoRR abs/1406.1827 (2014) - [i1]Samuel R. Bowman, Christopher Potts, Christopher D. Manning:
Learning Distributed Word Representations for Natural Logic Reasoning. CoRR abs/1410.4176 (2014) - 2013
- [c4]Marie-Catherine de Marneffe, Miriam Connor, Natalia Silveira, Samuel R. Bowman, Timothy Dozat, Christopher D. Manning:
More Constructions, More Genres: Extending Stanford Dependencies. DepLing 2013: 187-196 - 2012
- [c3]Samuel R. Bowman, Harshit Chopra:
Automatic Animacy Classification. HLT-NAACL 2012: 7-10 - 2011
- [c2]Geoffrey Zweig, Patrick Nguyen, Dirk Van Compernolle, Kris Demuynck, Les E. Atlas, Pascal Clark, Gregory Sell, Meihong Wang, Fei Sha, Hynek Hermansky
, Damianos G. Karakos, Aren Jansen, Samuel Thomas, Sivaram G. S. V. S., Samuel R. Bowman, Justine T. Kao:
Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop. ICASSP 2011: 5044-5047 - 2010
- [c1]Samuel R. Bowman, Karen Livescu:
Modeling pronunciation variation with context-dependent articulatory feature decision trees. INTERSPEECH 2010: 326-329
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-03-04 22:14 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint