default search action
Luca Soldaini
Person information
- affiliation: Amazon Alexa, CA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2024
- [j6]Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney, Aniket Kittur, Hyeonsu B. Kang, Egor Klevak, Bailey Kuehl, Michael Langan, Matt Latzke, Jaron Lochner, Kelsey MacMillan, Eric Marsh, Tyler Murray, Aakanksha Naik, Ngoc-Uyen Nguyen, Srishti Palani, Soya Park, Caroline Paulic, Napol Rachatasumrit, Smita Rao, Paul Sayre, Zejiang Shen, Pao Siangliulue, Luca Soldaini, Huy Tran, Madeleine van Zuylen, Lucy Lu Wang, Chris Wilhelm, Caroline Wu, Jiangjiang Yang, Angele Zamarron, Marti A. Hearst, Daniel S. Weld:
The Semantic Reader Project. Commun. ACM 67(10): 50-61 (2024) - 2023
- [j5]Suzan Verberne, Hussein Suleman, Luca Soldaini, Avijit Ghosh:
Report on the SIGIR 2023 Session on Diversity, Equity and Inclusivity. SIGIR Forum 57(2): 11:1-11:2 (2023) - 2019
- [j4]Sean MacAvaney, Andrew Yates, Arman Cohan, Luca Soldaini, Kai Hui, Nazli Goharian, Ophir Frieder:
Overcoming low-utility facets for complex answer retrieval. Inf. Retr. J. 22(3-4): 395-418 (2019) - 2018
- [j3]Luca Soldaini:
The Knowledge and Language Gap in Medical Information Seeking. SIGIR Forum 52(2): 178-179 (2018) - 2017
- [j2]Luca Soldaini, Andrew Yates, Nazli Goharian:
Learning to reformulate long queries for clinical decision support. J. Assoc. Inf. Sci. Technol. 68(11): 2602-2619 (2017) - 2016
- [j1]Luca Soldaini, Andrew Yates, Elad Yom-Tov, Ophir Frieder, Nazli Goharian:
Enhancing web search in the medical domain via query clarification. Inf. Retr. J. 19(1-2): 149-173 (2016)
Conference and Workshop Papers
- 2024
- [c44]Li Lucy, Suchin Gururangan, Luca Soldaini, Emma Strubell, David Bamman, Lauren Klein, Jesse Dodge:
AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters. ACL (1) 2024: 7393-7420 - [c43]Fangyuan Xu, Kyle Lo, Luca Soldaini, Bailey Kuehl, Eunsol Choi, David Wadden:
KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions. ACL (Findings) 2024: 12969-12990 - [c42]Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Raghavi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo:
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research. ACL (1) 2024: 15725-15788 - [c41]Dirk Groeneveld, Iz Beltagy, Evan Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi:
OLMo: Accelerating the Science of Language Models. ACL (1) 2024: 15789-15809 - [c40]Orion Weller, Kyle Lo, David Wadden, Dawn J. Lawrie, Benjamin Van Durme, Arman Cohan, Luca Soldaini:
When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets. EACL (Findings) 2024: 1987-2003 - [c39]Li Lucy, Tal August, Rose E. Wang, Luca Soldaini, Courtney Allison, Kyle Lo:
MathFish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula. EMNLP (Findings) 2024: 5644-5673 - [c38]Yanai Elazar, Akshita Bhagia, Ian Magnusson, Abhilasha Ravichander, Dustin Schwenk, Alane Suhr, Evan Pete Walsh, Dirk Groeneveld, Luca Soldaini, Sameer Singh, Hannaneh Hajishirzi, Noah A. Smith, Jesse Dodge:
What's In My Big Data? ICLR 2024 - [c37]James Mayfield, Eugene Yang, Dawn J. Lawrie, Sean MacAvaney, Paul McNamee, Douglas W. Oard, Luca Soldaini, Ian Soboroff, Orion Weller, Efsun Selin Kayi, Kate Sanders, Marc Mason, Noah Hibbler:
On the Evaluation of Machine-Generated Reports. SIGIR 2024: 1904-1915 - 2023
- [c36]Nathan Dennler, Anaelia Ovalle, Ashwin Singh, Luca Soldaini, Arjun Subramonian, Huy Tu, William Agnew, Avijit Ghosh, Kyra Yee, Irene Font Peradejordi, Zeerak Talat, Mayra Russo, Jessica de Jesus de Pinho Pinhal:
Bound by the Bounty: Collaboratively Shaping Evaluation Processes for Queer AI Harms. AIES 2023: 375-386 - [c35]Jon Saad-Falcon, Amanpreet Singh, Luca Soldaini, Mike D'Arcy, Arman Cohan, Doug Downey:
Embedding Recycling for Language Models. EACL (Findings) 2023: 1888-1908 - [c34]Kyle Lo, Zejiang Shen, Benjamin Newman, Joseph Chee Chang, Russell Authur, Erin Bransom, Stefan Candra, Yoganand Chandrasekhar, Regan Huff, Bailey Kuehl, Amanpreet Singh, Chris Wilhelm, Angele Zamarron, Marti A. Hearst, Daniel S. Weld, Doug Downey, Luca Soldaini:
PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents. EMNLP (Demos) 2023: 495-507 - [c33]Benjamin Newman, Luca Soldaini, Raymond Fok, Arman Cohan, Kyle Lo:
A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents. EMNLP 2023: 3194-3212 - [c32]John M. Giorgi, Luca Soldaini, Bo Wang, Gary D. Bader, Kyle Lo, Lucy Lu Wang, Arman Cohan:
Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval. EMNLP (Findings) 2023: 8177-8199 - [c31]Organizers Of QueerInAI, Anaelia Ovalle, Arjun Subramonian, Ashwin Singh, Claas Voelcker, Danica J. Sutherland, Davide Locatelli, Eva Breznik, Filip Klubicka, Hang Yuan, Hetvi Jethwani, Huan Zhang, Jaidev Shriram, Kruno Lehman, Luca Soldaini, Maarten Sap, Marc Peter Deisenroth, Maria Leonor Pacheco, Maria Ryskina, Martin Mundt, Milind Agarwal, Nyx McLean, Pan Xu, Pranav A, Raj Korpan, Ruchira Ray, Sarah Mathew, Sarthak Arora, St John, Tanvi Anand, Vishakha Agrawal, William Agnew, Yanan Long, Zijie J. Wang, Zeerak Talat, Avijit Ghosh, Nathaniel Dennler, Michael Noseworthy, Sharvani Jha, Emi Baylor, Aditya Joshi, Natalia Y. Bilenko, Andrew McNamara, Raphael Gontijo Lopes, Alex Markham, Evyn Dong, Jackie Kay, Manu Saraswat, Nikhil Vytla, Luke Stark:
Queer In AI: A Case Study in Community-Led Participatory AI. FAccT 2023: 1882-1895 - [c30]Raymond Fok, Hita Kambhamettu, Luca Soldaini, Jonathan Bragg, Kyle Lo, Marti A. Hearst, Andrew Head, Daniel S. Weld:
Scim: Intelligent Skimming Support for Scientific Papers. IUI 2023: 476-490 - [c29]Sean MacAvaney, Luca Soldaini:
One-Shot Labeling for Automatic Relevance Estimation. SIGIR 2023: 2230-2235 - [c28]Dawn J. Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang:
Overview of the TREC 2023 NeuCLIR Track. TREC 2023 - 2022
- [c27]Yoshitomo Matsubara, Luca Soldaini, Eric Lind, Alessandro Moschitti:
Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering Systems. EMNLP (Findings) 2022: 7259-7272 - [c26]Matteo Gabburo, Rik Koncel-Kedziorski, Siddhant Garg, Luca Soldaini, Alessandro Moschitti:
Knowledge Transfer from Answer Ranking to Answer Generation. EMNLP 2022: 9481-9495 - [c25]Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti:
Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection. EMNLP 2022: 11806-11816 - [c24]Benjamin Muller, Luca Soldaini, Rik Koncel-Kedziorski, Eric Lind, Alessandro Moschitti:
Cross-Lingual Open-Domain Question Answering with Answer Sentence Generation. AACL/IJCNLP (1) 2022: 337-353 - [c23]Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti:
Paragraph-based Transformer Pre-training for Multi-Sentence Inference. NAACL-HLT 2022: 2521-2531 - [c22]Dawn J. Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang:
Overview of the TREC 2022 NeuCLIR Track. TREC 2022 - 2021
- [c21]Chao-Chun Hsu, Eric Lind, Luca Soldaini, Alessandro Moschitti:
Answer Generation for Retrieval-based Question Answering Systems. ACL/IJCNLP (Findings) 2021: 4276-4282 - [c20]Rujun Han, Luca Soldaini, Alessandro Moschitti:
Modeling Context in Answer Sentence Selection Systems on a Latency Budget. EACL 2021: 3005-3010 - 2020
- [c19]Luca Soldaini, Alessandro Moschitti:
The Cascade Transformer: an Application for Efficient Answer Sentence Selection. ACL 2020: 5697-5708 - [c18]Mingda Li, Xinyue Liu, Weitong Ruan, Luca Soldaini, Wael Hamza, Chengwei Su:
Multi-task Learning of Spoken Language Understanding by Integrating N-Best Hypotheses with Hierarchical Attention. COLING (Industry) 2020: 113-123 - [c17]Sean MacAvaney, Luca Soldaini, Nazli Goharian:
Teaching a New Dog Old Tricks: Resurrecting Multilingual Retrieval Using Zero-Shot Learning. ECIR (2) 2020: 246-254 - [c16]Subendhu Rongali, Luca Soldaini, Emilio Monti, Wael Hamza:
Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing. WWW 2020: 2962-2968 - 2018
- [c15]Sean MacAvaney, Bart Desmet, Arman Cohan, Luca Soldaini, Andrew Yates, Ayah Zirikly, Nazli Goharian:
RSDD-Time: Temporal Annotation of Self-Reported Mental Health Diagnoses. CLPsych@NAACL-HTL 2018: 168-173 - [c14]Luca Soldaini, Timothy Walsh, Arman Cohan, Julien Han, Nazli Goharian:
Helping or Hurting? Predicting Changes in Users' Risk of Self-Harm Through Online Community Interactions. CLPsych@NAACL-HTL 2018: 194-203 - [c13]Ziling Fan, Luca Soldaini, Arman Cohan, Nazli Goharian:
Relation Extraction for Protein-protein Interactions Affected by Mutations. BCB 2018: 506-507 - [c12]Arman Cohan, Bart Desmet, Andrew Yates, Luca Soldaini, Sean MacAvaney, Nazli Goharian:
SMHD: a Large-Scale Resource for Exploring Online Language Usage for Multiple Mental Health Conditions. COLING 2018: 1485-1497 - [c11]Sean MacAvaney, Luca Soldaini, Arman Cohan, Nazli Goharian:
GU IRLAB at SemEval-2018 Task 7: Tree-LSTMs for Scientific Relation Classification. SemEval@NAACL-HLT 2018: 831-835 - [c10]Sean MacAvaney, Andrew Yates, Arman Cohan, Luca Soldaini, Kai Hui, Nazli Goharian, Ophir Frieder:
Overcoming Low-Utility Facets for Complex Answer Retrieval. ProfS/KG4IR/Data:Search@SIGIR 2018: 46-47 - [c9]Sean MacAvaney, Andrew Yates, Arman Cohan, Luca Soldaini, Kai Hui, Nazli Goharian, Ophir Frieder:
Characterizing Question Facets for Complex Answer Retrieval. SIGIR 2018: 1205-1208 - 2017
- [c8]Luca Soldaini, Andrew Yates, Nazli Goharian:
Denoising Clinical Notes for Medical Literature Retrieval with Convolutional Neural Model. CIKM 2017: 2307-2310 - [c7]Luca Soldaini, Nazli Goharian:
Learning to Rank for Consumer Health Search: A Semantic Approach. ECIR 2017: 640-646 - [c6]Luca Soldaini, Elad Yom-Tov:
Inferring Individual Attributes from Search Engine Queries and Auxiliary Information. WWW 2017: 293-301 - 2016
- [c5]Luca Soldaini, Will Edman, Nazli Goharian:
Team GU-IRLAB at CLEF eHealth 2016: Task 3. CLEF (Working Notes) 2016: 143-146 - 2015
- [c4]Luca Soldaini, Arman Cohan, Andrew Yates, Nazli Goharian, Ophir Frieder:
Retrieving Medical Literature for Clinical Decision Support. ECIR 2015: 538-549 - [c3]Arman Cohan, Luca Soldaini, Nazli Goharian:
Matching Citation Text and Cited Spans in Biomedical Literature: a Search-Oriented Approach. HLT-NAACL 2015: 1042-1048 - 2014
- [c2]Arman Cohan, Luca Soldaini, Andrew Yates, Nazli Goharian, Ophir Frieder:
On clinical decision support. BCB 2014: 651-652 - [c1]Luca Soldaini, Arman Cohan, Andrew Yates, Nazli Goharian, Ophir Frieder:
Query Reformulation for Clinical Decision Support Search. TREC 2014
Editorship
- 2023
- [e1]Danilo Croce, Luca Soldaini:
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. EACL 2023 - System Demonstrations, Dubrovnik, Croatia, May 2-4, 2023. Association for Computational Linguistics 2023, ISBN 978-1-959429-45-6 [contents]
Informal and Other Publications
- 2024
- [i47]Li Lucy, Suchin Gururangan, Luca Soldaini, Emma Strubell, David Bamman, Lauren Klein, Jesse Dodge:
AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters. CoRR abs/2401.06408 (2024) - [i46]Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Raghavi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo:
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research. CoRR abs/2402.00159 (2024) - [i45]Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi:
OLMo: Accelerating the Science of Language Models. CoRR abs/2402.00838 (2024) - [i44]Fangyuan Xu, Kyle Lo, Luca Soldaini, Bailey Kuehl, Eunsol Choi, David Wadden:
KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions. CoRR abs/2403.03866 (2024) - [i43]Orion Weller, Benjamin Chang, Sean MacAvaney, Kyle Lo, Arman Cohan, Benjamin Van Durme, Dawn J. Lawrie, Luca Soldaini:
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions. CoRR abs/2403.15246 (2024) - [i42]Dawn J. Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang:
Overview of the TREC 2023 NeuCLIR Track. CoRR abs/2404.08071 (2024) - [i41]James Mayfield, Eugene Yang, Dawn J. Lawrie, Sean MacAvaney, Paul McNamee, Douglas W. Oard, Luca Soldaini, Ian Soboroff, Orion Weller, Efsun Selin Kayi, Kate Sanders, Marc Mason, Noah Hibbler:
On the Evaluation of Machine-Generated Reports. CoRR abs/2405.00982 (2024) - [i40]David Wadden, Kejian Shi, Jacob Morrison, Aakanksha Naik, Shruti Singh, Nitzan Barzilay, Kyle Lo, Tom Hope, Luca Soldaini, Shannon Zejiang Shen, Doug Downey, Hannaneh Hajishirzi, Arman Cohan:
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature. CoRR abs/2406.07835 (2024) - [i39]Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Yitzhak Gadre, Hritik Bansal, Etash Kumar Guha, Sedrick Keh, Kushal Arora, Saurabh Garg, Rui Xin, Niklas Muennighoff, Reinhard Heckel, Jean Mercat, Mayee Chen, Suchin Gururangan, Mitchell Wortsman, Alon Albalak, Yonatan Bitton, Marianna Nezhurina, Amro Abbas, Cheng-Yu Hsieh, Dhruba Ghosh, Josh Gardner, Maciej Kilian, Hanlin Zhang, Rulin Shao, Sarah M. Pratt, Sunny Sanyal, Gabriel Ilharco, Giannis Daras, Kalyani Marathe, Aaron Gokaslan, Jieyu Zhang, Khyathi Raghavi Chandu, Thao Nguyen, Igor Vasiljevic, Sham M. Kakade, Shuran Song, Sujay Sanghavi, Fartash Faghri, Sewoong Oh, Luke Zettlemoyer, Kyle Lo, Alaaeldin El-Nouby, Hadi Pouransari, Alexander Toshev, Stephanie Wang, Dirk Groeneveld, Luca Soldaini, Pang Wei Koh, Jenia Jitsev, Thomas Kollar, Alexandros G. Dimakis, Yair Carmon, Achal Dave, Ludwig Schmidt, Vaishaal Shankar:
DataComp-LM: In search of the next generation of training sets for language models. CoRR abs/2406.11794 (2024) - [i38]Shayne Longpre, Stella Biderman, Alon Albalak, Hailey Schoelkopf, Daniel McDuff, Sayash Kapoor, Kevin Klyman, Kyle Lo, Gabriel Ilharco, Nay San, Maribeth Rauh, Aviya Skowron, Bertie Vidgen, Laura Weidinger, Arvind Narayanan, Victor Sanh, David Ifeoluwa Adelani, Percy Liang, Rishi Bommasani, Peter Henderson, Sasha Luccioni, Yacine Jernite, Luca Soldaini:
The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources. CoRR abs/2406.16746 (2024) - [i37]Nathan Lambert, Hailey Schoelkopf, Aaron Gokaslan, Luca Soldaini, Valentina Pyatkin, Louis Castricato:
Self-Directed Synthetic Dialogues and Revisions Technical Report. CoRR abs/2407.18421 (2024) - [i36]Li Lucy, Tal August, Rose E. Wang, Luca Soldaini, Courtney Allison, Kyle Lo:
Evaluating Language Model Math Reasoning via Grounding in Educational Curricula. CoRR abs/2408.04226 (2024) - [i35]Niklas Muennighoff, Luca Soldaini, Dirk Groeneveld, Kyle Lo, Jacob Morrison, Sewon Min, Weijia Shi, Pete Walsh, Oyvind Tafjord, Nathan Lambert, Yuling Gu, Shane Arora, Akshita Bhagia, Dustin Schwenk, David Wadden, Alexander Wettig, Binyuan Hui, Tim Dettmers, Douwe Kiela, Ali Farhadi, Noah A. Smith, Pang Wei Koh, Amanpreet Singh, Hannaneh Hajishirzi:
OLMoE: Open Mixture-of-Experts Language Models. CoRR abs/2409.02060 (2024) - [i34]Hyunji Lee, Luca Soldaini, Arman Cohan, Minjoon Seo, Kyle Lo:
RouterRetriever: Exploring the Benefits of Routing over Multiple Expert Embedding Models. CoRR abs/2409.02685 (2024) - [i33]Matt Deitke, Christopher Clark, Sangho Lee, Rohun Tripathi, Yue Yang, Jae Sung Park, Mohammadreza Salehi, Niklas Muennighoff, Kyle Lo, Luca Soldaini, Jiasen Lu, Taira Anderson, Erin Bransom, Kiana Ehsani, Huong Ngo, Yen-Sung Chen, Ajay Patel, Mark Yatskar, Chris Callison-Burch, Andrew Head, Rose Hendrix, Favyen Bastani, Eli VanderBilt, Nathan Lambert, Yvonne Chou, Arnavi Chheda, Jenna Sparks, Sam Skjonsberg, Michael Schmitz, Aaron Sarnat, Byron Bischoff, Pete Walsh, Chris Newell, Piper Wolters, Tanmay Gupta, Kuo-Hao Zeng, Jon Borchardt, Dirk Groeneveld, Jen Dumas, Crystal Nam, Sophie Lebrecht, Caitlin Wittlif, Carissa Schoenick, Oscar Michel, Ranjay Krishna, Luca Weihs, Noah A. Smith, Hannaneh Hajishirzi, Ross B. Girshick, Ali Farhadi, Aniruddha Kembhavi:
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models. CoRR abs/2409.17146 (2024) - 2023
- [i32]Rodney Kinney, Chloe Anastasiades, Russell Authur, Iz Beltagy, Jonathan Bragg, Alexandra Buraczynski, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Arman Cohan, Miles Crawford, Doug Downey, Jason Dunkelberger, Oren Etzioni, Rob Evans, Sergey Feldman, Joseph Gorney, David Graham, Fangzhou Hu, Regan Huff, Daniel King, Sebastian Kohlmeier, Bailey Kuehl, Michael Langan, Daniel Lin, Haokun Liu, Kyle Lo, Jaron Lochner, Kelsey MacMillan, Tyler Murray, Chris Newell, Smita Rao, Shaurya Rohatgi, Paul Sayre, Zejiang Shen, Amanpreet Singh, Luca Soldaini, Shivashankar Subramanian, Amber Tanaka, Alex D. Wade, Linda Wagner, Lucy Lu Wang, Chris Wilhelm, Caroline Wu, Jiangjiang Yang, Angele Zamarron, Madeleine van Zuylen, Daniel S. Weld:
The Semantic Scholar Open Data Platform. CoRR abs/2301.10140 (2023) - [i31]Sean MacAvaney, Luca Soldaini:
One-Shot Labeling for Automatic Relevance Estimation. CoRR abs/2302.11266 (2023) - [i30]Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney, Aniket Kittur, Hyeonsu B. Kang, Egor Klevak, Bailey Kuehl, Michael Langan, Matt Latzke, Jaron Lochner, Kelsey MacMillan, Eric Marsh, Tyler Murray, Aakanksha Naik, Ngoc-Uyen Nguyen, Srishti Palani, Soya Park, Caroline Paulic, Napol Rachatasumrit, Smita Rao, Paul Sayre, Zejiang Shen, Pao Siangliulue, Luca Soldaini, Huy Tran, Madeleine van Zuylen, Lucy Lu Wang, Chris Wilhelm, Caroline Wu, Jiangjiang Yang, Angele Zamarron, Marti A. Hearst, Daniel S. Weld:
The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces. CoRR abs/2303.14334 (2023) - [i29]Anaelia Ovalle, Arjun Subramonian, Ashwin Singh, Claas Voelcker, Danica J. Sutherland, Davide Locatelli, Eva Breznik, Filip Klubicka, Hang Yuan, Hetvi Jethwani, Huan Zhang, Jaidev Shriram, Kruno Lehman, Luca Soldaini, Maarten Sap, Marc Peter Deisenroth, Maria Leonor Pacheco, Maria Ryskina, Martin Mundt, Milind Agarwal, Nyx McLean, Pan Xu, Pranav A, Raj Korpan, Ruchira Ray, Sarah Mathew, Sarthak Arora, St John, Tanvi Anand, Vishakha Agrawal, William Agnew, Yanan Long, Zijie J. Wang, Zeerak Talat, Avijit Ghosh, Nathaniel Dennler, Michael Noseworthy, Sharvani Jha, Emi Baylor, Aditya Joshi, Natalia Y. Bilenko, Andrew McNamara, Raphael Gontijo Lopes, Alex Markham, Evyn Dong, Jackie Kay, Manu Saraswat, Nikhil Vytla, Luke Stark:
Queer In AI: A Case Study in Community-Led Participatory AI. CoRR abs/2303.16972 (2023) - [i28]Dawn J. Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang:
Overview of the TREC 2022 NeuCLIR Track. CoRR abs/2304.12367 (2023) - [i27]Benjamin Newman, Luca Soldaini, Raymond Fok, Arman Cohan, Kyle Lo:
A Controllable QA-based Framework for Decontextualization. CoRR abs/2305.14772 (2023) - [i26]Organizers Of QueerInAI, Nathaniel Dennler, Anaelia Ovalle, Ashwin Singh, Luca Soldaini, Arjun Subramonian, Huy Tu, William Agnew, Avijit Ghosh, Kyra Yee, Irene Font Peradejordi, Zeerak Talat, Mayra Russo, Jess de Jesus de Pinho Pinhal:
Bound by the Bounty: Collaboratively Shaping Evaluation Processes for Queer AI Harms. CoRR abs/2307.10223 (2023) - [i25]Orion Weller, Kyle Lo, David Wadden, Dawn J. Lawrie, Benjamin Van Durme, Arman Cohan, Luca Soldaini:
When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets. CoRR abs/2309.08541 (2023) - [i24]Pratyusha Ria Kalluri, William Agnew, Myra Cheng, Kentrell Owens, Luca Soldaini, Abeba Birhane:
The Surveillance AI Pipeline. CoRR abs/2309.15084 (2023) - [i23]Yanai Elazar, Akshita Bhagia, Ian Magnusson, Abhilasha Ravichander, Dustin Schwenk, Alane Suhr, Pete Walsh, Dirk Groeneveld, Luca Soldaini, Sameer Singh, Hanna Hajishirzi, Noah A. Smith, Jesse Dodge:
What's In My Big Data? CoRR abs/2310.20707 (2023) - [i22]Hyunji Lee, Luca Soldaini, Arman Cohan, Minjoon Seo, Kyle Lo:
Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense Encoders. CoRR abs/2311.09765 (2023) - [i21]Ian Magnusson, Akshita Bhagia, Valentin Hofmann, Luca Soldaini, Ananya Harsh Jha, Oyvind Tafjord, Dustin Schwenk, Evan Pete Walsh, Yanai Elazar, Kyle Lo, Dirk Groeneveld, Iz Beltagy, Hannaneh Hajishirzi, Noah A. Smith, Kyle Richardson, Jesse Dodge:
Paloma: A Benchmark for Evaluating Language Model Fit. CoRR abs/2312.10523 (2023) - 2022
- [i20]Yoshitomo Matsubara, Luca Soldaini, Eric Lind, Alessandro Moschitti:
Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering Systems. CoRR abs/2201.05767 (2022) - [i19]Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti:
Paragraph-based Transformer Pre-training for Multi-Sentence Inference. CoRR abs/2205.01228 (2022) - [i18]Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti:
Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection. CoRR abs/2205.10455 (2022) - [i17]Jon Saad-Falcon, Amanpreet Singh, Luca Soldaini, Mike D'Arcy, Arman Cohan, Doug Downey:
Embedding Recycling for Language Models. CoRR abs/2207.04993 (2022) - [i16]Matteo Gabburo, Rik Koncel-Kedziorski, Siddhant Garg, Luca Soldaini, Alessandro Moschitti:
Knowledge Transfer from Answer Ranking to Answer Generation. CoRR abs/2210.12865 (2022) - [i15]John M. Giorgi, Luca Soldaini, Bo Wang, Gary D. Bader, Kyle Lo, Lucy Lu Wang, Arman Cohan:
Exploring the Challenges of Open Domain Multi-Document Summarization. CoRR abs/2212.10526 (2022) - 2021
- [i14]Rujun Han, Luca Soldaini, Alessandro Moschitti:
Modeling Context in Answer Sentence Selection Systems on a Latency Budget. CoRR abs/2101.12093 (2021) - [i13]Chao-Chun Hsu, Eric Lind, Luca Soldaini, Alessandro Moschitti:
Answer Generation for Retrieval-based Question Answering Systems. CoRR abs/2106.00955 (2021) - [i12]Benjamin Muller, Luca Soldaini, Rik Koncel-Kedziorski, Eric Lind, Alessandro Moschitti:
Cross-Lingual GenQA: A Language-Agnostic Generative Question Answering Approach for Open-Domain Question Answering. CoRR abs/2110.07150 (2021) - 2020
- [i11]Mingda Li, Weitong Ruan, Xinyue Liu, Luca Soldaini, Wael Hamza, Chengwei Su:
Improving Spoken Language Understanding By Exploiting ASR N-best Hypotheses. CoRR abs/2001.05284 (2020) - [i10]Subendhu Rongali, Luca Soldaini, Emilio Monti, Wael Hamza:
Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing. CoRR abs/2001.11458 (2020) - [i9]Luca Soldaini, Alessandro Moschitti:
The Cascade Transformer: an Application for Efficient Answer Sentence Selection. CoRR abs/2005.02534 (2020) - 2019
- [i8]Sean MacAvaney, Luca Soldaini, Nazli Goharian:
Teaching a New Dog Old Tricks: Resurrecting Multilingual Retrieval Using Zero-shot Learning. CoRR abs/1912.13080 (2019) - 2018
- [i7]Sean MacAvaney, Luca Soldaini, Arman Cohan, Nazli Goharian:
GU IRLAB at SemEval-2018 Task 7: Tree-LSTMs for Scientific Relation Classification. CoRR abs/1804.05408 (2018) - [i6]Luca Soldaini, Timothy Walsh, Arman Cohan, Julien Han, Nazli Goharian:
Helping or Hurting? Predicting Changes in Users' Risk of Self-Harm Through Online Community Interactions. CoRR abs/1804.07253 (2018) - [i5]Sean MacAvaney, Andrew Yates, Arman Cohan, Luca Soldaini, Kai Hui, Nazli Goharian, Ophir Frieder:
Characterizing Question Facets for Complex Answer Retrieval. CoRR abs/1805.00791 (2018) - [i4]Arman Cohan, Bart Desmet, Andrew Yates, Luca Soldaini, Sean MacAvaney, Nazli Goharian:
SMHD: A Large-Scale Resource for Exploring Online Language Usage for Multiple Mental Health Conditions. CoRR abs/1806.05258 (2018) - [i3]Sean MacAvaney, Bart Desmet, Arman Cohan, Luca Soldaini, Andrew Yates, Ayah Zirikly, Nazli Goharian:
RSDD-Time: Temporal Annotation of Self-Reported Mental Health Diagnoses. CoRR abs/1806.07916 (2018) - [i2]Sean MacAvaney, Andrew Yates, Arman Cohan, Luca Soldaini, Kai Hui, Nazli Goharian, Ophir Frieder:
Overcoming low-utility facets for complex answer retrieval. CoRR abs/1811.08772 (2018) - 2016
- [i1]Luca Soldaini, Elad Yom-Tov:
Inferring individual attributes from search engine queries and auxiliary information. CoRR abs/1610.08442 (2016)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-26 01:53 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint