default search action
Jimmy Lin
Person information
- affiliation: University of Waterloo, David R. Cheriton School of Computer Science
- affiliation: Twitter Inc., San Francisco, USA
- affiliation: University of Maryland, College Park, Institute for Advanced Computer Studies (UMIACS)
- affiliation: Massachusetts Institute of Technology (MIT), Artificial Intelligence Laboratory
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j66]Xinyu Zhang, Kelechi Ogueji, Xueguang Ma, Jimmy Lin:
Toward Best Practices for Training Multilingual Dense Retrieval Models. ACM Trans. Inf. Syst. 42(2): 39:1-39:33 (2024) - [c369]Jimmy Lin, Junkai Li, Jiasi Gao, Weizhi Ma, Yang Liu:
Jointly Modeling Spatio-Temporal Features of Tactile Signals for Action Classification. AAAI 2024: 13817-13825 - [c368]Mofetoluwa Adeyemi, Akintunde Oladipo, Ronak Pradeep, Jimmy Lin:
Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages. ACL (Short Papers) 2024: 650-656 - [c367]Mohammad Dehghan, Mohammad Ali Alomrani, Sunyam Bagga, David Alfonso-Hermelo, Khalil Bibi, Abbas Ghaddar, Yingxue Zhang, Xiaoguang Li, Jianye Hao, Qun Liu, Jimmy Lin, Boxing Chen, Prasanna Parthasarathi, Mahdi Biparva, Mehdi Rezagholizadeh:
EWEK-QA : Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems. ACL (1) 2024: 14169-14187 - [c366]Ronak Pradeep, Jimmy Lin:
Towards Automated End-to-End Health Misinformation Free Search with a Large Language Model. ECIR (4) 2024: 78-86 - [c365]Ronak Pradeep, Daniel Lee, Ali Mousavi, Jeffrey Pound, Yisi Sang, Jimmy Lin, Ihab F. Ilyas, Saloni Potdar, Mostafa Arefiyan, Yunyao Li:
ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA Datasets with Large Language Models. EMNLP (Industry Track) 2024: 1176-1206 - [c364]Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon:
PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval. EMNLP 2024: 4375-4391 - [c363]Raphael Tang, Crystina Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture:
Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation. EMNLP 2024: 5441-5454 - [c362]Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin:
Unifying Multimodal Retrieval via Document Screenshot Embedding. EMNLP 2024: 6492-6505 - [c361]Nandan Thakur, Luiz Bonifacio, Crystina Zhang, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Boxing Chen, Mehdi Rezagholizadeh, Jimmy Lin:
"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation. EMNLP (Findings) 2024: 12508-12526 - [c360]Jheng-Hong Yang, Jimmy Lin:
Toward Automatic Relevance Judgment using Vision-Language Models for Image-Text Retrieval Evaluation. LLM4Eval@SIGIR 2024: 113-123 - [c359]Crystina Zhang, Minghan Li, Jimmy Lin:
CELI: Simple yet Effective Approach to Enhance Out-of-Domain Generalization of Cross-Encoders. NAACL (Short Papers) 2024: 188-196 - [c358]Raphael Tang, Xinyu Crystina Zhang, Xueguang Ma, Jimmy Lin, Ferhan Ture:
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models. NAACL-HLT 2024: 2327-2340 - [c357]Nandan Thakur, Jianmo Ni, Gustavo Hernández Ábrego, John Wieting, Jimmy Lin, Daniel Cer:
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval. NAACL-HLT 2024: 7699-7724 - [c356]Mofetoluwa Adeyemi, Akintunde Oladipo, Xinyu Zhang, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Boxing Chen, Abdul-Hakeem Omotayo, Idris Abdulmumin, Naome A. Etori, Toyib Babatunde Musa, Samuel Fanijo, Oluwabusayo Olufunke Awoyomi, Saheed Abdullahi Salahudeen, Labaran Adamu Mohammed, Daud Olamide Abolade, Falalu Ibrahim Lawan, Maryam Sabo Abubakar, Ruqayya Nasir Iro, Amina Abubakar Imam, Shafie Abdi Mohamed, Hanad Mohamud Mohamed, Tunde Oluwaseyi Ajayi, Jimmy Lin:
CIRAL: A Test Collection for CLIR Evaluations in African Languages. SIGIR 2024: 293-302 - [c355]Nandan Thakur, Luiz Bonifacio, Maik Fröbe, Alexander Bondarenko, Ehsan Kamalloo, Martin Potthast, Matthias Hagen, Jimmy Lin:
Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR. SIGIR 2024: 1420-1430 - [c354]Ehsan Kamalloo, Nandan Thakur, Carlos Lassance, Xueguang Ma, Jheng-Hong Yang, Jimmy Lin:
Resources for Brewing BEIR: Reproducible Reference Models and Statistical Analyses. SIGIR 2024: 1431-1440 - [c353]Minghan Li, Honglei Zhuang, Kai Hui, Zhen Qin, Jimmy Lin, Rolf Jagerman, Xuanhui Wang, Michael Bendersky:
Can Query Expansion Improve Generalization of Strong Cross-Encoder Rankers? SIGIR 2024: 2321-2326 - [c352]Xueguang Ma, Liang Wang, Nan Yang, Furu Wei, Jimmy Lin:
Fine-Tuning LLaMA for Multi-Stage Text Retrieval. SIGIR 2024: 2421-2425 - [c351]Akintunde Oladipo, Mofetoluwa Adeyemi, Jimmy Lin:
On Backbones and Training Regimes for Dense Retrieval in African Languages. SIGIR 2024: 2564-2568 - [c350]Ehsan Kamalloo, Shivani Upadhyay, Jimmy Lin:
Towards Robust QA Evaluation via Open LLMs. SIGIR 2024: 2811-2816 - [c349]Shi Zong, Santosh Kolagati, Amit Chaudhary, Josh Seltzer, Jimmy Lin:
Reflections on the Coding Ability of LLMs for Analyzing Market Research Surveys. SIGIR 2024: 2900-2904 - [c348]Jasper Xian, Tommaso Teofili, Ronak Pradeep, Jimmy Lin:
Vector Search with OpenAI Embeddings: Lucene Is All You Need. WSDM 2024: 1090-1093 - [i180]Jimmy Lin, Junkai Li, Jiasi Gao, Weizhi Ma, Yang Liu:
Jointly Modeling Spatio-Temporal Features of Tactile Signals for Action Classification. CoRR abs/2404.15279 (2024) - [i179]Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon:
PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval. CoRR abs/2404.18424 (2024) - [i178]Sheng-Chieh Lin, Luyu Gao, Barlas Oguz, Wenhan Xiong, Jimmy Lin, Wen-tau Yih, Xilun Chen:
FLAME: Factuality-Aware Alignment for Large Language Models. CoRR abs/2405.01525 (2024) - [i177]Shivani Upadhyay, Ehsan Kamalloo, Jimmy Lin:
LLMs Can Patch Up Missing Relevance Judgments in Evaluation. CoRR abs/2405.04727 (2024) - [i176]Sahel Sharifymoghaddam, Shivani Upadhyay, Wenhu Chen, Jimmy Lin:
UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models. CoRR abs/2405.10311 (2024) - [i175]Minghan Li, Xilun Chen, Ari Holtzman, Beidi Chen, Jimmy Lin, Wen-tau Yih, Xi Victoria Lin:
Nearest Neighbor Speculative Decoding for LLM Generation and Attribution. CoRR abs/2405.19325 (2024) - [i174]Shivani Upadhyay, Ronak Pradeep, Nandan Thakur, Nick Craswell, Jimmy Lin:
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor. CoRR abs/2406.06519 (2024) - [i173]Raphael Tang, Xinyu Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture:
Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation. CoRR abs/2406.08482 (2024) - [i172]Manveer Singh Tamber, Jasper Xian, Jimmy Lin:
Can't Hide Behind the API: Stealing Black-Box Commercial Embedding Models. CoRR abs/2406.09355 (2024) - [i171]Mohammad Dehghan, Mohammad Ali Alomrani, Sunyam Bagga, David Alfonso-Hermelo, Khalil Bibi, Abbas Ghaddar, Yingxue Zhang, Xiaoguang Li, Jianye Hao, Qun Liu, Jimmy Lin, Boxing Chen, Prasanna Parthasarathi, Mahdi Biparva, Mehdi Rezagholizadeh:
EWEK-QA: Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems. CoRR abs/2406.10393 (2024) - [i170]Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin:
Unifying Multimodal Retrieval via Document Screenshot Embedding. CoRR abs/2406.11251 (2024) - [i169]Ronak Pradeep, Nandan Thakur, Sahel Sharifymoghaddam, Eric Zhang, Ryan Nguyen, Daniel Campos, Nick Craswell, Jimmy Lin:
Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track. CoRR abs/2406.16828 (2024) - [i168]Shi Zong, Jimmy Lin:
Categorical Syllogisms Revisited: A Review of the Logical Reasoning Abilities of LLMs for Analyzing Categorical Syllogism. CoRR abs/2406.18762 (2024) - [i167]Nandan Thakur, Luiz Bonifacio, Maik Fröbe, Alexander Bondarenko, Ehsan Kamalloo, Martin Potthast, Matthias Hagen, Jimmy Lin:
Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR. CoRR abs/2407.07790 (2024) - [i166]Jheng-Hong Yang, Jimmy Lin:
Toward Automatic Relevance Judgment using Vision-Language Models for Image-Text Retrieval Evaluation. CoRR abs/2408.01363 (2024) - [i165]Ronak Pradeep, Daniel Lee, Ali Mousavi, Jeff Pound, Yisi Sang, Jimmy Lin, Ihab F. Ilyas, Saloni Potdar, Mostafa Arefiyan, Yunyao Li:
ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA datasets with Large Language Models. CoRR abs/2408.05948 (2024) - [i164]Jimmy Lin:
Operational Advice for Dense and Sparse Retrievers: HNSW, Flat, or Inverted Indexes? CoRR abs/2409.06464 (2024) - 2023
- [j65]Sheng-Chieh Lin, Minghan Li, Jimmy Lin:
Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval. Trans. Assoc. Comput. Linguistics 11: 436-452 (2023) - [j64]Xinyu Zhang, Nandan Thakur, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Mehdi Rezagholizadeh, Jimmy Lin:
MIRACL: A Multilingual Retrieval Dataset Covering 18 Diverse Languages. Trans. Assoc. Comput. Linguistics 11: 1114-1131 (2023) - [j63]Joel Mackenzie, Andrew Trotman, Jimmy Lin:
Efficient Document-at-a-time and Score-at-a-time Query Evaluation for Learned Sparse Representations. ACM Trans. Inf. Syst. 41(4): 96:1-96:28 (2023) - [j62]Sheng-Chieh Lin, Jimmy Lin:
A Dense Representation Framework for Lexical and Semantic Matching. ACM Trans. Inf. Syst. 41(4): 110:1-110:29 (2023) - [c347]Ehsan Kamalloo, Xinyu Zhang, Odunayo Ogundepo, Nandan Thakur, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Jimmy Lin:
Evaluating Embedding APIs for Information Retrieval. ACL (industry) 2023: 518-526 - [c346]Aleksandra Piktus, Odunayo Ogundepo, Christopher Akiki, Akintunde Oladipo, Xinyu Zhang, Hailey Schoelkopf, Stella Biderman, Martin Potthast, Jimmy Lin:
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration. ACL (demo) 2023: 588-598 - [c345]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Precise Zero-Shot Dense Retrieval without Relevance Labels. ACL (1) 2023: 1762-1777 - [c344]Ji Xin, Raphael Tang, Zhiying Jiang, Yaoliang Yu, Jimmy Lin:
Operator Selection and Ordering in a Pipeline Approach to Efficiency Optimizations for Transformers. ACL (Findings) 2023: 2870-2882 - [c343]Raphael Tang, Linqing Liu, Akshat Pandey, Zhiying Jiang, Gefei Yang, Karun Kumar, Pontus Stenetorp, Jimmy Lin, Ferhan Ture:
What the DAAM: Interpreting Stable Diffusion Using Cross Attention. ACL (1) 2023: 5644-5659 - [c342]Zhiying Jiang, Matthew Y. R. Yang, Mikhail Tsirlin, Raphael Tang, Yiqin Dai, Jimmy Lin:
"Low-Resource" Text Classification: A Parameter-Free Classification Method with Compressors. ACL (Findings) 2023: 6810-6828 - [c341]Minghan Li, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval. ACL (1) 2023: 11891-11907 - [c340]Xueguang Ma, Tommaso Teofili, Jimmy Lin:
Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes. CIKM 2023: 5366-5370 - [c339]Wei Zhong, Yuqing Xie, Jimmy Lin:
Answer Retrieval for Math Questions Using Structural and Dense Retrieval. CLEF 2023: 209-223 - [c338]Ronak Pradeep, Haonan Chen, Lingwei Gu, Manveer Singh Tamber, Jimmy Lin:
PyGaggle: A Gaggle of Resources for Open-Domain Question Answering. ECIR (3) 2023: 148-162 - [c337]Manveer Singh Tamber, Ronak Pradeep, Jimmy Lin:
Pre-processing Matters! Improved Wikipedia Corpora for Open-Domain Question Answering. ECIR (3) 2023: 163-176 - [c336]Christopher Akiki, Odunayo Ogundepo, Aleksandra Piktus, Xinyu Zhang, Akintunde Oladipo, Jimmy Lin, Martin Potthast:
Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face. EMNLP (Demos) 2023: 140-148 - [c335]Akintunde Oladipo, Mofetoluwa Adeyemi, Orevaoghene Ahia, Abraham Toluwase Owodunni, Odunayo Ogundepo, David Ifeoluwa Adelani, Jimmy Lin:
Better Quality Pre-training Data and T5 Models for African Languages. EMNLP 2023: 158-168 - [c334]Ronak Pradeep, Kai Hui, Jai Gupta, Ádám D. Lelkes, Honglei Zhuang, Jimmy Lin, Donald Metzler, Vinh Q. Tran:
How Does Generative Retrieval Scale to Millions of Passages? EMNLP 2023: 1305-1321 - [c333]Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
How to Train Your Dragon: Diverse Augmentation Towards Generalizable Dense Retrieval. EMNLP (Findings) 2023: 6385-6400 - [c332]Sheng-Chieh Lin, Amin Ahmad, Jimmy Lin:
mAggretriever: A Simple yet Effective Approach to Zero-Shot Multilingual Dense Retrieval. EMNLP 2023: 11688-11696 - [c331]Mofetoluwa Adeyemi, Akintunde Oladipo, Xinyu Zhang, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Boxing Chen, Jimmy Lin:
CIRAL at FIRE 2023: Cross-Lingual Information Retrieval for African Languages. FIRE 2023: 4-6 - [c330]Mofetoluwa Adeyemi, Akintunde Oladipo, Xinyu Crystina Zhang, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Boxing Chen, Jimmy Lin:
Overview of the CIRAL Track at FIRE 2023: Cross-lingual Information Retrieval for African Languages. FIRE (Working Notes) 2023: 118-136 - [c329]Wei Zhong, Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin:
One Blade for One Purpose: Advancing Math Information Retrieval using Hybrid Search. SIGIR 2023: 141-151 - [c328]Minghan Li, Sheng-Chieh Lin, Xueguang Ma, Jimmy Lin:
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes. SIGIR 2023: 1954-1959 - [c327]Chris Kamphuis, Aileen Lin, Siwen Yang, Jimmy Lin, Arjen P. de Vries, Faegheh Hasibi:
MMEAD: MS MARCO Entity Annotations and Disambiguations. SIGIR 2023: 2817-2825 - [c326]Nandan Thakur, Kexin Wang, Iryna Gurevych, Jimmy Lin:
SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval. SIGIR 2023: 2964-2974 - [c325]Jheng-Hong Yang, Carlos Lassance, Rafael Sampaio de Rezende, Krishna Srinivasan, Miriam Redi, Stéphane Clinchant, Jimmy Lin:
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation. SIGIR 2023: 2975-2984 - [c324]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Tevatron: An Efficient and Flexible Toolkit for Neural Retrieval. SIGIR 2023: 3120-3124 - [c323]Xueguang Ma, Hengxin Fun, Xusen Yin, Antonio Mallia, Jimmy Lin:
Enhancing Sparse Retrieval via Unsupervised Learning. SIGIR-AP 2023: 150-157 - [i163]Shi Zong, Josh Seltzer, Jiahua Pan, Kathy Cheng, Jimmy Lin:
Which Model Shall I Choose? Cost/Quality Trade-offs for Text Classification Tasks. CoRR abs/2301.07006 (2023) - [i162]Minghan Li, Sheng-Chieh Lin, Xueguang Ma, Jimmy Lin:
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes. CoRR abs/2302.06587 (2023) - [i161]Xinyu Zhang, Minghan Li, Jimmy Lin:
Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction. CoRR abs/2302.06589 (2023) - [i160]Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval. CoRR abs/2302.07452 (2023) - [i159]Christopher Akiki, Odunayo Ogundepo, Aleksandra Piktus, Xinyu Zhang, Akintunde Oladipo, Jimmy Lin, Martin Potthast:
Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face. CoRR abs/2302.14534 (2023) - [i158]Jimmy Lin, David Alfonso-Hermelo, Vitor Jeronymo, Ehsan Kamalloo, Carlos Lassance, Rodrigo Frassetto Nogueira, Odunayo Ogundepo, Mehdi Rezagholizadeh, Nandan Thakur, Jheng-Hong Yang, Xinyu Zhang:
Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval. CoRR abs/2304.01019 (2023) - [i157]Jheng-Hong Yang, Carlos Lassance, Rafael Sampaio de Rezende, Krishna Srinivasan, Miriam Redi, Stéphane Clinchant, Jimmy Lin:
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation. CoRR abs/2304.01961 (2023) - [i156]Xueguang Ma, Tommaso Teofili, Jimmy Lin:
Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes. CoRR abs/2304.12139 (2023) - [i155]Xueguang Ma, Xinyu Zhang, Ronak Pradeep, Jimmy Lin:
Zero-Shot Listwise Document Reranking with a Large Language Model. CoRR abs/2305.02156 (2023) - [i154]Ehsan Kamalloo, Xinyu Zhang, Odunayo Ogundepo, Nandan Thakur, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Jimmy Lin:
Evaluating Embedding APIs for Information Retrieval. CoRR abs/2305.06300 (2023) - [i153]Josh Seltzer, Jiahua Pan, Kathy Cheng, Yuxiao Sun, Santosh Kolagati, Jimmy Lin, Shi Zong:
SmartProbe: A Virtual Moderator for Market Research Surveys. CoRR abs/2305.08271 (2023) - [i152]Ronak Pradeep, Kai Hui, Jai Gupta, Ádám Dániel Lelkes, Honglei Zhuang, Jimmy Lin, Donald Metzler, Vinh Q. Tran:
How Does Generative Retrieval Scale to Millions of Passages? CoRR abs/2305.11841 (2023) - [i151]Vanessa Liao, Syed Shariyar Murtaza, Yifan Nie, Jimmy Lin:
Regex-augmented Domain Transfer Topic Classification based on a Pre-trained Language Model: An application in Financial Domain. CoRR abs/2305.18324 (2023) - [i150]Aleksandra Piktus, Odunayo Ogundepo, Christopher Akiki, Akintunde Oladipo, Xinyu Zhang, Hailey Schoelkopf, Stella Biderman, Martin Potthast, Jimmy Lin:
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration. CoRR abs/2306.01481 (2023) - [i149]Ehsan Kamalloo, Nandan Thakur, Carlos Lassance, Xueguang Ma, Jheng-Hong Yang, Jimmy Lin:
Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard. CoRR abs/2306.07471 (2023) - [i148]Nandan Thakur, Kexin Wang, Iryna Gurevych, Jimmy Lin:
SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval. CoRR abs/2307.10488 (2023) - [i147]Ehsan Kamalloo, Aref Jafari, Xinyu Zhang, Nandan Thakur, Jimmy Lin:
HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution. CoRR abs/2307.16883 (2023) - [i146]Cynthia Huang, Yuqing Xie, Zhiying Jiang, Jimmy Lin, Ming Li:
Approximating Human-Like Few-shot Learning with GPT-based Compression. CoRR abs/2308.06942 (2023) - [i145]Jimmy Lin, Ronak Pradeep, Tommaso Teofili, Jasper Xian:
Vector Search with OpenAI Embeddings: Lucene Is All You Need. CoRR abs/2308.14963 (2023) - [i144]Zijun Wu, Anup Anand Deshmukh, Yongkang Wu, Jimmy Lin, Lili Mou:
Unsupervised Chunking with Hierarchical RNN. CoRR abs/2309.04919 (2023) - [i143]Chris Kamphuis, Aileen Lin, Siwen Yang, Jimmy Lin, Arjen P. de Vries, Faegheh Hasibi:
MMEAD: MS MARCO Entity Annotations and Disambiguations. CoRR abs/2309.07574 (2023) - [i142]Ronak Pradeep, Sahel Sharifymoghaddam, Jimmy Lin:
RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models. CoRR abs/2309.15088 (2023) - [i141]Raphael Tang, Xinyu Zhang, Xueguang Ma, Jimmy Lin, Ferhan Ture:
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models. CoRR abs/2310.07712 (2023) - [i140]Xueguang Ma, Liang Wang, Nan Yang, Furu Wei, Jimmy Lin:
Fine-Tuning LLaMA for Multi-Stage Text Retrieval. CoRR abs/2310.08319 (2023) - [i139]Nandan Thakur, Jianmo Ni, Gustavo Hernández Ábrego, John Wieting, Jimmy Lin, Daniel Cer:
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval. CoRR abs/2311.05800 (2023) - [i138]Minghan Li, Honglei Zhuang, Kai Hui, Zhen Qin, Jimmy Lin, Rolf Jagerman, Xuanhui Wang, Michael Bendersky:
Generate, Filter, and Fuse: Query Expansion via Multi-Step Keyword Generation for Zero-Shot Neural Rankers. CoRR abs/2311.09175 (2023) - [i137]Haonan Chen, Carlos Lassance, Jimmy Lin:
End-to-End Retrieval with Learned Dense and Sparse Representations Using Lucene. CoRR abs/2311.18503 (2023) - [i136]Raphael Tang, Xinyu Zhang, Jimmy Lin, Ferhan Ture:
What Do Llamas Really Think? Revealing Preference Biases in Language Model Representations. CoRR abs/2311.18812 (2023) - [i135]Jimmy Lin, Tommaso Teofili:
Searching Dense Representations with Inverted Indexes. CoRR abs/2312.01556 (2023) - [i134]Ronak Pradeep, Sahel Sharifymoghaddam, Jimmy Lin:
RankZephyr: Effective and Robust Zero-Shot Listwise Reranking is a Breeze! CoRR abs/2312.02724 (2023) - [i133]Xinyu Zhang, Sebastian Hofstätter, Patrick S. H. Lewis, Raphael Tang, Jimmy Lin:
Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models. CoRR abs/2312.02969 (2023) - [i132]Nandan Thakur, Luiz Bonifacio, Xinyu Zhang, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Boxing Chen, Mehdi Rezagholizadeh, Jimmy Lin:
NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation. CoRR abs/2312.11361 (2023) - [i131]Manveer Singh Tamber, Ronak Pradeep, Jimmy Lin:
Scaling Down, LiTting Up: Efficient Zero-Shot Listwise Reranking with Seq2seq Encoder-Decoder Models. CoRR abs/2312.16098 (2023) - [i130]Mofetoluwa Adeyemi, Akintunde Oladipo, Ronak Pradeep, Jimmy Lin:
Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages. CoRR abs/2312.16159 (2023) - 2022
- [c322]Sankeerth Durvasula, Raymond Kiguru, Samarth Mathur, Jenny Xu, Jimmy Lin, Nandita Vijaykumar:
VoxelCache: Accelerating Online Mapping in Robotics and 3D Reconstruction Tasks. PACT 2022: 239-251 - [c321]Hang Li, Shengyao Zhuang, Xueguang Ma, Jimmy Lin, Guido Zuccon:
Pseudo-Relevance Feedback with Dense Retrievers in Pyserini. ADCS 2022: 1:1-1:6 - [c320]Wei Zhong, Yuqing Xie, Jimmy Lin:
Applying Structural and Dense Semantic Matching for the ARQMath Lab 2022, CLEF. CLEF (Working Notes) 2022: 147-170 - [c319]Chris Kamphuis, Faegheh Hasibi, Jimmy Lin, Arjen P. de Vries:
REBL: Entity Linking at Scale (prototype). DESIRES 2022: 68-75 - [c318]Hang Li, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon:
Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback: A Reproducibility Study. ECIR (1) 2022: 599-612 - [c317]Xueguang Ma, Kai Sun, Ronak Pradeep, Minghan Li, Jimmy Lin:
Another Look at DPR: Reproduction of Training and Replication of Retrieval. ECIR (1) 2022: 613-626 - [c316]Ronak Pradeep, Yuqi Liu, Xinyu Zhang, Yilin Li, Andrew Yates, Jimmy Lin:
Squeezing Water from a Stone: A Bag of Tricks for Further Improving Cross-Encoder Effectiveness for Reranking. ECIR (1) 2022: 655-670 - [c315]Raphael Tang, Karun Kumar, Gefei Yang, Akshat Pandey, Yajie Mao, Vladislav Belyaev, Madhuri Emmadi, G. Craig Murray, Ferhan Ture, Jimmy Lin:
SpeechNet: Weakly Supervised, End-to-End Speech Recognition at Industrial Scale. EMNLP (Industry Track) 2022: 285-293 - [c314]Minghan Li, Xinyu Zhang, Ji Xin, Hongyang Zhang, Jimmy Lin:
Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking. EMNLP 2022: 333-345 - [c313]Yizhen Zhong, Jiajie Xiao, Thomas Vetterli, Mahan Matin, Ellen Loo, Jimmy Lin, Richard Bourgon, Ofer Shapira:
Improving Precancerous Case Characterization via Transformer-based Ensemble Learning. EMNLP (Industry Track) 2022: 379-389 - [c312]Wei Zhong, Jheng-Hong Yang, Yuqing Xie, Jimmy Lin:
Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval. EMNLP (Findings) 2022: 1092-1102 - [c311]Peng Shi, Rui Zhang, He Bai, Jimmy Lin:
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing. EMNLP (Findings) 2022: 5248-5259 - [c310]Peng Shi, Linfeng Song, Lifeng Jin, Haitao Mi, He Bai, Jimmy Lin, Dong Yu:
Cross-lingual Text-to-SQL Semantic Parsing with Representation Mixup. EMNLP (Findings) 2022: 5296-5306 - [c309]Odunayo Ogundepo, Xinyu Zhang, Shuo Sun, Kevin Duh, Jimmy Lin:
AfriCLIRMatrix: Enabling Cross-Lingual Information Retrieval for African Languages. EMNLP 2022: 8721-8728 - [c308]Raphael Tang, Karun Kumar, Ji Xin, Piyush Vyas, Wenyan Li, Gefei Yang, Yajie Mao, G. Craig Murray, Jimmy Lin:
Temporal Early Exiting for Streaming Speech Commands Recognition. ICASSP 2022: 7567-7571 - [c307]Matthew Y. R. Yang, Siwen Yang, Jimmy Lin:
Integration of text and geospatial search for hydrographic datasets using the lucene search library. JCDL 2022: 36 - [c306]Zhiying Jiang, Yiqin Dai, Ji Xin, Ming Li, Jimmy Lin:
Few-Shot Non-Parametric Learning with Deep Latent Variable Model. NeurIPS 2022 - [c305]Ronak Pradeep, Yilin Li, Yuetong Wang, Jimmy Lin:
Neural Query Synthesis and Domain-Specific Ranking Templates for Multi-Stage Clinical Trial Matching. SIGIR 2022: 2325-2330 - [c304]Hang Li, Shuai Wang, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon:
To Interpolate or not to Interpolate: PRF, Dense and Sparse Retrievers. SIGIR 2022: 2495-2500 - [c303]Yuqi Liu, Chengcheng Hu, Jimmy Lin:
Another Look at Information Retrieval as Statistical Translation. SIGIR 2022: 2749-2754 - [c302]Jimmy Lin, Daniel Campos, Nick Craswell, Bhaskar Mitra, Emine Yilmaz:
Fostering Coopetition While Plugging Leaks: The Design and Implementation of the MS MARCO Leaderboards. SIGIR 2022: 2939-2948 - [c301]Ellen M. Voorhees, Nick Craswell, Jimmy Lin:
Too Many Relevants: Whither Cranfield Test Collections? SIGIR 2022: 2970-2980 - [c300]Xueguang Ma, Ronak Pradeep, Rodrigo Frassetto Nogueira, Jimmy Lin:
Document Expansion Baselines and Learned Sparse Lexical Representations for MS MARCO V1 and V2. SIGIR 2022: 3187-3197 - [c299]Andrew Trotman, Joel Mackenzie, Pradeesh Parameswaran, Jimmy Lin:
A Common Framework for Exploring Document-at-a-Time and Score-at-a-Time Retrieval Methods. SIGIR 2022: 3229-3234 - [c298]Josh Seltzer, Kathy Cheng, Shi Zong, Jimmy Lin:
Flipping the Script: Inverse Information Seeking Dialogues for Market Research. SIGIR 2022: 3380-3383 - [c297]Nick Craswell, Bhaskar Mitra, Emine Yilmaz, Daniel Campos, Jimmy Lin, Ellen M. Voorhees, Ian Soboroff:
Overview of the TREC 2022 Deep Learning Track. TREC 2022 - [c296]Jimmy Lin, David Alfonso-Hermelo, Vitor Jeronymo, Ehsan Kamalloo, Carlos Lassance, Rodrigo Frassetto Nogueira, Odunayo Ogundepo, Mehdi Rezagholizadeh, Nandan Thakur, Jheng-Hong Yang, Xinyu Zhang:
Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval. TREC 2022 - [c295]Josh Devins, Julie Tibshirani, Jimmy Lin:
Aligning the Research and Practice of Building Search Applications: Elasticsearch and Pyserini. WSDM 2022: 1573-1576 - [i129]Ellen M. Voorhees, Ian Soboroff, Jimmy Lin:
Can Old TREC Collections Reliably Evaluate Modern Neural Retrieval Models? CoRR abs/2201.11086 (2022) - [i128]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval. CoRR abs/2203.05765 (2022) - [i127]Wei Zhong, Jheng-Hong Yang, Jimmy Lin:
Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval. CoRR abs/2203.11163 (2022) - [i126]Xinyu Zhang, Kelechi Ogueji, Xueguang Ma, Jimmy Lin:
Towards Best Practices for Training Multilingual Dense Retrieval Models. CoRR abs/2204.02363 (2022) - [i125]Hang Li, Shuai Wang, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon:
To Interpolate or not to Interpolate: PRF, Dense and Sparse Retrievers. CoRR abs/2205.00235 (2022) - [i124]Minghan Li, Xinyu Zhang, Ji Xin, Hongyang Zhang, Jimmy Lin:
Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking. CoRR abs/2205.09638 (2022) - [i123]Nandan Thakur, Nils Reimers, Jimmy Lin:
Domain Adaptation for Memory-Efficient Dense Retrieval. CoRR abs/2205.11498 (2022) - [i122]Sheng-Chieh Lin, Jimmy Lin:
A Dense Representation Framework for Lexical and Semantic Matching. CoRR abs/2206.09912 (2022) - [i121]Zhiying Jiang, Yiqin Dai, Ji Xin, Ming Li, Jimmy Lin:
Few-Shot Non-Parametric Learning with Deep Latent Variable Model. CoRR abs/2206.11573 (2022) - [i120]Ji Xin, Raphael Tang, Zhiying Jiang, Yaoliang Yu, Jimmy Lin:
Building an Efficiency Pipeline: Commutativity and Cumulativeness of Efficiency Operators for Transformers. CoRR abs/2208.00483 (2022) - [i119]Sheng-Chieh Lin, Minghan Li, Jimmy Lin:
Aggretriever: A Simple Approach to Aggregate Textual Representation for Robust Dense Passage Retrieval. CoRR abs/2208.00511 (2022) - [i118]Raphael Tang, Akshat Pandey, Zhiying Jiang, Gefei Yang, Karun Kumar, Jimmy Lin, Ferhan Ture:
What the DAAM: Interpreting Stable Diffusion Using Cross Attention. CoRR abs/2210.04885 (2022) - [i117]Odunayo Ogundepo, Xinyu Zhang, Jimmy Lin:
Better Than Whitespace: Information Retrieval for Languages without Custom Tokenizers. CoRR abs/2210.05481 (2022) - [i116]Linqing Liu, Minghan Li, Jimmy Lin, Sebastian Riedel, Pontus Stenetorp:
Query Expansion Using Contextual Clue Sampling with Language Models. CoRR abs/2210.07093 (2022) - [i115]Sankeerth Durvasula, Raymond Kiguru, Samarth Mathur, Jenny Xu, Jimmy Lin, Nandita Vijaykumar:
VoxelCache: Accelerating Online Mapping in Robotics and 3D Reconstruction Tasks. CoRR abs/2210.08729 (2022) - [i114]Xinyu Zhang, Nandan Thakur, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Mehdi Rezagholizadeh, Jimmy Lin:
Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages. CoRR abs/2210.09984 (2022) - [i113]Peng Shi, Rui Zhang, He Bai, Jimmy Lin:
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing. CoRR abs/2210.13693 (2022) - [i112]Jimmy Lin:
On the Interaction Between Differential Privacy and Gradient Compression in Deep Learning. CoRR abs/2211.00734 (2022) - [i111]Minghan Li, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval. CoRR abs/2211.10411 (2022) - [i110]Raphael Tang, Karun Kumar, Gefei Yang, Akshat Pandey, Yajie Mao, Vladislav Belyaev, Madhuri Emmadi, G. Craig Murray, Ferhan Ture, Jimmy Lin:
SpeechNet: Weakly Supervised, End-to-End Speech Recognition at Industrial Scale. CoRR abs/2211.11740 (2022) - [i109]Yizhen Zhong, Jiajie Xiao, Thomas Vetterli, Mahan Matin, Ellen Loo, Jimmy Lin, Richard Bourgon, Ofer Shapira:
Improving Precancerous Case Characterization via Transformer-based Ensemble Learning. CoRR abs/2212.05150 (2022) - [i108]Zhiying Jiang, Matthew Y. R. Yang, Mikhail Tsirlin, Raphael Tang, Jimmy Lin:
Less is More: Parameter-Free Text Classification with Gzip. CoRR abs/2212.09410 (2022) - [i107]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Precise Zero-Shot Dense Retrieval without Relevance Labels. CoRR abs/2212.10496 (2022) - [i106]Jimmy Lin:
Building a Culture of Reproducibility in Academic Research. CoRR abs/2212.13534 (2022) - 2021
- [b3]Jimmy Lin, Rodrigo Frassetto Nogueira, Andrew Yates:
Pretrained Transformers for Text Ranking: BERT and Beyond. Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers 2021, ISBN 978-3-031-01053-8, pp. 1-325 - [j61]Samantha Fritz, Ian Milligan, Nick Ruest, Jimmy Lin:
Fostering Community Engagement through Datathon Events: The Archives Unleashed Experience. Digit. Humanit. Q. 15(1) (2021) - [j60]Martin Gauch, Juliane Mai, Jimmy Lin:
The proper care and feeding of CAMELS: How limited training data affects streamflow prediction. Environ. Model. Softw. 135: 104926 (2021) - [j59]Jimmy Lin:
A proposed conceptual framework for a representational approach to information retrieval. SIGIR Forum 55(2): 4:1-4:29 (2021) - [j58]Sheng-Chieh Lin, Jheng-Hong Yang, Rodrigo Frassetto Nogueira, Ming-Feng Tsai, Chuan-Ju Wang, Jimmy Lin:
Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term Importance Estimation and Neural Query Rewriting. ACM Trans. Inf. Syst. 39(4): 48:1-48:29 (2021) - [c294]He Bai, Peng Shi, Jimmy Lin, Yuqing Xie, Luchen Tan, Kun Xiong, Wen Gao, Ming Li:
Segatron: Segment-Aware Transformer for Language Modeling and Understanding. AAAI 2021: 12526-12534 - [c293]He Bai, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Jie Liu, Ming Li:
Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation with GPT2. ACL (student) 2021: 148-162 - [c292]Kelvin Jiang, Ronak Pradeep, Jimmy Lin:
Exploring Listwise Evidence Reasoning with T5 for Fact Verification. ACL/IJCNLP (2) 2021: 402-410 - [c291]Ji Xin, Raphael Tang, Yaoliang Yu, Jimmy Lin:
The Art of Abstention: Selective Prediction and Error Regularization for Natural Language Processing. ACL/IJCNLP (1) 2021: 1040-1051 - [c290]Ronak Pradeep, Xueguang Ma, Rodrigo Frassetto Nogueira, Jimmy Lin:
Scientific Claim Verification with VerT5erini. LOUHI@EACL 2021: 94-103 - [c289]Zhiying Jiang, Raphael Tang, Ji Xin, Jimmy Lin:
How Does BERT Rerank Passages? An Attribution Analysis with Information Bottlenecks. BlackboxNLP@EMNLP 2021: 496-509 - [c288]Wei Zhong, Xinyu Zhang, Ji Xin, Richard Zanibbi, Jimmy Lin:
Approach Zero and Anserini at the CLEF-2021 ARQMath Track: Applying Substructure Search and BM25 on Operator Tree Path Tokens. CLEF (Working Notes) 2021: 133-156 - [c287]Mayank Anand, Jiarui Zhang, Shane Ding, Ji Xin, Jimmy Lin:
Serverless BM25 Search and BERT Reranking. DESIRES 2021: 3-9 - [c286]Jimmy Lin, Xueguang Ma, Joel Mackenzie, Antonio Mallia:
On the Separation of Logical and Physical Ranking Models for Text Retrieval Applications. DESIRES 2021: 176-178 - [c285]Ogundepo Odunayo, Naveela N. Sookoo, Gautam Bathla, Anthony Cavallin, Bhaleka D. Persaud, Kathy Szigeti, Philippe Van Cappellen, Jimmy Lin:
Rescuing historical climate observations to support hydrological research: a case study of solar radiation data. DocEng 2021: 19:1-19:4 - [c284]Ji Xin, Raphael Tang, Yaoliang Yu, Jimmy Lin:
BERxiT: Early Exiting for BERT with Better Fine-Tuning and Extension to Regression. EACL 2021: 91-104 - [c283]Mohan Zhang, Luchen Tan, Zihang Fu, Kun Xiong, Jimmy Lin, Ming Li, Zhengkai Tu:
Don't Change Me! User-Controllable Selective Paraphrase Generation. EACL 2021: 3522-3527 - [c282]Xinyu Zhang, Andrew Yates, Jimmy Lin:
Comparing Score Aggregation Approaches for Document Retrieval with Pretrained Transformers. ECIR (2) 2021: 150-163 - [c281]Yue Zhang, Chengcheng Hu, Yuqi Liu, Hui Fang, Jimmy Lin:
Learning to Rank in the Age of Muppets: Effectiveness-Efficiency Tradeoffs in Multi-Stage Ranking. SustaiNLP@EMNLP 2021: 64-73 - [c280]Minghan Li, Ming Li, Kun Xiong, Jimmy Lin:
Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question Answering. EMNLP (Findings) 2021: 274-287 - [c279]Raphael Tang, Karun Kumar, Kendra Chalkley, Ji Xin, Liming Zhang, Wenyan Li, Gefei Yang, Yajie Mao, Junho Shin, Geoffrey Craig Murray, Jimmy Lin:
Voice Query Auto Completion. EMNLP (1) 2021: 900-906 - [c278]Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin:
Contextualized Query Embeddings for Conversational Search. EMNLP (1) 2021: 1004-1015 - [c277]Xueguang Ma, Minghan Li, Kai Sun, Ji Xin, Jimmy Lin:
Simple and Effective Unsupervised Redundancy Elimination to Compress Dense Vectors for Passage Retrieval. EMNLP (1) 2021: 2854-2859 - [c276]Anup Anand Deshmukh, Qianqiu Zhang, Ming Li, Jimmy Lin, Lili Mou:
Unsupervised Chunking as Syntactic Structure Induction with a Knowledge-Transfer Approach. EMNLP (Findings) 2021: 3626-3634 - [c275]Xiao Han, Yuqi Liu, Jimmy Lin:
The Simplest Thing That Can Possibly Work: (Pseudo-)Relevance Feedback via Text Classification. ICTIR 2021: 123-129 - [c274]Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin:
In-Batch Negatives for Knowledge Distillation with Tightly-Coupled Teachers for Dense Retrieval. RepL4NLP@ACL-IJCNLP 2021: 163-173 - [c273]Sebastian Hofstätter, Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin, Allan Hanbury:
Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling. SIGIR 2021: 113-122 - [c272]Nick Craswell, Bhaskar Mitra, Emine Yilmaz, Daniel Campos, Jimmy Lin:
MS MARCO: Benchmarking Ranking Models in the Large-Data Regime. SIGIR 2021: 1566-1576 - [c271]Ronak Pradeep, Xueguang Ma, Rodrigo Frassetto Nogueira, Jimmy Lin:
Vera: Prediction Techniques for Reducing Harmful Misinformation in Consumer Health Search. SIGIR 2021: 2066-2070 - [c270]Jimmy Lin, Daniel Campos, Nick Craswell, Bhaskar Mitra, Emine Yilmaz:
Significant Improvements over the State of the Art? A Case Study of the MS MARCO Document Ranking Leaderboard. SIGIR 2021: 2283-2287 - [c269]Jimmy Lin, Xueguang Ma, Sheng-Chieh Lin, Jheng-Hong Yang, Ronak Pradeep, Rodrigo Frassetto Nogueira:
Pyserini: A Python Toolkit for Reproducible Information Retrieval Research with Sparse and Dense Representations. SIGIR 2021: 2356-2362 - [c268]Edwin Zhang, Sheng-Chieh Lin, Jheng-Hong Yang, Ronak Pradeep, Rodrigo Frassetto Nogueira, Jimmy Lin:
Chatty Goose: A Python Framework for Conversational Search. SIGIR 2021: 2521-2525 - [c267]Wei Zhong, Jimmy Lin:
PYA0: A Python Toolkit for Accessible Math-Aware Search. SIGIR 2021: 2541-2545 - [c266]Andrew Yates, Rodrigo Frassetto Nogueira, Jimmy Lin:
Pretrained Transformers for Text Ranking: BERT and Beyond. SIGIR 2021: 2666-2668 - [c265]Nick Craswell, Bhaskar Mitra, Emine Yilmaz, Daniel Campos, Jimmy Lin:
Overview of the TREC 2021 Deep Learning Track. TREC 2021 - [c264]Andrew Yates, Rodrigo Frassetto Nogueira, Jimmy Lin:
Pretrained Transformers for Text Ranking: BERT and Beyond. WSDM 2021: 1154-1156 - [i105]Ronak Pradeep, Rodrigo Frassetto Nogueira, Jimmy Lin:
The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models. CoRR abs/2101.05667 (2021) - [i104]Jimmy Lin, Xueguang Ma, Sheng-Chieh Lin, Jheng-Hong Yang, Ronak Pradeep, Rodrigo Frassetto Nogueira:
Pyserini: An Easy-to-Use Python Toolkit to Support Replicable IR Research with Sparse and Dense Representations. CoRR abs/2102.10073 (2021) - [i103]Jimmy Lin, Daniel Campos, Nick Craswell, Bhaskar Mitra, Emine Yilmaz:
Significant Improvements over the State of the Art? A Case Study of the MS MARCO Document Ranking Leaderboard. CoRR abs/2102.12887 (2021) - [i102]Rodrigo Frassetto Nogueira, Zhiying Jiang, Jimmy Lin:
Investigating the Limitations of the Transformers with Simple Arithmetic Tasks. CoRR abs/2102.13019 (2021) - [i101]Xueguang Ma, Kai Sun, Ronak Pradeep, Jimmy Lin:
A Replication Study of Dense Passage Retriever. CoRR abs/2104.05740 (2021) - [i100]Sebastian Hofstätter, Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin, Allan Hanbury:
Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling. CoRR abs/2104.06967 (2021) - [i99]Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin:
Contextualized Query Embeddings for Conversational Search. CoRR abs/2104.08707 (2021) - [i98]Nick Craswell, Bhaskar Mitra, Emine Yilmaz, Daniel Campos, Jimmy Lin:
MS MARCO: Benchmarking Ranking Models in the Large-Data Regime. CoRR abs/2105.04021 (2021) - [i97]Jimmy Lin, Xueguang Ma:
A Few Brief Notes on DeepImpact, COIL, and a Conceptual Framework for Information Retrieval Techniques. CoRR abs/2106.14807 (2021) - [i96]Xinyu Zhang, Xueguang Ma, Peng Shi, Jimmy Lin:
Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval. CoRR abs/2108.08787 (2021) - [i95]Peng Shi, Rui Zhang, He Bai, Jimmy Lin:
Cross-Lingual Training with Dense Retrieval for Document Retrieval. CoRR abs/2109.01628 (2021) - [i94]Jimmy Lin:
A Proposed Conceptual Framework for a Representational Approach to Information Retrieval. CoRR abs/2110.01529 (2021) - [i93]Minghan Li, Jimmy Lin:
Encoder Adaptation of Dense Passage Retrieval for Open-Domain Question Answering. CoRR abs/2110.01599 (2021) - [i92]Joel Mackenzie, Andrew Trotman, Jimmy Lin:
Wacky Weights in Learned Sparse Representations and the Revenge of Score-at-a-Time Query Evaluation. CoRR abs/2110.11540 (2021) - [i91]Sheng-Chieh Lin, Jimmy Lin:
Densifying Sparse Representations for Passage Retrieval by Representational Slicing. CoRR abs/2112.04666 (2021) - [i90]Hang Li, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon:
Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback: A Reproducibility Study. CoRR abs/2112.06400 (2021) - [i89]Jheng-Hong Yang, Xueguang Ma, Jimmy Lin:
Sparsifying Sparse Representations for Passage Retrieval by Top-k Masking. CoRR abs/2112.09628 (2021) - 2020
- [j57]Samantha Fritz, Ian Milligan, Nick Ruest, Jimmy Lin:
Building community at distance: a datathon during COVID-19. Digit. Libr. Perspect. 36(4): 415-428 (2020) - [j56]Rodrigo Frassetto Nogueira, Zhiying Jiang, Kyunghyun Cho, Jimmy Lin:
Navigation-based candidate expansion and pretrained language models for citation recommendation. Scientometrics 125(3): 3001-3016 (2020) - [j55]Siddhartha Sahu, Amine Mhedhbi, Semih Salihoglu, Jimmy Lin, M. Tamer Özsu:
The ubiquity of large graphs and surprising challenges of graph processing: extended survey. VLDB J. 29(2-3): 595-618 (2020) - [c263]Ji Xin, Raphael Tang, Jaejun Lee, Yaoliang Yu, Jimmy Lin:
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference. ACL 2020: 2246-2251 - [c262]Raphael Tang, Jaejun Lee, Ji Xin, Xinyu Liu, Yaoliang Yu, Jimmy Lin:
Showing Your Work Doesn't Always Work. ACL 2020: 2766-2772 - [c261]Hamidreza Shahidi, Ming Li, Jimmy Lin:
Two Birds, One Stone: A Simple, Unified Model for Text Generation from Structured and Unstructured Data. ACL 2020: 3864-3870 - [c260]Rodrigo Frassetto Nogueira, Zhiying Jiang, Kyunghyun Cho, Jimmy Lin:
Evaluating Pretrained Transformer Models for Citation Recommendation. BIR@ECIR 2020: 89-100 - [c259]Jimmy Lin, Ian Milligan, Douglas W. Oard, Nick Ruest, Katie Shilton:
We Could, but Should We?: Ethical Considerations for Providing Access to GeoCities and Other Historical Digital Collections. CHIIR 2020: 135-144 - [c258]Royal Sequiera, Luchen Tan, Yinan Zhang, Jimmy Lin:
Update Delivery Mechanisms for Prospective Information Needs: A Reproducibility Study. CHIIR 2020: 308-312 - [c257]Andrew Yates, Kevin Martin Jose, Xinyu Zhang, Jimmy Lin:
Flexible IR Pipelines with Capreolus. CIKM 2020: 3181-3188 - [c256]Jheng-Hong Yang, Sheng-Chieh Lin, Rodrigo Frassetto Nogueira, Ming-Feng Tsai, Chuan-Ju Wang, Jimmy Lin:
Designing Templates for Eliciting Commonsense Knowledge from Pretrained Sequence-to-Sequence Models. COLING 2020: 3449-3453 - [c255]Adrien Grand, Robert Muir, Jim Ferenczi, Jimmy Lin:
From MAXSCORE to Block-Max Wand: The Story of How Lucene Significantly Improved Query Evaluation Performance. ECIR (2) 2020: 20-27 - [c254]Chris Kamphuis, Arjen P. de Vries, Leonid Boytsov, Jimmy Lin:
Which BM25 Do You Mean? A Large-Scale Reproducibility Study of Scoring Variants. ECIR (2) 2020: 28-34 - [c253]Jimmy Lin, Qian Zhang:
Reproducibility is a Process, Not an Achievement: The Replicability of IR Reproducibility Experiments. ECIR (2) 2020: 43-49 - [c252]Edwin Zhang, Nikhil Gupta, Raphael Tang, Xiao Han, Ronak Pradeep, Kuang Lu, Yue Zhang, Rodrigo Frassetto Nogueira, Kyunghyun Cho, Hui Fang, Jimmy Lin:
Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset. SDP@EMNLP 2020: 31-41 - [c251]Ji Xin, Rodrigo Frassetto Nogueira, Yaoliang Yu, Jimmy Lin:
Early Exiting BERT for Efficient Document Ranking. SustaiNLP@EMNLP 2020: 83-88 - [c250]Xinyu Zhang, Andrew Yates, Jimmy Lin:
A Little Bit Is Worse Than None: Ranking with Limited Training Data. SustaiNLP@EMNLP 2020: 107-112 - [c249]Shane Ding, Edwin Zhang, Jimmy Lin:
Cydex: Neural Search Infrastructure for the Scholarly Literature. SDP@EMNLP 2020: 168-173 - [c248]Rodrigo Frassetto Nogueira, Zhiying Jiang, Ronak Pradeep, Jimmy Lin:
Document Ranking with a Pretrained Sequence-to-Sequence Model. EMNLP (Findings) 2020: 708-718 - [c247]Peng Shi, He Bai, Jimmy Lin:
Cross-Lingual Training of Neural Models for Document Ranking. EMNLP (Findings) 2020: 2768-2773 - [c246]Zhiying Jiang, Raphael Tang, Ji Xin, Jimmy Lin:
Inserting Information Bottleneck for Attribution in Transformers. EMNLP (Findings) 2020: 3850-3857 - [c245]Jimmy Lin, Chudi Zhong, Diane Hu, Cynthia Rudin, Margo I. Seltzer:
Generalized and Scalable Optimal Sparse Decision Trees. ICML 2020: 6150-6160 - [c244]Zhengkai Tu, Wei Yang, Zihang Fu, Yuqing Xie, Luchen Tan, Kun Xiong, Ming Li, Jimmy Lin:
Approximate Nearest Neighbor Search and Lightweight Dense Vector Reranking in Multi-Stage Retrieval Architectures. ICTIR 2020: 97-100 - [c243]Nick Ruest, Jimmy Lin, Ian Milligan, Samantha Fritz:
The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives. JCDL 2020: 157-166 - [c242]Tobi Adewoye, Xiao Han, Nick Ruest, Ian Milligan, Samantha Fritz, Jimmy Lin:
Content-Based Exploration of Archival Images Using Neural Networks. JCDL 2020: 489-490 - [c241]Martin Gauch, James Bai, Juliane Mai, Jimmy Lin:
An Open-Source Interface to the Canadian Surface Prediction Archive. JCDL 2020: 529-530 - [c240]Venu Satuluri, Yao Wu, Xun Zheng, Yilei Qian, Brian Wichers, Qieyun Dai, Gui Ming Tang, Jerry Jiang, Jimmy Lin:
SimClusters: Community-Based Representations for Heterogeneous Recommendations at Twitter. KDD 2020: 3183-3193 - [c239]Ashutosh Adhikari, Achyudh Ram, Raphael Tang, William L. Hamilton, Jimmy Lin:
Exploring the Limits of Simple Learners in Knowledge Distillation for Document Classification with DocBERT. RepL4NLP@ACL 2020: 72-77 - [c238]Zeynep Akkalyoncu Yilmaz, Charles L. A. Clarke, Jimmy Lin:
A Lightweight Environment for Learning Experimental IR Research Practices. SIGIR 2020: 2113-2116 - [c237]Jimmy Lin, Joel M. Mackenzie, Chris Kamphuis, Craig Macdonald, Antonio Mallia, Michal Siedlaczek, Andrew Trotman, Arjen P. de Vries:
Supporting Interoperability Between Open-Source Search Engines with the Common Index File Format. SIGIR 2020: 2149-2152 - [c236]Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin:
TREC 2020 Notebook: CAsT Track. TREC 2020 - [c235]Ronak Pradeep, Xueguang Ma, Xinyu Zhang, Hang Cui, Ruizhou Xu, Rodrigo Frassetto Nogueira, Jimmy Lin:
H2oloo at TREC 2020: When all you got is a hammer... Deep Learning, Health Misinformation, and Precision Medicine. TREC 2020 - [c234]Andrew Yates, Siddhant Arora, Xinyu Zhang, Wei Yang, Kevin Martin Jose, Jimmy Lin:
Capreolus: A Toolkit for End-to-End Neural Ad Hoc Retrieval. WSDM 2020: 861-864 - [c233]Yuqing Xie, Wei Yang, Luchen Tan, Kun Xiong, Nicholas Jing Yuan, Baoxing Huai, Ming Li, Jimmy Lin:
Distant Supervision for Multi-Stage Fine-Tuning in Retrieval-Based Question Answering. WWW 2020: 2934-2940 - [i88]Nick Ruest, Jimmy Lin, Ian Milligan, Samantha Fritz:
The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives. CoRR abs/2001.05399 (2020) - [i87]Rodrigo Frassetto Nogueira, Zhiying Jiang, Kyunghyun Cho, Jimmy Lin:
Navigation-Based Candidate Expansion and Pretrained Language Models for Citation Recommendation. CoRR abs/2001.08687 (2020) - [i86]Jimmy Lin:
A Prototype of Serverless Lucene. CoRR abs/2002.01447 (2020) - [i85]Ruixue Zhang, Wei Yang, Luyun Lin, Zhengkai Tu, Yuqing Xie, Zihang Fu, Yuhao Xie, Luchen Tan, Kun Xiong, Jimmy Lin:
Rapid Adaptation of BERT for Information Extraction on Domain-Specific Business Documents. CoRR abs/2002.01861 (2020) - [i84]Rodrigo Frassetto Nogueira, Zhiying Jiang, Jimmy Lin:
Document Ranking with a Pretrained Sequence-to-Sequence Model. CoRR abs/2003.06713 (2020) - [i83]Jimmy Lin, Joel M. Mackenzie, Chris Kamphuis, Craig Macdonald, Antonio Mallia, Michal Siedlaczek, Andrew Trotman, Arjen P. de Vries:
Supporting Interoperability Between Open-Source Search Engines with the Common Index File Format. CoRR abs/2003.08276 (2020) - [i82]Sheng-Chieh Lin, Jheng-Hong Yang, Rodrigo Frassetto Nogueira, Ming-Feng Tsai, Chuan-Ju Wang, Jimmy Lin:
TTTTTackling WinoGrande Schemas. CoRR abs/2003.08380 (2020) - [i81]Sheng-Chieh Lin, Jheng-Hong Yang, Rodrigo Frassetto Nogueira, Ming-Feng Tsai, Chuan-Ju Wang, Jimmy Lin:
Conversational Question Reformulation via Sequence-to-Sequence Architectures and Pretrained Language Models. CoRR abs/2004.01909 (2020) - [i80]He Bai, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Jie Liu, Ming Li:
Semantics of the Unwritten. CoRR abs/2004.02251 (2020) - [i79]Edwin Zhang, Nikhil Gupta, Rodrigo Frassetto Nogueira, Kyunghyun Cho, Jimmy Lin:
Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset: Preliminary Thoughts and Lessons Learned. CoRR abs/2004.05125 (2020) - [i78]Raphael Tang, Rodrigo Frassetto Nogueira, Edwin Zhang, Nikhil Gupta, Phuong Cam, Kyunghyun Cho, Jimmy Lin:
Rapidly Bootstrapping a Question Answering Dataset for COVID-19. CoRR abs/2004.11339 (2020) - [i77]Ji Xin, Raphael Tang, Jaejun Lee, Yaoliang Yu, Jimmy Lin:
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference. CoRR abs/2004.12993 (2020) - [i76]Raphael Tang, Jaejun Lee, Ji Xin, Xinyu Liu, Yaoliang Yu, Jimmy Lin:
Showing Your Work Doesn't Always Work. CoRR abs/2004.13705 (2020) - [i75]He Bai, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Ming Li:
SegaBERT: Pre-training of Segment-aware BERT for Language Understanding. CoRR abs/2004.14996 (2020) - [i74]Sheng-Chieh Lin, Jheng-Hong Yang, Rodrigo Frassetto Nogueira, Ming-Feng Tsai, Chuan-Ju Wang, Jimmy Lin:
Query Reformulation using Query History for Passage Retrieval in Conversational Search. CoRR abs/2005.02230 (2020) - [i73]Jimmy Lin, Chudi Zhong, Diane Hu, Cynthia Rudin, Margo I. Seltzer:
Generalized Optimal Sparse Decision Trees. CoRR abs/2006.08690 (2020) - [i72]Martin Gauch, Jimmy Lin:
A Data Scientist's Guide to Streamflow Prediction. CoRR abs/2006.12975 (2020) - [i71]Edwin Zhang, Nikhil Gupta, Raphael Tang, Xiao Han, Ronak Pradeep, Kuang Lu, Yue Zhang, Rodrigo Frassetto Nogueira, Kyunghyun Cho, Hui Fang, Jimmy Lin:
Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset. CoRR abs/2007.07846 (2020) - [i70]Mohan Zhang, Luchen Tan, Zhengkai Tu, Zihang Fu, Kun Xiong, Ming Li, Jimmy Lin:
To Paraphrase or Not To Paraphrase: User-Controllable Selective Paraphrase Generation. CoRR abs/2008.09290 (2020) - [i69]Raphael Tang, Jaejun Lee, Afsaneh Razi, Julia Cambre, Ian Bicking, Jofish Kaye, Jimmy Lin:
Howl: A Deployed, Open-Source Wake Word Detection System. CoRR abs/2008.09606 (2020) - [i68]Jimmy Lin, Rodrigo Frassetto Nogueira, Andrew Yates:
Pretrained Transformers for Text Ranking: BERT and Beyond. CoRR abs/2010.06467 (2020) - [i67]Martin Gauch, Frederik Kratzert, Daniel Klotz, Grey Nearing, Jimmy Lin, Sepp Hochreiter:
Rainfall-Runoff Prediction at Multiple Timescales with a Single Long Short-Term Memory Network. CoRR abs/2010.07921 (2020) - [i66]Minghan Li, He Bai, Luchen Tan, Kun Xiong, Ming Li, Jimmy Lin:
Latte-Mix: Measuring Sentence Semantic Similarity with Latent Categorical Mixtures. CoRR abs/2010.11351 (2020) - [i65]Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin:
Distilling Dense Representations for Ranking using Tightly-Coupled Teachers. CoRR abs/2010.11386 (2020) - [i64]Ronak Pradeep, Xueguang Ma, Rodrigo Frassetto Nogueira, Jimmy Lin:
Scientific Claim Verification with VERT5ERINI. CoRR abs/2010.11930 (2020) - [i63]Zhiying Jiang, Raphael Tang, Ji Xin, Jimmy Lin:
Inserting Information Bottlenecks for Attribution in Transformers. CoRR abs/2012.13838 (2020)
2010 – 2019
- 2019
- [j54]Jimmy Lin:
The neural hype, justified!: a recantation. SIGIR Forum 53(2): 88-93 (2019) - [c232]Jinfeng Rao, Wei Yang, Yuhao Zhang, Ferhan Türe, Jimmy Lin:
Multi-Perspective Relevance Matching with Hierarchical ConvNets for Social Media Search. AAAI 2019: 232-240 - [c231]Raphael Tang, Yao Lu, Jimmy Lin:
Natural Language Generation for Effective Knowledge Distillation. DeepLo@EMNLP-IJCNLP 2019: 202-208 - [c230]Allison B. McCoy, Dean F. Sittig, Jimmy Lin, Adam Wright:
Identification and Ranking of Biomedical Informatics Researcher Citation Statistics through a Google Scholar Scraper. AMIA 2019 - [c229]Peilin Yang, Jimmy Lin:
Reproducing and Generalizing Semantic Term Matching in Axiomatic Information Retrieval. ECIR (1) 2019: 369-381 - [c228]Ruifan Yu, Yuhao Xie, Jimmy Lin:
Simple Techniques for Cross-Collection Relevance Feedback. ECIR (1) 2019: 397-409 - [c227]Zeynep Akkalyoncu Yilmaz, Shengjin Wang, Wei Yang, Haotian Zhang, Jimmy Lin:
Applying BERT to Document Retrieval with Birch. EMNLP/IJCNLP (3) 2019: 19-24 - [c226]Jaejun Lee, Raphael Tang, Jimmy Lin:
Honkling: In-Browser Personalization for Ubiquitous Keyword Spotting. EMNLP/IJCNLP (3) 2019: 91-96 - [c225]Linqing Liu, Wei Yang, Jinfeng Rao, Raphael Tang, Jimmy Lin:
Incorporating Contextual and Syntactic Structures Improves Semantic Similarity Modeling. EMNLP/IJCNLP (1) 2019: 1204-1209 - [c224]Zeynep Akkalyoncu Yilmaz, Wei Yang, Haotian Zhang, Jimmy Lin:
Cross-Domain Modeling of Sentence-Level Evidence for Document Retrieval. EMNLP/IJCNLP (1) 2019: 3488-3494 - [c223]Hsiu-Wei Yang, Yanyan Zou, Peng Shi, Wei Lu, Jimmy Lin, Xu Sun:
Aligning Cross-Lingual Entities with Multi-Aspect Information. EMNLP/IJCNLP (1) 2019: 4430-4440 - [c222]Jinfeng Rao, Linqing Liu, Yi Tay, Hsiu-Wei Yang, Peng Shi, Jimmy Lin:
Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling. EMNLP/IJCNLP (1) 2019: 5369-5380 - [c221]Ji Xin, Jimmy Lin, Yaoliang Yu:
What Part of the Neural Network Does This? Understanding LSTMs by Measuring and Dissecting Neurons. EMNLP/IJCNLP (1) 2019: 5822-5829 - [c220]Jaejun Lee, Raphael Tang, Jimmy Lin:
Universal voice-enabled user interfaces using JavaScript. IUI Companion 2019: 81-82 - [c219]Ryan Deschamps, Samantha Fritz, Jimmy Lin, Ian Milligan, Nick Ruest:
The Cost of a WARC: Analyzing Web Archives in the Cloud. JCDL 2019: 261-264 - [c218]Ian Milligan, Nathalie Casemajor, Samantha Fritz, Jimmy Lin, Nick Ruest, Matthew S. Weber, Nicholas Worby:
Building Community and Tools for Analyzing Web Archives Through Datathons. JCDL 2019: 265-268 - [c217]Ryan Deschamps, Nick Ruest, Jimmy Lin, Samantha Fritz, Ian Milligan:
The Archives Unleashed Notebook: Madlibs for Jumpstarting Scholarly Exploration of Web Archives. JCDL 2019: 337-338 - [c216]Hsiu-Wei Yang, Linqing Liu, Ian Milligan, Nick Ruest, Jimmy Lin:
Scalable Content-Based Analysis of Images in Web Archives with TensorFlow and the Archives Unleashed Toolkit. JCDL 2019: 436-437 - [c215]Nick Ruest, Ian Milligan, Jimmy Lin:
Warclight: A Rails Engine for Web Archive Discovery. JCDL 2019: 442-443 - [c214]Wei Yang, Luchen Tan, Chunwei Lu, Anqi Cui, Han Li, Xi Chen, Kun Xiong, Muzi Wang, Ming Li, Jian Pei, Jimmy Lin:
Detecting Customer Complaint Escalation with Recurrent Neural Networks and Manually-Engineered Features. NAACL-HLT (2) 2019: 56-63 - [c213]Wei Yang, Yuqing Xie, Aileen Lin, Xingyu Li, Luchen Tan, Kun Xiong, Ming Li, Jimmy Lin:
End-to-End Open-Domain Question Answering with BERTserini. NAACL-HLT (Demonstrations) 2019: 72-77 - [c212]Peng Shi, Jinfeng Rao, Jimmy Lin:
Simple Attention-Based Representation Learning for Ranking Short Social Media Posts. NAACL-HLT (1) 2019: 2212-2217 - [c211]Ashutosh Adhikari, Achyudh Ram, Raphael Tang, Jimmy Lin:
Rethinking Complex Neural Network Architectures for Document Classification. NAACL-HLT (1) 2019: 4046-4051 - [c210]Ryan Clancy, Nicola Ferro, Claudia Hauff, Jimmy Lin, Tetsuya Sakai, Ze Zhong Wu:
Overview of the 2019 Open-Source IR Replicability Challenge (OSIRRC 2019). OSIRRC@SIGIR 2019: 1-7 - [c209]Ryan Clancy, Zeynep Akkalyoncu Yilmaz, Ze Zhong Wu, Jimmy Lin:
University of Waterloo Docker Images for OSIRRC at SIGIR 2019. OSIRRC@SIGIR 2019: 36 - [c208]Raphael Tang, Ferhan Türe, Jimmy Lin:
Yelling at Your TV: An Analysis of Speech Recognition Errors and Subsequent User Behavior on Entertainment Systems. SIGIR 2019: 853-856 - [c207]Jimmy Lin, Peilin Yang:
The Impact of Score Ties on Repeatability in Document Ranking. SIGIR 2019: 1125-1128 - [c206]Wei Yang, Kuang Lu, Peilin Yang, Jimmy Lin:
Critically Examining the "Neural Hype": Weak Baselines and the Additivity of Effectiveness Gains from Neural Ranking Models. SIGIR 2019: 1129-1132 - [c205]Ryan Clancy, Toke Eskildsen, Nick Ruest, Jimmy Lin:
Solr Integration in the Anserini Information Retrieval Toolkit. SIGIR 2019: 1285-1288 - [c204]Ryan Clancy, Jaejun Lee, Zeynep Akkalyoncu Yilmaz, Jimmy Lin:
Information Retrieval Meets Scalable Text Analytics: Solr Integration with Spark. SIGIR 2019: 1313-1316 - [c203]Ferhan Türe, Jinfeng Rao, Raphael Tang, Jimmy Lin:
Challenges and Opportunities in Understanding Spoken Queries Directed at Modern Entertainment Platforms. SIGIR 2019: 1375-1376 - [c202]Ryan Clancy, Nicola Ferro, Claudia Hauff, Jimmy Lin, Tetsuya Sakai, Ze Zhong Wu:
The SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019). SIGIR 2019: 1432-1434 - [c201]Jheng-Hong Yang, Sheng-Chieh Lin, Chuan-Ju Wang, Jimmy Lin, Ming-Feng Tsai:
Query and Answer Expansion from Conversation History. TREC 2019 - [e3]Ryan Clancy, Nicola Ferro, Claudia Hauff, Jimmy Lin, Tetsuya Sakai, Ze Zhong Wu:
Proceedings of the Open-Source IR Replicability Challenge co-located with 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, OSIRRC@SIGIR 2019, Paris, France, July 25, 2019. CEUR Workshop Proceedings 2409, CEUR-WS.org 2019 [contents] - [i62]Wei Yang, Yuqing Xie, Aileen Lin, Xingyu Li, Luchen Tan, Kun Xiong, Ming Li, Jimmy Lin:
End-to-End Open-Domain Question Answering with BERTserini. CoRR abs/1902.01718 (2019) - [i61]Michael Azmy, Peng Shi, Jimmy Lin, Ihab F. Ilyas:
Matching Entities Across Different Knowledge Graphs with Graph Embeddings. CoRR abs/1903.06607 (2019) - [i60]Wei Yang, Haotian Zhang, Jimmy Lin:
Simple Applications of BERT for Ad Hoc Document Retrieval. CoRR abs/1903.10972 (2019) - [i59]Raphael Tang, Yao Lu, Linqing Liu, Lili Mou, Olga Vechtomova, Jimmy Lin:
Distilling Task-Specific Knowledge from BERT into Simple Neural Networks. CoRR abs/1903.12136 (2019) - [i58]Peng Shi, Jimmy Lin:
Simple BERT Models for Relation Extraction and Semantic Role Labeling. CoRR abs/1904.05255 (2019) - [i57]Wei Yang, Yuqing Xie, Luchen Tan, Kun Xiong, Ming Li, Jimmy Lin:
Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering. CoRR abs/1904.06652 (2019) - [i56]Rodrigo Frassetto Nogueira, Wei Yang, Jimmy Lin, Kyunghyun Cho:
Document Expansion by Query Prediction. CoRR abs/1904.08375 (2019) - [i55]Ashutosh Adhikari, Achyudh Ram, Raphael Tang, Jimmy Lin:
DocBERT: BERT for Document Classification. CoRR abs/1904.08398 (2019) - [i54]Jimmy Lin:
The Simplest Thing That Can Possibly Work: Pseudo-Relevance Feedback Using Text Classification. CoRR abs/1904.08861 (2019) - [i53]Wei Yang, Kuang Lu, Peilin Yang, Jimmy Lin:
Critically Examining the "Neural Hype": Weak Baselines and the Additivity of Effectiveness Gains from Neural Ranking Models. CoRR abs/1904.09171 (2019) - [i52]Hamidreza Shahidi, Ming Li, Jimmy Lin:
Two Birds, One Stone: A Simple, Unified Model for Text Generation from Structured and Unstructured Data. CoRR abs/1909.10158 (2019) - [i51]Hsiu-Wei Yang, Yanyan Zou, Peng Shi, Wei Lu, Jimmy Lin, Xu Sun:
Aligning Cross-Lingual Entities with Multi-Aspect Information. CoRR abs/1910.06575 (2019) - [i50]Tommaso Teofili, Jimmy Lin:
Lucene for Approximate Nearest-Neighbors Search on Arbitrary Dense Vectors. CoRR abs/1910.10208 (2019) - [i49]Jimmy Lin, Lori Paniak, Gordon Boerke:
The Performance Envelope of Inverted Indexing on Modern Hardware. CoRR abs/1910.11028 (2019) - [i48]Rodrigo Frassetto Nogueira, Wei Yang, Kyunghyun Cho, Jimmy Lin:
Multi-Stage Document Ranking with BERT. CoRR abs/1910.14424 (2019) - [i47]Yinan Zhang, Raphael Tang, Jimmy Lin:
Explicit Pairwise Word Interaction Modeling Improves Pretrained Transformers for English Semantic Similarity Tasks. CoRR abs/1911.02847 (2019) - [i46]Peng Shi, Jimmy Lin:
Cross-Lingual Relevance Transfer for Document Retrieval. CoRR abs/1911.02989 (2019) - [i45]Jaejun Lee, Raphael Tang, Jimmy Lin:
What Would Elsa Do? Freezing Layers During Transformer Fine-Tuning. CoRR abs/1911.03090 (2019) - [i44]Linqing Liu, Huan Wang, Jimmy Lin, Richard Socher, Caiming Xiong:
Attentive Student Meets Multi-Task Teacher: Improved Knowledge Distillation for Pretrained Models. CoRR abs/1911.03588 (2019) - [i43]Martin Gauch, Juliane Mai, Jimmy Lin:
The Proper Care and Feeding of CAMELS: How Limited Training Data Affects Streamflow Prediction. CoRR abs/1911.07249 (2019) - [i42]Achyudh Ram, Ji Xin, Meiyappan Nagappan, Yaoliang Yu, Rocío Cabrera Lozoya, Antonino Sabetta, Jimmy Lin:
Exploiting Token and Path-based Representations of Code for Identifying Security-Relevant Commits. CoRR abs/1911.07620 (2019) - 2018
- [j53]Jimmy Lin:
Scale Up or Scale Out for Graph Processing? IEEE Internet Comput. 22(3): 72-78 (2018) - [j52]Frank Hopfgartner, Allan Hanbury, Henning Müller, Ivan Eggel, Krisztian Balog, Torben Brodt, Gordon V. Cormack, Jimmy Lin, Jayashree Kalpathy-Cramer, Noriko Kando, Makoto P. Kato, Anastasia Krithara, Tim Gollub, Martin Potthast, Evelyne Viegas, Simon Mercer:
Evaluation-as-a-Service for the Computational Sciences: Overview and Outlook. ACM J. Data Inf. Qual. 10(4): 15:1-15:32 (2018) - [j51]Peilin Yang, Hui Fang, Jimmy Lin:
Anserini: Reproducible Ranking Baselines Using Lucene. ACM J. Data Inf. Qual. 10(4): 16:1-16:20 (2018) - [j50]Jimmy Lin:
The Neural Hype and Comparisons Against Weak Baselines. SIGIR Forum 52(2): 40-51 (2018) - [c200]Youngbin Kim, Jimmy Lin:
Serverless Data Analytics with Flint. IEEE CLOUD 2018: 451-455 - [c199]Michael Azmy, Peng Shi, Jimmy Lin, Ihab F. Ilyas:
Farewell Freebase: Migrating the SimpleQuestions Dataset to DBpedia. COLING 2018: 2093-2103 - [c198]Jimmy Lin:
Computing without Servers, V8, Rocket Ships, and Other Batsh*t Crazy Ideas in Data Systems. DESIRES 2018: 3-6 - [c197]Ajeet Grewal, Jerry Jiang, Gary Lam, Tristan Jung, Lohith Vuddemarri, Quannan Li, Aaditya Landge, Jimmy Lin:
RecService: Distributed Real-Time Graph Processing at Twitter. HotCloud 2018 - [c196]Raphael Tang, Weijie Wang, Zhucheng Tu, Jimmy Lin:
An Experimental Analysis of the Power Consumption of Convolutional Neural Networks for Keyword Spotting. ICASSP 2018: 5479-5483 - [c195]Raphael Tang, Jimmy Lin:
Deep Residual Learning for Small-Footprint Keyword Spotting. ICASSP 2018: 5484-5488 - [c194]Jinfeng Rao, Ferhan Türe, Jimmy Lin:
Multi-Task Learning with Neural Networks for Voice Query Understanding on an Entertainment Platform. KDD 2018: 636-645 - [c193]Zhucheng Tu, Mengping Li, Jimmy Lin:
Pay-Per-Request Deployment of Neural Network Models Using Serverless Architectures. NAACL-HLT (Demonstrations) 2018: 6-10 - [c192]Yiyun Liang, Zhucheng Tu, Laetitia Huang, Jimmy Lin:
CNNs for NLP in the Browser: Client-Side Deployment and Visualization Opportunities. NAACL-HLT (Demonstrations) 2018: 61-65 - [c191]Salman Mohammed, Peng Shi, Jimmy Lin:
Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks. NAACL-HLT (2) 2018: 291-296 - [c190]Jimmy Lin, Salman Mohammed, Royal Sequiera, Luchen Tan:
Update Delivery Mechanisms for Prospective Information Needs: An Analysis of Attention in Mobile Users. SIGIR 2018: 785-794 - [c189]Jinfeng Rao, Ferhan Türe, Jimmy Lin:
What Do Viewers Say to Their TVs?: An Analysis of Voice Queries to Entertainment Systems. SIGIR 2018: 1213-1216 - [c188]Ajeet Grewal, Jimmy Lin:
The Evolution of Content Analysis for Personalized Recommendations at Twitter. SIGIR 2018: 1355-1356 - [c187]Peilin Yang, Srikanth Thiagarajan, Jimmy Lin:
Robust, Scalable, Real-Time Event Time Series Aggregation at Twitter. SIGMOD Conference 2018: 595-599 - [c186]Royal Sequiera, Luchen Tan, Jimmy Lin:
Overview of the TREC 2018 Real-Time Summarization Track. TREC 2018 - [c185]Ruifan Yu, Yuhao Xie, Jimmy Lin:
H2oloo at TREC 2018: Cross-Collection Relevance Transfer for the Common Core Track. TREC 2018 - [c184]Joel M. Mackenzie, J. Shane Culpepper, Roi Blanco, Matt Crane, Charles L. A. Clarke, Jimmy Lin:
Query Driven Algorithm Selection in Early Stage Retrieval. WSDM 2018: 396-404 - [r2]Jimmy Lin:
Summarization. Encyclopedia of Database Systems (2nd ed.) 2018 - [i41]Youngbin Kim, Jimmy Lin:
Serverless Data Analytics with Flint. CoRR abs/1803.06354 (2018) - [i40]Kareem El Gebaly, Jimmy Lin:
In-Browser Split-Execution Support for Interactive Analytics in the Cloud. CoRR abs/1804.08822 (2018) - [i39]Jinfeng Rao, Wei Yang, Yuhao Zhang, Ferhan Türe, Jimmy Lin:
Multi-Perspective Relevance Matching with Hierarchical ConvNets for Social Media Search. CoRR abs/1805.08159 (2018) - [i38]Ahmed El-Roby, Khalid Ammar, Ashraf Aboulnaga, Jimmy Lin:
Sapphire: Querying RDF Data Made Simple. CoRR abs/1805.11728 (2018) - [i37]Jimmy Lin, Peilin Yang:
Repeatability Corner Cases in Document Ranking: The Impact of Score Ties. CoRR abs/1807.05798 (2018) - [i36]Raphael Tang, Jimmy Lin:
Adaptive Pruning of Neural Language Models for Mobile Devices. CoRR abs/1809.10282 (2018) - [i35]Jaejun Lee, Raphael Tang, Jimmy Lin:
JavaScript Convolutional Neural Networks for Keyword Spotting in the Browser: An Experimental Analysis. CoRR abs/1810.12859 (2018) - [i34]Raphael Tang, Jimmy Lin:
Progress and Tradeoffs in Neural Language Models. CoRR abs/1811.00942 (2018) - [i33]Peng Shi, Jinfeng Rao, Jimmy Lin:
Simple Attention-Based Representation Learning for Ranking Short Social Media Posts. CoRR abs/1811.01013 (2018) - [i32]Raphael Tang, Ashutosh Adhikari, Jimmy Lin:
FLOPs as a Direct Optimization Objective for Learning Sparse Neural Networks. CoRR abs/1811.03060 (2018) - [i31]Raphael Tang, Gefei Yang, Hong Wei, Yajie Mao, Ferhan Türe, Jimmy Lin:
Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks. CoRR abs/1812.07754 (2018) - 2017
- [j49]Jimmy Lin:
In Defense of MapReduce. IEEE Internet Comput. 21(3): 94-98 (2017) - [j48]Jimmy Lin:
The Lambda and the Kappa. IEEE Internet Comput. 21(5): 60-66 (2017) - [j47]Jimmy Lin, Andrew Trotman:
The role of index compression in score-at-a-time query evaluation. Inf. Retr. J. 20(3): 199-220 (2017) - [j46]Jimmy Lin, Ian Milligan, Jeremy Wiebe, Alice Zhou:
Warcbase: Scalable Analytics Infrastructure for Exploring Web Archives. ACM Journal on Computing and Cultural Heritage 10(4): 22:1-22:30 (2017) - [j45]Siddhartha Sahu, Amine Mhedhbi, Semih Salihoglu, Jimmy Lin, M. Tamer Özsu:
The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing. Proc. VLDB Endow. 11(4): 420-431 (2017) - [c183]Gaurav Baruah, Richard McCreadie, Jimmy Lin:
A Comparison of Nuggets and Clusters for Evaluating Timeline Summaries. CIKM 2017: 67-76 - [c182]Jinfeng Rao, Ferhan Türe, Hua He, Oliver Jojic, Jimmy Lin:
Talking to Your TV: Context-Aware Voice Search with Hierarchical Recurrent Neural Networks. CIKM 2017: 557-566 - [c181]Hua He, Kris Ganjam, Navendu Jain, Jessica Lundin, Ryen White, Jimmy Lin:
An Insight Extraction System on BioMedical Literature with Deep Neural Networks. EMNLP 2017: 2691-2701 - [c180]Anil Pacaci, Alice Zhou, Jimmy Lin, M. Tamer Özsu:
Do We Need Specialized Graph Databases?: Benchmarking Real-Time Social Networking Applications. GRADES@SIGMOD/PODS 2017: 12:1-12:7 - [c179]Jinfeng Rao, Ferhan Türe, Xing Niu, Jimmy Lin:
Mining the Temporal Statistics of Query Terms for Searching Social Media Posts. ICTIR 2017: 133-140 - [c178]Matt Crane, Jimmy Lin:
An Exploration of Serverless Architectures for Information Retrieval. ICTIR 2017: 241-244 - [c177]Gaurav Baruah, Jimmy Lin:
The Pareto Frontier of Utility Models as a Framework for Evaluating Push Notification Systems. ICTIR 2017: 253-256 - [c176]Salman Mohammed, Matt Crane, Jimmy Lin:
Quantization in Append-Only Collections. ICTIR 2017: 265-268 - [c175]Babak Ehteshami Bejnordi, Jimmy Lin, Ben Glass, Maeve Mullooly, Gretchen L. Gierach, Mark E. Sherman, Nico Karssemeijer, Jeroen van der Laak, Andrew H. Beck:
Deep learning-based assessment of tumor-associated stroma for diagnosing breast cancer in histopathology images. ISBI 2017: 929-932 - [c174]Adam Roegiest, Luchen Tan, Jimmy Lin:
Online In-Situ Interleaved Evaluation of Real-Time Push Notification Systems. SIGIR 2017: 415-424 - [c173]Luchen Tan, Gaurav Baruah, Jimmy Lin:
On the Reusability of "Living Labs" Test Collections: : A Case Study of Real-Time Summarization. SIGIR 2017: 793-796 - [c172]Haotian Zhang, Jinfeng Rao, Jimmy Lin, Mark D. Smucker:
Automatically Extracting High-Quality Negative Examples for Answer Selection in Question Answering. SIGIR 2017: 797-800 - [c171]Jinfeng Rao, Hua He, Jimmy Lin:
Experiments with Convolutional Neural Network Models for Answer Selection. SIGIR 2017: 1217-1220 - [c170]Royal Sequiera, Jimmy Lin:
Finally, a Downloadable Test Collection of Tweets. SIGIR 2017: 1225-1228 - [c169]Peilin Yang, Hui Fang, Jimmy Lin:
Anserini: Enabling the Use of Lucene for Information Retrieval Research. SIGIR 2017: 1253-1256 - [c168]Nimesh Ghelani, Salman Mohammed, Shine Wang, Jimmy Lin:
Event Detection on Curated Tweet Streams. SIGIR 2017: 1325-1328 - [c167]Leif Azzopardi, Matt Crane, Hui Fang, Grant Ingersoll, Jimmy Lin, Yashar Moshfeghi, Harrisen Scells, Peilin Yang, Guido Zuccon:
The Lucene for Information Access and Retrieval Research (LIARR) Workshop at SIGIR 2017. SIGIR 2017: 1429-1430 - [c166]Kareem El Gebaly, Jimmy Lin:
In-Browser Interactive SQL Analytics with Afterburner. SIGMOD Conference 2017: 1623-1626 - [c165]Jimmy Lin, Salman Mohammed, Royal Sequiera, Luchen Tan, Nimesh Ghelani, Mustafa Abualsaud, Richard McCreadie, Dmitrijs Milajevs, Ellen M. Voorhees:
Overview of the TREC 2017 Real-Time Summarization Track. TREC 2017 - [c164]Ziquan Wang, Borui Lin, Ian Milligan, Jimmy Lin:
Topic Shifts Between Two US Presidential Administrations. WADL 2017: 13 - [c163]Matt Crane, J. Shane Culpepper, Jimmy Lin, Joel M. Mackenzie, Andrew Trotman:
A Comparison of Document-at-a-Time and Score-at-a-Time Query Evaluation. WSDM 2017: 201-210 - [c162]Yulu Wang, Jimmy Lin:
Partitioning and Segment Organization Strategies for Real-Time Selective Search on Document Streams. WSDM 2017: 221-230 - [c161]Charles L. A. Clarke, Gordon V. Cormack, Jimmy Lin, Adam Roegiest:
Ten Blue Links on Mars. WWW 2017: 273-281 - [p1]Carol Shen, Tony Shen, Jimmy Lin:
Comparative Assessment of Alignment Algorithms for NGS Data: Features, Considerations, Implementations, and Future. Algorithms for Next-Generation Sequencing Data 2017: 187-202 - [i30]Babak Ehteshami Bejnordi, Jimmy Lin, Ben Glass, Maeve Mullooly, Gretchen L. Gierach, Mark E. Sherman, Nico Karssemeijer, Jeroen van der Laak, Andrew H. Beck:
Deep learning-based assessment of tumor-associated stroma for diagnosing breast cancer in histopathology images. CoRR abs/1702.05803 (2017) - [i29]Joel M. Mackenzie, J. Shane Culpepper, Roi Blanco, Matt Crane, Charles L. A. Clarke, Jimmy Lin:
Efficient and Effective Tail Latency Minimization in Multi-Stage Retrieval Systems. CoRR abs/1704.03970 (2017) - [i28]Salman Mohammed, Nimesh Ghelani, Jimmy Lin:
Distant Supervision for Topic Classification of Tweets in Curated Streams. CoRR abs/1704.06726 (2017) - [i27]Jinfeng Rao, Ferhan Türe, Hua He, Oliver Jojic, Jimmy Lin:
Talking to Your TV: Context-Aware Voice Search with Hierarchical Recurrent Neural Networks. CoRR abs/1705.04892 (2017) - [i26]Jinfeng Rao, Hua He, Haotian Zhang, Ferhan Türe, Royal Sequiera, Salman Mohammed, Jimmy Lin:
Integrating Lexical and Temporal Signals in Neural Ranking Models for Searching Social Media Streams. CoRR abs/1707.07792 (2017) - [i25]Royal Sequiera, Gaurav Baruah, Zhucheng Tu, Salman Mohammed, Jinfeng Rao, Haotian Zhang, Jimmy Lin:
Exploring the Effectiveness of Convolutional Neural Networks for Answer Selection in End-to-End Question Answering. CoRR abs/1707.07804 (2017) - [i24]Zhucheng Tu, Matt Crane, Royal Sequiera, Junchen Zhang, Jimmy Lin:
An Exploration of Approaches to Integrating Neural Reranking Models in Multi-Stage Ranking Architectures. CoRR abs/1707.08275 (2017) - [i23]Siddhartha Sahu, Amine Mhedhbi, Semih Salihoglu, Jimmy Lin, M. Tamer Özsu:
The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing: A User Survey. CoRR abs/1709.03188 (2017) - [i22]Raphael Tang, Jimmy Lin:
Honk: A PyTorch Reimplementation of Convolutional Neural Networks for Keyword Spotting. CoRR abs/1710.06554 (2017) - [i21]Raphael Tang, Jimmy Lin:
Deep Residual Learning for Small-Footprint Keyword Spotting. CoRR abs/1710.10361 (2017) - [i20]Raphael Tang, Weijie Wang, Zhucheng Tu, Jimmy Lin:
An Experimental Analysis of the Power Consumption of Convolutional Neural Networks for Keyword Spotting. CoRR abs/1711.00333 (2017) - [i19]Salman Mohammed, Peng Shi, Jimmy Lin:
Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks. CoRR abs/1712.01969 (2017) - 2016
- [j44]Jimmy Lin, Charles L. A. Clarke, Gaurav Baruah:
Searching from Mars. IEEE Internet Comput. 20(1): 78-82 (2016) - [j43]Jimmy Lin, Kareem El Gebaly:
The Future of Big Data Is ... JavaScript? IEEE Internet Comput. 20(5): 82-88 (2016) - [j42]Aneesh Sharma, Jerry Jiang, Praveen Bommannavar, Brian Larson, Jimmy Lin:
GraphJet: Real-Time Content Recommendations at Twitter. Proc. VLDB Endow. 9(13): 1281-1292 (2016) - [j41]Ahmed El-Roby, Khaled Ammar, Ashraf Aboulnaga, Jimmy Lin:
Sapphire: Querying RDF Data Made Simple. Proc. VLDB Endow. 9(13): 1481-1484 (2016) - [j40]Abdul Quamar, Amol Deshpande, Jimmy Lin:
NScale: neighborhood-centric large-scale graph analytics in the cloud. VLDB J. 25(2): 125-150 (2016) - [c160]Andrew Trotman, Jimmy Lin:
In Vacuo and In Situ Evaluation of SIMD Codecs. ADCS 2016: 1-8 - [c159]J. Shane Culpepper, Charles L. A. Clarke, Jimmy Lin:
Dynamic Cutoff Prediction in Multi-Stage Retrieval Systems. ADCS 2016: 17-24 - [c158]Cody Buntain, Jimmy Lin, Jennifer Golbeck:
Discovering key moments in social media streams. CCNC 2016: 366-374 - [c157]Jinfeng Rao, Hua He, Jimmy Lin:
Noise-Contrastive Estimation for Answer Selection with Deep Neural Networks. CIKM 2016: 1913-1916 - [c156]Gaurav Baruah, Haotian Zhang, Rakesh Guttikonda, Jimmy Lin, Mark D. Smucker, Olga Vechtomova:
Optimizing Nugget Annotations with Active Learning. CIKM 2016: 2359-2364 - [c155]Ian Milligan, Jimmy Lin, Jeremy Wiebe, Alice Zhou:
Exploring and Discovering Archive-It Collections with Warcbase. DH 2016: 285-288 - [c154]Jimmy Lin, Matt Crane, Andrew Trotman, Jamie Callan, Ishan Chattopadhyaya, John Foley, Grant Ingersoll, Craig MacDonald, Sebastiano Vigna:
Toward Reproducible Baselines: The Open-Source IR Reproducibility Challenge. ECIR 2016: 408-420 - [c153]Jinfeng Rao, Xing Niu, Jimmy Lin:
Compressing and Decoding Term Statistics Time Series. ECIR 2016: 675-681 - [c152]Charles L. A. Clarke, Gordon V. Cormack, Jimmy Lin, Adam Roegiest:
Total Recall: Blue Sky on Mars. ICTIR 2016: 45-48 - [c151]Jiaul H. Paik, Jimmy Lin:
Retrievability in API-Based "Evaluation as a Service". ICTIR 2016: 91-94 - [c150]Ahmed Elbagoury, Matt Crane, Jimmy Lin:
Rank-at-a-Time Query Processing. ICTIR 2016: 229-232 - [c149]Jinfeng Rao, Jimmy Lin:
Temporal Query Expansion Using a Continuous Hidden Markov Model. ICTIR 2016: 295-298 - [c148]Andrew Jackson, Jimmy Lin, Ian Milligan, Nick Ruest:
Desiderata for Exploratory Search Interfaces to Web Archives in Support of Scholarly Activities. JCDL 2016: 103-106 - [c147]Ian Milligan, Nick Ruest, Jimmy Lin:
Content Selection and Curation for Web Archiving: The Gatekeepers vs. the Masses. JCDL 2016: 107-110 - [c146]Jimmy Lin, Zhucheng Tu, Michael Rose, Patrick White:
Prizm: A Wireless Access Point for Proxy-Based Web Lifelogging. LTA@MM 2016: 19-25 - [c145]Hua He, Jimmy Lin:
Pairwise Word Interaction Modeling with Deep Neural Networks for Semantic Similarity Measurement. HLT-NAACL 2016: 937-948 - [c144]Douglas W. Oard, Katie Shilton, Jimmy Lin:
Evaluating Search Among Secrets. EVIA@NTCIR 2016 - [c143]Praveen Bommannavar, Jimmy Lin, Anand Rajaraman:
Estimating topical volume in social media streams. SAC 2016: 1096-1101 - [c142]Hua He, John Wieting, Kevin Gimpel, Jinfeng Rao, Jimmy Lin:
UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement. SemEval@NAACL-HLT 2016: 1103-1108 - [c141]Xin Qian, Jimmy Lin, Adam Roegiest:
Interleaved Evaluation for Retrospective Summarization and Prospective Notification on Document Streams. SIGIR 2016: 175-184 - [c140]Luchen Tan, Adam Roegiest, Jimmy Lin, Charles L. A. Clarke:
An Exploration of Evaluation Metrics for Mobile Push Notifications. SIGIR 2016: 741-744 - [c139]Cody Buntain, Jimmy Lin:
Burst Detection in Social Media Streams for Tracking Interest Profiles in Real Time. SIGIR 2016: 777-780 - [c138]Haotian Zhang, Jimmy Lin, Gordon V. Cormack, Mark D. Smucker:
Sampling Strategies and Active Learning for Volume Estimation. SIGIR 2016: 981-984 - [c137]Luchen Tan, Adam Roegiest, Charles L. A. Clarke, Jimmy Lin:
Simple Dynamic Emission Strategies for Microblog Filtering. SIGIR 2016: 1009-1012 - [c136]Adam Roegiest, Luchen Tan, Jimmy Lin, Charles L. A. Clarke:
A Platform for Streaming Push Notifications to Mobile Assessors. SIGIR 2016: 1077-1080 - [c135]Jimmy Lin, Adam Roegiest, Luchen Tan, Richard McCreadie, Ellen M. Voorhees, Fernando Diaz:
Overview of the TREC 2016 Real-Time Summarization Track. TREC 2016 - [i18]Kareem El Gebaly, Jimmy Lin:
Afterburner: The Case for In-Browser Analytics. CoRR abs/1605.04035 (2016) - [i17]Luchen Tan, Jimmy Lin, Adam Roegiest, Charles L. A. Clarke:
The Effects of Latency Penalties in Evaluating Push Notification Systems. CoRR abs/1606.03066 (2016) - [i16]J. Shane Culpepper, Charles L. A. Clarke, Jimmy Lin:
Dynamic Trade-Off Prediction in Multi-Stage Retrieval Systems. CoRR abs/1610.02502 (2016) - [i15]Charles L. A. Clarke, Gordon V. Cormack, Jimmy Lin, Adam Roegiest:
Ten Blue Links on Mars. CoRR abs/1610.06468 (2016) - 2015
- [j39]Jimmy Lin:
Is Big Data a Transient Problem? IEEE Internet Comput. 19(5): 86-90 (2015) - [j38]Frank Hopfgartner, Allan Hanbury, Henning Müller, Noriko Kando, Simon Mercer, Jayashree Kalpathy-Cramer, Martin Potthast, Tim Gollub, Anastasia Krithara, Jimmy Lin, Krisztian Balog, Ivan Eggel:
Report on the Evaluation-as-a-Service (EaaS) Expert Workshop. SIGIR Forum 49(1): 57-65 (2015) - [j37]Jaime Arguello, Matt Crane, Fernando Diaz, Jimmy Lin, Andrew Trotman:
Report on the SIGIR 2015 Workshop on Reproducibility, Inexplicability, and Generalizability of Results (RIGOR). SIGIR Forum 49(2): 107-116 (2015) - [j36]Hua He, Jimmy Lin, Adam Lopez:
Gappy Pattern Matching on GPUs for On-Demand Extraction of Hierarchical Translation Grammars. Trans. Assoc. Comput. Linguistics 3: 87-100 (2015) - [c134]Jinfeng Rao, Jimmy Lin, Miles Efron:
Reproducible Experiments on Lexical and Temporal Feedback for Tweet Search. ECIR 2015: 755-767 - [c133]Hua He, Kevin Gimpel, Jimmy Lin:
Multi-Perspective Sentence Similarity Modeling with Convolutional Neural Networks. EMNLP 2015: 1576-1586 - [c132]Jimmy Lin, Andrew Trotman:
Anytime Ranking for Impact-Ordered Indexes. ICTIR 2015: 301-304 - [c131]Jimmy Lin:
Building a Self-Contained Search Engine in the Browser. ICTIR 2015: 309-312 - [c130]Yulu Wang, Jimmy Lin:
The Feasibility of Brute Force Scans for Real-Time Tweet Search. ICTIR 2015: 321-324 - [c129]Sarah Weissman, Samet Ayhan, Joshua Bradley, Jimmy Lin:
Identifying Duplicate and Contradictory Information in Wikipedia. JCDL 2015: 57-60 - [c128]Jimmy Lin:
The Sum of All Human Knowledge in Your Pocket: Full-Text Searchable Wikipedia on a Raspberry Pi. JCDL 2015: 85-86 - [c127]Dean F. Sittig, Allison B. McCoy, Adam Wright, Jimmy Lin:
Developing an Open-Source Bibliometric Ranking Website Using Google Scholar Citation Profiles for Researchers in the Field of Biomedical Informatics. MedInfo 2015: 1004 - [c126]Yulu Wang, Garrick Sherman, Jimmy Lin, Miles Efron:
Assessor Differences and User Preferences in Tweet Timeline Generation. SIGIR 2015: 615-624 - [c125]Jaime Arguello, Fernando Diaz, Jimmy Lin, Andrew Trotman:
SIGIR 2015 Workshop on Reproducibility, Inexplicability, and Generalizability of Results (RIGOR). SIGIR 2015: 1147-1148 - [c124]Cody Buntain, Jimmy Lin:
Burst Detection in Social Media Streams for Tracking Interest Profiles in Real Time. TREC 2015 - [c123]Jimmy Lin, Miles Efron, Garrick Sherman, Yulu Wang, Ellen M. Voorhees:
Overview of the TREC-2015 Microblog Track. TREC 2015 - [c122]Jimmy Lin:
Scaling Down Distributed Infrastructure on Wimpy Machines for Personal Web Archiving. WWW (Companion Volume) 2015: 1351-1355 - [i14]Cody Buntain, Jimmy Lin, Jennifer Golbeck:
Learning to Discover Key Moments in Social Media Streams. CoRR abs/1508.00488 (2015) - [i13]Allan Hanbury, Henning Müller, Krisztian Balog, Torben Brodt, Gordon V. Cormack, Ivan Eggel, Tim Gollub, Frank Hopfgartner, Jayashree Kalpathy-Cramer, Noriko Kando, Anastasia Krithara, Jimmy Lin, Simon Mercer, Martin Potthast:
Evaluation-as-a-Service: Overview and Outlook. CoRR abs/1512.07454 (2015) - 2014
- [j35]Pankaj Gupta, Venu Satuluri, Ajeet Grewal, Siva Gurumurthy, Volodymyr Zhabiuk, Quannan Li, Jimmy Lin:
Real-Time Twitter Recommendation: Online Motif Detection in Large Dynamic Graphs. Proc. VLDB Endow. 7(13): 1379-1380 (2014) - [j34]P. Oscar Boykin, Sam Ritchie, Ian O'Connell, Jimmy Lin:
Summingbird: A Framework for Integrating Batch and Online MapReduce Computations. Proc. VLDB Endow. 7(13): 1441-1451 (2014) - [j33]Abdul Quamar, Amol Deshpande, Jimmy Lin:
NScale: Neighborhood-centric Analytics on Large Graphs. Proc. VLDB Endow. 7(13): 1673-1676 (2014) - [j32]Nima Asadi, Jimmy Lin, Arjen P. de Vries:
Runtime Optimizations for Tree-Based Machine Learning Models. IEEE Trans. Knowl. Data Eng. 26(9): 2281-2292 (2014) - [j31]Ferhan Türe, Jimmy Lin:
Exploiting Representations from Statistical Machine Translation for Cross-Language Information Retrieval. ACM Trans. Inf. Syst. 32(4): 19:1-19:32 (2014) - [c121]Alan Said, Alejandro Bellogín, Jimmy Lin, Arjen P. de Vries:
Do recommendations matter?: news recommendation in real life. CSCW Companion 2014: 237-240 - [c120]Gebrekirstos G. Gebremeskel, Jiyin He, Arjen P. de Vries, Jimmy Lin:
Cumulative Citation Recommendation: A Feature-Aware Comparison of Approaches. DEXA Workshops 2014: 193-197 - [c119]Jimmy Lin, Kari Kraus, Ricardo L. Punzalan:
Supporting "Distant Reading" for Web Archives. DH 2014 - [c118]Yulu Wang, Jimmy Lin:
The Impact of Future Term Statistics in Real-Time Tweet Search. ECIR 2014: 567-572 - [c117]Hannes Mühleisen, Thaer Samar, Jimmy Lin, Arjen P. de Vries:
Column Stores as an IR Prototyping Tool. ECIR 2014: 789-792 - [c116]K. Ashwin Kumar, Jonathan Gluck, Amol Deshpande, Jimmy Lin:
Optimization Techniques for "Scaling Down" Hadoop on Multi-Core, Shared-Memory Systems. EDBT 2014: 13-24 - [c115]Jinfeng Rao, Jimmy Lin, Hanan Samet:
Partitioning strategies for spatio-textual similarity join. BigSpatial@SIGSPATIAL 2014: 40-49 - [c114]Krist Wongsuphasawat, Jimmy Lin:
Using visualizations to monitor changes and harvest insights from a global-scale logging infrastructure at Twitter. IEEE VAST 2014: 113-122 - [c113]Zhengzheng Xu, Dan Goldwasser, Benjamin B. Bederson, Jimmy Lin:
Visual analytics of MOOCs at maryland. L@S 2014: 195-196 - [c112]Miles Efron, Jimmy Lin, Jiyin He, Arjen P. de Vries:
Temporal feedback for tweet search with non-parametric density estimation. SIGIR 2014: 33-42 - [c111]Hannes Mühleisen, Thaer Samar, Jimmy Lin, Arjen P. de Vries:
Old dogs are great at new tricks: column stores for ir prototyping. SIGIR 2014: 863-866 - [c110]Ellen M. Voorhees, Jimmy Lin, Miles Efron:
On run diversity in Evaluation as a Service. SIGIR 2014: 959-962 - [c109]Jimmy Lin, Yulu Wang, Miles Efron, Garrick Sherman:
Overview of the TREC-2014 Microblog Track. TREC 2014 - [c108]Jimmy Lin, Miles Efron:
Infrastructure support for evaluation as a service. WWW (Companion Volume) 2014: 79-82 - [c107]Lidan Wang, Jimmy Lin, Donald Metzler, Jiawei Han:
Learning to efficiently rank on big data. WWW (Companion Volume) 2014: 209-210 - [c106]Seth A. Myers, Aneesh Sharma, Pankaj Gupta, Jimmy Lin:
Information network or social network?: the structure of the twitter follow graph. WWW (Companion Volume) 2014: 493-498 - [c105]Jimmy Lin, Milad Gholami, Jinfeng Rao:
Infrastructure for supporting exploration and discovery in web archives. WWW (Companion Volume) 2014: 851-856 - [e2]Jimmy Lin, Jian Pei, Xiaohua Hu, Wo Chang, Raghunath Nambiar, Charu C. Aggarwal, Nick Cercone, Vasant G. Honavar, Jun Huan, Bamshad Mobasher, Saumyadipta Pyne:
2014 IEEE International Conference on Big Data (IEEE BigData 2014), Washington, DC, USA, October 27-30, 2014. IEEE Computer Society 2014, ISBN 978-1-4799-5665-4 [contents] - [i12]Abdul Quamar, Amol Deshpande, Jimmy Lin:
NScale: Neighborhood-centric Large-Scale Graph Analytics in the Cloud. CoRR abs/1405.1499 (2014) - [i11]Sarah Weissman, Samet Ayhan, Joshua Bradley, Jimmy Lin:
Identifying Duplicate and Contradictory Information in Wikipedia. CoRR abs/1406.1143 (2014) - [i10]Jimmy Lin:
On the Feasibility and Implications of Self-Contained Search Engines in the Browser. CoRR abs/1410.4500 (2014) - 2013
- [j30]Jimmy Lin:
Mapreduce is Good Enough?If All You Have is a Hammer, Throw Away Everything That's Not a Nail! Big Data 1(1): 28-37 (2013) - [j29]Nima Asadi, Jimmy Lin:
Document vector representations for feature extraction in multi-stage document ranking. Inf. Retr. 16(6): 747-768 (2013) - [j28]K. Ashwin Kumar, Jonathan Gluck, Amol Deshpande, Jimmy Lin:
Hone: "Scaling Down" Hadoop on Shared-Memory Systems. Proc. VLDB Endow. 6(12): 1354-1357 (2013) - [j27]Jimmy Lin, Miles Efron:
Evaluation as a service for information retrieval. SIGIR Forum 47(2): 8-14 (2013) - [j26]Nima Asadi, Jimmy Lin:
Fast candidate generation for real-time tweet search with bloom filter chains. ACM Trans. Inf. Syst. 31(3): 13 (2013) - [c104]Vladimir Eidelman, Ke Wu, Ferhan Türe, Philip Resnik, Jimmy Lin:
Mr. MIRA: Open-Source Large-Margin Structured Learning on MapReduce. ACL (Conference System Demonstrations) 2013: 199-204 - [c103]Alan Said, Jimmy Lin, Alejandro Bellogín, Arjen P. de Vries:
A month in the life of a production news recommender system. LivingLab@CIKM 2013: 7-10 - [c102]Nima Asadi, Jimmy Lin:
Training Efficient Tree-Based Models for Document Ranking. ECIR 2013: 146-157 - [c101]Miguel Rios, Jimmy Lin:
Visualizing the "Pulse" of World Cities on Twitter. ICWSM 2013 - [c100]Nima Asadi, Jimmy Lin, Michael Busch:
Dynamic memory allocation policies for postings in real-time Twitter search. KDD 2013: 1186-1194 - [c99]Hua He, Jimmy Lin, Adam Lopez:
Massively Parallel Suffix Array Queries and On-Demand Phrase Extraction for Statistical Machine Translation Using GPUs. HLT-NAACL 2013: 325-334 - [c98]Ferhan Türe, Jimmy Lin:
Flat vs. hierarchical phrase-based translation models for cross-language information retrieval. SIGIR 2013: 813-816 - [c97]Nima Asadi, Jimmy Lin:
Effectiveness/efficiency tradeoffs for candidate generation in multi-stage retrieval architectures. SIGIR 2013: 997-1000 - [c96]Gilad Mishne, Jeff Dalton, Zhenghua Li, Aneesh Sharma, Jimmy Lin:
Fast data in the era of big data: Twitter's real-time related query suggestion architecture. SIGMOD Conference 2013: 1147-1158 - [c95]Alejandro Bellogín, Gebrekirstos G. Gebremeskel, Jiyin He, Alan Said, Thaer Samar, Arjen P. de Vries, Jimmy Lin, Jeroen B. P. Vuurens:
CWI and TU Delft Notebook TREC 2013: Contextual Suggestion, Federated Web Search, KBA, and Web Tracks. TREC 2013 - [c94]Jimmy Lin, Miles Efron:
Overview of the TREC-2013 Microblog Track. TREC 2013 - [c93]Vladimir Eidelman, Ke Wu, Ferhan Türe, Philip Resnik, Jimmy Lin:
Towards Efficient Large-Scale Feature-Rich Statistical Machine Translation. WMT@ACL 2013: 128-133 - [c92]Pankaj Gupta, Ashish Goel, Jimmy Lin, Aneesh Sharma, Dong Wang, Reza Zadeh:
WTF: the who to follow service at Twitter. WWW 2013: 505-514 - [i9]Nima Asadi, Jimmy Lin, Michael Busch:
Dynamic Memory Allocation Policies for Postings in Real-Time Twitter Search. CoRR abs/1302.5302 (2013) - [i8]Jimmy Lin:
Monoidify! Monoids as a Design Principle for Efficient MapReduce Algorithms. CoRR abs/1304.7544 (2013) - [i7]Nima Asadi, Jimmy Lin:
Fast, Incremental Inverted Indexing in Main Memory for Web-Scale Collections. CoRR abs/1305.0699 (2013) - 2012
- [j25]George Lee, Jimmy Lin, Chuang Liu, Andrew Lorek, Dmitriy V. Ryaboy:
The Unified Logging Infrastructure for Data Analytics at Twitter. Proc. VLDB Endow. 5(12): 1771-1780 (2012) - [j24]Jimmy Lin, Dmitriy V. Ryaboy:
Scaling big data mining infrastructure: the twitter experience. SIGKDD Explor. 14(2): 6-19 (2012) - [c91]Nima Asadi, Jimmy Lin:
Fast candidate generation for two-phase document ranking: postings list intersection with bloom filters. CIKM 2012: 2419-2422 - [c90]Ferhan Türe, Jimmy Lin, Douglas W. Oard:
Combining Statistical Translation Techniques for Cross-Language Information Retrieval. COLING 2012: 2685-2702 - [c89]Michael Busch, Krishna Gade, Brian Larson, Patrick Lok, Samuel Luckenbill, Jimmy Lin:
Earlybird: Real-Time Search at Twitter. ICDE 2012: 1360-1369 - [c88]Jimmy Lin, Gilad Mishne:
A Study of "Churn" in Tweets and Real-Time Search Queries. ICWSM 2012 - [c87]Dean McCullough, Jimmy Lin, Craig Macdonald, Iadh Ounis, Richard McCreadie:
Evaluating Real-Time Search over Tweets. ICWSM 2012 - [c86]Ferhan Türe, Jimmy Lin:
Why Not Grab a Free Lunch? Mining Large Corpora for Parallel Sentences to Improve Translation Modeling. HLT-NAACL 2012: 626-630 - [c85]Ferhan Türe, Jimmy Lin, Douglas W. Oard:
Looking inside the box: context-sensitive translation for cross-language information retrieval. SIGIR 2012: 1105-1106 - [c84]Richard McCreadie, Ian Soboroff, Jimmy Lin, Craig Macdonald, Iadh Ounis, Dean McCullough:
On building a reusable Twitter corpus. SIGIR 2012: 1113-1114 - [c83]Gilad Mishne, Jimmy Lin:
Twanchor text: a preliminary study of the value of tweets as anchor text. SIGIR 2012: 1159-1160 - [c82]Jimmy Lin, Alek Kolcz:
Large-scale machine learning at twitter. SIGMOD Conference 2012: 793-804 - [c81]Ian Soboroff, Iadh Ounis, Craig Macdonald, Jimmy Lin:
Overview of the TREC-2012 Microblog Track. TREC 2012 - [i6]Jimmy Lin, Gilad Mishne:
A Study of "Churn" in Tweets and Real-Time Search Queries (Extended Version). CoRR abs/1205.6855 (2012) - [i5]George Lee, Jimmy Lin, Chuang Liu, Andrew Lorek, Dmitriy V. Ryaboy:
The Unified Logging Infrastructure for Data Analytics at Twitter. CoRR abs/1208.4171 (2012) - [i4]Jimmy Lin:
MapReduce is Good Enough? If All You Have is a Hammer, Throw Away Everything That's Not a Nail! CoRR abs/1209.2191 (2012) - [i3]Gilad Mishne, Jeff Dalton, Zhenghua Li, Aneesh Sharma, Jimmy Lin:
Fast Data in the Era of Big Data: Twitter's Real-Time Related Query Suggestion Architecture. CoRR abs/1210.7350 (2012) - [i2]Nima Asadi, Jimmy Lin, Arjen P. de Vries:
Runtime Optimizations for Prediction with Tree-Based Models. CoRR abs/1212.2287 (2012) - 2011
- [j23]Gregory V. Chockler, Eliezer Dekel, Joseph F. JáJá, Jimmy Lin:
Special Issue on Cloud Computing. J. Parallel Distributed Comput. 71(6): 731 (2011) - [c80]Tamer Elsayed, Jimmy Lin, Donald Metzler:
When close enough is good enough: approximate positional indexes for efficient ranked retrieval. CIKM 2011: 1993-1996 - [c79]Florian Leibert, Jake Mannix, Jimmy Lin, Babak Hamadani:
Automatic management of partitioned, replicated search services. SoCC 2011: 27 - [c78]Earl J. Wagner, Jimmy Lin:
In-depth accounts and passing mentions in the news: connecting readers to the context of a news event. iConference 2011: 790-791 - [c77]Jimmy Lin, Rion Snow, William Morgan:
Smoothing techniques for adaptive online language models: topic tracking in tweet streams. KDD 2011: 422-429 - [c76]Lidan Wang, Jimmy Lin, Donald Metzler:
A cascade ranking model for efficient ranked retrieval. SIGIR 2011: 105-114 - [c75]Ferhan Türe, Tamer Elsayed, Jimmy Lin:
No free lunch: brute force vs. locality-sensitive hashing for cross-lingual pairwise similarity. SIGIR 2011: 943-952 - [c74]Nima Asadi, Donald Metzler, Tamer Elsayed, Jimmy Lin:
Pseudo test collections for learning web search ranking functions. SIGIR 2011: 1073-1082 - [c73]Nima Asadi, Donald Metzler, Jimmy Lin:
Cross-corpus relevance projection. SIGIR 2011: 1163-1164 - [c72]Iadh Ounis, Craig Macdonald, Jimmy Lin, Ian Soboroff:
Overview of the TREC 2011 Microblog Track. TREC 2011 - 2010
- [b2]Jimmy Lin, Chris Dyer:
Data-Intensive Text Processing with MapReduce. Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers 2010, ISBN 978-3-031-01008-8 - [c71]Lidan Wang, Donald Metzler, Jimmy Lin:
Ranking under temporal constraints. CIKM 2010: 79-88 - [c70]Di-Wei Huang, Jimmy Lin:
Scaling Populations of a Genetic Algorithm for Job Shop Scheduling Problems Using MapReduce. CloudCom 2010: 780-785 - [c69]Jimmy Lin, Michael Schatz:
Design patterns for efficient graph algorithms in MapReduce. MLG@KDD 2010: 78-85 - [c68]Jimmy Lin, Chris Dyer:
Data-Intensive Text Processing with MapReduce. NAACL (Tutorial Abstracts) 2010: 1-2 - [c67]Jimmy Lin, Nitin Madnani, Bonnie J. Dorr:
Putting the User in the Loop: Interactive Maximal Marginal Relevance for Query-Focused Summarization. HLT-NAACL 2010: 305-308 - [c66]Lidan Wang, Jimmy Lin, Donald Metzler:
Learning to efficiently rank. SIGIR 2010: 138-145 - [c65]Tamer Elsayed, Nima Asadi, Lidan Wang, Jimmy Lin, Donald Metzler:
UMD and USC/ISI: TREC 2010 Web Track Experiments with Ivory. TREC 2010
2000 – 2009
- 2009
- [j22]Jimmy Lin:
Is searching full text more effective than searching abstracts? BMC Bioinform. 10 (2009) - [j21]Paul T. Jaeger, Jimmy Lin, Justin M. Grimes, Shannon N. Simmons:
Where is the Cloud? Geography, Economics, Environment, and Jurisdiction in Cloud Computing. First Monday 14(5) (2009) - [j20]Jimmy Lin, W. John Wilbur:
Modeling actions of PubMed users with n-gram language models. Inf. Retr. 12(4): 487-503 (2009) - [j19]Timothy Hawes, Jimmy Lin, Philip Resnik:
Elements of a computational model for multi-party discourse: The turn-taking behavior of Supreme Court justices. J. Assoc. Inf. Sci. Technol. 60(8): 1607-1615 (2009) - [j18]Gregory V. Chockler, Eliezer Dekel, Joseph F. JáJá, Jimmy Lin:
Special Issue of the Journal of Parallel and Distributed Computing: Cloud Computing. J. Parallel Distributed Comput. 69(9): 813 (2009) - [j17]Jimmy Lin, G. Craig Murray, Bonnie J. Dorr, Jan Hajic, Pavel Pecina:
A cost-effective lexical acquisition process for large-scale thesaurus translation. Lang. Resour. Evaluation 43(1): 27-40 (2009) - [j16]Judith L. Klavans, Carolyn Sheffield, Eileen G. Abels, Jimmy Lin, Rebecca J. Passonneau, Tandeep Sidhu, Dagobert Soergel:
Computational linguistics for metadata building (CLiMB): using text mining for the automatic identification, categorization, and disambiguation of subject terms for image metadata. Multim. Tools Appl. 42(1): 115-138 (2009) - [c64]Michael D. Lieberman, Jimmy Lin:
You Are Where You Edit: Locating Wikipedia Contributors through Edit Histories. ICWSM 2009 - [c63]G. Craig Murray, Jimmy Lin, W. John Wilbur, Zhiyong Lu:
Users' adjustments to unsuccessful queries in biomedical search. JCDL 2009: 433-434 - [c62]Jimmy Lin, Chris Dyer:
Data Intensive Text Processing with MapReduce. HLT-NAACL (Tutorial Abstracts) 2009: 1-2 - [c61]Jimmy Lin:
Brute force and indexed approaches to pairwise document similarity comparisons with MapReduce. SIGIR 2009: 155-162 - [c60]Jimmy Lin:
The Curse of Zipf and Limits to Parallelization: An Look at the Stragglers Problem in MapReduce. LSDS-IR@SIGIR 2009 - [c59]Jimmy Lin, Tamer Elsayed, Lidan Wang, Donald Metzler:
Of Ivory and Smurfs: Loxodontan MapReduce Experiments for Web Search. TREC 2009 - [e1]David Wai-Lok Cheung, Il-Yeol Song, Wesley W. Chu, Xiaohua Hu, Jimmy Lin:
Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, Hong Kong, China, November 2-6, 2009. ACM 2009, ISBN 978-1-60558-512-3 [contents] - [r1]Jimmy Lin:
Summarization. Encyclopedia of Database Systems 2009: 2884-2889 - 2008
- [j15]Jimmy Lin:
PageRank without hyperlinks: Reranking with PubMed related article networks for biomedical text retrieval. BMC Bioinform. 9 (2008) - [j14]David M. Zajic, Bonnie J. Dorr, Jimmy Lin:
Single-document and multi-document summarization techniques for email threads using sentence compression. Inf. Process. Manag. 44(4): 1600-1610 (2008) - [j13]Jimmy Lin, Michael DiCuccio, Vahan Grigoryan, W. John Wilbur:
Navigating information spaces: A case study of related article search in PubMed. Inf. Process. Manag. 44(5): 1771-1783 (2008) - [j12]Jimmy Lin, Philip Fei Wu, Eileen G. Abels:
Toward automatic facet analysis and need negotiation: Lessons from mediated search. ACM Trans. Inf. Syst. 27(1): 6:1-6:42 (2008) - [c58]Tamer Elsayed, Jimmy Lin, Douglas W. Oard:
Pairwise Document Similarity in Large Collections with MapReduce. ACL (2) 2008: 265-268 - [c57]Jimmy Lin:
Scalable Language Processing Algorithms for the Masses: A Case Study in Computing Word Co-occurrence Matrices with MapReduce. EMNLP 2008: 419-428 - [c56]Judith L. Klavans, Carolyn Sheffield, Jimmy Lin, Tandeep Sidhu:
Computational linguistics for metadata building. JCDL 2008: 427 - [c55]Jimmy Lin, Mark D. Smucker:
How do users find things with PubMed?: towards automatic utility evaluation with user simulations. SIGIR 2008: 19-26 - [c54]Chris Dyer, Aaron Cordova, Alex Mont, Jimmy Lin:
Fast, Easy, and Cheap: Construction of Statistical Machine Translation Models with MapReduce. WMT@ACL 2008: 199-207 - [i1]Saif M. Mohammad, Bonnie J. Dorr, Melissa Egan, Nitin Madnani, David M. Zajic, Jimmy Lin:
Multiple Alternative Sentence Compressions and Word-Pair Antonymy for Automatic Text Summarization and Recognizing Textual Entailment. TAC 2008 - 2007
- [j11]Jimmy Lin, W. John Wilbur:
PubMed related articles: a probabilistic topic-based model for content similarity. BMC Bioinform. 8 (2007) - [j10]Dina Demner-Fushman, Jimmy Lin:
Answering Clinical Questions with Knowledge-Based and Statistical Techniques. Comput. Linguistics 33(1): 63-103 (2007) - [j9]Jimmy Lin:
User simulations for evaluating answers to question series. Inf. Process. Manag. 43(3): 717-729 (2007) - [j8]David M. Zajic, Bonnie J. Dorr, Jimmy Lin, Richard M. Schwartz:
Multi-candidate reduction: Sentence compression as a tool for document summarization tasks. Inf. Process. Manag. 43(6): 1549-1570 (2007) - [j7]Jimmy Lin, W. John Wilbur:
Syntactic sentence compression in the biomedical domain: facilitating access to related articles. Inf. Retr. 10(4-5): 393-414 (2007) - [j6]Paul B. Kantor, Jimmy Lin:
Presentation schemes for component analysis in IR experiments. SIGIR Forum 41(1): 34-39 (2007) - [j5]Diane Kelly, Jimmy Lin:
Overview of the TREC 2006 ciQA task. SIGIR Forum 41(1): 107-116 (2007) - [j4]Jimmy Lin:
An exploration of the principles underlying redundancy-based factoid question answering. ACM Trans. Inf. Syst. 25(2): 6 (2007) - [c53]Hoa Trang Dang, Jimmy Lin:
Different Structures for Evaluating Answers to Complex Questions: Pyramids Won't Topple, and Neither Will Human Assessors. ACL 2007 - [c52]Jimmy Lin, Dina Demner-Fushman:
Semantic Clustering of Answers to Clinical Questions. AMIA 2007 - [c51]Tandeep Sidhu, Judith Klavans, Jimmy Lin:
Concept Disambiguation for Improved Subject Access Using Multiple Knowledge Sources. LaTeCH@ACL 2007 2007: 25-32 - [c50]Jimmy Lin:
Is Question Answering Better than Information Retrieval? Towards a Task-Based Evaluation Framework for Question Series. HLT-NAACL 2007: 212-219 - [c49]Jimmy Lin, Pengyi Zhang:
Deconstructing nuggets: the stability and reliability of complex question answering evaluation. SIGIR 2007: 327-334 - [c48]Hoa Trang Dang, Diane Kelly, Jimmy Lin:
Overview of the TREC 2007 Question Answering Track. TREC 2007 - [c47]Nitin Madnani, Jimmy Lin, Bonnie J. Dorr:
TREC 2007 ciQA Task: University of Maryland. TREC 2007 - 2006
- [j3]Jimmy Lin, Dina Demner-Fushman:
Methods for automatically evaluating answers to complex questions. Inf. Retr. 9(5): 565-587 (2006) - [j2]Jimmy Lin, Boris Katz:
Building a reusable test collection for question answering. J. Assoc. Inf. Sci. Technol. 57(7): 851-861 (2006) - [c46]Dina Demner-Fushman, Jimmy Lin:
Answer Extraction, Semantic Clustering, and Extractive Summarization for Clinical Question Answering. ACL 2006 - [c45]Jimmy Lin:
The Role of Information Retrieval in Answering Complex Questions. ACL 2006 - [c44]G. Craig Murray, Bonnie J. Dorr, Jimmy Lin, Jan Hajic, Pavel Pecina:
Leveraging Reusability: Cost-Effective Lexical Acquisition for Large-Scale Ontology Translation. ACL 2006 - [c43]Xiaoli Huang, Jimmy Lin, Dina Demner-Fushman:
Evaluation of PICO as a Knowledge Representation for Clinical Questions. AMIA 2006 - [c42]G. Craig Murray, Jimmy Lin, Abdur Chowdhury:
Identification of user sessions with hierarchical agglomerative clustering. ASIST 2006: 1-9 - [c41]Jimmy Lin, Damianos G. Karakos, Dina Demner-Fushman, Sanjeev Khudanpur:
Generative Content Models for Structural Analysis of Medical Abstracts. BioNLP@NAACL-HLT 2006: 65-72 - [c40]G. Craig Murray, Bonnie J. Dorr, Jimmy Lin, Jan Hajic, Pavel Pecina:
Leveraging Recurrent Phrase Structure in Large-scale Ontology Translation. EAMT 2006 - [c39]Jimmy Lin, Dina Demner-Fushman:
Will Pyramids Built of Nuggets Topple Over?. HLT-NAACL 2006 - [c38]Jimmy Lin, Dina Demner-Fushman:
The role of knowledge in conceptual retrieval: a study in the domain of clinical medicine. SIGIR 2006: 99-106 - [c37]Jimmy Lin, Philip Fei Wu, Dina Demner-Fushman, Eileen G. Abels:
Exploring the limits of single-iteration clarification dialogs. SIGIR 2006: 469-476 - [c36]G. Craig Murray, Jimmy Lin, Abdur Chowdhury:
Action modeling: language models that predict query behavior. SIGIR 2006: 681-682 - [c35]Hoa Trang Dang, Jimmy Lin, Diane Kelly:
Overview of the TREC 2006 Question Answering Track 99. TREC 2006 - [c34]Douglas W. Oard, Tamer Elsayed, Jianqiang Wang, Yejun Wu, Pengyi Zhang, Eileen G. Abels, Jimmy Lin, Dagobert Soergel:
TREC 2006 at Maryland: Blog, Enterprise, Legal and QA Tracks. TREC 2006 - 2005
- [c33]Jimmy Lin, Dina Demner-Fushman:
Evaluating Summaries and Answers: Two Sides of the Same Coin? IEEvaluation@ACL 2005: 41-48 - [c32]Jimmy Lin, Dina Demner-Fushman:
"Bag of Words" is not enough for Strength of Evidence Classification. AMIA 2005 - [c31]Jimmy Lin, Dina Demner-Fushman:
Automatically Evaluating Answers to Definition Questions. HLT/EMNLP 2005: 931-938 - [c30]Jimmy Lin:
Evaluation of resources for question answering evaluation. SIGIR 2005: 392-399 - [c29]Jimmy Lin, G. Craig Murray:
Assessing the term independence assumption in blind relevance feedback. SIGIR 2005: 635-636 - [c28]Alan R. Aronson, Dina Demner-Fushman, Susanne M. Humphrey, Jimmy Lin, Patrick Ruch, Miguel E. Ruiz, Lawrence H. Smith, Lorraine K. Tanabe, W. John Wilbur, Hongfang Liu:
Fusion of Knowledge-Intensive and Statistical Approaches for Retrieving and Annotating Textual Genomics Documents. TREC 2005 - [c27]Jimmy Lin, Eileen G. Abels, Dina Demner-Fushman, Douglas W. Oard, Philip Fei Wu, Yejun Wu:
A Menagerie of Tracks at Maryland: HARD, Enterprise, QA, and Genomics, Oh My! TREC 2005 - 2004
- [b1]Jimmy Lin:
Event structure and the encoding of arguments: the syntax of the Mandarin and English verb phrase. Massachusetts Institute of Technology, Cambridge, MA, USA, 2004 - [c26]Wesley Hildebrandt, Boris Katz, Jimmy Lin:
Answering Definition Questions Using Multiple Knowledge Sources. HLT-NAACL 2004: 49-56 - [c25]Jimmy Lin:
A Computational Framework for Non-Lexicalist Semantics. HLT-NAACL (Student Research Workshop) 2004 - [c24]Boris Katz, Jimmy Lin, Chris Stauffer, W. Eric L. Grimson:
Answering Questions About Moving Objects in Videos. New Directions in Question Answering 2004: 113-128 - [c23]Boris Katz, Sue Felshin, Jimmy Lin, Gregory Marton:
Viewing the Web as a Virtual Database for Question Answering. New Directions in Question Answering 2004: 215-226 - [c22]Boris Katz, Matthew W. Bilotti, Sue Felshin, Aaron Fernandes, Wesley Hildebrandt, Roni Katzir, Jimmy Lin, Daniel Loreto, Gregory Marton, Federico Mora, Özlem Uzuner:
Answering Multiple Questions on a Topic From Heterogeneous Resources. TREC 2004 - 2003
- [c21]Ali Ibrahim, Boris Katz, Jimmy Lin:
Extracting Structural Paraphrases from Aligned Monolingual Corpora. IWP@ACL 2003: 57-64 - [c20]Jimmy Lin, Dennis Quan, Vineet Sinha, Karun Bakshi, David Huynh, Boris Katz, David R. Karger:
The role of context in question answering systems. CHI Extended Abstracts 2003: 1006-1007 - [c19]Jimmy Lin, Boris Katz:
Question answering from the web using knowledge annotation and knowledge mining techniques. CIKM 2003: 116-123 - [c18]Boris Katz, Roger Hurwitz, Jimmy Lin, Özlem Uzuner:
Better Public Policy Through Natural Language Information Access. DG.O 2003 - [c17]Boris Katz, Roger Hurwitz, Jimmy Lin, Özlem Uzuner:
START: A Framework for Facilitating E-Rulemaking. DG.O 2003 - [c16]Jimmy Lin, Dennis Quan, Vineet Sinha, Karun Bakshi, David Huynh, Boris Katz, David R. Karger:
What Makes a Good Answer? The Role of Context in Question Answering. INTERACT 2003 - [c15]David R. Karger, Boris Katz, Jimmy Lin, Dennis Quan:
Sticky notes for the semantic web. IUI 2003: 254-256 - [c14]Boris Katz, Jimmy Lin, Chris Stauffer, W. Eric L. Grimson:
Answering Questions about Moving Objects in Surveillance Videos. New Directions in Question Answering 2003: 145-152 - [c13]Stefanie Tellex, Boris Katz, Jimmy Lin, Aaron Fernandes, Gregory Marton:
Quantitative evaluation of passage retrieval algorithms for question answering. SIGIR 2003: 41-47 - [c12]Boris Katz, Jimmy Lin, Daniel Loreto, Wesley Hildebrandt, Matthew W. Bilotti, Sue Felshin, Aaron Fernandes, Gregory Marton, Federico Mora:
Integrating Web-based and Corpus-based Techniques for Question Answering. TREC 2003: 426-435 - 2002
- [c11]Boris Katz, Jimmy Lin:
Annotating the Semantic Web Using Natural Language. NLPXML@COLING 2002 - [c10]Boris Katz, Jimmy Lin, Dennis Quan:
Natural Language Annotations for the Semantic Web. OTM 2002: 1317-1331 - [c9]Jimmy Lin:
The Web as a Resource for Question Answering: Perspectives and Challenges. LREC 2002 - [c8]Boris Katz, Jimmy Lin, Sue Felshin:
The START Multimedia Information System: Current Technology and Future Directions. Multimedia Information Systems 2002: 117-123 - [c7]Boris Katz, Sue Felshin, Deniz Yuret, Ali Ibrahim, Jimmy Lin, Gregory Marton, Alton Jerome McFarland, Baris Temelkuran:
Omnibase: Uniform Access to Heterogeneous Data for Question Answering. NLDB 2002: 230-234 - [c6]Susan T. Dumais, Michele Banko, Eric Brill, Jimmy Lin, Andrew Y. Ng:
Web question answering: is more always better?. SIGIR 2002: 291-298 - [c5]Jimmy Lin, Aaron Fernandes, Boris Katz, Gregory Marton, Stefanie Tellex:
Extracting Answers from the Web Using Data Annotation and Knowledge Mining Techniques. TREC 2002 - 2001
- [j1]Joyce Y. Chai, Jimmy Lin, Wlodek Zadrozny, Yiming Ye, Margo Stys-Budzikowska, Veronika Horvath, Nanda Kambhatla, Catherine G. Wolf:
The Role of a Natural Language Conversational Interface in Online Sales: A Case Study. Int. J. Speech Technol. 4(3-4): 285-295 (2001) - [c4]Boris Katz, Jimmy Lin, Sue Felshin:
Gathering Knowledge for a Question Answering System from Heterogeneous Information Sources. HTLKM@ACL 2001 - [c3]Eric Brill, Jimmy Lin, Michele Banko, Susan T. Dumais, Andrew Y. Ng:
Data-Intensive Question Answering. TREC 2001 - 2000
- [c2]Joyce Yue Chai, Jimmy Lin, Wlodek Zadrozny, Yiming Ye, Malgorzata Budzikowska, Veronika Horvath, Nanda Kambhatla, Catherine G. Wolf:
Comparative Evaluation of a Natural Language Dialog Based System and a Menu Driven System for Information Access: a Case Study. RIAO 2000: 1590-1600
1990 – 1999
- 1999
- [c1]Boris Katz, Deniz Yuret, Jimmy Lin, Sue Felshin, Rebecca Schulman, Adnan Ilik, Ali Ibrahim, Philip Osafo-Kwaako:
Integrating Web Resources and Lexicons into a Natural Language Query System. ICMCS, Vol. 2 1999: 255-261
Coauthor Index
aka: Carlos Lassance
aka: Geoffrey Craig Murray
aka: Ogundepo Odunayo
aka: Ferhan Türe
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-22 20:42 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint