


default search action
Xueguang Ma
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [i29]Shengyao Zhuang, Ekaterina Khramtsova, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon:
Document Screenshot Retrievers are Vulnerable to Pixel Poisoning Attacks. CoRR abs/2501.16902 (2025) - [i28]Zhiheng Lyu, Xueguang Ma, Wenhu Chen:
PixelWorld: Towards Perceiving Everything as Pixels. CoRR abs/2501.19339 (2025) - 2024
- [j5]Xinyu Zhang
, Kelechi Ogueji
, Xueguang Ma
, Jimmy Lin
:
Toward Best Practices for Training Multilingual Dense Retrieval Models. ACM Trans. Inf. Syst. 42(2): 39:1-39:33 (2024) - [c26]Yubo Wang, Xueguang Ma, Wenhu Chen:
Augmenting Black-box LLMs with Medical Textbooks for Biomedical Question Answering. EMNLP (Findings) 2024: 1754-1770 - [c25]Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon:
PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval. EMNLP 2024: 4375-4391 - [c24]Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin:
Unifying Multimodal Retrieval via Document Screenshot Embedding. EMNLP 2024: 6492-6505 - [c23]Raphael Tang, Xinyu Crystina Zhang, Xueguang Ma, Jimmy Lin, Ferhan Ture:
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models. NAACL-HLT 2024: 2327-2340 - [c22]Yubo Wang, Xueguang Ma, Ge Zhang, Yuansheng Ni, Abhranil Chandra, Shiguang Guo, Weiming Ren, Aaran Arulraj, Xuan He, Ziyan Jiang, Tianle Li, Max Ku, Kai Wang, Alex Zhuang, Rongqi Fan, Xiang Yue, Wenhu Chen:
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark. NeurIPS 2024 - [c21]Ehsan Kamalloo
, Nandan Thakur
, Carlos Lassance
, Xueguang Ma
, Jheng-Hong Yang
, Jimmy Lin
:
Resources for Brewing BEIR: Reproducible Reference Models and Statistical Analyses. SIGIR 2024: 1431-1440 - [c20]Xueguang Ma
, Liang Wang
, Nan Yang
, Furu Wei
, Jimmy Lin
:
Fine-Tuning LLaMA for Multi-Stage Text Retrieval. SIGIR 2024: 2421-2425 - [i27]Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon:
PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval. CoRR abs/2404.18424 (2024) - [i26]Yubo Wang, Xueguang Ma, Ge Zhang, Yuansheng Ni, Abhranil Chandra, Shiguang Guo, Weiming Ren, Aaran Arulraj, Xuan He, Ziyan Jiang, Tianle Li, Max Ku, Kai Wang, Alex Zhuang, Rongqi Fan, Xiang Yue
, Wenhu Chen:
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark. CoRR abs/2406.01574 (2024) - [i25]Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin:
Unifying Multimodal Retrieval via Document Screenshot Embedding. CoRR abs/2406.11251 (2024) - [i24]Ziyan Jiang, Xueguang Ma, Wenhu Chen:
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs. CoRR abs/2406.15319 (2024) - [i23]Xueguang Ma, Shengyao Zhuang, Bevan Koopman, Guido Zuccon, Wenhu Chen, Jimmy Lin:
VISA: Retrieval Augmented Generation with Visual Source Attribution. CoRR abs/2412.14457 (2024) - 2023
- [j4]Wenhu Chen, Xueguang Ma, Xinyi Wang, William W. Cohen:
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks. Trans. Mach. Learn. Res. 2023 (2023) - [c19]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Precise Zero-Shot Dense Retrieval without Relevance Labels. ACL (1) 2023: 1762-1777 - [c18]Tianle Li, Xueguang Ma, Alex Zhuang, Yu Gu, Yu Su, Wenhu Chen:
Few-shot In-context Learning on Knowledge Base Question Answering. ACL (1) 2023: 6966-6980 - [c17]Xueguang Ma
, Tommaso Teofili
, Jimmy Lin
:
Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes. CIKM 2023: 5366-5370 - [c16]Wenhu Chen, Ming Yin, Max Ku, Pan Lu, Yixin Wan, Xueguang Ma, Jianyu Xu, Xinyi Wang, Tony Xia:
TheoremQA: A Theorem-driven Question Answering Dataset. EMNLP 2023: 7889-7901 - [c15]Minghan Li
, Sheng-Chieh Lin
, Xueguang Ma
, Jimmy Lin
:
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes. SIGIR 2023: 1954-1959 - [c14]Luyu Gao
, Xueguang Ma
, Jimmy Lin
, Jamie Callan
:
Tevatron: An Efficient and Flexible Toolkit for Neural Retrieval. SIGIR 2023: 3120-3124 - [c13]Xueguang Ma
, Hengxin Fun
, Xusen Yin
, Antonio Mallia
, Jimmy Lin
:
Enhancing Sparse Retrieval via Unsupervised Learning. SIGIR-AP 2023: 150-157 - [i22]Minghan Li, Sheng-Chieh Lin, Xueguang Ma, Jimmy Lin:
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes. CoRR abs/2302.06587 (2023) - [i21]Xueguang Ma, Tommaso Teofili, Jimmy Lin:
Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes. CoRR abs/2304.12139 (2023) - [i20]Tianle Li, Xueguang Ma, Alex Zhuang, Yu Gu, Yu Su, Wenhu Chen:
Few-shot In-context Learning for Knowledge Base Question Answering. CoRR abs/2305.01750 (2023) - [i19]Xueguang Ma, Xinyu Zhang, Ronak Pradeep, Jimmy Lin:
Zero-Shot Listwise Document Reranking with a Large Language Model. CoRR abs/2305.02156 (2023) - [i18]Wenhu Chen, Ming Yin, Max Ku, Pan Lu, Yixin Wan, Xueguang Ma, Jianyu Xu, Xinyi Wang, Tony Xia:
TheoremQA: A Theorem-driven Question Answering dataset. CoRR abs/2305.12524 (2023) - [i17]Ehsan Kamalloo, Nandan Thakur, Carlos Lassance, Xueguang Ma, Jheng-Hong Yang, Jimmy Lin:
Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard. CoRR abs/2306.07471 (2023) - [i16]Yubo Wang, Xueguang Ma, Wenhu Chen:
Augmenting Black-box LLMs with Medical Textbooks for Clinical Question Answering. CoRR abs/2309.02233 (2023) - [i15]Raphael Tang, Xinyu Zhang, Xueguang Ma, Jimmy Lin, Ferhan Ture:
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models. CoRR abs/2310.07712 (2023) - [i14]Xueguang Ma, Liang Wang, Nan Yang, Furu Wei, Jimmy Lin:
Fine-Tuning LLaMA for Multi-Stage Text Retrieval. CoRR abs/2310.08319 (2023) - 2022
- [j3]Alexandre Parmentier, Robin Cohen, Xueguang Ma, Gaurav Sahu
, Queenie Chen:
Personalized multi-faceted trust modeling to determine trust links in social media and its potential for misinformation management. Int. J. Data Sci. Anal. 13(4): 399-425 (2022) - [c12]Hang Li
, Shengyao Zhuang
, Xueguang Ma
, Jimmy Lin
, Guido Zuccon
:
Pseudo-Relevance Feedback with Dense Retrievers in Pyserini. ADCS 2022: 1:1-1:6 - [c11]Hang Li
, Shengyao Zhuang
, Ahmed Mourad
, Xueguang Ma, Jimmy Lin
, Guido Zuccon
:
Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback: A Reproducibility Study. ECIR (1) 2022: 599-612 - [c10]Xueguang Ma, Kai Sun, Ronak Pradeep, Minghan Li
, Jimmy Lin:
Another Look at DPR: Reproduction of Training and Replication of Retrieval. ECIR (1) 2022: 613-626 - [c9]Hang Li, Shuai Wang
, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon
:
To Interpolate or not to Interpolate: PRF, Dense and Sparse Retrievers. SIGIR 2022: 2495-2500 - [c8]Xueguang Ma, Ronak Pradeep, Rodrigo Nogueira, Jimmy Lin:
Document Expansion Baselines and Learned Sparse Lexical Representations for MS MARCO V1 and V2. SIGIR 2022: 3187-3197 - [i13]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval. CoRR abs/2203.05765 (2022) - [i12]Xinyu Zhang, Kelechi Ogueji, Xueguang Ma, Jimmy Lin:
Towards Best Practices for Training Multilingual Dense Retrieval Models. CoRR abs/2204.02363 (2022) - [i11]Hang Li, Shuai Wang, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon:
To Interpolate or not to Interpolate: PRF, Dense and Sparse Retrievers. CoRR abs/2205.00235 (2022) - [i10]Wenhu Chen, Xueguang Ma, Xinyi Wang, William W. Cohen:
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks. CoRR abs/2211.12588 (2022) - [i9]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Precise Zero-Shot Dense Retrieval without Relevance Labels. CoRR abs/2212.10496 (2022) - 2021
- [c7]Ronak Pradeep, Xueguang Ma, Rodrigo Nogueira, Jimmy Lin:
Scientific Claim Verification with VerT5erini. LOUHI@EACL 2021: 94-103 - [c6]Jimmy Lin, Xueguang Ma, Joel Mackenzie, Antonio Mallia:
On the Separation of Logical and Physical Ranking Models for Text Retrieval Applications. DESIRES 2021: 176-178 - [c5]Xueguang Ma, Minghan Li
, Kai Sun, Ji Xin, Jimmy Lin:
Simple and Effective Unsupervised Redundancy Elimination to Compress Dense Vectors for Passage Retrieval. EMNLP (1) 2021: 2854-2859 - [c4]Amira Ghenai, Xueguang Ma, Robin Cohen, Karyn Moffatt, Andy Yang, Yipeng Ji:
e-Health for Older Adults: Navigating Misinformation. ICT4AWE 2021: 195-202 - [c3]Ronak Pradeep, Xueguang Ma, Rodrigo Nogueira, Jimmy Lin:
Vera: Prediction Techniques for Reducing Harmful Misinformation in Consumer Health Search. SIGIR 2021: 2066-2070 - [c2]Jimmy Lin, Xueguang Ma, Sheng-Chieh Lin, Jheng-Hong Yang, Ronak Pradeep, Rodrigo Nogueira:
Pyserini: A Python Toolkit for Reproducible Information Retrieval Research with Sparse and Dense Representations. SIGIR 2021: 2356-2362 - [i8]Jimmy Lin, Xueguang Ma, Sheng-Chieh Lin, Jheng-Hong Yang, Ronak Pradeep, Rodrigo Nogueira:
Pyserini: An Easy-to-Use Python Toolkit to Support Replicable IR Research with Sparse and Dense Representations. CoRR abs/2102.10073 (2021) - [i7]Xueguang Ma, Kai Sun, Ronak Pradeep, Jimmy Lin:
A Replication Study of Dense Passage Retriever. CoRR abs/2104.05740 (2021) - [i6]Jimmy Lin, Xueguang Ma:
A Few Brief Notes on DeepImpact, COIL, and a Conceptual Framework for Information Retrieval Techniques. CoRR abs/2106.14807 (2021) - [i5]Xinyu Zhang, Xueguang Ma, Peng Shi, Jimmy Lin:
Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval. CoRR abs/2108.08787 (2021) - [i4]Alexandre Parmentier, Robin Cohen, Xueguang Ma, Gaurav Sahu, Queenie Chen:
Personalized multi-faceted trust modeling to determine trust links in social media and its potential for misinformation management. CoRR abs/2111.06440 (2021) - [i3]Hang Li, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon:
Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback: A Reproducibility Study. CoRR abs/2112.06400 (2021) - [i2]Jheng-Hong Yang, Xueguang Ma, Jimmy Lin:
Sparsifying Sparse Representations for Passage Retrieval by Top-k Masking. CoRR abs/2112.09628 (2021) - 2020
- [c1]Ronak Pradeep, Xueguang Ma, Xinyu Zhang, Hang Cui, Ruizhou Xu, Rodrigo Nogueira, Jimmy Lin:
H2oloo at TREC 2020: When all you got is a hammer... Deep Learning, Health Misinformation, and Precision Medicine. TREC 2020 - [i1]Ronak Pradeep, Xueguang Ma, Rodrigo Nogueira, Jimmy Lin:
Scientific Claim Verification with VERT5ERINI. CoRR abs/2010.11930 (2020)
2000 – 2009
- 2006
- [j2]Xueguang Ma, Roy Rada:
Web-Based Education Accountability System and Organizational Changes: An Actor-Network Approach. Int. J. Web Based Learn. Teach. Technol. 1(4): 1-14 (2006) - 2005
- [j1]Xueguang Ma, Roy Rada:
Building a Web-Based Accountability System in a Teacher Education Program. Interact. Learn. Environ. 13(1-2): 93-119 (2005)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-03-04 22:22 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint