default search action
Daya Guo
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c24]Junjie Huang, Daya Guo, Chenglong Wang, Jiazhen Gu, Shuai Lu, Jeevana Priya Inala, Cong Yan, Jianfeng Gao, Nan Duan, Michael R. Lyu:
Contextualized Data-Wrangling Code Generation in Computational Notebooks. ASE 2024: 1282-1294 - [c23]Yanlin Wang, Yanxian Huang, Daya Guo, Hongyu Zhang, Zibin Zheng:
SparseCoder: Identifier-Aware Sparse Transformer for File- Level Code Summarization. SANER 2024: 614-625 - [i33]Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li, Guowei Li, Jiashi Li, Yao Li, Y. K. Li, Wenfeng Liang, Fangyun Lin, Alex X. Liu, Bo Liu, Wen Liu, Xiaodong Liu, Xin Liu, Yiyuan Liu, Haoyu Lu, Shanghao Lu, Fuli Luo, Shirong Ma, Xiaotao Nie, Tian Pei, Yishi Piao, Junjie Qiu, Hui Qu, Tongzheng Ren, Zehui Ren, Chong Ruan, Zhangli Sha, Zhihong Shao, Junxiao Song, Xuecheng Su, Jingxiang Sun, Yaofeng Sun, Minghui Tang, Bingxuan Wang, Peiyi Wang, Shiyu Wang, Yaohui Wang, Yongji Wang, Tong Wu, Y. Wu, Xin Xie, Zhenda Xie, Ziwei Xie, Yiliang Xiong, Hanwei Xu, R. X. Xu, Yanhong Xu, Dejian Yang, Yuxiang You, Shuiping Yu, Xingkai Yu, B. Zhang, Haowei Zhang, Lecong Zhang, Liyue Zhang, Mingchuan Zhang, Minghua Zhang, Wentao Zhang, Yichao Zhang, Chenggang Zhao, Yao Zhao, Shangyan Zhou, Shunfeng Zhou, Qihao Zhu, Yuheng Zou:
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism. CoRR abs/2401.02954 (2024) - [i32]Daya Guo, Qihao Zhu, Dejian Yang, Zhenda Xie, Kai Dong, Wentao Zhang, Guanting Chen, Xiao Bi, Y. Wu, Y. K. Li, Fuli Luo, Yingfei Xiong, Wenfeng Liang:
DeepSeek-Coder: When the Large Language Model Meets Programming - The Rise of Code Intelligence. CoRR abs/2401.14196 (2024) - [i31]Yanlin Wang, Yanxian Huang, Daya Guo, Hongyu Zhang, Zibin Zheng:
SparseCoder: Identifier-Aware Sparse Transformer for File-Level Code Summarization. CoRR abs/2401.14727 (2024) - [i30]Zhihong Shao, Peiyi Wang, Qihao Zhu, Runxin Xu, Junxiao Song, Mingchuan Zhang, Y. K. Li, Y. Wu, Daya Guo:
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models. CoRR abs/2402.03300 (2024) - [i29]DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Deng, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, Hao Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, Jianzhong Guo, Jiaqi Ni, Jiashi Li, Jin Chen, Jingyang Yuan, Junjie Qiu, Junxiao Song, Kai Dong, Kaige Gao, Kang Guan, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruizhe Pan, Runxin Xu, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Size Zheng, Tao Wang, Tian Pei, Tian Yuan, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun:
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model. CoRR abs/2405.04434 (2024) - [i28]Huajian Xin, Daya Guo, Zhihong Shao, Zhizhou Ren, Qihao Zhu, Bo Liu, Chong Ruan, Wenda Li, Xiaodan Liang:
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data. CoRR abs/2405.14333 (2024) - [i27]DeepSeek-AI, Qihao Zhu, Daya Guo, Zhihong Shao, Dejian Yang, Peiyi Wang, Runxin Xu, Y. Wu, Yukun Li, Huazuo Gao, Shirong Ma, Wangding Zeng, Xiao Bi, Zihui Gu, Hanwei Xu, Damai Dai, Kai Dong, Liyue Zhang, Yishi Piao, Zhibin Gou, Zhenda Xie, Zhewen Hao, Bingxuan Wang, Junxiao Song, Deli Chen, Xin Xie, Kang Guan, Yuxiang You, Aixin Liu, Qiushi Du, Wenjun Gao, Xuan Lu, Qinyu Chen, Yaohui Wang, Chengqi Deng, Jiashi Li, Chenggang Zhao, Chong Ruan, Fuli Luo, Wenfeng Liang:
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence. CoRR abs/2406.11931 (2024) - [i26]Yanlin Wang, Daya Guo, Jiachi Chen, Ruikai Zhang, Yuchi Ma, Zibin Zheng:
RLCoder: Reinforcement Learning for Repository-Level Code Completion. CoRR abs/2407.19487 (2024) - [i25]Junjie Huang, Daya Guo, Chenglong Wang, Jiazhen Gu, Shuai Lu, Jeevana Priya Inala, Cong Yan, Jianfeng Gao, Nan Duan, Michael R. Lyu:
Contextualized Data-Wrangling Code Generation in Computational Notebooks. CoRR abs/2409.13551 (2024) - 2023
- [c22]Canwen Xu, Daya Guo, Nan Duan, Julian J. McAuley:
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data. EMNLP 2023: 6268-6278 - [c21]Hang Zhang, Yeyun Gong, Xingwei He, Dayiheng Liu, Daya Guo, Jiancheng Lv, Jian Guo:
Noisy Pair Corrector for Dense Retrieval. EMNLP (Findings) 2023: 11439-11451 - [c20]Daya Guo, Canwen Xu, Nan Duan, Jian Yin, Julian J. McAuley:
LongCoder: A Long-Range Pre-trained Language Model for Code Completion. ICML 2023: 12098-12107 - [i24]Canwen Xu, Daya Guo, Nan Duan, Julian J. McAuley:
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data. CoRR abs/2304.01196 (2023) - [i23]Daya Guo, Canwen Xu, Nan Duan, Jian Yin, Julian J. McAuley:
LongCoder: A Long-Range Pre-trained Language Model for Code Completion. CoRR abs/2306.14893 (2023) - [i22]Hang Zhang, Yeyun Gong, Xingwei He, Dayiheng Liu, Daya Guo, Jiancheng Lv, Jian Guo:
Noisy Pair Corrector for Dense Retrieval. CoRR abs/2311.03798 (2023) - 2022
- [c19]Canwen Xu, Daya Guo, Nan Duan, Julian J. McAuley:
LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval. ACL (Findings) 2022: 3557-3569 - [c18]Shuai Lu, Nan Duan, Hojae Han, Daya Guo, Seung-won Hwang, Alexey Svyatkovskiy:
ReACC: A Retrieval-Augmented Code Completion Framework. ACL (1) 2022: 6227-6240 - [c17]Daya Guo, Shuai Lu, Nan Duan, Yanlin Wang, Ming Zhou, Jian Yin:
UniXcoder: Unified Cross-Modal Pre-training for Code Representation. ACL (1) 2022: 7212-7225 - [c16]Xiaonan Li, Daya Guo, Yeyun Gong, Yun Lin, Yelong Shen, Xipeng Qiu, Daxin Jiang, Weizhu Chen, Nan Duan:
Soft-Labeled Contrastive Pre-Training for Function-Level Code Representation. EMNLP (Findings) 2022: 118-129 - [c15]Daya Guo, Alexey Svyatkovskiy, Jian Yin, Nan Duan, Marc Brockschmidt, Miltiadis Allamanis:
Learning to Complete Code with Sketches. ICLR 2022 - [c14]Wanjun Zhong, Siyuan Wang, Duyu Tang, Zenan Xu, Daya Guo, Yining Chen, Jiahai Wang, Jian Yin, Ming Zhou, Nan Duan:
Analytical Reasoning of Text. NAACL-HLT (Findings) 2022: 2306-2319 - [c13]Zhiyu Li, Shuai Lu, Daya Guo, Nan Duan, Shailesh Jannu, Grant Jenks, Deep Majumder, Jared Green, Alexey Svyatkovskiy, Shengyu Fu, Neel Sundaresan:
Automating code review activities by large-scale pre-training. ESEC/SIGSOFT FSE 2022: 1035-1047 - [i21]Daya Guo, Shuai Lu, Nan Duan, Yanlin Wang, Ming Zhou, Jian Yin:
UniXcoder: Unified Cross-Modal Pre-training for Code Representation. CoRR abs/2203.03850 (2022) - [i20]Canwen Xu, Daya Guo, Nan Duan, Julian J. McAuley:
LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval. CoRR abs/2203.06169 (2022) - [i19]Shuai Lu, Nan Duan, Hojae Han, Daya Guo, Seung-won Hwang, Alexey Svyatkovskiy:
ReACC: A Retrieval-Augmented Code Completion Framework. CoRR abs/2203.07722 (2022) - [i18]Zhiyu Li, Shuai Lu, Daya Guo, Nan Duan, Shailesh Jannu, Grant Jenks, Deep Majumder, Jared Green, Alexey Svyatkovskiy, Shengyu Fu, Neel Sundaresan:
CodeReviewer: Pre-Training for Automating Code Review Activities. CoRR abs/2203.09095 (2022) - [i17]Xiaonan Li, Daya Guo, Yeyun Gong, Yun Lin, Yelong Shen, Xipeng Qiu, Daxin Jiang, Weizhu Chen, Nan Duan:
Soft-Labeled Contrastive Pre-training for Function-level Code Representation. CoRR abs/2210.09597 (2022) - 2021
- [c12]Zenan Xu, Daya Guo, Duyu Tang, Qinliang Su, Linjun Shou, Ming Gong, Wanjun Zhong, Xiaojun Quan, Daxin Jiang, Nan Duan:
Syntax-Enhanced Pre-trained Model. ACL/IJCNLP (1) 2021: 5412-5422 - [c11]Daya Guo, Shuo Ren, Shuai Lu, Zhangyin Feng, Duyu Tang, Shujie Liu, Long Zhou, Nan Duan, Alexey Svyatkovskiy, Shengyu Fu, Michele Tufano, Shao Kun Deng, Colin B. Clement, Dawn Drain, Neel Sundaresan, Jian Yin, Daxin Jiang, Ming Zhou:
GraphCodeBERT: Pre-training Code Representations with Data Flow. ICLR 2021 - [c10]Daya Guo, Zhaoyang Zeng:
Multi-modal Representation Learning for Video Advertisement Content Structuring. ACM Multimedia 2021: 4770-4774 - [c9]Shuai Lu, Daya Guo, Shuo Ren, Junjie Huang, Alexey Svyatkovskiy, Ambrosio Blanco, Colin B. Clement, Dawn Drain, Daxin Jiang, Duyu Tang, Ge Li, Lidong Zhou, Linjun Shou, Long Zhou, Michele Tufano, Ming Gong, Ming Zhou, Nan Duan, Neel Sundaresan, Shao Kun Deng, Shengyu Fu, Shujie Liu:
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation. NeurIPS Datasets and Benchmarks 2021 - [i16]Shuai Lu, Daya Guo, Shuo Ren, Junjie Huang, Alexey Svyatkovskiy, Ambrosio Blanco, Colin B. Clement, Dawn Drain, Daxin Jiang, Duyu Tang, Ge Li, Lidong Zhou, Linjun Shou, Long Zhou, Michele Tufano, Ming Gong, Ming Zhou, Nan Duan, Neel Sundaresan, Shao Kun Deng, Shengyu Fu, Shujie Liu:
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation. CoRR abs/2102.04664 (2021) - [i15]Wanjun Zhong, Siyuan Wang, Duyu Tang, Zenan Xu, Daya Guo, Jiahai Wang, Jian Yin, Ming Zhou, Nan Duan:
AR-LSAT: Investigating Analytical Reasoning of Text. CoRR abs/2104.06598 (2021) - [i14]Daya Guo, Alexey Svyatkovskiy, Jian Yin, Nan Duan, Marc Brockschmidt, Miltiadis Allamanis:
Learning to Generate Code Sketches. CoRR abs/2106.10158 (2021) - [i13]Daya Guo, Zhaoyang Zeng:
Multi-modal Representation Learning for Video Advertisement Content Structuring. CoRR abs/2109.06637 (2021) - 2020
- [c8]Shangwen Lv, Daya Guo, Jingjing Xu, Duyu Tang, Nan Duan, Ming Gong, Linjun Shou, Daxin Jiang, Guihong Cao, Songlin Hu:
Graph-Based Reasoning over Heterogeneous External Knowledge for Commonsense Question Answering. AAAI 2020: 8449-8456 - [c7]Daya Guo, Duyu Tang, Nan Duan, Jian Yin, Daxin Jiang, Ming Zhou:
Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEncoder. ACL 2020: 6118-6129 - [c6]Zhangyin Feng, Daya Guo, Duyu Tang, Nan Duan, Xiaocheng Feng, Ming Gong, Linjun Shou, Bing Qin, Ting Liu, Daxin Jiang, Ming Zhou:
CodeBERT: A Pre-Trained Model for Programming and Natural Languages. EMNLP (Findings) 2020: 1536-1547 - [i12]Zhangyin Feng, Daya Guo, Duyu Tang, Nan Duan, Xiaocheng Feng, Ming Gong, Linjun Shou, Bing Qin, Ting Liu, Daxin Jiang, Ming Zhou:
CodeBERT: A Pre-Trained Model for Programming and Natural Languages. CoRR abs/2002.08155 (2020) - [i11]Daya Guo, Akari Asai, Duyu Tang, Nan Duan, Ming Gong, Linjun Shou, Daxin Jiang, Jian Yin, Ming Zhou:
Inferential Text Generation with Multiple Knowledge Sources and Meta-Learning. CoRR abs/2004.03070 (2020) - [i10]Shangwen Lv, Yuechen Wang, Daya Guo, Duyu Tang, Nan Duan, Fuqing Zhu, Ming Gong, Linjun Shou, Ryan Ma, Daxin Jiang, Guihong Cao, Ming Zhou, Songlin Hu:
Pre-training Text Representations as Meta Learning. CoRR abs/2004.05568 (2020) - [i9]Daya Guo, Duyu Tang, Nan Duan, Jian Yin, Daxin Jiang, Ming Zhou:
Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEncoder. CoRR abs/2006.08101 (2020) - [i8]Daya Guo, Shuo Ren, Shuai Lu, Zhangyin Feng, Duyu Tang, Shujie Liu, Long Zhou, Nan Duan, Alexey Svyatkovskiy, Shengyu Fu, Michele Tufano, Shao Kun Deng, Colin B. Clement, Dawn Drain, Neel Sundaresan, Jian Yin, Daxin Jiang, Ming Zhou:
GraphCodeBERT: Pre-training Code Representations with Data Flow. CoRR abs/2009.08366 (2020) - [i7]Shuo Ren, Daya Guo, Shuai Lu, Long Zhou, Shujie Liu, Duyu Tang, Neel Sundaresan, Ming Zhou, Ambrosio Blanco, Shuai Ma:
CodeBLEU: a Method for Automatic Evaluation of Code Synthesis. CoRR abs/2009.10297 (2020) - [i6]Zenan Xu, Daya Guo, Duyu Tang, Qinliang Su, Linjun Shou, Ming Gong, Wanjun Zhong, Xiaojun Quan, Nan Duan, Daxin Jiang:
Syntax-Enhanced Pre-trained Model. CoRR abs/2012.14116 (2020)
2010 – 2019
- 2019
- [c5]Daya Guo, Duyu Tang, Nan Duan, Ming Zhou, Jian Yin:
Coupling Retrieval and Meta-Learning for Context-Dependent Semantic Parsing. ACL (1) 2019: 855-866 - [c4]Tao Shen, Xiubo Geng, Tao Qin, Daya Guo, Duyu Tang, Nan Duan, Guodong Long, Daxin Jiang:
Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge Base. EMNLP/IJCNLP (1) 2019: 2442-2451 - [c3]Daya Guo, Jiangshui Hong, Binli Luo, Qirui Yan, Zhangming Niu:
Multi-modal Representation Learning for Short Video Understanding and Recommendation. ICME Workshops 2019: 687-690 - [i5]Daya Guo, Duyu Tang, Nan Duan, Ming Zhou, Jian Yin:
Coupling Retrieval and Meta-Learning for Context-Dependent Semantic Parsing. CoRR abs/1906.07108 (2019) - [i4]Shangwen Lv, Daya Guo, Jingjing Xu, Duyu Tang, Nan Duan, Ming Gong, Linjun Shou, Daxin Jiang, Guihong Cao, Songlin Hu:
Graph-Based Reasoning over Heterogeneous External Knowledge for Commonsense Question Answering. CoRR abs/1909.05311 (2019) - [i3]Tao Shen, Xiubo Geng, Tao Qin, Daya Guo, Duyu Tang, Nan Duan, Guodong Long, Daxin Jiang:
Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge Base. CoRR abs/1910.05069 (2019) - 2018
- [c2]Daya Guo, Yibo Sun, Duyu Tang, Nan Duan, Jian Yin, Hong Chi, James Cao, Peng Chen, Ming Zhou:
Question Generation from SQL Queries Improves Neural Semantic Parsing. EMNLP 2018: 1597-1607 - [c1]Daya Guo, Duyu Tang, Nan Duan, Ming Zhou, Jian Yin:
Dialog-to-Action: Conversational Question Answering Over a Large-Scale Knowledge Base. NeurIPS 2018: 2946-2955 - [i2]Daya Guo, Yibo Sun, Duyu Tang, Nan Duan, Jian Yin, Hong Chi, James Cao, Peng Chen, Ming Zhou:
Question Generation from SQL Queries Improves Neural Semantic Parsing. CoRR abs/1808.06304 (2018) - [i1]Yibo Sun, Daya Guo, Duyu Tang, Nan Duan, Zhao Yan, Xiaocheng Feng, Bing Qin:
Knowledge Based Machine Reading Comprehension. CoRR abs/1809.04267 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-21 00:21 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint