default search action

combined dblp search
author search
venue search
publication search

ask others

Rishabh Agarwal

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/nlpj/UpadhyayADSC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nlpj/UpadhyayADSC24
Prashant Upadhyay, Rishabh Agarwal, Sumeet Dhiman, Abhinav Sarkar, Saumya Chaturvedi:
A comprehensive survey on answer generation methods using NLP. Nat. Lang. Process. J. 8: 100088 (2024)
[j1]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/SinghCAAPGLH0XP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/SinghCAAPGLH0XP24
Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron T. Parisi, Abhishek Kumar, Alexander A. Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Fathy Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel:
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models. Trans. Mach. Learn. Res. 2024 (2024)
[c34]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/AgarwalVZSGGB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/AgarwalVZSGGB24
Rishabh Agarwal, Nino Vieillard, Yongchao Zhou, Piotr Stanczyk, Sabela Ramos Garea, Matthieu Geist, Olivier Bachem:
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes. ICLR 2024
[c33]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhouLRMRKKA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhouLRMRKKA24
Yongchao Zhou, Kaifeng Lyu, Ankit Singh Rawat, Aditya Krishna Menon, Afshin Rostamizadeh, Sanjiv Kumar, Jean-François Kagy, Rishabh Agarwal:
DistillSpec: Improving Speculative Decoding via Knowledge Distillation. ICLR 2024
[c32]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/FarebrotherOVTC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FarebrotherOVTC24
Jesse Farebrother, Jordi Orbay, Quan Vuong, Adrien Ali Taïga, Yevgen Chebotar, Ted Xiao, Alex Irpan, Sergey Levine, Pablo Samuel Castro, Aleksandra Faust, Aviral Kumar, Rishabh Agarwal:
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL. ICML 2024
[c31]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WeissenbacherAK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WeissenbacherAK24
Matthias Weissenbacher, Rishabh Agarwal, Yoshinobu Kawahara:
SiT: Symmetry-invariant Transformers for Generalisation in Reinforcement Learning. ICML 2024
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-06457
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-06457
Arian Hosseini, Xingdi Yuan, Nikolay Malkin, Aaron C. Courville, Alessandro Sordoni, Rishabh Agarwal:
V-STaR: Training Verifiers for Self-Taught Reasoners. CoRR abs/2402.06457 (2024)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-09371
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-09371
Yongchao Zhou, Uri Alon, Xinyun Chen, Xuezhi Wang, Rishabh Agarwal, Denny Zhou:
Transformers Can Achieve Length Generalization But Not Robustly. CoRR abs/2402.09371 (2024)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-03950
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-03950
Jesse Farebrother, Jordi Orbay, Quan Vuong, Adrien Ali Taïga, Yevgen Chebotar, Ted Xiao, Alex Irpan, Sergey Levine, Pablo Samuel Castro, Aleksandra Faust, Aviral Kumar, Rishabh Agarwal:
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL. CoRR abs/2403.03950 (2024)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-11018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-11018
Rishabh Agarwal, Avi Singh, Lei M. Zhang, Bernd Bohnet, Stephanie Chan, Ankesh Anand, Zaheer Abbas, Azade Nova, John D. Co-Reyes, Eric Chu, Feryal M. P. Behbahani, Aleksandra Faust, Hugo Larochelle:
Many-Shot In-Context Learning. CoRR abs/2404.11018 (2024)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-15025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-15025
Matthias Weissenbacher, Rishabh Agarwal, Yoshinobu Kawahara:
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning. CoRR abs/2406.15025 (2024)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04622
Zachary Kenton, Noah Y. Siegel, János Kramár, Jonah Brown-Cohen, Samuel Albanie, Jannis Bulian, Rishabh Agarwal, David Lindner, Yunhao Tang, Noah D. Goodman, Rohin Shah:
On scalable oversight with weak LLMs judging strong LLMs. CoRR abs/2407.04622 (2024)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-10456
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-10456
Jun Wang, Eleftheria Briakou, Hamid Dadkhahi, Rishabh Agarwal, Colin Cherry, Trevor Cohn:
Don't Throw Away Data: Better Sequence Knowledge Distillation. CoRR abs/2407.10456 (2024)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-15240
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-15240
Lunjun Zhang, Arian Hosseini, Hritik Bansal, Mehran Kazemi, Aviral Kumar, Rishabh Agarwal:
Generative Verifiers: Reward Modeling as Next-Token Prediction. CoRR abs/2408.15240 (2024)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-16737
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-16737
Hritik Bansal, Arian Hosseini, Rishabh Agarwal, Vinh Q. Tran, Mehran Kazemi:
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling. CoRR abs/2408.16737 (2024)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-12917
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-12917
Aviral Kumar, Vincent Zhuang, Rishabh Agarwal, Yi Su, John D. Co-Reyes, Avi Singh, Kate Baumli, Shariq Iqbal, Colton Bishop, Rebecca Roelofs, Lei M. Zhang, Kay McKinney, Disha Shrivastava, Cosmin Paduraru, George Tucker, Doina Precup, Feryal M. P. Behbahani, Aleksandra Faust:
Training Language Models to Self-Correct via Reinforcement Learning. CoRR abs/2409.12917 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-01748
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-01748
Arian Hosseini, Alessandro Sordoni, Daniel Toyama, Aaron C. Courville, Rishabh Agarwal:
Not All LLM Reasoners Are Created Equal. CoRR abs/2410.01748 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-08146
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-08146
Amrith Setlur, Chirag Nagpal, Adam Fisch, Xinyang Geng, Jacob Eisenstein, Rishabh Agarwal, Alekh Agarwal, Jonathan Berant, Aviral Kumar:
Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning. CoRR abs/2410.08146 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-11325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-11325
Wenda Xu, Rujun Han, Zifeng Wang, Long T. Le, Dhruv Madeka, Lei Li, William Yang Wang, Rishabh Agarwal, Chen-Yu Lee, Tomas Pfister:
Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling. CoRR abs/2410.11325 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-18252
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-18252
Michael Noukhovitch, Shengyi Huang, Sophie Xhonneux, Arian Hosseini, Rishabh Agarwal, Aaron C. Courville:
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models. CoRR abs/2410.18252 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-00062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-00062
Ziyu Ye, Rishabh Agarwal, Tianqi Liu, Rishabh Joshi, Sarmishta Velury, Quoc V. Le, Qijun Tan, Yuan Liu:
Evolving Alignment via Asymmetric Self-Play. CoRR abs/2411.00062 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-15287
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-15287
Yinlam Chow, Guy Tennenholtz, Izzeddin Gur, Vincent Zhuang, Bo Dai, Sridhar Thiagarajan, Craig Boutilier, Rishabh Agarwal, Aviral Kumar, Aleksandra Faust:
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models. CoRR abs/2412.15287 (2024)
2023
[c30]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/LanGFRPAB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/LanGFRPAB23
Charline Le Lan, Joshua Greaves, Jesse Farebrother, Mark Rowland, Fabian Pedregosa, Rishabh Agarwal, Marc G. Bellemare:
A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces. AISTATS 2023: 1703-1718
[c29]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FarebrotherGALG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FarebrotherGALG23
Jesse Farebrother, Joshua Greaves, Rishabh Agarwal, Charline Le Lan, Ross Goroshin, Pablo Samuel Castro, Marc G. Bellemare:
Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks. ICLR 2023
[c28]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/KumarAGTL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KumarAGTL23
Aviral Kumar, Rishabh Agarwal, Xinyang Geng, George Tucker, Sergey Levine:
Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes. ICLR 2023
[c27]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LanA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LanA23
Charline Le Lan, Rishabh Agarwal:
Revisiting Bisimulation: A Sampling-Based State Similarity Pseudo-metric. Tiny Papers @ ICLR 2023
[c26]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/TaigaAFCB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TaigaAFCB23
Adrien Ali Taïga, Rishabh Agarwal, Jesse Farebrother, Aaron C. Courville, Marc G. Bellemare:
Investigating Multi-task Pretraining and Generalization in Reinforcement Learning. ICLR 2023
[c25]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LanTRHABD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LanTRHABD23
Charline Le Lan, Stephen Tu, Mark Rowland, Anna Harutyunyan, Rishabh Agarwal, Marc G. Bellemare, Will Dabney:
Bootstrapped Representations in Reinforcement Learning. ICML 2023: 18686-18713
[c24]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SchwarzerOCBAC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SchwarzerOCBAC23
Max Schwarzer, Johan Samir Obando-Ceron, Aaron C. Courville, Marc G. Bellemare, Rishabh Agarwal, Pablo Samuel Castro:
Bigger, Better, Faster: Human-level Atari with human-level efficiency. ICML 2023: 30365-30380
[c23]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SokarACE23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SokarACE23
Ghada Sokar, Rishabh Agarwal, Pablo Samuel Castro, Utku Evci:
The Dormant Neuron Phenomenon in Deep Reinforcement Learning. ICML 2023: 32145-32168
[c22]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZitovskyMAK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZitovskyMAK23
Joshua P. Zitovsky, Daniel de Marchi, Rishabh Agarwal, Michael Rene Kosorok:
Revisiting Bellman Errors for Offline Model Selection. ICML 2023: 43369-43406
[c21]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/GulinoFLTBLHPWC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GulinoFLTBLHPWC23
Cole Gulino, Justin Fu, Wenjie Luo, George Tucker, Eli Bronstein, Yiren Lu, Jean Harb, Xinlei Pan, Yan Wang, Xiangyu Chen, John D. Co-Reyes, Rishabh Agarwal, Rebecca Roelofs, Yao Lu, Nico Montali, Paul Mougin, Zoey Yang, Brandyn White, Aleksandra Faust, Rowan McAllister, Dragomir Anguelov, Benjamin Sapp:
Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research. NeurIPS 2023
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-00141
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-00141
Joshua P. Zitovsky, Daniel de Marchi, Rishabh Agarwal, Michael R. Kosorok:
Revisiting Bellman Errors for Offline Model Selection. CoRR abs/2302.00141 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-12902
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-12902
Ghada Sokar, Rishabh Agarwal, Pablo Samuel Castro, Utku Evci:
The Dormant Neuron Phenomenon in Deep Reinforcement Learning. CoRR abs/2302.12902 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-12567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-12567
Jesse Farebrother, Joshua Greaves, Rishabh Agarwal, Charline Le Lan, Ross Goroshin, Pablo Samuel Castro, Marc G. Bellemare:
Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks. CoRR abs/2304.12567 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19452
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19452
Max Schwarzer, Johan S. Obando-Ceron, Aaron C. Courville, Marc G. Bellemare, Rishabh Agarwal, Pablo Samuel Castro:
Bigger, Better, Faster: Human-level Atari with human-level efficiency. CoRR abs/2305.19452 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10171
Charline Le Lan, Stephen Tu, Mark Rowland, Anna Harutyunyan, Rishabh Agarwal, Marc G. Bellemare, Will Dabney:
Bootstrapped Representations in Reinforcement Learning. CoRR abs/2306.10171 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-13649
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-13649
Rishabh Agarwal, Nino Vieillard, Piotr Stanczyk, Sabela Ramos, Matthieu Geist, Olivier Bachem:
GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models. CoRR abs/2306.13649 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-08461
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-08461
Yongchao Zhou, Kaifeng Lyu, Ankit Singh Rawat, Aditya Krishna Menon, Afshin Rostamizadeh, Sanjiv Kumar, Jean-François Kagy, Rishabh Agarwal:
DistillSpec: Improving Speculative Decoding via Knowledge Distillation. CoRR abs/2310.08461 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-08710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-08710
Cole Gulino, Justin Fu, Wenjie Luo, George Tucker, Eli Bronstein, Yiren Lu, Jean Harb, Xinlei Pan, Yan Wang, Xiangyu Chen, John D. Co-Reyes, Rishabh Agarwal, Rebecca Roelofs, Yao Lu, Nico Montali, Paul Mougin, Zoey Yang, Brandyn White, Aleksandra Faust, Rowan McAllister, Dragomir Anguelov, Benjamin Sapp:
Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research. CoRR abs/2310.08710 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-17894
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-17894
Max Schwarzer, Jesse Farebrother, Joshua Greaves, Ekin Dogus Cubuk, Rishabh Agarwal, Aaron C. Courville, Marc G. Bellemare, Sergei V. Kalinin, Igor Mordatch, Pablo Samuel Castro, Kevin M. Roccapriore:
Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy. CoRR abs/2311.17894 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-06585
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-06585
Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin F. Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel:
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models. CoRR abs/2312.06585 (2023)
2022
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/NikishinAAB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/NikishinAAB22
Evgenii Nikishin, Romina Abachi, Rishabh Agarwal, Pierre-Luc Bacon:
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation. AAAI 2022: 7886-7894
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/LanTOAB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/LanTOAB22
Charline Le Lan, Stephen Tu, Adam Oberman, Rishabh Agarwal, Marc G. Bellemare:
On the Generalization of Representations in Reinforcement Learning. AISTATS 2022: 4132-4157
[c18]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/KumarA0CTL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KumarA0CTL22
Aviral Kumar, Rishabh Agarwal, Tengyu Ma, Aaron C. Courville, George Tucker, Sergey Levine:
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization. ICLR 2022
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/igarss/MohiteSAPP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/igarss/MohiteSAPP22
Jayantrao Mohite, Suryakant A. Sawant, Rishabh Agarwal, Ankur Pandit, Srinivasu Pappula:
Detection Of Crop Water Stress In Maize Using Drone Based Hyperspectral Imaging. IGARSS 2022: 5957-5960
[c16]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/AgarwalSCCB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AgarwalSCCB22
Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron C. Courville, Marc G. Bellemare:
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress. NeurIPS 2022
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-00543
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-00543
Charline Le Lan, Stephen Tu, Adam Oberman, Rishabh Agarwal, Marc G. Bellemare:
On the Generalization of Representations in Reinforcement Learning. CoRR abs/2203.00543 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-01626
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-01626
Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron C. Courville, Marc G. Bellemare:
Beyond Tabula Rasa: Reincarnating Reinforcement Learning. CoRR abs/2206.01626 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-15144
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-15144
Aviral Kumar, Rishabh Agarwal, Xinyang Geng, George Tucker, Sergey Levine:
Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes. CoRR abs/2211.15144 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-04025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-04025
Charline Le Lan, Joshua Greaves, Jesse Farebrother, Mark Rowland, Fabian Pedregosa, Rishabh Agarwal, Marc G. Bellemare:
A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces. CoRR abs/2212.04025 (2022)
2021
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/agro-geoinformatics/SawantAMPP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/agro-geoinformatics/SawantAMPP21
Suryakant A. Sawant, Rishabh Agarwal, Jayantrao Mohite, Ankur Pandit, Srinivasu Pappula:
Field Boundary Identification using Convolutional Neural Network and GIS on High Resolution Satellite Observations. Agro-Geoinformatics 2021: 1-6
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/AgarwalMCB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/AgarwalMCB21
Rishabh Agarwal, Marlos C. Machado, Pablo Samuel Castro, Marc G. Bellemare:
Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning. ICLR 2021
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/KumarAGL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KumarAGL21
Aviral Kumar, Rishabh Agarwal, Dibya Ghosh, Sergey Levine:
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning. ICLR 2021
[c12]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/AgarwalMFZLCH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AgarwalMFZLCH21
Rishabh Agarwal, Levi Melnick, Nicholas Frosst, Xuezhou Zhang, Benjamin J. Lengerich, Rich Caruana, Geoffrey E. Hinton:
Neural Additive Models: Interpretable Machine Learning with Neural Nets. NeurIPS 2021: 4699-4711
[c11]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/AgarwalSCCB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AgarwalSCCB21
Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron C. Courville, Marc G. Bellemare:
Deep Reinforcement Learning at the Edge of the Statistical Precipice. NeurIPS 2021: 29304-29320
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-05265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-05265
Rishabh Agarwal, Marlos C. Machado, Pablo Samuel Castro, Marc G. Bellemare:
Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning. CoRR abs/2101.05265 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-03273
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-03273
Evgenii Nikishin, Romina Abachi, Rishabh Agarwal, Pierre-Luc Bacon:
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation. CoRR abs/2106.03273 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-13264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-13264
Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron C. Courville, Marc G. Bellemare:
Deep Reinforcement Learning at the Edge of the Statistical Precipice. CoRR abs/2108.13264 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-04716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-04716
Aviral Kumar, Rishabh Agarwal, Tengyu Ma, Aaron C. Courville, George Tucker, Sergey Levine:
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization. CoRR abs/2112.04716 (2021)
2020
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/AgarwalS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AgarwalS020
Rishabh Agarwal, Dale Schuurmans, Mohammad Norouzi:
An Optimistic Perspective on Offline Reinforcement Learning. ICML 2020: 104-114
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/FedusRABLRD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FedusRABLRD20
William Fedus, Prajit Ramachandran, Rishabh Agarwal, Yoshua Bengio, Hugo Larochelle, Mark Rowland, Will Dabney:
Revisiting Fundamentals of Experience Replay. ICML 2020: 3061-3071
[c8]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Gulcehre0NPCZAM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Gulcehre0NPCZAM20
Çaglar Gülçehre, Ziyu Wang, Alexander Novikov, Thomas Paine, Sergio Gómez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel J. Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matthew Hoffman, Nicolas Heess, Nando de Freitas:
RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning. NeurIPS 2020
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/semeval/SinghalDAM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/semeval/SinghalDAM20
Vipul Singhal, Sahil Dhull, Rishabh Agarwal, Ashutosh Modi:
IITK at SemEval-2020 Task 10: Transformers for Emphasis Selection. SemEval@COLING 2020: 1665-1670
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-13912
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-13912
Rishabh Agarwal, Nicholas Frosst, Xuezhou Zhang, Rich Caruana, Geoffrey E. Hinton:
Neural Additive Models: Interpretable Machine Learning with Neural Nets. CoRR abs/2004.13912 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-13888
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-13888
Çaglar Gülçehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gómez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel J. Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas:
RL Unplugged: Benchmarks for Offline Reinforcement Learning. CoRR abs/2006.13888 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-06700
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-06700
William Fedus, Prajit Ramachandran, Rishabh Agarwal, Yoshua Bengio, Hugo Larochelle, Mark Rowland, Will Dabney:
Revisiting Fundamentals of Experience Replay. CoRR abs/2007.06700 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-10820
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-10820
Vipul Singhal, Sahil Dhull, Rishabh Agarwal, Ashutosh Modi:
IITK at SemEval-2020 Task 10: Transformers for Emphasis Selection. CoRR abs/2007.10820 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14498
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14498
Aviral Kumar, Rishabh Agarwal, Dibya Ghosh, Sergey Levine:
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning. CoRR abs/2010.14498 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/AgarwalLS019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AgarwalLS019
Rishabh Agarwal, Chen Liang, Dale Schuurmans, Mohammad Norouzi:
Learning to Generalize from Sparse and Underspecified Rewards. ICML 2019: 130-140
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/robosoft/AgarwalB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/robosoft/AgarwalB19
Rishabh Agarwal, Sarah Bergbreiter:
Measurement of shear forces during gripping tasks with a low-cost tactile sensing system. RoboSoft 2019: 330-336
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-08728
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-08728
Rishabh Agarwal:
Evaluation Function Approximation for Scrabble. CoRR abs/1901.08728 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-07198
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-07198
Rishabh Agarwal, Chen Liang, Dale Schuurmans, Mohammad Norouzi:
Learning to Generalize from Sparse and Underspecified Rewards. CoRR abs/1902.07198 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-04543
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-04543
Rishabh Agarwal, Dale Schuurmans, Mohammad Norouzi:
Striving for Simplicity in Off-policy Deep Reinforcement Learning. CoRR abs/1907.04543 (2019)
2017
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/air/ShuklaSASSC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/air/ShuklaSASSC17
Ayush Shukla, Rishabjit Singh, Rishabh Agarwal, Muhammad Suhail, Subir K. Saha, Santanu Chaudhury:
Development of a Low-Cost Education Platform: RoboMuse 4.0. AIR 2017: 38:1-38:6
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/globecom/GuptaASGD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globecom/GuptaASGD17
Prakhar Gupta, Rishabh Agarwal, Surbhi Saraswat, Hari Prabhat Gupta, Tanima Dutta:
S-Pencil: A Smart Pencil Grip Monitoring System for Kids Using Sensors. GLOBECOM 2017: 1-6
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/isda/RautKA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isda/RautKA17
Manoj K. Raut, Tushar V. Kokane, Rishabh Agarwal:
Computing Theory Prime Implicates in Modal Logic. ISDA 2017: 273-282
2016
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/sii/AgarwalSSM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sii/AgarwalSSM16
Rishabh Agarwal, Prayag Sharma, Subir K. Saha, Takafumi Matsumaru:
Touchless human-mobile robot interaction using a projectable interactive surface. SII 2016: 723-728

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.