default search action

combined dblp search
author search
venue search
publication search

ask others

Yash Chandak

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c22]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/ShankarSCMF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/ShankarSCMF24
Shiv Shankar, Ritwik Sinha, Yash Chandak, Saayan Mitra, Madalina Fiterau:
A/B testing under Interference with Partial Network Information. AISTATS 2024: 19-27
[c21]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/Flet-BerliacGSC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/Flet-BerliacGSC24
Yannis Flet-Berliac, Nathan Grinsztajn, Florian Strub, Eugene Choi, Bill Wu, Chris Cremer, Arash Ahmadian, Yash Chandak, Mohammad Gheshlaghi Azar, Olivier Pietquin, Matthieu Geist:
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion. EMNLP 2024: 21353-21370
[c20]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChandakSSB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChandakSSB24
Yash Chandak, Shiv Shankar, Vasilis Syrgkanis, Emma Brunskill:
Adaptive Instrument Design for Indirect Experiments. ICLR 2024
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/lak/LeonNCB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lak/LeonNCB24
Amelia Leon, Allen Nie, Yash Chandak, Emma Brunskill:
Estimating the Causal Treatment Effect of Unproductive Persistence. LAK 2024: 843-849
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-10547
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-10547
Shiv Shankar, Ritwik Sinha, Yash Chandak, Saayan Mitra, Madalina Fiterau:
A/B testing under Interference with Partial Network Information. CoRR abs/2404.10547 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17708
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17708
Allen Nie, Yash Chandak, Christina J. Yuan, Anirudhan Badrinath, Yannis Flet-Berliac, Emma Brunskill:
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators. CoRR abs/2405.17708 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19185
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19185
Yannis Flet-Berliac, Nathan Grinsztajn, Florian Strub, Eugene Choi, Chris Cremer, Arash Ahmadian, Yash Chandak, Mohammad Gheshlaghi Azar, Olivier Pietquin, Matthieu Geist:
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion. CoRR abs/2406.19185 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19188
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19188
Nathan Grinsztajn, Yannis Flet-Berliac, Mohammad Gheshlaghi Azar, Florian Strub, Bill Wu, Eugene Choi, Chris Cremer, Arash Ahmadian, Yash Chandak, Olivier Pietquin, Matthieu Geist:
Averaging log-likelihoods in direct alignment. CoRR abs/2406.19188 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03674
Hyunji Alex Nam, Yash Chandak, Emma Brunskill:
Short-Long Policy Evaluation with Novel Actions. CoRR abs/2407.03674 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-09975
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-09975
Allen Nie, Yash Chandak, Miroslav Suzara, Ali Malik, Juliette Woodrow, Matt Peng, Mehran Sahami, Emma Brunskill, Chris Piech:
The GPT Surprise: Offering Large Language Model Chat in a Massive Coding Class Reduced Engagement but Increased Adopters Exam Performances. CoRR abs/2407.09975 (2024)
2023
[c18]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/LiuCTW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/LiuCTW23
Vincent Liu, Yash Chandak, Philip S. Thomas, Martha White:
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments. AISTATS 2023: 5474-5492
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChandakTGTMDB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChandakTGTMDB23
Yash Chandak, Shantanu Thakoor, Zhaohan Daniel Guo, Yunhao Tang, Rémi Munos, Will Dabney, Diana L. Borsa:
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition. ICML 2023: 4009-4034
[c16]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/TangGRPCMRALL0T23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TangGRPCMRALL0T23
Yunhao Tang, Zhaohan Daniel Guo, Pierre Harvey Richemond, Bernardo Ávila Pires, Yash Chandak, Rémi Munos, Mark Rowland, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, András György, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, Michal Valko:
Understanding Self-Predictive Learning for Reinforcement Learning. ICML 2023: 33632-33656
[c15]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0002XPCFNB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0002XPCFNB23
Jonathan Lee, Annie Xie, Aldo Pacchiano, Yash Chandak, Chelsea Finn, Ofir Nachum, Emma Brunskill:
Supervised Pretraining Can Learn In-Context Reinforcement Learning. NeurIPS 2023
[c14]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/GuptaCJT023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GuptaCJT023
Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno C. da Silva:
Behavior Alignment via Reward Function Optimization. NeurIPS 2023
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-10330
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-10330
Yash Chandak, Shiv Shankar, Nathaniel D. Bastian, Bruno Castro da Silva, Emma Brunskill, Philip S. Thomas:
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments. CoRR abs/2301.10330 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-03161
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-03161
Yash Chandak, Shiv Shankar, Venkata Gandikota, Philip S. Thomas, Arya Mazumdar:
Optimization using Parallel Gradient Evaluations on Multiple Parameters. CoRR abs/2302.03161 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11725
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11725
Vincent Liu, Yash Chandak, Philip S. Thomas, Martha White:
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments. CoRR abs/2302.11725 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-00654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-00654
Yash Chandak, Shantanu Thakoor, Zhaohan Daniel Guo, Yunhao Tang, Rémi Munos, Will Dabney, Diana L. Borsa:
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition. CoRR abs/2305.00654 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09838
James E. Kostas, Scott M. Jordan, Yash Chandak, Georgios Theocharous, Dhawal Gupta, Martha White, Bruno Castro da Silva, Philip S. Thomas:
Coagent Networks: Generalized and Scaled. CoRR abs/2305.09838 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-14892
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-14892
Jonathan N. Lee, Annie Xie, Aldo Pacchiano, Yash Chandak, Chelsea Finn, Ofir Nachum, Emma Brunskill:
Supervised Pretraining Can Learn In-Context Reinforcement Learning. CoRR abs/2306.14892 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-19007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-19007
Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno Castro da Silva:
Behavior Alignment via Reward Function Optimization. CoRR abs/2310.19007 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-02438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-02438
Yash Chandak, Shiv Shankar, Vasilis Syrgkanis, Emma Brunskill:
Adaptive Instrument Design for Indirect Experiments. CoRR abs/2312.02438 (2023)
2022
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/fdata/VijayanCKPR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fdata/VijayanCKPR22
Priyesh Vijayan, Yash Chandak, Mitesh M. Khapra, Srinivasan Parthasarathy, Balaraman Ravindran:
Scaling Graph Propagation Kernels for Predictive Learning. Frontiers Big Data 5: 616617 (2022)
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/TanKPPCRRSHC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/TanKPPCRRSHC22
Weihao Tan, David Koleczek, Siddhant Pradhan, Nicholas Perello, Vivek Chettiar, Vishal Rohra, Aaslesha Rajaram, Soundararajan Srinivasan, H. M. Sajjad Hossain, Yash Chandak:
On Optimizing Interventions in Shared Autonomy. AAAI 2022: 5341-5349
[c12]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChandakSB0BT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChandakSB0BT22
Yash Chandak, Shiv Shankar, Nathaniel D. Bastian, Bruno C. da Silva, Emma Brunskill, Philip S. Thomas:
Off-Policy Evaluation for Action-Dependent Non-stationary Environments. NeurIPS 2022
[c11]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/MuCHB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MuCHB22
Tong Mu, Yash Chandak, Tatsunori B. Hashimoto, Emma Brunskill:
Factored DRO: Factored Distributionally Robust Policies for Contextual Bandits. NeurIPS 2022
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-03319
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-03319
Yunhao Tang, Zhaohan Daniel Guo, Pierre Harvey Richemond, Bernardo Ávila Pires, Yash Chandak, Rémi Munos, Mark Rowland, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, András György, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, Michal Valko:
Understanding Self-Predictive Learning for Reinforcement Learning. CoRR abs/2212.03319 (2022)
2021
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ChandakST21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ChandakST21
Yash Chandak, Shiv Shankar, Philip S. Thomas:
High-Confidence Off-Policy (or Counterfactual) Variance Estimation. AAAI 2021: 6939-6947
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KostasCJTT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KostasCJTT21
James E. Kostas, Yash Chandak, Scott M. Jordan, Georgios Theocharous, Philip S. Thomas:
High Confidence Generalization for Reinforcement Learning. ICML 2021: 5764-5773
[c8]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YuanCGTN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YuanCGTN21
Christina J. Yuan, Yash Chandak, Stephen Giguere, Philip S. Thomas, Scott Niekum:
SOPE: Spectrum of Off-Policy Estimators. NeurIPS 2021: 18958-18969
[c7]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChandakNSLBT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChandakNSLBT21
Yash Chandak, Scott Niekum, Bruno C. da Silva, Erik G. Learned-Miller, Emma Brunskill, Philip S. Thomas:
Universal Off-Policy Evaluation. NeurIPS 2021: 27475-27490
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-09847
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-09847
Yash Chandak, Shiv Shankar, Philip S. Thomas:
High-Confidence Off-Policy (or Counterfactual) Variance Estimation. CoRR abs/2101.09847 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-12820
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-12820
Yash Chandak, Scott Niekum, Bruno Castro da Silva, Erik G. Learned-Miller, Emma Brunskill, Philip S. Thomas:
Universal Off-Policy Evaluation. CoRR abs/2104.12820 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-03936
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-03936
Christina J. Yuan, Yash Chandak, Stephen Giguere, Philip S. Thomas, Scott Niekum:
SOPE: Spectrum of Off-Policy Estimators. CoRR abs/2111.03936 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-09169
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-09169
Weihao Tan, David Koleczek, Siddhant Pradhan, Nicholas Perello, Vivek Chettiar, Vishal Rohra, Aaslesha Rajaram, Soundararajan Srinivasan, H. M. Sajjad Hossain, Yash Chandak:
On Optimizing Interventions in Shared Autonomy. CoRR abs/2112.09169 (2021)
2020
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ChandakTNT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ChandakTNT20
Yash Chandak, Georgios Theocharous, Chris Nota, Philip S. Thomas:
Lifelong Learning with a Changing Action Set. AAAI 2020: 3373-3380
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ChandakTMT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ChandakTMT20
Yash Chandak, Georgios Theocharous, Blossom Metevier, Philip S. Thomas:
Reinforcement Learning When All Actions Are Not Always Available. AAAI 2020: 3381-3388
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChandakTSWMT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChandakTSWMT20
Yash Chandak, Georgios Theocharous, Shiv Shankar, Martha White, Sridhar Mahadevan, Philip S. Thomas:
Optimizing for the Future in Non-Stationary MDPs. ICML 2020: 1414-1425
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/JordanCCZT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/JordanCCZT20
Scott M. Jordan, Yash Chandak, Daniel Cohen, Mengxue Zhang, Philip S. Thomas:
Evaluating the Performance of Reinforcement Learning Algorithms. ICML 2020: 4962-4973
[c2]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChandakJTWT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChandakJTWT20
Yash Chandak, Scott M. Jordan, Georgios Theocharous, Martha White, Philip S. Thomas:
Towards Safe Policy Improvement for Non-Stationary MDPs. NeurIPS 2020
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08158
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08158
Yash Chandak, Georgios Theocharous, Shiv Shankar, Martha White, Sridhar Mahadevan, Philip S. Thomas:
Optimizing for the Future in Non-Stationary MDPs. CoRR abs/2005.08158 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-16958
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-16958
Scott M. Jordan, Yash Chandak, Daniel Cohen, Mengxue Zhang, Philip S. Thomas:
Evaluating the Performance of Reinforcement Learning Algorithms. CoRR abs/2006.16958 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-07346
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-07346
Georgios Theocharous, Yash Chandak, Philip S. Thomas, Frits de Nijs:
Reinforcement Learning for Strategic Recommendations. CoRR abs/2009.07346 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12645
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12645
Yash Chandak, Scott M. Jordan, Georgios Theocharous, Martha White, Philip S. Thomas:
Towards Safe Policy Improvement for Non-Stationary MDPs. CoRR abs/2010.12645 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChandakTKJT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChandakTKJT19
Yash Chandak, Georgios Theocharous, James E. Kostas, Scott M. Jordan, Philip S. Thomas:
Learning Action Representations for Reinforcement Learning. ICML 2019: 941-950
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-00183
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-00183
Yash Chandak, Georgios Theocharous, James E. Kostas, Scott M. Jordan, Philip S. Thomas:
Learning Action Representations for Reinforcement Learning. CoRR abs/1902.00183 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-01770
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-01770
Yash Chandak, Georgios Theocharous, Chris Nota, Philip S. Thomas:
Lifelong Learning with a Changing Action Set. CoRR abs/1906.01770 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-01772
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-01772
Yash Chandak, Georgios Theocharous, Blossom Metevier, Philip S. Thomas:
Reinforcement Learning When All Actions are Not Always Available. CoRR abs/1906.01772 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-03063
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-03063
Philip S. Thomas, Scott M. Jordan, Yash Chandak, Chris Nota, James E. Kostas:
Classical Policy Gradient: Preserving Bellman's Principle of Optimality. CoRR abs/1906.03063 (2019)
2018
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-12421
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-12421
Priyesh Vijayan, Yash Chandak, Mitesh M. Khapra, Balaraman Ravindran:
HOPF: Higher Order Propagation Framework for Deep Collective Classification. CoRR abs/1805.12421 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-12528
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-12528
Priyesh Vijayan, Yash Chandak, Mitesh M. Khapra, Balaraman Ravindran:
Fusion Graph Convolutional Networks. CoRR abs/1805.12528 (2018)
2015
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/VeitWVBDAACCCDD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/VeitWVBDAACCCDD15
Andreas Veit, Michael J. Wilber, Rajan Vaish, Serge J. Belongie, James Davis, Vishal Anand, Anshu Aviral, Prithvijit Chakrabarty, Yash Chandak, Sidharth Chaturvedi, Chinmaya Devaraj, Ankit Dhall, Utkarsh Dwivedi, Sanket Gupte, Sharath N. Sridhar, Karthik Paga, Anuj Pahuja, Aditya Raisinghani, Ayush Sharma, Shweta Sharma, Darpana Sinha, Nisarg Thakkar, K. Bala Vignesh, Utkarsh Verma, Kanniganti Abhishek, Amod Agrawal, Arya Aishwarya, Aurgho Bhattacharjee, Sarveshwaran Dhanasekar, Venkata Karthik Gullapalli, Shuchita Gupta, Chandana G, Kinjal Jain, Simran Kapur, Meghana Kasula, Shashi Kumar, Parth Kundaliya, Utkarsh Mathur, Alankrit Mishra, Aayush Mudgal, Aditya Nadimpalli, Munakala Sree Nihit, Akanksha Periwal, Ayush Sagar, Ayush Shah, Vikas Sharma, Yashovardhan Sharma, Faizal Siddiqui, Virender Singh, Abhinav S., Pradyumna Tambwekar, Rashida Taskin, Ankit Tripathi, Anurag D. Yadav:
On Optimizing Human-Machine Task Assignments. CoRR abs/1509.07543 (2015)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.