default search action

combined dblp search
author search
venue search
publication search

ask others

Scott M. Jordan

Scott Michael Jordan

> Home > Persons

Person information

affiliation: University of Alberta, Canada
affiliation (former): University of Massachusetts, MA, USA

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j1]
- no documents available
  - details & citations
- export record
  dblp key:
  - conf/rlc/JordanNK0T24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/JordanNK0T24
Scott M. Jordan, Samuel Neumann, James E. Kostas, Adam White, Philip S. Thomas:
The Cliff of Overcommitment with Policy Gradient Step Sizes. RLJ 2: 864-883 (2024)
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/GuptaJCLTS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/GuptaJCLTS24
Dhawal Gupta, Scott M. Jordan, Shreyas Chaudhari, Bo Liu, Philip S. Thomas, Bruno Castro da Silva:
From Past to Future: Rethinking Eligibility Traces. AAAI 2024: 12253-12260
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Jordan0SWT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Jordan0SWT24
Scott M. Jordan, Adam White, Bruno Castro da Silva, Martha White, Philip S. Thomas:
Position: Benchmarking is Limited in Reinforcement Learning Research. ICML 2024
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-01562
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-01562
Kevin Roice, Parham Mohammad Panahi, Scott M. Jordan, Adam White, Martha White:
A New View on Planning in Online Reinforcement Learning. CoRR abs/2406.01562 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-16241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-16241
Scott M. Jordan, Adam White, Bruno Castro da Silva, Martha White, Philip S. Thomas:
Position: Benchmarking is Limited in Reinforcement Learning Research. CoRR abs/2406.16241 (2024)
2023
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/GuptaCJT023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GuptaCJT023
Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno C. da Silva:
Behavior Alignment via Reward Function Optimization. NeurIPS 2023
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-01248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-01248
Wenhao Yang, Han Wang, Tadashi Kozuno, Scott M. Jordan, Zhihua Zhang:
Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model. CoRR abs/2302.01248 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09838
James E. Kostas, Scott M. Jordan, Yash Chandak, Georgios Theocharous, Dhawal Gupta, Martha White, Bruno Castro da Silva, Philip S. Thomas:
Coagent Networks: Generalized and Scaled. CoRR abs/2305.09838 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-19007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-19007
Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno Castro da Silva:
Behavior Alignment via Reward Function Optimization. CoRR abs/2310.19007 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-12972
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-12972
Dhawal Gupta, Scott M. Jordan, Shreyas Chaudhari, Bo Liu, Philip S. Thomas, Bruno Castro da Silva:
From Past to Future: Rethinking Eligibility Traces. CoRR abs/2312.12972 (2023)
2021
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KostasCJTT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KostasCJTT21
James E. Kostas, Yash Chandak, Scott M. Jordan, Georgios Theocharous, Philip S. Thomas:
High Confidence Generalization for Reinforcement Learning. ICML 2021: 5764-5773
2020
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/JordanCCZT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/JordanCCZT20
Scott M. Jordan, Yash Chandak, Daniel Cohen, Mengxue Zhang, Philip S. Thomas:
Evaluating the Performance of Reinforcement Learning Algorithms. ICML 2020: 4962-4973
[c5]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChandakJTWT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChandakJTWT20
Yash Chandak, Scott M. Jordan, Georgios Theocharous, Martha White, Philip S. Thomas:
Towards Safe Policy Improvement for Non-Stationary MDPs. NeurIPS 2020
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-16958
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-16958
Scott M. Jordan, Yash Chandak, Daniel Cohen, Mengxue Zhang, Philip S. Thomas:
Evaluating the Performance of Reinforcement Learning Algorithms. CoRR abs/2006.16958 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12645
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12645
Yash Chandak, Scott M. Jordan, Georgios Theocharous, Martha White, Philip S. Thomas:
Towards Safe Policy Improvement for Non-Stationary MDPs. CoRR abs/2010.12645 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChandakTKJT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChandakTKJT19
Yash Chandak, Georgios Theocharous, James E. Kostas, Scott M. Jordan, Philip S. Thomas:
Learning Action Representations for Reinforcement Learning. ICML 2019: 941-950
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/ictir/CohenJC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ictir/CohenJC19
Daniel Cohen, Scott M. Jordan, W. Bruce Croft:
Learning a Better Negative Sampling Policy with Deep Neural Networks for Search. ICTIR 2019: 19-26
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-00183
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-00183
Yash Chandak, Georgios Theocharous, James E. Kostas, Scott M. Jordan, Philip S. Thomas:
Learning Action Representations for Reinforcement Learning. CoRR abs/1902.00183 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-03063
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-03063
Philip S. Thomas, Scott M. Jordan, Yash Chandak, Chris Nota, James E. Kostas:
Classical Policy Gradient: Preserving Bellman's Principle of Optimality. CoRR abs/1906.03063 (2019)
2018
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iser/KuJBLG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iser/KuJBLG18
Li Yang Ku, Scott Michael Jordan, Julia Badger, Erik G. Learned-Miller, Rod Grupen:
Learning to Use a Ratchet by Modeling Spatial Relations in Demonstrations. ISER 2018: 398-410
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-03790
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-03790
Daniel Cohen, Scott M. Jordan, W. Bruce Croft:
Distributed Evaluations: Ending Neural Point Metrics. CoRR abs/1806.03790 (2018)
2017
[c1]
- view
  - electronic edition @ aaai.org
  - details & citations
- export record
  dblp key:
  - conf/aaaiss/JordanRL0LG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaaiss/JordanRL0LG17
Scott Michael Jordan, Dirk Ruiken, Tiffany Q. Liu, Takeshi Takahashi, Michael William Lanighan, Roderic A. Grupen:
Summary of Experiments in Belief-Space Planning at the Laboratory for Perceptual Robotics. AAAI Spring Symposia 2017

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.