default search action
Scott M. Jordan
Person information
- affiliation: University of Alberta, Canada
- affiliation (former): University of Massachusetts, MA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j1]Scott M. Jordan, Samuel Neumann, James E. Kostas, Adam White, Philip S. Thomas:
The Cliff of Overcommitment with Policy Gradient Step Sizes. RLJ 2: 864-883 (2024) - [c10]Dhawal Gupta, Scott M. Jordan, Shreyas Chaudhari, Bo Liu, Philip S. Thomas, Bruno Castro da Silva:
From Past to Future: Rethinking Eligibility Traces. AAAI 2024: 12253-12260 - [c9]Scott M. Jordan, Adam White, Bruno Castro da Silva, Martha White, Philip S. Thomas:
Position: Benchmarking is Limited in Reinforcement Learning Research. ICML 2024 - [i11]Kevin Roice, Parham Mohammad Panahi, Scott M. Jordan, Adam White, Martha White:
A New View on Planning in Online Reinforcement Learning. CoRR abs/2406.01562 (2024) - [i10]Scott M. Jordan, Adam White, Bruno Castro da Silva, Martha White, Philip S. Thomas:
Position: Benchmarking is Limited in Reinforcement Learning Research. CoRR abs/2406.16241 (2024) - 2023
- [c8]Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno C. da Silva:
Behavior Alignment via Reward Function Optimization. NeurIPS 2023 - [i9]Wenhao Yang, Han Wang, Tadashi Kozuno, Scott M. Jordan, Zhihua Zhang:
Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model. CoRR abs/2302.01248 (2023) - [i8]James E. Kostas, Scott M. Jordan, Yash Chandak, Georgios Theocharous, Dhawal Gupta, Martha White, Bruno Castro da Silva, Philip S. Thomas:
Coagent Networks: Generalized and Scaled. CoRR abs/2305.09838 (2023) - [i7]Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno Castro da Silva:
Behavior Alignment via Reward Function Optimization. CoRR abs/2310.19007 (2023) - [i6]Dhawal Gupta, Scott M. Jordan, Shreyas Chaudhari, Bo Liu, Philip S. Thomas, Bruno Castro da Silva:
From Past to Future: Rethinking Eligibility Traces. CoRR abs/2312.12972 (2023) - 2021
- [c7]James E. Kostas, Yash Chandak, Scott M. Jordan, Georgios Theocharous, Philip S. Thomas:
High Confidence Generalization for Reinforcement Learning. ICML 2021: 5764-5773 - 2020
- [c6]Scott M. Jordan, Yash Chandak, Daniel Cohen, Mengxue Zhang, Philip S. Thomas:
Evaluating the Performance of Reinforcement Learning Algorithms. ICML 2020: 4962-4973 - [c5]Yash Chandak, Scott M. Jordan, Georgios Theocharous, Martha White, Philip S. Thomas:
Towards Safe Policy Improvement for Non-Stationary MDPs. NeurIPS 2020 - [i5]Scott M. Jordan, Yash Chandak, Daniel Cohen, Mengxue Zhang, Philip S. Thomas:
Evaluating the Performance of Reinforcement Learning Algorithms. CoRR abs/2006.16958 (2020) - [i4]Yash Chandak, Scott M. Jordan, Georgios Theocharous, Martha White, Philip S. Thomas:
Towards Safe Policy Improvement for Non-Stationary MDPs. CoRR abs/2010.12645 (2020)
2010 – 2019
- 2019
- [c4]Yash Chandak, Georgios Theocharous, James E. Kostas, Scott M. Jordan, Philip S. Thomas:
Learning Action Representations for Reinforcement Learning. ICML 2019: 941-950 - [c3]Daniel Cohen, Scott M. Jordan, W. Bruce Croft:
Learning a Better Negative Sampling Policy with Deep Neural Networks for Search. ICTIR 2019: 19-26 - [i3]Yash Chandak, Georgios Theocharous, James E. Kostas, Scott M. Jordan, Philip S. Thomas:
Learning Action Representations for Reinforcement Learning. CoRR abs/1902.00183 (2019) - [i2]Philip S. Thomas, Scott M. Jordan, Yash Chandak, Chris Nota, James E. Kostas:
Classical Policy Gradient: Preserving Bellman's Principle of Optimality. CoRR abs/1906.03063 (2019) - 2018
- [c2]Li Yang Ku, Scott Michael Jordan, Julia Badger, Erik G. Learned-Miller, Rod Grupen:
Learning to Use a Ratchet by Modeling Spatial Relations in Demonstrations. ISER 2018: 398-410 - [i1]Daniel Cohen, Scott M. Jordan, W. Bruce Croft:
Distributed Evaluations: Ending Neural Point Metrics. CoRR abs/1806.03790 (2018) - 2017
- [c1]Scott Michael Jordan, Dirk Ruiken, Tiffany Q. Liu, Takeshi Takahashi, Michael William Lanighan, Roderic A. Grupen:
Summary of Experiments in Belief-Space Planning at the Laboratory for Perceptual Robotics. AAAI Spring Symposia 2017
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-09 13:23 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint