default search action

combined dblp search
author search
venue search
publication search

ask others

Sina Ghiassian

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/FelicioniMGC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/FelicioniMGC24
Nicolò Felicioni, Lucas Maystre, Sina Ghiassian, Kamil Ciosek:
On the Importance of Uncertainty in Decision-Making with Large Language Models. Trans. Mach. Learn. Res. 2024 (2024)
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DaiTG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DaiTG24
Zhenwen Dai, Federico Tomasi, Sina Ghiassian:
In-context Exploration-Exploitation for Reinforcement Learning. ICLR 2024
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-06826
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-06826
Zhenwen Dai, Federico Tomasi, Sina Ghiassian:
In-context Exploration-Exploitation for Reinforcement Learning. CoRR abs/2403.06826 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-02649
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-02649
Nicolò Felicioni, Lucas Maystre, Sina Ghiassian, Kamil Ciosek:
On the Importance of Uncertainty in Decision-Making with Large Language Models. CoRR abs/2404.02649 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-00747
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-00747
Arsalan Sharifnassab, Sina Ghiassian, Saber Salehkaleybar, Surya Kanoria, Dale Schuurmans:
Soft Preference Optimization: Aligning Language Models to Expert Distributions. CoRR abs/2405.00747 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-06317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-06317
Arash Tavakoli, Sina Ghiassian, Nemanja Rakicevic:
Learning in complex action spaces without policy gradients. CoRR abs/2410.06317 (2024)
2023
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/adb/RafieeAGKSLW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/adb/RafieeAGKSLW23
Banafsheh Rafiee, Zaheer Abbas, Sina Ghiassian, Raksha Kumaraswamy, Richard S. Sutton, Elliot A. Ludvig, Adam White:
From eye-blinks to state construction: Diagnostic benchmarks for online representation learning. Adapt. Behav. 31(1): 3-19 (2023)
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/collas/RafieeG0S0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/collas/RafieeG0S0023
Banafsheh Rafiee, Sina Ghiassian, Jun Jin, Richard S. Sutton, Jun Luo, Adam White:
Auxiliary task discovery through generate-and-test. CoLLAs 2023: 703-714
2022
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-10172
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-10172
Eric Graves, Sina Ghiassian:
Importance Sampling Placement in Off-Policy Temporal-Difference Methods. CoRR abs/2203.10172 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-14361
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-14361
Banafsheh Rafiee, Sina Ghiassian, Jun Jin, Richard S. Sutton, Jun Luo, Adam White:
Auxiliary task discovery through generate-and-test. CoRR abs/2210.14361 (2022)
2021
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-07686
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-07686
Dylan R. Ashley, Sina Ghiassian, Richard S. Sutton:
Does Standard Backpropagation Forget Less Catastrophically Than Adam? CoRR abs/2102.07686 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-13844
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-13844
Andrew Patterson, Adam White, Sina Ghiassian, Martha White:
A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning. CoRR abs/2104.13844 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-00922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-00922
Sina Ghiassian, Richard S. Sutton:
An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task. CoRR abs/2106.00922 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-05110
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-05110
Sina Ghiassian, Richard S. Sutton:
An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment. CoRR abs/2109.05110 (2021)
2020
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/GhiassianRLW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/GhiassianRLW20
Sina Ghiassian, Banafsheh Rafiee, Yat Long Lo, Adam White:
Improving Performance in Reinforcement Learning by Breaking Generalization in Neural Networks. AAMAS 2020: 438-446
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/GhiassianP0GWW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GhiassianP0GWW20
Sina Ghiassian, Andrew Patterson, Shivam Garg, Dhawal Gupta, Adam White, Martha White:
Gradient Temporal-Difference Learning with Regularized Corrections. ICML 2020: 3524-3534
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-07417
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-07417
Sina Ghiassian, Banafsheh Rafiee, Yat Long Lo, Adam White:
Improving Performance in Reinforcement Learning by Breaking Generalization in Neural Networks. CoRR abs/2003.07417 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-00611
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-00611
Sina Ghiassian, Andrew Patterson, Shivam Garg, Dhawal Gupta, Adam White, Martha White:
Gradient Temporal-Difference Learning with Regularized Corrections. CoRR abs/2007.00611 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/RafieeGWS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/RafieeGWS19
Banafsheh Rafiee, Sina Ghiassian, Adam White, Richard S. Sutton:
Prediction in Intelligence: An Empirical Comparison of Off-policy Algorithms on Robots. AAMAS 2019: 332-340
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-00194
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-00194
Xiang Gu, Sina Ghiassian, Richard S. Sutton:
Should All Temporal Difference Learning Use Emphasis? CoRR abs/1903.00194 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-13213
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-13213
Yat Long Lo, Sina Ghiassian:
Overcoming Catastrophic Interference in Online Reinforcement Learning with Dynamic Self-Organizing Maps. CoRR abs/1910.13213 (2019)
2018
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-07476
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-07476
Sina Ghiassian, Huizhen Yu, Banafsheh Rafiee, Richard S. Sutton:
Two geometric input transformation methods for fast online reinforcement learning with neural nets. CoRR abs/1805.07476 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02597
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02597
Sina Ghiassian, Andrew Patterson, Martha White, Richard S. Sutton, Adam White:
Online Off-policy Prediction. CoRR abs/1811.02597 (2018)
2017
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/GhiassianRS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/GhiassianRS17
Sina Ghiassian, Banafsheh Rafiee, Richard S. Sutton:
A First Empirical Study of Emphatic Temporal Difference Learning. CoRR abs/1705.04185 (2017)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.