default search action
Alborz Geramifard
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j9]Rohan Chitnis, Shentao Yang, Alborz Geramifard:
Sequential Decision-Making for Inline Text Autocomplete. RLJ 2: 946-960 (2024) - [j8]Koichiro Yoshino, Yun-Nung Chen, Paul A. Crook, Satwik Kottur, Jinchao Li, Behnam Hedayatnia, Seungwhan Moon, Zhengcong Fei, Zekang Li, Jinchao Zhang, Yang Feng, Jie Zhou, Seokhwan Kim, Yang Liu, Di Jin, Alexandros Papangelis, Karthik Gopalakrishnan, Dilek Hakkani-Tur, Babak Damavandi, Alborz Geramifard, Chiori Hori, Ankit Shah, Chen Zhang, Haizhou Li, João Sedoc, Luis F. D'Haro, Rafael E. Banchs, Alexander Rudnicky:
Overview of the Tenth Dialog System Technology Challenge: DSTC10. IEEE ACM Trans. Audio Speech Lang. Process. 32: 765-778 (2024) - [j7]R. Chulaka Gunasekara, Seokhwan Kim, Luis Fernando D'Haro, Abhinav Rastogi, Yun-Nung Chen, Mihail Eric, Behnam Hedayatnia, Karthik Gopalakrishnan, Yang Liu, Chao-Wei Huang, Dilek Hakkani-Tür, Jinchao Li, Qi Zhu, Lingxiao Luo, Lars Liden, Kaili Huang, Shahin Shayandeh, Runze Liang, Baolin Peng, Zheng Zhang, Swadheen Shukla, Minlie Huang, Jianfeng Gao, Shikib Mehri, Yulan Feng, Carla Gordon, Seyed Hossein Alavi, David R. Traum, Maxine Eskénazi, Ahmad Beirami, Eunjoon Cho, Paul A. Crook, Ankita De, Alborz Geramifard, Satwik Kottur, Seungwhan Moon, Shivani Poddar, Rajen Subba:
Overview of the Ninth Dialog System Technology Challenge: DSTC9. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4066-4076 (2024) - [c29]Prajjwal Bhargava, Rohan Chitnis, Alborz Geramifard, Shagun Sodhani, Amy Zhang:
When should we prefer Decision Transformers for Offline Reinforcement Learning? ICLR 2024 - [c28]Harshit Sikchi, Rohan Chitnis, Ahmed Touati, Alborz Geramifard, Amy Zhang, Scott Niekum:
Score Models for Offline Goal-Conditioned Reinforcement Learning. ICLR 2024 - [i22]Rohan Chitnis, Shentao Yang, Alborz Geramifard:
Sequential Decision-Making for Inline Text Autocomplete. CoRR abs/2403.15502 (2024) - 2023
- [j6]Tianjian Huang, Shaunak Ashish Halbe, Chinnadhurai Sankar, Pooyan Amini, Satwik Kottur, Alborz Geramifard, Meisam Razaviyayn, Ahmad Beirami:
Robustness through Data Augmentation Loss Consistency. Trans. Mach. Learn. Res. 2023 (2023) - [i21]Khyathi Raghavi Chandu, Alborz Geramifard:
Curriculum Script Distillation for Multilingual Visual Question Answering. CoRR abs/2301.07227 (2023) - [i20]Prajjwal Bhargava, Rohan Chitnis, Alborz Geramifard, Shagun Sodhani, Amy Zhang:
Sequence Modeling is a Robust Contender for Offline Reinforcement Learning. CoRR abs/2305.14550 (2023) - [i19]Harshit Sikchi, Rohan Chitnis, Ahmed Touati, Alborz Geramifard, Amy Zhang, Scott Niekum:
Score Models for Offline Goal-Conditioned Reinforcement Learning. CoRR abs/2311.02013 (2023) - 2022
- [c27]Satwik Kottur, Seungwhan Moon, Alborz Geramifard, Babak Damavandi:
Navigating Connected Memories with a Task-oriented Dialog System. EMNLP 2022: 2495-2507 - [c26]Qingyang Wu, Zhenzhong Lan, Kun Qian, Jing Gu, Alborz Geramifard, Zhou Yu:
Memformer: A Memory-Augmented Transformer for Sequence Modeling. AACL/IJCNLP (Findings) 2022: 308-318 - [c25]Kun Qian, Satwik Kottur, Ahmad Beirami, Shahin Shayandeh, Paul A. Crook, Alborz Geramifard, Zhou Yu, Chinnadhurai Sankar:
Database Search Results Disambiguation for Task-Oriented Dialog Systems. NAACL-HLT 2022: 1158-1173 - [i18]Jorge A. Mendez, Alborz Geramifard, Mohammad Ghavamzadeh, Bing Liu:
Reinforcement Learning of Multi-Domain Dialog Policies Via Action Embeddings. CoRR abs/2207.00468 (2022) - [i17]Khyathi Raghavi Chandu, Alborz Geramifard:
Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities. CoRR abs/2210.16960 (2022) - [i16]Satwik Kottur, Seungwhan Moon, Aram H. Markosyan, Hardik Shah, Babak Damavandi, Alborz Geramifard:
Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation. CoRR abs/2211.03940 (2022) - [i15]Seungwhan Moon, Satwik Kottur, Alborz Geramifard, Babak Damavandi:
Navigating Connected Memories with a Task-oriented Dialog System. CoRR abs/2211.08462 (2022) - 2021
- [j5]Yuxi Li, Alborz Geramifard, Lihong Li, Csaba Szepesvári, Tao Wang:
Guest editorial: special issue on reinforcement learning for real life. Mach. Learn. 110(9): 2291-2293 (2021) - [c24]Hung Le, Chinnadhurai Sankar, Seungwhan Moon, Ahmad Beirami, Alborz Geramifard, Satwik Kottur:
DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue. ACL/IJCNLP (1) 2021: 5651-5665 - [c23]Satwik Kottur, Seungwhan Moon, Alborz Geramifard, Babak Damavandi:
SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations. EMNLP (1) 2021: 4903-4912 - [c22]Alborz Geramifard:
Conversational AI Efforts within Facebook AI Applied Research. MuCAI @ ACM Multimedia 2021: 1 - [c21]Satwik Kottur, Chinnadhurai Sankar, Zhou Yu, Alborz Geramifard:
DialogStitch: Synthetic Deeper and Multi-Context Task-Oriented Dialogs. SIGDIAL 2021: 21-26 - [c20]Satwik Kottur, Paul A. Crook, Seungwhan Moon, Ahmad Beirami, Eunjoon Cho, Rajen Subba, Alborz Geramifard:
An Analysis of State-of-the-Art Models for Situated Interactive MultiModal Conversations (SIMMC). SIGDIAL 2021: 144-153 - [c19]Kun Qian, Ahmad Beirami, Zhouhan Lin, Ankita De, Alborz Geramifard, Zhou Yu, Chinnadhurai Sankar:
Annotation Inconsistency and Entity Bias in MultiWOZ. SIGDIAL 2021: 326-337 - [i14]Hung Le, Chinnadhurai Sankar, Seungwhan Moon, Ahmad Beirami, Alborz Geramifard, Satwik Kottur:
DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue. CoRR abs/2101.00151 (2021) - [i13]Satwik Kottur, Seungwhan Moon, Alborz Geramifard, Babak Damavandi:
SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations. CoRR abs/2104.08667 (2021) - [i12]Kun Qian, Ahmad Beirami, Zhouhan Lin, Ankita De, Alborz Geramifard, Zhou Yu, Chinnadhurai Sankar:
Annotation Inconsistency and Entity Bias in MultiWOZ. CoRR abs/2105.14150 (2021) - [i11]Tianjian Huang, Shaunak Ashish Halbe, Chinnadhurai Sankar, Pooyan Amini, Satwik Kottur, Alborz Geramifard, Meisam Razaviyayn, Ahmad Beirami:
DAIR: Data Augmented Invariant Regularization. CoRR abs/2110.11205 (2021) - [i10]Kun Qian, Ahmad Beirami, Satwik Kottur, Shahin Shayandeh, Paul A. Crook, Alborz Geramifard, Zhou Yu, Chinnadhurai Sankar:
Database Search Results Disambiguation for Task-Oriented Dialog Systems. CoRR abs/2112.08351 (2021) - 2020
- [c18]Seungwhan Moon, Satwik Kottur, Paul A. Crook, Ankita De, Shivani Poddar, Theodore Levin, David Whitney, Daniel Difranco, Ahmad Beirami, Eunjoon Cho, Rajen Subba, Alborz Geramifard:
Situated and Interactive Multimodal Conversations. COLING 2020: 1103-1121 - [c17]Zhenpeng Zhou, Ahmad Beirami, Paul A. Crook, Pararth Shah, Rajen Subba, Alborz Geramifard:
Resource Constrained Dialog Policy Learning Via Differentiable Inductive Logic Programming. COLING 2020: 6775-6787 - [i9]Seungwhan Moon, Satwik Kottur, Paul A. Crook, Ankita De, Shivani Poddar, Theodore Levin, David Whitney, Daniel Difranco, Ahmad Beirami, Eunjoon Cho, Rajen Subba, Alborz Geramifard:
Situated and Interactive Multimodal Conversations. CoRR abs/2006.01460 (2020) - [i8]Zhenpeng Zhou, Ahmad Beirami, Paul A. Crook, Pararth Shah, Rajen Subba, Alborz Geramifard:
Resource Constrained Dialog Policy Learning via Differentiable Inductive Logic Programming. CoRR abs/2011.05457 (2020) - [i7]R. Chulaka Gunasekara, Seokhwan Kim, Luis Fernando D'Haro, Abhinav Rastogi, Yun-Nung Chen, Mihail Eric, Behnam Hedayatnia, Karthik Gopalakrishnan, Yang Liu, Chao-Wei Huang, Dilek Hakkani-Tür, Jinchao Li, Qi Zhu, Lingxiao Luo, Lars Liden, Kaili Huang, Shahin Shayandeh, Runze Liang, Baolin Peng, Zheng Zhang, Swadheen Shukla, Minlie Huang, Jianfeng Gao, Shikib Mehri, Yulan Feng, Carla Gordon, Seyed Hossein Alavi, David R. Traum, Maxine Eskénazi, Ahmad Beirami, Eunjoon Cho, Paul A. Crook, Ankita De, Alborz Geramifard, Satwik Kottur, Seungwhan Moon, Shivani Poddar, Rajen Subba:
Overview of the Ninth Dialog System Technology Challenge: DSTC9. CoRR abs/2011.06486 (2020)
2010 – 2019
- 2019
- [i6]Praveen Kumar Bodigutla, Longshaokan Wang, Kate Ridgeway, Joshua Levy, Swanand Joshi, Alborz Geramifard, Spyros Matsoukas:
Domain-Independent turn-level Dialogue Quality Evaluation via User Satisfaction Estimation. CoRR abs/1908.07064 (2019) - [i5]Paul A. Crook, Shivani Poddar, Ankita De, Semir Shafi, David Whitney, Alborz Geramifard, Rajen Subba:
SIMMC: Situated Interactive Multi-Modal Conversational Data Collection And Evaluation Platform. CoRR abs/1911.02690 (2019) - 2017
- [c16]Muthu Muthukrishnan, Andrew Tomkins, Larry P. Heck, Alborz Geramifard, Deepak Agarwal:
The Future of Artificially Intelligent Assistants. KDD 2017: 33-34 - [i4]Maryam Fazel-Zarandi, Shang-Wen Li, Jin Cao, Jared Casale, Peter Henderson, David Whitney, Alborz Geramifard:
Learning Robust Dialog Policies in Noisy Environments. CoRR abs/1712.04034 (2017) - 2015
- [j4]Alborz Geramifard, Christoph Dann, Robert H. Klein, William Dabney, Jonathan P. How:
RLPy: a value-function-based reinforcement learning framework for education and research. J. Mach. Learn. Res. 16: 1573-1578 (2015) - 2013
- [j3]Alborz Geramifard, Thomas J. Walsh, Stefanie Tellex, Girish Chowdhary, Nicholas Roy, Jonathan P. How:
A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning. Found. Trends Mach. Learn. 6(4): 375-451 (2013) - [j2]Alborz Geramifard, Josh Redding, Jonathan P. How:
Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning. J. Intell. Robotic Syst. 72(1): 83-103 (2013) - [c15]Christopher Amato, Girish Chowdhary, Alborz Geramifard, N. Kemal Ure, Mykel J. Kochenderfer:
Decentralized control of partially observable Markov decision processes. CDC 2013: 2398-2405 - [c14]Joshua Mason Joseph, Alborz Geramifard, John W. Roberts, Jonathan P. How, Nicholas Roy:
Reinforcement learning with misspecified model classes. ICRA 2013: 939-946 - [c13]Alborz Geramifard, Thomas J. Walsh, Nicholas Roy, Jonathan P. How:
Batch-iFDD for Representation Expansion in Large MDPs. UAI 2013 - [i3]Alborz Geramifard, Thomas J. Walsh, Nicholas Roy, Jonathan P. How:
Batch-iFDD for Representation Expansion in Large MDPs. CoRR abs/1309.6831 (2013) - 2012
- [c12]Alborz Geramifard, Joshua D. Redding, Joshua Mason Joseph, Nicholas Roy, Jonathan P. How:
Model estimation within planning and learning. ACC 2012: 793-799 - [c11]N. Kemal Ure, Alborz Geramifard, Girish Chowdhary, Jonathan P. How:
Adaptive Planning for Markov Decision Processes with Uncertain Transition Models via Incremental Feature Dependency Discovery. ECML/PKDD (2) 2012: 99-115 - [i2]Richard S. Sutton, Csaba Szepesvári, Alborz Geramifard, Michael Bowling:
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping. CoRR abs/1206.3285 (2012) - 2011
- [c10]Alborz Geramifard, Joshua D. Redding, Nicholas Roy, Jonathan P. How:
UAV cooperative control with stochastic risk models. ACC 2011: 3393-3398 - [c9]Alborz Geramifard, Finale Doshi, Josh Redding, Nicholas Roy, Jonathan P. How:
Online Discovery of Feature Dependencies. ICML 2011: 881-888 - 2010
- [j1]Ruijie He, Abraham Bachrach, Michael Achtelik, Alborz Geramifard, Daniel Gurdan, Samuel Prentice, Jan Stumpf, Nicholas Roy:
On the Design and Use of a Micro Air Vehicle to Track and Avoid Adversaries. Int. J. Robotics Res. 29(5): 529-546 (2010) - [c8]Josh Redding, Alborz Geramifard, Jonathan P. How:
Actor-Critic Policy Learning in Cooperative Planning. AAAI Spring Symposium: Embedded Reasoning 2010 - [c7]Joshua D. Redding, Alborz Geramifard, Aditya Undurti, Han-Lim Choi, Jonathan P. How:
An intelligent Cooperative Control Architecture. ACC 2010: 57-62
2000 – 2009
- 2008
- [c6]Michael H. Bowling, Alborz Geramifard, David Wingate:
Sigma point policy iteration. AAMAS (1) 2008: 379-386 - [c5]Abraham Bachrach, Alborz Geramifard, Daniel Gurdan, Ruijie He, Sam Prentice, Jan Stumpf, Nicholas Roy:
Co-ordinated Tracking and Planning Using Air and Ground Vehicles. ISER 2008: 137-146 - [c4]Richard S. Sutton, Csaba Szepesvári, Alborz Geramifard, Michael H. Bowling:
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping. UAI 2008: 528-536 - 2006
- [c3]Alborz Geramifard, Michael H. Bowling, Richard S. Sutton:
Incremental Least-Squares Temporal Difference Learning. AAAI 2006: 356-361 - [c2]Alborz Geramifard, Pirooz Chubak, Vadim Bulitko:
Biased Cost Pathfinding. AIIDE 2006: 112-114 - [c1]Alborz Geramifard, Michael H. Bowling, Martin Zinkevich, Richard S. Sutton:
iLSTD: Eligibility Traces and Convergence Analysis. NIPS 2006: 441-448 - [i1]Alborz Geramifard, Peyman Nayeri, Reza Zamani-Nasab, Jafar Habibi:
A Hybrid Three Layer Architecture for Fire Agent Management in Rescue Simulation Environment. CoRR abs/cs/0601055 (2006)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-10 21:44 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint