default search action
Irfan A. Essa
Person information
- affiliation: Georgia Institute of Technology, Atlanta GA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j26]Harish Haresamudram, Irfan Essa, Thomas Plötz:
Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition. Sensors 24(4): 1238 (2024) - [c161]Harish Haresamudram, Irfan Essa, Thomas Plötz:
A Washing Machine is All You Need? On the Feasibility of Machine Data for Self-Supervised Human Activity Recognition. ABC 2024: 1-10 - [c160]Vincent Cartillier, Grant Schindler, Irfan Essa:
SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping. CVPR Workshops 2024: 2862-2871 - [c159]Xingqian Xu, Jiayi Guo, Zhangyang Wang, Gao Huang, Irfan Essa, Humphrey Shi:
Prompt-Free Diffusion: Taking "Text" Out of Text-to-Image Diffusion Models. CVPR 2024: 8682-8692 - [c158]Agrim Gupta, Lijun Yu, Kihyuk Sohn, Xiuye Gu, Meera Hahn, Fei-Fei Li, Irfan Essa, Lu Jiang, José Lezama:
Photorealistic Video Generation with Diffusion Models. ECCV (79) 2024: 393-411 - [c157]Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang:
Parrot: Pareto-Optimal Multi-reward Reinforcement Learning Framework for Text-to-Image Generation. ECCV (38) 2024: 462-478 - [c156]Lijun Yu, José Lezama, Nitesh Bharadwaj Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang:
Language Model Beats Diffusion - Tokenizer is key to visual generation. ICLR 2024 - [c155]Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Grant Schindler, Rachel Hornung, Vighnesh Birodkar, Jimmy Yan, Ming-Chang Chiu, Krishna Somandepalli, Hassan Akbari, Yair Alon, Yong Cheng, Joshua V. Dillon, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, Mikhail Sirotenko, Kihyuk Sohn, Xuan Yang, Hartwig Adam, Ming-Hsuan Yang, Irfan Essa, Huisheng Wang, David A. Ross, Bryan Seybold, Lu Jiang:
VideoPoet: A Large Language Model for Zero-Shot Video Generation. ICML 2024 - [i83]Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang:
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation. CoRR abs/2401.05675 (2024) - [i82]Apoorva Beedu, Karan Samel, Irfan Essa:
On the Efficacy of Text-Based Input Modalities for Action Anticipation. CoRR abs/2401.12972 (2024) - [i81]Vincent Cartillier, Neha Jain, Irfan Essa:
3D Semantic MapNet: Building Maps for Multi-Object Re-Identification in 3D. CoRR abs/2403.13190 (2024) - [i80]Vincent Cartillier, Grant Schindler, Irfan Essa:
SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping. CoRR abs/2404.11419 (2024) - [i79]Andrew Marmon, Grant Schindler, José Lezama, Dan Kondratyuk, Bryan Seybold, Irfan Essa:
CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers. CoRR abs/2405.13195 (2024) - [i78]Seung Hyun Lee, Junjie Ke, Yinxiao Li, Junfeng He, Steven Hickson, Katie Datsenko, Sangpil Kim, Ming-Hsuan Yang, Irfan Essa, Feng Yang:
Cropper: Vision-Language Model for Image Cropping through In-Context Learning. CoRR abs/2408.07790 (2024) - [i77]Harish Haresamudram, Apoorva Beedu, Mashfiqui Rabbi, Sankalita Saha, Irfan Essa, Thomas Ploetz:
Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition - And Ways to Overcome Them. CoRR abs/2408.12023 (2024) - [i76]Zhikang Dong, Apoorva Beedu, Jason Sheinkopf, Irfan Essa:
Mamba Fusion: Learning Actions Through Questioning. CoRR abs/2409.11513 (2024) - [i75]Karan Samel, Apoorva Beedu, Nitish Sontakke, Irfan Essa:
Exploring Efficient Foundational Multi-modal Models for Video Summarization. CoRR abs/2410.07405 (2024) - 2023
- [j25]Erik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra:
Emergence of Maps in the Memories of Blind Navigation Agents. AI Matters 9(2): 8-14 (2023) - [j24]K. Niranjan Kumar, Irfan Essa, Sehoon Ha:
Cascaded Compositional Residual Learning for Complex Interactive Behaviors. IEEE Robotics Autom. Lett. 8(8): 4601-4608 (2023) - [c154]Karan Samel, Jun Ma, Zhengyang Wang, Tong Zhao, Irfan Essa:
Integrating Noisy Knowledge into Language Representations for E-Commerce Applications. IEEE Big Data 2023: 548-553 - [c153]Vighnesh Birodkar, Jonathan Huang, Meera Hahn, Irfan Essa, Nikolai Warner:
Text and Click inputs for unambiguous open vocabulary instance segmentation. BMVC 2023: 815-819 - [c152]Yi-Hao Peng, Peggy Chi, Anjuli Kannan, Meredith Ringel Morris, Irfan Essa:
Slide Gestalt: Automatic Structure Extraction in Slide Decks for Non-Visual Access. CHI 2023: 829:1-829:14 - [c151]Dina Bashkirova, José Lezama, Kihyuk Sohn, Kate Saenko, Irfan Essa:
MaskSketch: Unpaired Structure-guided Masked Image Generation. CVPR 2023: 1879-1889 - [c150]Lijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang:
MAGVIT: Masked Generative Video Transformer. CVPR 2023: 10459-10469 - [c149]Kihyuk Sohn, Huiwen Chang, José Lezama, Luisa Polania, Han Zhang, Yuan Hao, Irfan Essa, Lu Jiang:
Visual Prompt Tuning for Generative Transfer Learning. CVPR 2023: 19840-19851 - [c148]José Lezama, Tim Salimans, Lu Jiang, Huiwen Chang, Jonathan Ho, Irfan Essa:
Discrete Predictor-Corrector Diffusion Models for Image Synthesis. ICLR 2023 - [c147]Erik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra:
Emergence of Maps in the Memories of Blind Navigation Agents. ICLR 2023 - [c146]Kihyuk Sohn, Lu Jiang, Jarred Barber, Kimin Lee, Nataniel Ruiz, Dilip Krishnan, Huiwen Chang, Yuanzhen Li, Irfan Essa, Michael Rubinstein, Yuan Hao, Glenn Entis, Irina Blok, Daniel Castro Chin:
StyleDrop: Text-to-Image Synthesis of Any Style. NeurIPS 2023 - [c145]Lijun Yu, Yong Cheng, Zhiruo Wang, Vivek Kumar, Wolfgang Macherey, Yanping Huang, David A. Ross, Irfan Essa, Yonatan Bisk, Ming-Hsuan Yang, Kevin P. Murphy, Alexander G. Hauptmann, Lu Jiang:
SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs. NeurIPS 2023 - [c144]Harish Haresamudram, Irfan Essa, Thomas Plötz:
Investigating Enhancements to Contrastive Predictive Coding for Human Activity Recognition. PERCOM 2023: 232-241 - [i74]Erik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra:
Emergence of Maps in the Memories of Blind Navigation Agents. CoRR abs/2301.13261 (2023) - [i73]Dina Bashkirova, José Lezama, Kihyuk Sohn, Kate Saenko, Irfan Essa:
MaskSketch: Unpaired Structure-guided Masked Image Generation. CoRR abs/2302.05496 (2023) - [i72]Daniel Nkemelu, Harshil Shah, Irfan Essa, Michael L. Best:
Tackling Hate Speech in Low-resource Languages with Context Experts. CoRR abs/2303.16828 (2023) - [i71]Xingqian Xu, Jiayi Guo, Zhangyang Wang, Gao Huang, Irfan Essa, Humphrey Shi:
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models. CoRR abs/2305.16223 (2023) - [i70]Kihyuk Sohn, Albert E. Shaw, Yuan Hao, Han Zhang, Luisa Polania, Huiwen Chang, Lu Jiang, Irfan Essa:
Learning Disentangled Prompts for Compositional Image Synthesis. CoRR abs/2306.00763 (2023) - [i69]Kihyuk Sohn, Nataniel Ruiz, Kimin Lee, Daniel Castro Chin, Irina Blok, Huiwen Chang, Jarred Barber, Lu Jiang, Glenn Entis, Yuanzhen Li, Yuan Hao, Irfan Essa, Michael Rubinstein, Dilip Krishnan:
StyleDrop: Text-to-Image Generation in Any Style. CoRR abs/2306.00983 (2023) - [i68]Harish Haresamudram, Irfan Essa, Thomas Ploetz:
Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition. CoRR abs/2306.01108 (2023) - [i67]Lijun Yu, Yong Cheng, Zhiruo Wang, Vivek Kumar, Wolfgang Macherey, Yanping Huang, David A. Ross, Irfan Essa, Yonatan Bisk, Ming-Hsuan Yang, Kevin Murphy, Alexander G. Hauptmann, Lu Jiang:
SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs. CoRR abs/2306.17842 (2023) - [i66]Hyeongju Choi, Apoorva Beedu, Irfan Essa:
Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition. CoRR abs/2309.01262 (2023) - [i65]Daniel Nkemelu, Peggy Chi, Daniel Castro Chin, Krishna Srinivasan, Irfan Essa:
Automatic Multi-Path Web Story Creation from a Structural Article. CoRR abs/2310.02383 (2023) - [i64]Lijun Yu, José Lezama, Nitesh Bharadwaj Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang:
Language Model Beats Diffusion - Tokenizer is Key to Visual Generation. CoRR abs/2310.05737 (2023) - [i63]K. Niranjan Kumar, Irfan Essa, Sehoon Ha:
Words into Action: Learning Diverse Humanoid Robot Behaviors using Language Guided Iterative Motion Refinement. CoRR abs/2310.06226 (2023) - [i62]Tianle Huang, Nitish Sontakke, K. Niranjan Kumar, Irfan Essa, Stefanos Nikolaidis, Dennis W. Hong, Sehoon Ha:
BayRnTune: Adaptive Bayesian Domain Randomization via Strategic Fine-tuning. CoRR abs/2310.10606 (2023) - [i61]Nikolai Warner, Meera Hahn, Jonathan Huang, Irfan Essa, Vighnesh Birodkar:
Text and Click inputs for unambiguous open vocabulary instance segmentation. CoRR abs/2311.14822 (2023) - [i60]Agrim Gupta, Lijun Yu, Kihyuk Sohn, Xiuye Gu, Meera Hahn, Li Fei-Fei, Irfan Essa, Lu Jiang, José Lezama:
Photorealistic Video Generation with Diffusion Models. CoRR abs/2312.06662 (2023) - [i59]Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Rachel Hornung, Hartwig Adam, Hassan Akbari, Yair Alon, Vighnesh Birodkar, Yong Cheng, Ming-Chang Chiu, Joshua V. Dillon, Irfan Essa, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, David A. Ross, Grant Schindler, Mikhail Sirotenko, Kihyuk Sohn, Krishna Somandepalli, Huisheng Wang, Jimmy Yan, Ming-Hsuan Yang, Xuan Yang, Bryan Seybold, Lu Jiang:
VideoPoet: A Large Language Model for Zero-Shot Video Generation. CoRR abs/2312.14125 (2023) - 2022
- [j23]Harish Haresamudram, Irfan Essa, Thomas Plötz:
Assessing the State of Self-Supervised Human Activity Recognition Using Wearables. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 6(3): 116:1-116:47 (2022) - [c143]Erik Wijmans, Irfan Essa, Dhruv Batra:
How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget. AAMAS 2022: 1762-1764 - [c142]José Lezama, Huiwen Chang, Lu Jiang, Irfan Essa:
Improved Masked Image Generation with Token-Critic. ECCV (23) 2022: 70-86 - [c141]Xiang Kong, Lu Jiang, Huiwen Chang, Han Zhang, Yuan Hao, Haifeng Gong, Irfan Essa:
BLT: Bidirectional Layout Transformer for Controllable Layout Generation. ECCV (17) 2022: 474-490 - [c140]Chengzhi Mao, Lu Jiang, Mostafa Dehghani, Carl Vondrick, Rahul Sukthankar, Irfan Essa:
Discrete Representations Strengthen Vision Transformer Robustness. ICLR 2022 - [c139]K. Niranjan Kumar, Irfan Essa, Sehoon Ha:
Graph-based Cluttered Scene Generation and Interactive Exploration using Deep Reinforcement Learning. ICRA 2022: 7521-7527 - [c138]Erik Wijmans, Irfan Essa, Dhruv Batra:
VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement. NeurIPS 2022 - [c137]Peggy Chi, Tao Dong, Christian Früh, Brian Colonna, Vivek Kwatra, Irfan Essa:
Synthesis-Assisted Video Prototyping From a Document. UIST 2022: 16:1-16:10 - [c136]Steven Hickson, Karthik Raveendran, Irfan A. Essa:
Sharing Decoders: Network Fission for Multi-task Pixel Prediction. WACV 2022: 3655-3664 - [i58]Karan Samel, Zelin Zhao, Binghong Chen, Shuang Li, Dharmashankar Subramanian, Irfan Essa, Le Song:
Learning Temporal Rules from Noisy Timeseries Data. CoRR abs/2202.05403 (2022) - [i57]Harish Haresamudram, Irfan Essa, Thomas Plötz:
Assessing the State of Self-Supervised Human Activity Recognition using Wearables. CoRR abs/2202.12938 (2022) - [i56]José Lezama, Huiwen Chang, Lu Jiang, Irfan Essa:
Improved Masked Image Generation with Token-Critic. CoRR abs/2209.04439 (2022) - [i55]Kihyuk Sohn, Yuan Hao, José Lezama, Luisa Polania, Huiwen Chang, Han Zhang, Irfan Essa, Lu Jiang:
Visual Prompt Tuning for Generative Transfer Learning. CoRR abs/2210.00990 (2022) - [i54]Erik Wijmans, Irfan Essa, Dhruv Batra:
VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement. CoRR abs/2210.05064 (2022) - [i53]Daniel Scarafoni, Irfan Essa, Thomas Ploetz:
Finding Islands of Predictability in Action Forecasting. CoRR abs/2210.07354 (2022) - [i52]Apoorva Beedu, Huda AlAmri, Irfan Essa:
Video based Object 6D Pose Estimation using Transformers. CoRR abs/2210.13540 (2022) - [i51]Huda AlAmri, Anthony Bilic, Michael Hu, Apoorva Beedu, Irfan Essa:
End-to-End Multimodal Representation Learning for Video Dialog. CoRR abs/2210.14512 (2022) - [i50]Hyeongju Choi, Apoorva Beedu, Harish Haresamudram, Irfan Essa:
Multi-Stage Based Feature Fusion of Multi-Modal Data for Human Activity Recognition. CoRR abs/2211.04331 (2022) - [i49]Harish Haresamudram, Irfan Essa, Thomas Ploetz:
Investigating Enhancements to Contrastive Predictive Coding for Human Activity Recognition. CoRR abs/2211.06173 (2022) - [i48]Lijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang:
MAGVIT: Masked Generative Video Transformer. CoRR abs/2212.05199 (2022) - [i47]K. Niranjan Kumar, Irfan Essa, Sehoon Ha:
Cascaded Compositional Residual Learning for Complex Interactive Behaviors. CoRR abs/2212.08954 (2022) - 2021
- [j22]Harish Haresamudram, Irfan A. Essa, Thomas Plötz:
Contrastive Predictive Coding for Human Activity Recognition. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 5(2): 65:1-65:26 (2021) - [c135]Vincent Cartillier, Zhile Ren, Neha Jain, Stefan Lee, Irfan Essa, Dhruv Batra:
Semantic MapNet: Building Allocentric Semantic Maps and Representations from Egocentric Views. AAAI 2021: 964-972 - [c134]A. J. Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan Essa:
Unsupervised Discovery of Actions in Instructional Videos. BMVC 2021: 283 - [c133]Anh Truong, Peggy Chi, David Salesin, Irfan Essa, Maneesh Agrawala:
Automatic Generation of Two-Level Hierarchical Tutorials from Instructional Makeup Videos. CHI 2021: 108:1-108:16 - [c132]Tianhao Zhang, Hung-Yu Tseng, Lu Jiang, Weilong Yang, Honglak Lee, Irfan Essa:
Text as Neural Operator: Image Manipulation by Text Instruction. ACM Multimedia 2021: 1893-1902 - [c131]Peggy Chi, Nathan Frey, Katrina Panovich, Irfan Essa:
Automatic Instructional Video Creation from a Markdown-Formatted Tutorial. UIST 2021: 677-690 - [i46]Dan Scarafoni, Irfan Essa, Thomas Ploetz:
PLAN-B: Predicting Likely Alternative Next Best Sequences for Action Prediction. CoRR abs/2103.15987 (2021) - [i45]Nathan Frey, Peggy Chi, Weilong Yang, Irfan Essa:
Automatic Non-Linear Video Editing Transfer. CoRR abs/2105.06988 (2021) - [i44]A. J. Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan A. Essa:
Unsupervised Action Segmentation for Instructional Videos. CoRR abs/2106.03738 (2021) - [i43]A. J. Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan A. Essa:
Unsupervised Discovery of Actions in Instructional Videos. CoRR abs/2106.14733 (2021) - [i42]K. Niranjan Kumar, Irfan Essa, Sehoon Ha:
Graph-based Cluttered Scene Generation and Interactive Exploration using Deep Reinforcement Learning. CoRR abs/2109.10460 (2021) - [i41]Chengzhi Mao, Lu Jiang, Mostafa Dehghani, Carl Vondrick, Rahul Sukthankar, Irfan Essa:
Discrete Representations Strengthen Vision Transformer Robustness. CoRR abs/2111.10493 (2021) - [i40]Apoorva Beedu, Zhile Ren, Varun Agrawal, Irfan Essa:
VideoPose: Estimating 6D object pose from videos. CoRR abs/2111.10677 (2021) - [i39]Xiang Kong, Lu Jiang, Huiwen Chang, Han Zhang, Yuan Hao, Haifeng Gong, Irfan Essa:
BLT: Bidirectional Layout Transformer for Controllable Layout Generation. CoRR abs/2112.05112 (2021) - 2020
- [c130]Hsin-Ying Lee, Lu Jiang, Irfan Essa, Phuong B. Le, Haifeng Gong, Ming-Hsuan Yang, Weilong Yang:
Neural Design Network: Graphic Layout Generation with Constraints. ECCV (3) 2020: 491-506 - [c129]Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra:
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames. ICLR 2020 - [c128]Harish Haresamudram, Apoorva Beedu, Varun Agrawal, Patrick L. Grady, Irfan A. Essa, Judy Hoffman, Thomas Plötz:
Masked reconstruction based self-supervision for human activity recognition. ISWC 2020: 45-49 - [c127]Peggy Chi, Zheng Sun, Katrina Panovich, Irfan Essa:
Automatic Video Creation From a Web Page. UIST 2020: 279-292 - [i38]Erik Wijmans, Julian Straub, Dhruv Batra, Irfan Essa, Judy Hoffman, Ari Morcos:
Analyzing Visual Representations in Embodied Navigation Tasks. CoRR abs/2003.05993 (2020) - [i37]Tianhao Zhang, Hung-Yu Tseng, Lu Jiang, Honglak Lee, Irfan Essa, Weilong Yang:
Text as Neural Operator: Image Manipulation by Text Instruction. CoRR abs/2008.04556 (2020) - [i36]Vincent Cartillier, Zhile Ren, Neha Jain, Stefan Lee, Irfan Essa, Dhruv Batra:
Semantic MapNet: Building Allocentric SemanticMaps and Representations from Egocentric Views. CoRR abs/2010.01191 (2020) - [i35]Harish Haresamudram, Irfan A. Essa, Thomas Ploetz:
Contrastive Predictive Coding for Human Activity Recognition. CoRR abs/2012.05333 (2020) - [i34]Erik Wijmans, Irfan Essa, Dhruv Batra:
How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget. CoRR abs/2012.06117 (2020)
2010 – 2019
- 2019
- [j21]Aneeq Zia, Liheng Guo, Linlin Zhou, Irfan A. Essa, Anthony M. Jarc:
Novel evaluation of surgical activity recognition models using task-based efficiency metrics. Int. J. Comput. Assist. Radiol. Surg. 14(12): 2155-2163 (2019) - [c126]Erik Wijmans, Samyak Datta, Oleksandr Maksymets, Abhishek Das, Georgia Gkioxari, Stefan Lee, Irfan Essa, Devi Parikh, Dhruv Batra:
Embodied Question Answering in Photorealistic Environments With Point Cloud Perception. CVPR 2019: 6659-6668 - [c125]Huda AlAmri, Vincent Cartillier, Abhishek Das, Jue Wang, Anoop Cherian, Irfan Essa, Dhruv Batra, Tim K. Marks, Chiori Hori, Peter Anderson, Stefan Lee, Devi Parikh:
Audio Visual Scene-Aware Dialog. CVPR 2019: 7558-7567 - [c124]Chiori Hori, Huda AlAmri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh:
End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features. ICASSP 2019: 2352-2356 - [c123]Steven Hickson, Karthik Raveendran, Alireza Fathi, Kevin Murphy, Irfan A. Essa:
Floors are Flat: Leveraging Semantics for Real-Time Surface Normal Prediction. ICCV Workshops 2019: 4065-4074 - [c122]Luke Drnach, Jessica L. Allen, Irfan Essa, Lena H. Ting:
A Data-Driven Predictive Model of Individual-Specific Effects of FES on Human Gait Dynamics. ICRA 2019: 5090-5096 - [c121]Unaiza Ahsan, Rishi Madhok, Irfan A. Essa:
Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition. WACV 2019: 179-189 - [c120]Steven Hickson, Nick Dufour, Avneesh Sud, Vivek Kwatra, Irfan A. Essa:
Eyemotion: Classifying Facial Expressions in VR Using Eye-Tracking Cameras. WACV 2019: 1626-1635 - [i33]Huda AlAmri, Vincent Cartillier, Abhishek Das, Jue Wang, Stefan Lee, Peter Anderson, Irfan Essa, Devi Parikh, Dhruv Batra, Anoop Cherian, Tim K. Marks, Chiori Hori:
Audio-Visual Scene-Aware Dialog. CoRR abs/1901.09107 (2019) - [i32]Erik Wijmans, Samyak Datta, Oleksandr Maksymets, Abhishek Das, Georgia Gkioxari, Stefan Lee, Irfan Essa, Devi Parikh, Dhruv Batra:
Embodied Question Answering in Photorealistic Environments with Point Cloud Perception. CoRR abs/1904.03461 (2019) - [i31]Steven Hickson, Karthik Raveendran, Alireza Fathi, Kevin Murphy, Irfan A. Essa:
Floors are Flat: Leveraging Semantics for Real-Time Surface Normal Prediction. CoRR abs/1906.06792 (2019) - [i30]Aneeq Zia, Liheng Guo, Linlin Zhou, Irfan A. Essa, Anthony M. Jarc:
Novel evaluation of surgical activity recognition models using task-based efficiency metrics. CoRR abs/1907.02060 (2019) - [i29]Niranjan Kumar Kannabiran, Irfan Essa, C. Karen Liu:
Estimating Mass Distribution of Articulated Objects through Physical Interaction. CoRR abs/1907.03964 (2019) - [i28]Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra:
Decentralized Distributed PPO: Solving PointGoal Navigation. CoRR abs/1911.00357 (2019) - [i27]Hsin-Ying Lee, Weilong Yang, Lu Jiang, Madison Le, Irfan Essa, Haifeng Gong, Ming-Hsuan Yang:
Neural Design Network: Graphic Layout Generation with Constraints. CoRR abs/1912.09421 (2019) - 2018
- [j20]Aneeq Zia, Yachna Sharma, Vinay Bettadapura, Eric L. Sarin, Irfan A. Essa:
Video and accelerometer-based motion analysis for automated surgical skills assessment. Int. J. Comput. Assist. Radiol. Surg. 13(3): 443-455 (2018) - [j19]Aneeq Zia, Irfan A. Essa:
Automated surgical skill assessment in RMIS training. Int. J. Comput. Assist. Radiol. Surg. 13(5): 731-739 (2018) - [c119]Luke Drnach, Irfan Essa, Lena H. Ting:
Identifying Gait Phases from Joint Kinematics during Walking with Switched Linear Dynamical Systems. BioRob 2018: 1181-1186 - [c118]Aneeq Zia, Andrew Hung, Irfan A. Essa, Anthony M. Jarc:
Surgical Activity Recognition in Robot-Assisted Radical Prostatectomy Using Deep Learning. MICCAI (4) 2018: 273-280 - [c117]Erkam Uzun, Simon Pak Ho Chung, Irfan Essa, Wenke Lee:
rtCaptcha: A Real-Time CAPTCHA Based Liveness Detection System. NDSS 2018 - [i26]Unaiza Ahsan, Chen Sun, Irfan A. Essa:
DiscrimNet: Semi-Supervised Action Recognition from Videos using Generative Adversarial Networks. CoRR abs/1801.07230 (2018) - [i25]Daniel Castro, Steven Hickson, Patsorn Sangkloy, Bhavishya Mittal, Sean Dai, James Hays, Irfan A. Essa:
Let's Dance: Learning From Online Dance Videos. CoRR abs/1801.07388 (2018) - [i24]Steven Hickson, Stan Birchfield, Irfan A. Essa, Henrik I. Christensen:
Efficient Hierarchical Graph-Based Segmentation of RGBD Videos. CoRR abs/1801.08981 (2018) - [i23]Steven Hickson, Anelia Angelova, Irfan A. Essa, Rahul Sukthankar:
Object category learning and retrieval with weak supervision. CoRR abs/1801.08985 (2018) - [i22]Aneeq Zia, Andrew Hung, Irfan A. Essa, Anthony M. Jarc:
Surgical Activity Recognition in Robot-Assisted Radical Prostatectomy using Deep Learning. CoRR abs/1806.00466 (2018) - [i21]Huda AlAmri, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Jue Wang, Irfan Essa, Dhruv Batra, Devi Parikh, Anoop Cherian, Tim K. Marks, Chiori Hori:
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7. CoRR abs/1806.00525 (2018) - [i20]Chiori Hori, Huda AlAmri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh:
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features. CoRR abs/1806.08409 (2018) - [i19]Unaiza Ahsan, Rishi Madhok, Irfan A. Essa:
Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition. CoRR abs/1808.07507 (2018) - [i18]Jonathan C. Balloch, Varun Agrawal, Irfan Essa, Sonia Chernova:
Unbiasing Semantic Segmentation For Robot Perception using Synthetic Data Feature Transfer. CoRR abs/1809.03676 (2018) - 2017
- [j18]Thomas B. Moeslund, Graham A. Thomas, Adrian Hilton, Peter Carr, Irfan Essa:
Computer Vision in Sports. Comput. Vis. Image Underst. 159: 1-2 (2017) - [c116]Amirreza Shaban, Shray Bansal, Zhen Liu, Irfan Essa, Byron Boots:
One-Shot Learning for Semantic Segmentation. BMVC 2017 - [c115]Julia Deeb-Swihart, Christopher Polack, Eric Gilbert, Irfan A. Essa:
Selfie-Presentation in Everyday Life: A Large-Scale Characterization of Selfie Contexts on Instagram. ICWSM 2017: 42-51 - [c114]Unaiza Ahsan, Munmun De Choudhury, Irfan A. Essa:
Towards using visual attributes to infer image sentiment of social events. IJCNN 2017: 1372-1379 - [c113]Edison Thomaz, Abdelkareem Bedri, Temiloluwa Prioleau, Irfan A. Essa, Gregory D. Abowd:
Exploring Symmetric and Asymmetric Bimanual Eating Detection with Inertial Sensors on the Wrist. DigitalBioMarker@MobiSys 2017: 21-26 - [c112]Unaiza Ahsan, Chen Sun, James Hays, Irfan A. Essa:
Complex Event Recognition from Images with Few Training Examples. WACV 2017: 669-678 - [p1]Edison Thomaz, Irfan A. Essa, Gregory D. Abowd:
Challenges and Opportunities in Automated Detection of Eating Activity. Mobile Health - Sensors, Analytic Methods, and Applications 2017: 151-174 - [i17]Unaiza Ahsan, Chen Sun, James Hays, Irfan A. Essa:
Complex Event Recognition from Images with Few Training Examples. CoRR abs/1701.04769 (2017) - [i16]Aneeq Zia, Yachna Sharma, Vinay Bettadapura, Eric L. Sarin, Irfan A. Essa:
Video and Accelerometer-Based Motion Analysis for Automated Surgical Skills Assessment. CoRR abs/1702.07772 (2017) - [i15]Steven Hickson, Nick Dufour, Avneesh Sud, Vivek Kwatra, Irfan A. Essa:
Eyemotion: Classifying facial expressions in VR using eye-tracking cameras. CoRR abs/1707.07204 (2017) - [i14]Steven Hickson, Irfan A. Essa, Henrik I. Christensen:
Semantic Instance Labeling Leveraging Hierarchical Segmentation. CoRR abs/1708.00946 (2017) - [i13]Amirreza Shaban, Shray Bansal, Zhen Liu, Irfan Essa, Byron Boots:
One-Shot Learning for Semantic Segmentation. CoRR abs/1709.03410 (2017) - [i12]Aneeq Zia, Irfan A. Essa:
Automated Surgical Skill Assessment in RMIS Training. CoRR abs/1712.08604 (2017) - 2016
- [j17]Aneeq Zia, Yachna Sharma, Vinay Bettadapura, Eric L. Sarin, Thomas Ploetz, Mark A. Clements, Irfan A. Essa:
Automated video-based assessment of surgical skills for training and evaluation in medical schools. Int. J. Comput. Assist. Radiol. Surg. 11(9): 1623-1636 (2016) - [c111]Vinay Bettadapura, Caroline Pantofaru, Irfan A. Essa:
Leveraging Contextual Cues for Generating Basketball Highlights. ACM Multimedia 2016: 908-917 - [c110]Vinay Bettadapura, Daniel Castro, Irfan A. Essa:
Discovering picturesque highlights from egocentric vacation videos. WACV 2016: 1-9 - [i11]Vinay Bettadapura, Daniel Castro, Irfan A. Essa:
Discovering Picturesque Highlights from Egocentric Vacation Videos. CoRR abs/1601.04406 (2016) - [i10]Vinay Bettadapura, Caroline Pantofaru, Irfan A. Essa:
Leveraging Contextual Cues for Generating Basketball Highlights. CoRR abs/1606.08955 (2016) - 2015
- [c109]Edison Thomaz, Irfan A. Essa, Gregory D. Abowd:
A practical approach for recognizing eating moments with wrist-mounted inertial sensing. UbiComp 2015: 1029-1040 - [c108]Daniel Castro, Steven Hickson, Vinay Bettadapura, Edison Thomaz, Gregory D. Abowd, Henrik I. Christensen, Irfan A. Essa:
Predicting daily activities from egocentric images using deep learning. ISWC 2015: 75-82 - [c107]Edison Thomaz, Cheng Zhang, Irfan A. Essa, Gregory D. Abowd:
Inferring Meal Eating Activities in Real World Settings from Ambient Sounds: A Feasibility Study. IUI 2015: 427-431 - [c106]Aneeq Zia, Yachna Sharma, Vinay Bettadapura, Eric L. Sarin, Mark A. Clements, Irfan A. Essa:
Automated Assessment of Surgical Skills Using Frequency Analysis. MICCAI (1) 2015: 430-438 - [c105]Vinay Bettadapura, Edison Thomaz, Aman Parnami, Gregory D. Abowd, Irfan A. Essa:
Leveraging Context to Support Automated Food Recognition in Restaurants. WACV 2015: 580-587 - [c104]Vinay Bettadapura, Irfan A. Essa, Caroline Pantofaru:
Egocentric Field-of-View Localization Using First-Person Point-of-View Devices. WACV 2015: 626-633 - [c103]S. Hussain Raza, Ahmad Humayun, Irfan A. Essa, Matthias Grundmann, David Anderson:
Finding Temporally Consistent Occlusion Boundaries in Videos Using Geometric Context. WACV 2015: 1022-1029 - [c102]Steven Hickson, Irfan A. Essa, Henrik I. Christensen:
Semantic Instance Labeling Leveraging Hierarchical Segmentation. WACV 2015: 1068-1075 - [i9]Daniel Castro, Steven Hickson, Vinay Bettadapura, Edison Thomaz, Gregory D. Abowd, Henrik I. Christensen, Irfan A. Essa:
Predicting Daily Activities From Egocentric Images Using Deep Learning. CoRR abs/1510.01576 (2015) - [i8]Vinay Bettadapura, Grant Schindler, Thomas Plötz, Irfan A. Essa:
Augmenting Bag-of-Words: Data-Driven Discovery of Temporal and Structural Information for Activity Recognition. CoRR abs/1510.02071 (2015) - [i7]Vinay Bettadapura, Irfan A. Essa, Caroline Pantofaru:
Egocentric Field-of-View Localization Using First-Person Point-of-View Devices. CoRR abs/1510.02073 (2015) - [i6]Vinay Bettadapura, Edison Thomaz, Aman Parnami, Gregory D. Abowd, Irfan A. Essa:
Leveraging Context to Support Automated Food Recognition in Restaurants. CoRR abs/1510.02078 (2015) - [i5]S. Hussain Raza, Omar Javed, Aveek Das, Harpreet S. Sawhney, Hui Cheng, Irfan A. Essa:
Depth Extraction from Videos Using Geometric Context and Occlusion Boundaries. CoRR abs/1510.07317 (2015) - [i4]S. Hussain Raza, Matthias Grundmann, Irfan A. Essa:
Geometric Context from Videos. CoRR abs/1510.07320 (2015) - [i3]S. Hussain Raza, Ahmad Humayun, Matthias Grundmann, David Anderson, Irfan A. Essa:
Finding Temporally Consistent Occlusion Boundaries in Videos using Geometric Context. CoRR abs/1510.07323 (2015) - 2014
- [j16]Raffay Hamid, Ramkrishan K. Kumar, Jessica K. Hodgins, Irfan A. Essa:
A visualization framework for team sports captured using multiple static cameras. Comput. Vis. Image Underst. 118: 171-183 (2014) - [c101]Syed Raza, Omar Javed, Aveek Das, Harpreet S. Sawhney, Hui Cheng, Irfan A. Essa:
Depth Extraction from Videos Using Geometric Context and Occlusion Boundaries. BMVC 2014 - [c100]Steven Hickson, Stan Birchfield, Irfan A. Essa, Henrik I. Christensen:
Efficient Hierarchical Graph-Based Segmentation of RGBD Videos. CVPR 2014: 344-351 - [c99]Unaiza Ahsan, Irfan A. Essa:
Clustering Social Event Images Using Kernel Canonical Correlation Analysis. CVPR Workshops 2014: 814-819 - [c98]Jonathan Bidwell, Irfan A. Essa, Agata Rozga, Gregory D. Abowd:
Measuring Child Visual Attention using Markerless Head Tracking from Color and Depth Sensing Cameras. ICMI 2014: 447-454 - [c97]Yachna Sharma, Thomas Plötz, Nils Y. Hammerla, Sebastian Mellor, Roisin McNaney, Patrick Olivier, Sandeep Deshmukh, Andrew McCaskie, Irfan A. Essa:
Automated surgical OSATS prediction from videos. ISBI 2014: 461-464 - 2013
- [c96]Seungyeon Kim, Fuxin Li, Guy Lebanon, Irfan A. Essa:
Beyond Sentiment: The Manifold of Human Emotions. AISTATS 2013: 360-369 - [c95]Vinay Bettadapura, Grant Schindler, Thomas Ploetz, Irfan A. Essa:
Augmenting Bag-of-Words: Data-Driven Discovery of Temporal and Structural Information for Activity Recognition. CVPR 2013: 2619-2626 - [c94]S. Hussain Raza, Matthias Grundmann, Irfan A. Essa:
Geometric Context from Videos. CVPR 2013: 3081-3088 - [c93]James M. Rehg, Gregory D. Abowd, Agata Rozga, Mario Romero, Mark A. Clements, Stan Sclaroff, Irfan A. Essa, Opal Y. Ousley, Yin Li, Chanho Kim, Hrishikesh Rao, Jonathan C. Kim, Liliana Lo Presti, Jianming Zhang, Denis Lantsman, Jonathan Bidwell, Zhefan Ye:
Decoding Children's Social Behavior. CVPR 2013: 3414-3421 - [c92]Edison Thomaz, Aman Parnami, Jonathan Bidwell, Irfan A. Essa, Gregory D. Abowd:
Technological approaches for addressing privacy concerns when recognizing eating behaviors with wearable cameras. UbiComp 2013: 739-748 - [c91]Matthias Grundmann, Chris McClanahan, Sing Bing Kang, Irfan A. Essa:
Post-processing approach for radiometric self-calibration of video. ICCP 2013: 1-9 - [c90]Ted E. Senator, Henry G. Goldberg, Alex Memory, William T. Young, Brad Rees, Robert Pierce, Daniel Huang, Matthew Reardon, David A. Bader, Edmond Chow, Irfan A. Essa, Joshua Jones, Vinay Bettadapura, Duen Horng Chau, Oded Green, Oguz Kaya, Anita Zakrzewska, Erica Briscoe, Rudolph L. Mappus IV, Robert McColl, Lora Weiss, Thomas G. Dietterich, Alan Fern, Weng-Keen Wong, Shubhomoy Das, Andrew Emmott, Jed Irvine, Jay Yoon Lee, Danai Koutra, Christos Faloutsos, Daniel D. Corkill, Lisa Friedland, Amanda Gentzel, David D. Jensen:
Detecting insider threats in a real corporate database of computer usage activity. KDD 2013: 1393-1401 - [c89]Edison Thomaz, Aman Parnami, Irfan A. Essa, Gregory D. Abowd:
Feasibility of identifying eating moments from first-person images leveraging human computation. SenseCam 2013: 26-33 - [c88]Seungyeon Kim, Fuxin Li, Guy Lebanon, Irfan A. Essa:
The Manifold of Human Emotions. ICLR (Workshop Poster) 2013 - 2012
- [c87]Kihwan Kim, Dongryeol Lee, Irfan A. Essa:
Detecting regions of interest in dynamic scenes with camera motions. CVPR 2012: 1258-1265 - [c86]Glenn Hartmann, Matthias Grundmann, Judy Hoffman, David Tsai, Vivek Kwatra, Omid Madani, Sudheendra Vijayanarasimhan, Irfan A. Essa, James M. Rehg, Rahul Sukthankar:
Weakly Supervised Learning of Object Segmentations from Web-Scale Video. ECCV Workshops (1) 2012: 198-208 - [c85]Edison Thomaz, Vinay Bettadapura, Gabriel Reyes, Megha Sandesh, Grant Schindler, Thomas Plötz, Gregory D. Abowd, Irfan A. Essa:
Recognizing water-based activities in the home through infrastructure-mediated sensing. UbiComp 2012: 85-94 - [c84]Jing Wang, Grant Schindler, Irfan A. Essa:
Orientation-aware scene understanding for mobile cameras. UbiComp 2012: 260-269 - [c83]Matthias Grundmann, Vivek Kwatra, Daniel Castro, Irfan A. Essa:
Calibration-free rolling shutter removal. ICCP 2012: 1-8 - [c82]Neil Dantam, Irfan A. Essa, Mike Stilman:
Linguistic transfer of human assembly tasks to robots. IROS 2012: 237-242 - [i2]Seungyeon Kim, Fuxin Li, Guy Lebanon, Irfan A. Essa:
Beyond Sentiment: The Manifold of Human Emotions. CoRR abs/1202.1568 (2012) - [i1]Rafay Hammid, Siddhartha Maddi, Amos Y. Johnson, Aaron F. Bobick, Irfan A. Essa, Charles Lee Isbell Jr.:
Unsupervised Activity Discovery and Characterization From Event-Streams. CoRR abs/1207.1381 (2012) - 2011
- [j15]Pei Yin, Antonio Criminisi, John M. Winn, Irfan A. Essa:
Bilayer Segmentation of Webcam Videos Using Tree-Based Classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 33(1): 30-42 (2011) - [j14]Irfan A. Essa, Sing Bing Kang, Marc Pollefeys:
Guest Editors' Introduction to the Special Section on Award-Winning Papers from the IEEE Conference on Computer Vision and Pattern Recognition 2009 (CVPR 2009). IEEE Trans. Pattern Anal. Mach. Intell. 33(12): 2339-2340 (2011) - [j13]Kihwan Kim, Sangmin Oh, Jeonggyu Lee, Irfan A. Essa:
Augmenting aerial earth maps with dynamic information from videos. Virtual Real. 15(2-3): 185-200 (2011) - [c81]Matthias Grundmann, Vivek Kwatra, Irfan A. Essa:
Auto-directed video stabilization with robust L1 optimal camera paths. CVPR 2011: 225-232 - [c80]Kihwan Kim, Dongryeol Lee, Irfan A. Essa:
Gaussian process regression flow for analysis of motion trajectories. ICCV 2011: 1164-1171 - 2010
- [j12]Nipun Kwatra, Christopher Wojtan, Mark Carlson, Irfan A. Essa, Peter J. Mucha, Greg Turk:
Fluid Simulation with Articulated Bodies. IEEE Trans. Vis. Comput. Graph. 16(1): 70-80 (2010) - [c79]Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan A. Essa:
Discontinuous seam-carving for video retargeting. CVPR 2010: 569-576 - [c78]Raffay Hamid, Ramkrishan K. Kumar, Matthias Grundmann, Kihwan Kim, Irfan A. Essa, Jessica K. Hodgins:
Player localization using multiple static cameras for sports visualization. CVPR 2010: 731-738 - [c77]Kihwan Kim, Matthias Grundmann, Ariel Shamir, Iain A. Matthews, Jessica K. Hodgins, Irfan A. Essa:
Motion fields to predict play evolution in dynamic sport scenes. CVPR 2010: 840-847 - [c76]Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan A. Essa:
Efficient hierarchical graph-based video segmentation. CVPR 2010: 2141-2148 - [c75]Nicholas Diakopoulos, Irfan A. Essa:
Modulating video credibility via visualization of quality evaluations. WICOW 2010: 75-82
2000 – 2009
- 2009
- [j11]Raffay Hamid, Siddhartha Maddi, Amos Y. Johnson, Aaron F. Bobick, Irfan A. Essa, Charles Lee Isbell Jr.:
A novel sequence representation for unsupervised analysis of human activities. Artif. Intell. 173(14): 1221-1244 (2009) - [j10]Radu Bogdan Rusu, Jan Bandouch, Franziska Meier, Irfan A. Essa, Michael Beetz:
Human Action Recognition Using Global Point Feature Histograms and Action Shapes. Adv. Robotics 23(14): 1873-1908 (2009) - [c74]Nicholas Diakopoulos, Sergio Goldenberg, Irfan A. Essa:
Videolyzer: quality analysis of online informational video for bloggers and journalists. CHI 2009: 799-808 - [c73]Pei Yin, Thad Starner, Harley Hamilton, Irfan A. Essa, James M. Rehg:
Learning the basic units in American Sign Language using discriminative segmental feature selection. ICASSP 2009: 4757-4760 - [c72]Kihwan Kim, Sangmin Oh, Jeonggyu Lee, Irfan A. Essa:
Augmenting Aerial Earth Maps with dynamic information. ISMAR 2009: 35-38 - [c71]Matthew Flagg, Atsushi Nakazawa, Qiushuang Zhang, Sing Bing Kang, Young Kee Ryu, Irfan A. Essa, James M. Rehg:
Human video textures. SI3D 2009: 199-206 - 2008
- [c70]Irfan A. Essa:
Computational photography and video: interacting and creating with videos and images. AVI 2008: 4 - [c69]Pei Yin, Irfan A. Essa, Thad Starner, James M. Rehg:
Discriminative feature selection for hidden Markov models using Segmental Boosting. ICASSP 2008: 2001-2004 - [c68]Matthias Grundmann, Franziska Meier, Irfan A. Essa:
3D Shape Context and Distance Transform for action recognition. ICPR 2008: 1-4 - [c67]Nicholas Diakopoulos, Irfan A. Essa:
An annotation model for making sense of information quality in online video. ICPW 2008: 31-34 - [c66]Kihwan Kim, Jay Summet, Thad Starner, Daniel Ashbrook, Mrunal Kapade, Irfan A. Essa:
Localization and 3D reconstruction of urban scenes using GPS. ISWC 2008: 11-14 - [c65]Nicholas Diakopoulos, Kurt Luther, Irfan A. Essa:
Audio Puzzler: piecing together time-stamped speech transcripts with a puzzle game. ACM Multimedia 2008: 865-868 - 2007
- [c64]David Minnen, Charles Lee Isbell Jr., Irfan A. Essa, Thad Starner:
Discovering Multivariate Motifs using Subsequence Density Estimation and Greedy Mixture Learning. AAAI 2007: 615-620 - [c63]Pei Yin, Antonio Criminisi, John M. Winn, Irfan A. Essa:
Tree-based Classifiers for Bilayer Video Segmentation. CVPR 2007 - [c62]Nicholas Diakopoulos, Kurt Luther, Yevgeniy Eugene Medynskiy, Irfan A. Essa:
The evolution of authorship in a remix society. Hypertext 2007: 133-136 - [c61]R. Mitchell Parry, Irfan A. Essa:
Phase-Aware Non-negative Spectrogram Factorization. ICA 2007: 536-543 - [c60]R. Mitchell Parry, Irfan A. Essa:
Incorporating Phase Information for Source Separation via Spectrogram Factorization. ICASSP (2) 2007: 661-664 - [c59]Raffay Hamid, Siddhartha Maddi, Aaron F. Bobick, Irfan A. Essa:
Structure from Statistics - Unsupervised Activity Analysis using Suffix Trees. ICCV 2007: 1-8 - [c58]David Minnen, Charles L. Isbell Jr., Irfan A. Essa, Thad Starner:
Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery. ICDM 2007: 601-606 - [c57]David Minnen, Thad Starner, Irfan A. Essa, Charles Lee Isbell Jr.:
Improving Activity Discovery with Automatic Neighborhood Estimation. IJCAI 2007: 2814-2819 - [c56]Nicolas Padoy, Tobias Blum, Irfan A. Essa, Hubertus Feußner, Marie-Odile Berger, Nassir Navab:
A Boosted Segmentation Method for Surgical Workflow Analysis. MICCAI (1) 2007: 102-109 - [c55]Irfan A. Essa:
Data-driven and Procedural Analysis and Synthesis of Multimedia. WIAMIS 2007 - 2006
- [c54]Yifan Shi, Aaron F. Bobick, Irfan A. Essa:
Learning Temporal Sequence Model from Partially Labeled Data. CVPR (2) 2006: 1631-1638 - [c53]Jaeil Choi, Andrzej Szymczak, Greg Turk, Irfan A. Essa:
Element-Free Elastic Models for Volume Fitting and Capture. CVPR (2) 2006: 2245-2252 - [c52]R. Mitchell Parry, Irfan A. Essa:
Estimating the Spatial Position of Spectral Components in Audio. ICA 2006: 666-673 - [c51]R. Mitchell Parry, Irfan A. Essa:
Source Detection Using Repetitive Structure. ICASSP (4) 2006: 1093-1096 - [c50]David Minnen, Thad Starner, Irfan A. Essa, Charles Lee Isbell Jr.:
Discovering Characteristic Actions from On-Body Sensor Data. ISWC 2006: 11-18 - [c49]Raffay Hamid, Siddhartha Maddi, Aaron F. Bobick, Irfan A. Essa:
Unsupervised analysis of activity sequences using event-motifs. VSSN@MM 2006: 71-78 - [c48]Kihwan Kim, Irfan A. Essa, Gregory D. Abowd:
Interactive mosaic generation for video navigation. ACM Multimedia 2006: 655-658 - [c47]Nicholas Diakopoulos, Irfan A. Essa:
Videotater: an approach for pen-based digital video segmentation and tagging. UIST 2006: 221-224 - 2005
- [j9]Yavor Angelov, Umakishore Ramachandran, Kenneth M. Mackenzie, James M. Rehg, Irfan A. Essa:
Experiences with optimizing two stream-based applications for cluster execution. J. Parallel Distributed Comput. 65(6): 678-691 (2005) - [j8]Vivek Kwatra, Irfan A. Essa, Aaron F. Bobick, Nipun Kwatra:
Texture optimization for example-based synthesis. ACM Trans. Graph. 24(3): 795-802 (2005) - [c46]ByungMoon Kim, Irfan A. Essa:
Video-based nonphotorealistic and expressive illustration of motion. Computer Graphics International 2005: 32-35 - [c45]Yan Huang, Irfan A. Essa:
Tracking Multiple Objects through Occlusions. CVPR (2) 2005: 1051-1058 - [c44]Yan Huang, Irfan A. Essa:
Tracking Multiple Objects through Occlusions. CVPR (2) 2005: 1182 - [c43]Rafay Hammid, Siddhartha Maddi, Amos Y. Johnson, Aaron F. Bobick, Irfan A. Essa, Charles Lee Isbell Jr.:
Unsupervised Activity Discovery and Characterization From Event-Streams. UAI 2005: 251-258 - [c42]Nicholas Diakopoulos, Irfan A. Essa:
Mediating photo collage authoring. UIST 2005: 183-186 - 2004
- [c41]Nicholas Diakopoulos, Irfan A. Essa, Ramesh C. Jain:
Content Based Image Synthesis. CIVR 2004: 299-307 - [c40]Pei Yin, Irfan A. Essa, James M. Rehg:
Asymmetrically Boosted HMM for Speech Reading. CVPR (2) 2004: 755-761 - [c39]Yifan Shi, Yan Huang, David Minnen, Aaron F. Bobick, Irfan A. Essa:
Propagation Networks for Recognition of Partially Ordered Sequential Action. CVPR (2) 2004: 862-869 - [c38]Gabriel J. Brostow, Irfan A. Essa, Drew Steedly, Vivek Kwatra:
Novel Skeletal Representation for Articulated Creatures. ECCV (3) 2004: 66-78 - [c37]Michael J. Covington, Mustaque Ahamad, Irfan A. Essa, H. Venkateswaran:
Parameterized Authentication. ESORICS 2004: 276-292 - [c36]R. Mitchell, Irfan A. Essa:
Feature Weighting for Segmentation. ISMIR 2004 - [c35]James Hays, Irfan A. Essa:
Image and video based painterly animation. NPAR 2004: 113-120 - 2003
- [j7]Katherine E. Sukel, Richard Catrambone, Irfan A. Essa, Gabriel J. Brostow:
Presenting Movement in a Computer-Based Dance Tutor. Int. J. Hum. Comput. Interact. 15(3): 433-452 (2003) - [j6]Vivek Kwatra, Arno Schödl, Irfan A. Essa, Greg Turk, Aaron F. Bobick:
Graphcut textures: image and video synthesis using graph cuts. ACM Trans. Graph. 22(3): 277-286 (2003) - [c34]Pei Yin, Irfan A. Essa, James M. Rehg:
Boosted Audio-Visual HMM for Speech Reading. AMFG 2003: 68-73 - [c33]Raffay Hamid, Yan Huang, Irfan A. Essa:
ARGMode - Activity Recognition using Graphical Models. CVPR Workshops 2003: 38 - [c32]David Minnen, Irfan A. Essa, Thad Starner:
Expectation Grammars: Leveraging High-Level Expectations for Activity Recognition. CVPR (2) 2003: 626-632 - [c31]Jun Xu, Richard J. Lipton, Irfan A. Essa, Minho Sung, Yong Zhu:
Mandatory human participation: a new authentication scheme for building secure systems. ICCCN 2003: 547-552 - [c30]Drew Steedly, Irfan A. Essa, Frank Dellaert:
Spectral Partitioning for Structure from Motion. ICCV 2003: 996-1003 - [c29]Ravikrishna Ruddarraju, Antonio Haro, Kris Nagel, Quan T. Tran, Irfan A. Essa, Gregory D. Abowd, Elizabeth D. Mynatt:
Perceptual user interfaces using vision-based eye tracking. ICMI 2003: 227-233 - [c28]R. Mitchell Parry, Irfan A. Essa:
Rhythmic similarity through elaboration. ISMIR 2003 - [c27]Antonio Haro, Irfan A. Essa:
Exemplar-Based Surface Texture. VMV 2003: 95-101 - 2002
- [c26]Darnell J. Moore, Irfan A. Essa:
Recognizing Multitasked Activities from Video Using Stochastic Context-Free Grammar. AAAI/IAAI 2002: 770-776 - [c25]Antonio Haro, Irfan A. Essa:
Learning Video Processing by Example. ICPR (1) 2002: 487-491 - [c24]Arno Schödl, Irfan A. Essa:
Controlled animation of video sprites. Symposium on Computer Animation 2002: 121-127 - 2001
- [c23]Arno Schödl, Irfan A. Essa:
Depth Layers from Occlusions. CVPR (1) 2001: 639-644 - [c22]Drew Steedly, Irfan A. Essa:
Propagation of Innovative Information in Non-Linear Least-Squares Structure from Motion. ICCV 2001: 223-229 - [c21]Scott Stillman, Irfan Essa:
Towards reliable multimodal sensing in aware environments. PUI 2001: 1:1-1:6 - [c20]Antonio Haro, Irfan A. Essa, Brian K. Guenter:
Real-time Photo-Realistic Physically Based Rendering of Fine Scale Human Skin Structure. Rendering Techniques 2001: 53-62 - [c19]Gabriel J. Brostow, Irfan A. Essa:
Image-based motion blur for stop motion animation. SIGGRAPH 2001: 561-566 - 2000
- [j5]Irfan A. Essa:
Ubiquitous sensing for smart and aware environments. IEEE Wirel. Commun. 7(5): 47-49 (2000) - [c18]Antonio Haro, Irfan A. Essa, Myron Flickner:
A non-invasive computer vision system for reliable eye tracking. CHI Extended Abstracts 2000: 167-168 - [c17]Gregory D. Abowd, Christopher G. Atkeson, Aaron F. Bobick, Irfan A. Essa, Blair MacIntyre, Elizabeth D. Mynatt, Thad E. Starner:
Living laboratories: the future computing environments group at the Georgia Institute of Technology. CHI Extended Abstracts 2000: 215-216 - [c16]Elizabeth D. Mynatt, Irfan A. Essa, Wendy A. Rogers:
Increasing the opportunities for aging in place. CUU 2000: 65-71 - [c15]Antonio Haro, Myron Flickner, Irfan A. Essa:
Detecting and Tracking Eyes by Using Their Physiological Properties, Dynamics, and Appearance. CVPR 2000: 1163-1168 - [c14]Arno Schödl, Irfan A. Essa:
Machine Learning for Video-Based Rendering. NIPS 2000: 1002-1008 - [c13]Arno Schödl, Richard Szeliski, David Salesin, Irfan A. Essa:
Video textures. SIGGRAPH 2000: 489-498
1990 – 1999
- 1999
- [j4]Irfan A. Essa:
Computers Seeing People. AI Mag. 20(2): 69-82 (1999) - [c12]Cory D. Kidd, Robert J. Orr, Gregory D. Abowd, Christopher G. Atkeson, Irfan A. Essa, Blair MacIntyre, Elizabeth D. Mynatt, Thad Starner, Wendy Newstetter:
The Aware Home: A Living Laboratory for Ubiquitous Computing Research. CoBuild 1999: 191-198 - [c11]Gabriel J. Brostow, Irfan A. Essa:
Motion based Decompositing of Video. ICCV 1999: 8-13 - [c10]Darnell J. Moore, Irfan A. Essa, Monson H. Hayes:
Exploiting Human Actions and Object Context for Recognition Tasks. ICCV 1999: 80-86 - [c9]Arno Schödl, Karsten Schwan, Irfan A. Essa:
Adaptive Parallelization of Model-Based Head Tracking. PDPTA 1999: 1571-1577 - 1997
- [j3]Irfan A. Essa, Alex Pentland:
Coding, Analysis, Interpretation, and Recognition of Facial Expressions. IEEE Trans. Pattern Anal. Mach. Intell. 19(7): 757-763 (1997) - 1996
- [j2]Trevor Darrell, Irfan A. Essa, Alex Pentland:
Task-Specific Gesture Analysis in Real-Time Using Interpolated Views. IEEE Trans. Pattern Anal. Mach. Intell. 18(12): 1236-1242 (1996) - [c8]Irfan A. Essa, Sumit Basu, Trevor Darrell, Alex Pentland:
Modeling, Tracking and Interactive Animation of Faces and Heads Using Input from Video. CA 1996: 68-79 - [c7]Irfan A. Essa:
Vision-Based HCI - What's Next and What are the Difficult Problems? FG 1996 - [c6]Sumit Basu, Irfan A. Essa, Alex Pentland:
Motion regularization for model-based head tracking. ICPR 1996: 611-616 - 1995
- [c5]Irfan A. Essa, Alex Pentland:
Facial Expression Recognition Using a Dynamic Model and Motion Energy. ICCV 1995: 360-367 - 1994
- [c4]Alex Pentland, Trevor Darrell, Irfan A. Essa, Ali Azarbayejani, Stan Sclaroff:
Visually guided animation. CA 1994: 112-121 - [c3]Irfan A. Essa, Alex Pentland:
A vision system for observing and extracting facial action parameters. CVPR 1994: 76-83 - [c2]Trevor Darrell, Irfan A. Essa, Alex Pentland:
Correlation and Interpolation Networks for Real-time Expression Analysis/Synthesis. NIPS 1994: 909-916 - 1992
- [j1]Irfan A. Essa, Stan Sclaroff, Alex Pentland:
A Unified Approach for Physical and Geometric Modeling for Graphics and Animation. Comput. Graph. Forum 11(3): 129-138 (1992) - 1990
- [c1]Stanley E. Scharoff, Alex Pentland, Irfan A. Essa, Martin Friedmann, Bradley Horowitz:
The ThingWorld modeling system: virtual sculpting by modal forces. I3D 1990: 143-144
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-02 22:32 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint