default search action
Karan Sikka
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c33]Anirudh Som, Karan Sikka, Helen Gent, Ajay Divakaran, Andreas Kathol, Dimitra Vergyri:
Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning. ACL (Findings) 2024: 12612-12627 - [c32]Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran:
DRESS : Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback. CVPR 2024: 14239-14250 - [c31]Pritish Sahu, Karan Sikka, Ajay Divakaran:
Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification. EMNLP 2024: 8228-8248 - [c30]Abhinav Rajvanshi, Karan Sikka, Xiao Lin, Bhoram Lee, Han-Pang Chiu, Alvaro Velasquez:
SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments. ICAPS 2024: 464-474 - [c29]Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran:
Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models. NAACL-HLT 2024: 192-210 - [i31]Pritish Sahu, Karan Sikka, Ajay Divakaran:
Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification. CoRR abs/2407.02352 (2024) - 2023
- [c28]Meng Ye, Karan Sikka, Katherine Atwell, Sabit Hassan, Ajay Divakaran, Malihe Alikhani:
Multilingual Content Moderation: A Case Study on Reddit. EACL 2023: 3810-3826 - [c27]Karan Sikka, Indranil Sur, Anirban Roy, Ajay Divakaran, Susmit Jha:
Detecting Trojaned DNNs Using Counterfactual Attributions. ICAA 2023: 76-85 - [c26]Indranil Sur, Karan Sikka, Matthew Walmer, Kaushik Koneripalli, Anirban Roy, Xiao Lin, Ajay Divakaran, Susmit Jha:
TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models. ICCV 2023: 165-175 - [c25]Yiqiao Jin, Yeon-Chang Lee, Kartik Sharma, Meng Ye, Karan Sikka, Ajay Divakaran, Srijan Kumar:
Predicting Information Pathways Across Online Communities. KDD 2023: 1044-1056 - [i30]Meng Ye, Karan Sikka, Katherine Atwell, Sabit Hassan, Ajay Divakaran, Malihe Alikhani:
Multilingual Content Moderation: A Case Study on Reddit. CoRR abs/2302.09618 (2023) - [i29]Yiqiao Jin, Yeon-Chang Lee, Kartik Sharma, Meng Ye, Karan Sikka, Ajay Divakaran, Srijan Kumar:
Predicting Information Pathways Across Online Communities. CoRR abs/2306.02259 (2023) - [i28]Indranil Sur, Karan Sikka, Matthew Walmer, Kaushik Koneripalli, Anirban Roy, Xiao Lin, Ajay Divakaran, Susmit Jha:
TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models. CoRR abs/2308.03906 (2023) - [i27]Abhinav Rajvanshi, Karan Sikka, Xiao Lin, Bhoram Lee, Han-Pang Chiu, Alvaro Velasquez:
SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments. CoRR abs/2309.04077 (2023) - [i26]Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran:
Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models. CoRR abs/2309.04461 (2023) - [i25]Anirudh Som, Karan Sikka, Helen Gent, Ajay Divakaran, Andreas Kathol, Dimitra Vergyri:
Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning. CoRR abs/2310.10707 (2023) - [i24]Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran:
DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback. CoRR abs/2311.10081 (2023) - [i23]Matthew Gwilliam, Michael Cogswell, Meng Ye, Karan Sikka, Abhinav Shrivastava, Ajay Divakaran:
A Video is Worth 10, 000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval. CoRR abs/2312.00115 (2023) - 2022
- [c24]Matthew Walmer, Karan Sikka, Indranil Sur, Abhinav Shrivastava, Susmit Jha:
Dual-Key Multimodal Backdoors for Visual Question Answering. CVPR 2022: 15354-15364 - [c23]Pritish Sahu, Karan Sikka, Ajay Divakaran:
Challenges in Procedural Multimodal Machine Comprehension: A Novel Way To Benchmark. WACV 2022: 526-535 - [c22]Sherzod Hakimov, Gullal Singh Cheema, Marc A. Kastner, Rajiv Ratn Shah, Karan Sikka:
MUWS'22: 1st International Workshop on Multimodal Understanding for the Web and Social Media. WWW (Companion Volume) 2022: 692-693 - 2021
- [c21]Panagiota Kiourti, Wenchao Li, Anirban Roy, Karan Sikka, Susmit Jha:
MISA: Online Defense of Trojaned Models using Misattributions. ACSAC 2021: 570-585 - [i22]Panagiota Kiourti, Wenchao Li, Anirban Roy, Karan Sikka, Susmit Jha:
Online Defense of Trojaned Models using Misattributions. CoRR abs/2103.15918 (2021) - [i21]Pritish Sahu, Karan Sikka, Ajay Divakaran:
Towards Solving Multimodal Comprehension. CoRR abs/2104.10139 (2021) - [i20]Pritish Sahu, Karan Sikka, Ajay Divakaran:
Challenges in Procedural Multimodal Machine Comprehension: A Novel Way To Benchmark. CoRR abs/2110.11899 (2021) - [i19]Matthew Walmer, Karan Sikka, Indranil Sur, Abhinav Shrivastava, Susmit Jha:
Dual-Key Multimodal Backdoors for Visual Question Answering. CoRR abs/2112.07668 (2021) - 2020
- [c20]Niluthpol Chowdhury Mithun, Karan Sikka, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar:
RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization. ACM Multimedia 2020: 934-954 - [i18]Karan Sikka, Andrew Silberfarb, John Byrnes, Indranil Sur, Edmond Chow, Ajay Divakaran, Richard Rohwer:
Deep Adaptive Semantic Logic (DASL): Compiling Declarative Knowledge into Deep Neural Networks. CoRR abs/2003.07344 (2020) - [i17]Niluthpol Chowdhury Mithun, Karan Sikka, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar:
RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization. CoRR abs/2009.05695 (2020) - [i16]Karan Sikka, Jihua Huang, Andrew Silberfarb, Prateeth Nayak, Luke Rohrer, Pritish Sahu, John Byrnes, Ajay Divakaran, Richard Rohwer:
Zero-Shot Learning with Knowledge Enhanced Visual Semantic Embeddings. CoRR abs/2011.10889 (2020) - [i15]Karan Sikka, Indranil Sur, Susmit Jha, Anirban Roy, Ajay Divakaran:
Detecting Trojaned DNNs Using Counterfactual Attributions. CoRR abs/2012.02275 (2020)
2010 – 2019
- 2019
- [c19]Zachary Seymour, Karan Sikka, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar:
Semantically-Aware Attentive Neural Embeddings for 2D Long-Term Visual Localization. BMVC 2019: 70 - [c18]Julia Kruk, Jonah Lubin, Karan Sikka, Xiao Lin, Dan Jurafsky, Ajay Divakaran:
Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts. EMNLP/IJCNLP (1) 2019: 4621-4631 - [c17]Arijit Ray, Karan Sikka, Ajay Divakaran, Stefan Lee, Giedrius Burachas:
Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation. EMNLP/IJCNLP (1) 2019: 5859-5864 - [c16]Samyak Datta, Karan Sikka, Anirban Roy, Karuna Ahuja, Devi Parikh, Ajay Divakaran:
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment. ICCV 2019: 2601-2610 - [c15]Karan Sikka:
Learning User Preferences from Social Multimedia Analysis and Overview of the iFood2019Challenge. MADiMa @ ACM Multimedia 2019: 18 - [i14]Samyak Datta, Karan Sikka, Anirban Roy, Karuna Ahuja, Devi Parikh, Ajay Divakaran:
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment. CoRR abs/1903.11649 (2019) - [i13]Julia Kruk, Jonah Lubin, Karan Sikka, Xiao Lin, Dan Jurafsky, Ajay Divakaran:
Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts. CoRR abs/1904.09073 (2019) - [i12]Karan Sikka, Lucas Van Bramer, Ajay Divakaran:
Deep Unified Multimodal Embeddings for Understanding both Content and Users in Social Media Networks. CoRR abs/1905.07075 (2019) - [i11]Parneet Kaur, Karan Sikka, Weijun Wang, Serge J. Belongie, Ajay Divakaran:
FoodX-251: A Dataset for Fine-grained Food Classification. CoRR abs/1907.06167 (2019) - [i10]Arijit Ray, Karan Sikka, Ajay Divakaran, Stefan Lee, Giedrius Burachas:
Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation. CoRR abs/1909.04696 (2019) - 2018
- [j3]Karan Sikka, Gaurav Sharma:
Discriminatively Trained Latent Ordinal Model for Video Classification. IEEE Trans. Pattern Anal. Mach. Intell. 40(8): 1829-1844 (2018) - [c14]Ankan Bansal, Karan Sikka, Gaurav Sharma, Rama Chellappa, Ajay Divakaran:
Zero-Shot Object Detection. ECCV (1) 2018: 397-414 - [i9]Ankan Bansal, Karan Sikka, Gaurav Sharma, Rama Chellappa, Ajay Divakaran:
Zero-Shot Object Detection. CoRR abs/1804.04340 (2018) - [i8]Karuna Ahuja, Karan Sikka, Anirban Roy, Ajay Divakaran:
Understanding Visual Ads by Aligning Symbols and Objects using Co-Attention. CoRR abs/1807.01448 (2018) - [i7]Zachary Seymour, Karan Sikka, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar:
Semantically-Aware Attentive Neural Embeddings for Image-based Visual Localization. CoRR abs/1812.03402 (2018) - 2017
- [j2]Mohsen Malmir, Karan Sikka, Deborah Forster, Ian R. Fasel, Javier R. Movellan, Garrison W. Cottrell:
Deep active object recognition by joint label and action prediction. Comput. Vis. Image Underst. 156: 128-137 (2017) - [c13]Amlan Kar, Nishant Rai, Karan Sikka, Gaurav Sharma:
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos. CVPR 2017: 5699-5708 - [i6]Parneet Kaur, Karan Sikka, Ajay Divakaran:
Combining Weakly and Webly Supervised Learning for Classifying Food Images. CoRR abs/1712.08730 (2017) - 2016
- [b1]Karan Sikka:
Latent Dynamic Space-Time Volumes for Predicting Human Facial Behavior in Videos. University of California, San Diego, USA, 2016 - [c12]Karan Sikka, Gaurav Sharma, Marian Stewart Bartlett:
LOMo: Latent Ordinal Model for Facial Analysis in Videos. CVPR 2016: 5580-5589 - [i5]Karan Sikka, Gaurav Sharma, Marian Stewart Bartlett:
LOMo: Latent Ordinal Model for Facial Analysis in Videos. CoRR abs/1604.01500 (2016) - [i4]Karan Sikka, Gaurav Sharma:
Discriminatively Trained Latent Ordinal Model for Video Classification. CoRR abs/1608.02318 (2016) - [i3]Amlan Kar, Nishant Rai, Karan Sikka, Gaurav Sharma:
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos. CoRR abs/1611.08240 (2016) - 2015
- [c11]Karan Sikka, Ritwik Giri, Marian Stewart Bartlett:
Joint Clustering and Classification for Multiple Instance Learning. BMVC 2015: 71.1-71.12 - [c10]Mohsen Malmir, Karan Sikka, Deborah Forster, Javier R. Movellan, Garison Cottrell:
Deep Q-learning for Active Recognition of GERMS: Baseline performance on a standardized dataset for active learning. BMVC 2015: 161.1-161.11 - [c9]Karan Sikka, Abhinav Dhall, Marian Stewart Bartlett:
Exemplar Hidden Markov Models for classification of facial expressions in videos. CVPR Workshops 2015: 18-25 - [c8]Abhinav Dhall, Jyoti Joshi, Karan Sikka, Roland Goecke, Nicu Sebe:
The more the merrier: Analysing the affect of a group of people in images. FG 2015: 1-8 - [i2]Mohsen Malmir, Karan Sikka, Deborah Forster, Ian R. Fasel, Javier R. Movellan, Garrison W. Cottrell:
Deep Active Object Recognition by Joint Label and Action Prediction. CoRR abs/1512.05484 (2015) - 2014
- [j1]Karan Sikka, Abhinav Dhall, Marian Stewart Bartlett:
Classification and weakly supervised pain localization using multiple segment representation. Image Vis. Comput. 32(10): 659-670 (2014) - [c7]Karan Sikka:
Facial Expression Analysis for Estimating Pain in Clinical Settings. ICMI 2014: 349-353 - [c6]Abhinav Dhall, Roland Goecke, Jyoti Joshi, Karan Sikka, Tom Gedeon:
Emotion Recognition In The Wild Challenge 2014: Baseline, Data and Protocol. ICMI 2014: 461-466 - [c5]Abhinav Dhall, Karan Sikka, Gwen Littlewort, Roland Goecke, Marian Stewart Bartlett:
A discriminative parts based model approach for fiducial points free and shape constrained head pose normalisation in the wild. WACV 2014: 1-2 - [c4]Abhinav Dhall, Karan Sikka, Gwen Littlewort, Roland Goecke, Marian Stewart Bartlett:
A discriminative parts based model approach for fiducial points free and shape constrained head pose normalisation in the wild. WACV 2014: 1028-1035 - 2013
- [c3]Karan Sikka, Abhinav Dhall, Marian Stewart Bartlett:
Weakly supervised pain localization using multiple instance learning. FG 2013: 1-8 - [c2]Karan Sikka, Karmen Dykstra, Suchitra Sathyanarayana, Gwen Littlewort, Marian Stewart Bartlett:
Multiple kernel learning for emotion recognition in the wild. ICMI 2013: 517-524 - [i1]Sahil Sikka, Karan Sikka, Manas Kamal Bhuyan, Yuji Iwahori:
Pseudo vs. True Defect Classification in Printed Circuits Boards using Wavelet Features. CoRR abs/1310.6654 (2013) - 2012
- [c1]Karan Sikka, Tingfan Wu, Joshua Susskind, Marian Stewart Bartlett:
Exploring Bag of Words Architectures in the Facial Expression Domain. ECCV Workshops (2) 2012: 250-259
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-21 00:21 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint