default search action
Marcus Rohrbach
Person information
- affiliation: TU Darmstadt, Germany
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c74]Anil Batra, Davide Moltisanti, Laura Sevilla-Lara, Marcus Rohrbach, Frank Keller:
Efficient Pre-training for Localized Instruction Generation of Procedural Videos. ECCV (39) 2024: 347-363 - [c73]Suzanne Petryk, Spencer Whitehead, Joseph E. Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach:
Simple Token-Level Confidence Improves Caption Correctness. WACV 2024: 5730-5740 - 2023
- [c72]Corentin Dancette, Spencer Whitehead, Rishabh Maheshwary, Ramakrishna Vedantam, Stefan Scherer, Xinlei Chen, Matthieu Cord, Marcus Rohrbach:
Improving Selective Visual Question Answering by Learning from Your Peers. CVPR 2023: 24049-24059 - [i66]Suzanne Petryk, Spencer Whitehead, Joseph E. Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach:
Simple Token-Level Confidence Improves Caption Correctness. CoRR abs/2305.07021 (2023) - [i65]Corentin Dancette, Spencer Whitehead, Rishabh Maheshwary, Ramakrishna Vedantam, Stefan Scherer, Xinlei Chen, Matthieu Cord, Marcus Rohrbach:
Improving Selective Visual Question Answering by Learning from Your Peers. CoRR abs/2306.08751 (2023) - 2022
- [c71]Xudong Lin, Fabio Petroni, Gedas Bertasius, Marcus Rohrbach, Shih-Fu Chang, Lorenzo Torresani:
Learning To Recognize Procedural Activities with Distant Supervision. CVPR 2022: 13843-13853 - [c70]Amanpreet Singh, Ronghang Hu, Vedanuj Goswami, Guillaume Couairon, Wojciech Galuba, Marcus Rohrbach, Douwe Kiela:
FLAVA: A Foundational Language And Vision Alignment Model. CVPR 2022: 15617-15629 - [c69]Spencer Whitehead, Suzanne Petryk, Vedaad Shakib, Joseph Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach:
Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly. ECCV (36) 2022: 148-166 - [c68]Shreyank N. Gowda, Laura Sevilla-Lara, Frank Keller, Marcus Rohrbach:
CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition. ECCV (20) 2022: 187-203 - [c67]Shreyank N. Gowda, Marcus Rohrbach, Frank Keller, Laura Sevilla-Lara:
Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition. ECCV (31) 2022: 242-259 - [i64]Xudong Lin, Fabio Petroni, Gedas Bertasius, Marcus Rohrbach, Shih-Fu Chang, Lorenzo Torresani:
Learning To Recognize Procedural Activities with Distant Supervision. CoRR abs/2201.10990 (2022) - [i63]Spencer Whitehead, Suzanne Petryk, Vedaad Shakib, Joseph Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach:
Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly. CoRR abs/2204.13631 (2022) - [i62]Shreyank N. Gowda, Marcus Rohrbach, Frank Keller, Laura Sevilla-Lara:
Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition. CoRR abs/2206.04790 (2022) - 2021
- [c66]Shreyank N. Gowda, Marcus Rohrbach, Laura Sevilla-Lara:
SMART Frame Selection for Action Recognition. AAAI 2021: 1451-1459 - [c65]Kenneth Marino, Xinlei Chen, Devi Parikh, Abhinav Gupta, Marcus Rohrbach:
KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA. CVPR 2021: 14111-14121 - [c64]Shreyank N. Gowda, Laura Sevilla-Lara, Kiyoon Kim, Frank Keller, Marcus Rohrbach:
A New Split for Evaluating True Zero-Shot Action Recognition. GCPR 2021: 191-205 - [c63]Sayna Ebrahimi, Suzanne Petryk, Akash Gokul, William Gan, Joseph E. Gonzalez, Marcus Rohrbach, Trevor Darrell:
Remembering for the Right Reasons: Explanations Reduce Catastrophic Forgetting. ICLR 2021 - [i61]Shreyank N. Gowda, Laura Sevilla-Lara, Frank Keller, Marcus Rohrbach:
CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition. CoRR abs/2101.07042 (2021) - [i60]Shreyank N. Gowda, Laura Sevilla-Lara, Kiyoon Kim, Frank Keller, Marcus Rohrbach:
A New Split for Evaluating True Zero-Shot Action Recognition. CoRR abs/2107.13029 (2021) - [i59]Amanpreet Singh, Ronghang Hu, Vedanuj Goswami, Guillaume Couairon, Wojciech Galuba, Marcus Rohrbach, Douwe Kiela:
FLAVA: A Foundational Language And Vision Alignment Model. CoRR abs/2112.04482 (2021) - 2020
- [c62]Ronghang Hu, Amanpreet Singh, Trevor Darrell, Marcus Rohrbach:
Iterative Answer Prediction With Pointer-Augmented Multimodal Transformers for TextVQA. CVPR 2020: 9989-9999 - [c61]Huaizu Jiang, Ishan Misra, Marcus Rohrbach, Erik G. Learned-Miller, Xinlei Chen:
In Defense of Grid Features for Visual Question Answering. CVPR 2020: 10264-10273 - [c60]Jiasen Lu, Vedanuj Goswami, Marcus Rohrbach, Devi Parikh, Stefan Lee:
12-in-1: Multi-Task Vision and Language Representation Learning. CVPR 2020: 10434-10443 - [c59]Chih-Yao Ma, Yannis Kalantidis, Ghassan AlRegib, Peter Vajda, Marcus Rohrbach, Zsolt Kira:
Learning to Generate Grounded Visual Captions Without Localization Supervision. ECCV (18) 2020: 353-370 - [c58]Sayna Ebrahimi, Franziska Meier, Roberto Calandra, Trevor Darrell, Marcus Rohrbach:
Adversarial Continual Learning. ECCV (11) 2020: 386-402 - [c57]Oleksii Sidorov, Ronghang Hu, Marcus Rohrbach, Amanpreet Singh:
TextCaps: A Dataset for Image Captioning with Reading Comprehension. ECCV (2) 2020: 742-758 - [c56]Sayna Ebrahimi, Mohamed Elhoseiny, Trevor Darrell, Marcus Rohrbach:
Uncertainty-guided Continual Learning with Bayesian Neural Networks. ICLR 2020 - [c55]Bingyi Kang, Saining Xie, Marcus Rohrbach, Zhicheng Yan, Albert Gordo, Jiashi Feng, Yannis Kalantidis:
Decoupling Representation and Classifier for Long-Tailed Recognition. ICLR 2020 - [i58]Huaizu Jiang, Ishan Misra, Marcus Rohrbach, Erik G. Learned-Miller, Xinlei Chen:
In Defense of Grid Features for Visual Question Answering. CoRR abs/2001.03615 (2020) - [i57]Sayna Ebrahimi, Franziska Meier, Roberto Calandra, Trevor Darrell, Marcus Rohrbach:
Adversarial Continual Learning. CoRR abs/2003.09553 (2020) - [i56]Oleksii Sidorov, Ronghang Hu, Marcus Rohrbach, Amanpreet Singh:
TextCaps: a Dataset for Image Captioning with Reading Comprehension. CoRR abs/2003.12462 (2020) - [i55]Sayna Ebrahimi, Suzanne Petryk, Akash Gokul, William Gan, Joseph E. Gonzalez, Marcus Rohrbach, Trevor Darrell:
Remembering for the Right Reasons: Explanations Reduce Catastrophic Forgetting. CoRR abs/2010.01528 (2020) - [i54]Shreyank N. Gowda, Marcus Rohrbach, Laura Sevilla-Lara:
SMART Frame Selection for Action Recognition. CoRR abs/2012.10671 (2020) - [i53]Kenneth Marino, Xinlei Chen, Devi Parikh, Abhinav Gupta, Marcus Rohrbach:
KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA. CoRR abs/2012.11014 (2020)
2010 – 2019
- 2019
- [c54]Ji Zhang, Yannis Kalantidis, Marcus Rohrbach, Manohar Paluri, Ahmed Elgammal, Mohamed Elhoseiny:
Large-Scale Visual Relationship Understanding. AAAI 2019: 9185-9194 - [c53]Jin-Hwa Kim, Nikita Kitaev, Xinlei Chen, Marcus Rohrbach, Byoung-Tak Zhang, Yuandong Tian, Dhruv Batra, Devi Parikh:
CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication. ACL (1) 2019: 6495-6513 - [c52]Jae Sung Park, Marcus Rohrbach, Trevor Darrell, Anna Rohrbach:
Adversarial Inference for Multi-Sentence Video Description. CVPR Workshops 2019: 0 - [c51]Luowei Zhou, Yannis Kalantidis, Xinlei Chen, Jason J. Corso, Marcus Rohrbach:
Grounded Video Description. CVPR Workshops 2019: 0 - [c50]Sayna Ebrahimi, Mohamed Elhoseiny, Trevor Darrell, Marcus Rohrbach:
Uncertainty-Guided Continual Learning in Bayesian Neural Networks - Extended Abstract. CVPR Workshops 2019: 75-78 - [c49]Yunpeng Chen, Marcus Rohrbach, Zhicheng Yan, Shuicheng Yan, Jiashi Feng, Yannis Kalantidis:
Graph-Based Global Reasoning Networks. CVPR 2019: 433-442 - [c48]Zheng Shou, Xudong Lin, Yannis Kalantidis, Laura Sevilla-Lara, Marcus Rohrbach, Shih-Fu Chang, Zhicheng Yan:
DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition. CVPR 2019: 1268-1277 - [c47]Luowei Zhou, Yannis Kalantidis, Xinlei Chen, Jason J. Corso, Marcus Rohrbach:
Grounded Video Description. CVPR 2019: 6578-6587 - [c46]Jae Sung Park, Marcus Rohrbach, Trevor Darrell, Anna Rohrbach:
Adversarial Inference for Multi-Sentence Video Description. CVPR 2019: 6598-6608 - [c45]Meet Shah, Xinlei Chen, Marcus Rohrbach, Devi Parikh:
Cycle-Consistency for Robust Visual Question Answering. CVPR 2019: 6649-6658 - [c44]Amanpreet Singh, Vivek Natarajan, Meet Shah, Yu Jiang, Xinlei Chen, Dhruv Batra, Devi Parikh, Marcus Rohrbach:
Towards VQA Models That Can Read. CVPR 2019: 8317-8326 - [c43]Yunpeng Chen, Haoqi Fan, Bing Xu, Zhicheng Yan, Yannis Kalantidis, Marcus Rohrbach, Shuicheng Yan, Jiashi Feng:
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution. ICCV 2019: 3434-3443 - [c42]Rahaf Aljundi, Marcus Rohrbach, Tinne Tuytelaars:
Selfless Sequential Learning. ICLR (Poster) 2019 - [c41]Arslan Chaudhry, Marc'Aurelio Ranzato, Marcus Rohrbach, Mohamed Elhoseiny:
Efficient Lifelong Learning with A-GEM. ICLR (Poster) 2019 - [c40]Ramakrishna Vedantam, Karan Desai, Stefan Lee, Marcus Rohrbach, Dhruv Batra, Devi Parikh:
Probabilistic Neural Symbolic Models for Interpretable Visual Question Answering. ICML 2019: 6428-6437 - [c39]Satwik Kottur, José M. F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach:
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog. NAACL-HLT (1) 2019: 582-595 - [i52]Zheng Shou, Zhicheng Yan, Yannis Kalantidis, Laura Sevilla-Lara, Marcus Rohrbach, Xudong Lin, Shih-Fu Chang:
DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition. CoRR abs/1901.03460 (2019) - [i51]Meet Shah, Xinlei Chen, Marcus Rohrbach, Devi Parikh:
Cycle-Consistency for Robust Visual Question Answering. CoRR abs/1902.05660 (2019) - [i50]Ramakrishna Vedantam, Karan Desai, Stefan Lee, Marcus Rohrbach, Dhruv Batra, Devi Parikh:
Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering. CoRR abs/1902.07864 (2019) - [i49]Arslan Chaudhry, Marcus Rohrbach, Mohamed Elhoseiny, Thalaiyasingam Ajanthan, Puneet Kumar Dokania, Philip H. S. Torr, Marc'Aurelio Ranzato:
Continual Learning with Tiny Episodic Memories. CoRR abs/1902.10486 (2019) - [i48]Satwik Kottur, José M. F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach:
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog. CoRR abs/1903.03166 (2019) - [i47]Yunpeng Chen, Haoqi Fan, Bing Xu, Zhicheng Yan, Yannis Kalantidis, Marcus Rohrbach, Shuicheng Yan, Jiashi Feng:
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution. CoRR abs/1904.05049 (2019) - [i46]Amanpreet Singh, Vivek Natarajan, Meet Shah, Yu Jiang, Xinlei Chen, Dhruv Batra, Devi Parikh, Marcus Rohrbach:
Towards VQA Models that can Read. CoRR abs/1904.08920 (2019) - [i45]Chih-Yao Ma, Yannis Kalantidis, Ghassan AlRegib, Peter Vajda, Marcus Rohrbach, Zsolt Kira:
Learning to Generate Grounded Image Captions without Localization Supervision. CoRR abs/1906.00283 (2019) - [i44]Sayna Ebrahimi, Mohamed Elhoseiny, Trevor Darrell, Marcus Rohrbach:
Uncertainty-guided Continual Learning with Bayesian Neural Networks. CoRR abs/1906.02425 (2019) - [i43]Bingyi Kang, Saining Xie, Marcus Rohrbach, Zhicheng Yan, Albert Gordo, Jiashi Feng, Yannis Kalantidis:
Decoupling Representation and Classifier for Long-Tailed Recognition. CoRR abs/1910.09217 (2019) - [i42]Ronghang Hu, Amanpreet Singh, Trevor Darrell, Marcus Rohrbach:
Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA. CoRR abs/1911.06258 (2019) - [i41]Jiasen Lu, Vedanuj Goswami, Marcus Rohrbach, Devi Parikh, Stefan Lee:
12-in-1: Multi-Task Vision and Language Representation Learning. CoRR abs/1912.02315 (2019) - 2018
- [c38]Mohamed Elhoseiny, Francesca Babiloni, Rahaf Aljundi, Marcus Rohrbach, Manohar Paluri, Tinne Tuytelaars:
Exploring the Challenges Towards Lifelong Fact Learning. ACCV (6) 2018: 66-84 - [c37]Dong Huk Park, Lisa Anne Hendricks, Zeynep Akata, Anna Rohrbach, Bernt Schiele, Trevor Darrell, Marcus Rohrbach:
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence. CVPR 2018: 8779-8788 - [c36]Rahaf Aljundi, Francesca Babiloni, Mohamed Elhoseiny, Marcus Rohrbach, Tinne Tuytelaars:
Memory Aware Synapses: Learning What (not) to Forget. ECCV (3) 2018: 144-161 - [c35]Satwik Kottur, José M. F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach:
Visual Coreference Resolution in Visual Dialog Using Neural Module Networks. ECCV (15) 2018: 160-178 - [c34]Spandana Gella, Mike Lewis, Marcus Rohrbach:
A Dataset for Telling the Stories of Social Media Videos. EMNLP 2018: 968-974 - [i40]Dong Huk Park, Lisa Anne Hendricks, Zeynep Akata, Anna Rohrbach, Bernt Schiele, Trevor Darrell, Marcus Rohrbach:
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence. CoRR abs/1802.08129 (2018) - [i39]Ji Zhang, Yannis Kalantidis, Marcus Rohrbach, Manohar Paluri, Ahmed M. Elgammal, Mohamed Elhoseiny:
Large-Scale Visual Relationship Understanding. CoRR abs/1804.10660 (2018) - [i38]Rahaf Aljundi, Marcus Rohrbach, Tinne Tuytelaars:
Selfless Sequential Learning. CoRR abs/1806.05421 (2018) - [i37]Yu Jiang, Vivek Natarajan, Xinlei Chen, Marcus Rohrbach, Dhruv Batra, Devi Parikh:
Pythia v0.1: the Winning Entry to the VQA Challenge 2018. CoRR abs/1807.09956 (2018) - [i36]Satwik Kottur, José M. F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach:
Visual Coreference Resolution in Visual Dialog using Neural Module Networks. CoRR abs/1809.01816 (2018) - [i35]Yunpeng Chen, Marcus Rohrbach, Zhicheng Yan, Shuicheng Yan, Jiashi Feng, Yannis Kalantidis:
Graph-Based Global Reasoning Networks. CoRR abs/1811.12814 (2018) - [i34]Arslan Chaudhry, Marc'Aurelio Ranzato, Marcus Rohrbach, Mohamed Elhoseiny:
Efficient Lifelong Learning with A-GEM. CoRR abs/1812.00420 (2018) - [i33]Jae Sung Park, Marcus Rohrbach, Trevor Darrell, Anna Rohrbach:
Adversarial Inference for Multi-Sentence Video Description. CoRR abs/1812.05634 (2018) - [i32]Luowei Zhou, Yannis Kalantidis, Xinlei Chen, Jason J. Corso, Marcus Rohrbach:
Grounded Video Description. CoRR abs/1812.06587 (2018) - [i31]Mohamed Elhoseiny, Francesca Babiloni, Rahaf Aljundi, Marcus Rohrbach, Manohar Paluri, Tinne Tuytelaars:
Exploring the Challenges towards Lifelong Fact Learning. CoRR abs/1812.10524 (2018) - 2017
- [j6]Anna Rohrbach, Atousa Torabi, Marcus Rohrbach, Niket Tandon, Christopher Joseph Pal, Hugo Larochelle, Aaron C. Courville, Bernt Schiele:
Movie Description. Int. J. Comput. Vis. 123(1): 94-120 (2017) - [j5]Mateusz Malinowski, Marcus Rohrbach, Mario Fritz:
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering. Int. J. Comput. Vis. 125(1-3): 110-135 (2017) - [j4]Jeff Donahue, Lisa Anne Hendricks, Marcus Rohrbach, Subhashini Venugopalan, Sergio Guadarrama, Kate Saenko, Trevor Darrell:
Long-Term Recurrent Convolutional Networks for Visual Recognition and Description. IEEE Trans. Pattern Anal. Mach. Intell. 39(4): 677-691 (2017) - [c33]Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond J. Mooney, Trevor Darrell, Kate Saenko:
Captioning Images with Diverse Objects. CVPR 2017: 1170-1178 - [c32]Anna Rohrbach, Marcus Rohrbach, Siyu Tang, Seong Joon Oh, Bernt Schiele:
Generating Descriptions with Grounded and Co-referenced People. CVPR 2017: 4196-4206 - [c31]Ronghang Hu, Marcus Rohrbach, Jacob Andreas, Trevor Darrell, Kate Saenko:
Modeling Relationships in Referential Expressions with Compositional Modular Networks. CVPR 2017: 4418-4427 - [c30]Ronghang Hu, Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Kate Saenko:
Learning to Reason: End-to-End Module Networks for Visual Question Answering. ICCV 2017: 804-813 - [c29]Rakshith Shetty, Marcus Rohrbach, Lisa Anne Hendricks, Mario Fritz, Bernt Schiele:
Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training. ICCV 2017: 4155-4164 - [i30]Rakshith Shetty, Marcus Rohrbach, Lisa Anne Hendricks, Mario Fritz, Bernt Schiele:
Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training. CoRR abs/1703.10476 (2017) - [i29]Anna Rohrbach, Marcus Rohrbach, Siyu Tang, Seong Joon Oh, Bernt Schiele:
Generating Descriptions with Grounded and Co-Referenced People. CoRR abs/1704.01518 (2017) - [i28]Ronghang Hu, Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Kate Saenko:
Learning to Reason: End-to-End Module Networks for Visual Question Answering. CoRR abs/1704.05526 (2017) - [i27]Dong Huk Park, Lisa Anne Hendricks, Zeynep Akata, Anna Rohrbach, Bernt Schiele, Trevor Darrell, Marcus Rohrbach:
Attentive Explanations: Justifying Decisions and Pointing to the Evidence (Extended Abstract). CoRR abs/1711.07373 (2017) - [i26]Rahaf Aljundi, Francesca Babiloni, Mohamed Elhoseiny, Marcus Rohrbach, Tinne Tuytelaars:
Memory Aware Synapses: Learning what (not) to forget. CoRR abs/1711.09601 (2017) - 2016
- [j3]Marcus Rohrbach, Anna Rohrbach, Michaela Regneri, Sikandar Amin, Mykhaylo Andriluka, Manfred Pinkal, Bernt Schiele:
Recognizing Fine-Grained and Composite Activities Using Hand-Centric Features and Script Data. Int. J. Comput. Vis. 119(3): 346-373 (2016) - [c28]Niket Tandon, Charles Hariman, Jacopo Urbani, Anna Rohrbach, Marcus Rohrbach, Gerhard Weikum:
Commonsense in Parts: Mining Part-Whole Relations from the Web and Image Tags. AAAI 2016: 243-250 - [c27]Lisa Anne Hendricks, Subhashini Venugopalan, Marcus Rohrbach, Raymond J. Mooney, Kate Saenko, Trevor Darrell:
Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data. CVPR 2016: 1-10 - [c26]Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein:
Neural Module Networks. CVPR 2016: 39-48 - [c25]Ronghang Hu, Huazhe Xu, Marcus Rohrbach, Jiashi Feng, Kate Saenko, Trevor Darrell:
Natural Language Object Retrieval. CVPR 2016: 4555-4564 - [c24]Lisa Anne Hendricks, Zeynep Akata, Marcus Rohrbach, Jeff Donahue, Bernt Schiele, Trevor Darrell:
Generating Visual Explanations. ECCV (4) 2016: 3-19 - [c23]Ronghang Hu, Marcus Rohrbach, Trevor Darrell:
Segmentation from Natural Language Expressions. ECCV (1) 2016: 108-124 - [c22]Anna Rohrbach, Marcus Rohrbach, Ronghang Hu, Trevor Darrell, Bernt Schiele:
Grounding of Textual Phrases in Images by Reconstruction. ECCV (1) 2016: 817-834 - [c21]Akira Fukui, Dong Huk Park, Daylen Yang, Anna Rohrbach, Trevor Darrell, Marcus Rohrbach:
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding. EMNLP 2016: 457-468 - [c20]Vasili Ramanishka, Abir Das, Dong Huk Park, Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Kate Saenko:
Multimodal Video Description. ACM Multimedia 2016: 1092-1096 - [c19]Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein:
Learning to Compose Neural Networks for Question Answering. HLT-NAACL 2016: 1545-1554 - [i25]Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein:
Learning to Compose Neural Networks for Question Answering. CoRR abs/1601.01705 (2016) - [i24]Ronghang Hu, Marcus Rohrbach, Trevor Darrell:
Segmentation from Natural Language Expressions. CoRR abs/1603.06180 (2016) - [i23]Lisa Anne Hendricks, Zeynep Akata, Marcus Rohrbach, Jeff Donahue, Bernt Schiele, Trevor Darrell:
Generating Visual Explanations. CoRR abs/1603.08507 (2016) - [i22]Marcus Rohrbach:
Attributes as Semantic Units between Natural Language and Visual Recognition. CoRR abs/1604.03249 (2016) - [i21]Mateusz Malinowski, Marcus Rohrbach, Mario Fritz:
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering. CoRR abs/1605.02697 (2016) - [i20]Anna Rohrbach, Atousa Torabi, Marcus Rohrbach, Niket Tandon, Christopher J. Pal, Hugo Larochelle, Aaron C. Courville, Bernt Schiele:
Movie Description. CoRR abs/1605.03705 (2016) - [i19]Akira Fukui, Dong Huk Park, Daylen Yang, Anna Rohrbach, Trevor Darrell, Marcus Rohrbach:
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding. CoRR abs/1606.01847 (2016) - [i18]Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond J. Mooney, Trevor Darrell, Kate Saenko:
Captioning Images with Diverse Objects. CoRR abs/1606.07770 (2016) - [i17]Ronghang Hu, Marcus Rohrbach, Subhashini Venugopalan, Trevor Darrell:
Utilizing Large Scale Vision and Text Datasets for Image Segmentation from Referring Expressions. CoRR abs/1608.08305 (2016) - [i16]Ronghang Hu, Marcus Rohrbach, Jacob Andreas, Trevor Darrell, Kate Saenko:
Modeling Relationships in Referential Expressions with Compositional Modular Networks. CoRR abs/1611.09978 (2016) - [i15]Dong Huk Park, Lisa Anne Hendricks, Zeynep Akata, Bernt Schiele, Trevor Darrell, Marcus Rohrbach:
Attentive Explanations: Justifying Decisions and Pointing to the Evidence. CoRR abs/1612.04757 (2016) - 2015
- [c18]Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Trevor Darrell, Kate Saenko:
Long-term recurrent convolutional networks for visual recognition and description. CVPR 2015: 2625-2634 - [c17]Anna Rohrbach, Marcus Rohrbach, Niket Tandon, Bernt Schiele:
A dataset for Movie Description. CVPR 2015: 3202-3212 - [c16]Anna Rohrbach, Marcus Rohrbach, Bernt Schiele:
The Long-Short Story of Movie Description. GCPR 2015: 209-221 - [c15]Mateusz Malinowski, Marcus Rohrbach, Mario Fritz:
Ask Your Neurons: A Neural-Based Approach to Answering Questions about Images. ICCV 2015: 1-9 - [c14]Damian Mrowca, Marcus Rohrbach, Judy Hoffman, Ronghang Hu, Kate Saenko, Trevor Darrell:
Spatial Semantic Regularisation for Large Scale Object Detection. ICCV 2015: 2003-2011 - [c13]Subhashini Venugopalan, Marcus Rohrbach, Jeffrey Donahue, Raymond J. Mooney, Trevor Darrell, Kate Saenko:
Sequence to Sequence - Video to Text. ICCV 2015: 4534-4542 - [c12]Subhashini Venugopalan, Huijuan Xu, Jeff Donahue, Marcus Rohrbach, Raymond J. Mooney, Kate Saenko:
Translating Videos to Natural Language Using Deep Recurrent Neural Networks. HLT-NAACL 2015: 1494-1504 - [i14]Anna Rohrbach, Marcus Rohrbach, Niket Tandon, Bernt Schiele:
A Dataset for Movie Description. CoRR abs/1501.02530 (2015) - [i13]Marcus Rohrbach, Anna Rohrbach, Michaela Regneri, Sikandar Amin, Mykhaylo Andriluka, Manfred Pinkal, Bernt Schiele:
Recognizing Fine-Grained and Composite Activities using Hand-Centric Features and Script Data. CoRR abs/1502.06648 (2015) - [i12]Subhashini Venugopalan, Marcus Rohrbach, Jeff Donahue, Raymond J. Mooney, Trevor Darrell, Kate Saenko:
Sequence to Sequence - Video to Text. CoRR abs/1505.00487 (2015) - [i11]Mateusz Malinowski, Marcus Rohrbach, Mario Fritz:
Ask Your Neurons: A Neural-based Approach to Answering Questions about Images. CoRR abs/1505.01121 (2015) - [i10]Huijuan Xu, Subhashini Venugopalan, Vasili Ramanishka, Marcus Rohrbach, Kate Saenko:
A Multi-scale Multiple Instance Video Description Network. CoRR abs/1505.05914 (2015) - [i9]Anna Rohrbach, Marcus Rohrbach, Bernt Schiele:
The Long-Short Story of Movie Description. CoRR abs/1506.01698 (2015) - [i8]Damian Mrowca, Marcus Rohrbach, Judy Hoffman, Ronghang Hu, Kate Saenko, Trevor Darrell:
Spatial Semantic Regularisation for Large Scale Object Detection. CoRR abs/1510.02949 (2015) - [i7]Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein:
Deep Compositional Question Answering with Neural Module Networks. CoRR abs/1511.02799 (2015) - [i6]Anna Rohrbach, Marcus Rohrbach, Ronghang Hu, Trevor Darrell, Bernt Schiele:
Grounding of Textual Phrases in Images by Reconstruction. CoRR abs/1511.03745 (2015) - [i5]Ronghang Hu, Huazhe Xu, Marcus Rohrbach, Jiashi Feng, Kate Saenko, Trevor Darrell:
Natural Language Object Retrieval. CoRR abs/1511.04164 (2015) - [i4]Lisa Anne Hendricks, Subhashini Venugopalan, Marcus Rohrbach, Raymond J. Mooney, Kate Saenko, Trevor Darrell:
Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data. CoRR abs/1511.05284 (2015) - 2014
- [b1]Marcus Rohrbach:
Combining visual recognition and computational linguistics : linguistic knowledge for visual recognition and natural language descriptions of visual content. Saarland University, 2014 - [c11]Anna Rohrbach, Marcus Rohrbach, Wei Qiu, Annemarie Friedrich, Manfred Pinkal, Bernt Schiele:
Coherent Multi-sentence Video Description with Variable Level of Detail. GCPR 2014: 184-195 - [i3]Anna Senina, Marcus Rohrbach, Wei Qiu, Annemarie Friedrich, Sikandar Amin, Mykhaylo Andriluka, Manfred Pinkal, Bernt Schiele:
Coherent Multi-Sentence Video Description with Variable Level of Detail. CoRR abs/1403.6173 (2014) - [i2]Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, Trevor Darrell:
Long-term Recurrent Convolutional Networks for Visual Recognition and Description. CoRR abs/1411.4389 (2014) - [i1]Subhashini Venugopalan, Huijuan Xu, Jeff Donahue, Marcus Rohrbach, Raymond J. Mooney, Kate Saenko:
Translating Videos to Natural Language Using Deep Recurrent Neural Networks. CoRR abs/1412.4729 (2014) - 2013
- [j2]Michaela Regneri, Marcus Rohrbach, Dominikus Wetzel, Stefan Thater, Bernt Schiele, Manfred Pinkal:
Grounding Action Descriptions in Videos. Trans. Assoc. Comput. Linguistics 1: 25-36 (2013) - [c10]Sikandar Amin, Mykhaylo Andriluka, Marcus Rohrbach, Bernt Schiele:
Multi-view Pictorial Structures for 3D Human Pose Estimation. BMVC 2013 - [c9]Marcus Rohrbach, Wei Qiu, Ivan Titov, Stefan Thater, Manfred Pinkal, Bernt Schiele:
Translating Video Content to Natural Language Descriptions. ICCV 2013: 433-440 - [c8]Marcus Rohrbach, Sandra Ebert, Bernt Schiele:
Transfer Learning in a Transductive Setting. NIPS 2013: 46-54 - 2012
- [c7]Marcus Rohrbach, Sikandar Amin, Mykhaylo Andriluka, Bernt Schiele:
A database for fine grained activity detection of cooking activities. CVPR 2012: 1194-1201 - [c6]Wandi Susanto, Marcus Rohrbach, Bernt Schiele:
3D Object Detection with Multiple Kinects. ECCV Workshops (2) 2012: 93-102 - [c5]Marcus Rohrbach, Michaela Regneri, Mykhaylo Andriluka, Sikandar Amin, Manfred Pinkal, Bernt Schiele:
Script Data for Attribute-Based Recognition of Composite Activities. ECCV (1) 2012: 144-157 - 2011
- [j1]Christoph Gustav Keller, Markus Enzweiler, Marcus Rohrbach, David Fernández Llorca, Christoph Schnörr, Dariu M. Gavrila:
The Benefits of Dense Stereo for Pedestrian Detection. IEEE Trans. Intell. Transp. Syst. 12(4): 1096-1106 (2011) - [c4]Marcus Rohrbach, Michael Stark, Bernt Schiele:
Evaluating knowledge transfer and zero-shot learning in a large-scale setting. CVPR 2011: 1641-1648 - 2010
- [c3]Marcus Rohrbach, Michael Stark, György Szarvas, Iryna Gurevych, Bernt Schiele:
What helps where - and why? Semantic relatedness for knowledge transfer. CVPR 2010: 910-917 - [c2]Marcus Rohrbach, Michael Stark, György Szarvas, Bernt Schiele:
Combining Language Sources and Robust Semantic Relatedness for Attribute-Based Knowledge Transfer. ECCV Workshops (1) 2010: 15-28
2000 – 2009
- 2009
- [c1]Marcus Rohrbach, Markus Enzweiler, Dariu M. Gavrila:
High-Level Fusion of Depth and Intensity for Pedestrian Classification. DAGM-Symposium 2009: 101-110
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-23 21:21 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint