Publications

Please also see my Google academic author page.

Also see the code and datasets page.

2016

Venugopalan, Subhashini; Hendricks, Lisa Anne; Rohrbach, Marcus; Mooney, Raymond; Darrell, Trevor; Saenko, Kate

Captioning Images with Diverse Objects Journal Article

arXiv:1606.07770, 2016.

BibTeX

Fukui, Akira; Park, Dong Huk; Yang, Daylen; Rohrbach, Anna; Darrell, Trevor; Rohrbach, Marcus

Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding Journal Article

arXiv:1606.01847, 2016.

BibTeX

Malinowski, Mateusz; Rohrbach, Marcus; Fritz, Mario

Ask Your Neurons: A Deep Learning Approach to Visual Question Answering Journal Article

arXiv:1605.02697, 2016.

BibTeX

Rohrbach, Marcus

Attributes as Semantic Units between Natural Language and Visual Recognition Journal Article

arXiv:1604.03249, 2016.

BibTeX

Hendricks, Lisa Anne; Akata, Zeynep; Rohrbach, Marcus; Donahue, Jeff; Schiele, Bernt; Darrell, Trevor

Generating Visual Explanations Inproceedings

Proceedings of the European Conference on Computer Vision (ECCV), 2016.

BibTeX

Hu, Ronghang; Rohrbach, Marcus; Darrell, Trevor

Segmentation from Natural Language Expressions Inproceedings

Proceedings of the European Conference on Computer Vision (ECCV), 2016.

BibTeX

Rohrbach, Anna; Torabi, Atousa; Rohrbach, Marcus; Tandon, Niket; Pal, Christopher; Larochelle, Hugo; Courville, Aaron; Schiele, Bernt

Movie Description Journal Article

arXiv:1605.03705, 2016.

BibTeX

Andreas, Jacob; Rohrbach, Marcus; Darrell, Trevor; Klein, Dan

Learning to Compose Neural Networks for Question Answering Inproceedings

Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), 2016, (Best paper award).

BibTeX

Tandon, Niket; Hariman, Charles; Urbani, Jacopo; Rohrbach, Anna; Rohrbach, Marcus; Weikum, Gerhard

Commonsense in Parts: Mining Part-Whole Relations from the Web and Image Tags Inproceedings

AAAI Conference on Artificial Intelligence (AAAI), 2016.

BibTeX

Hu, Ronghang; Xu, Huazhe; Rohrbach, Marcus; Feng, Jiashi; Saenko, Kate; Darrell, Trevor

Natural Language Object Retrieval Inproceedings

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, (Oral).

BibTeX

Hendricks, Lisa Anne; Venugopalan, Subhashini; Rohrbach, Marcus; Mooney, Raymond; Saenko, Kate; Darrell, Trevor

Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data Inproceedings

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, (Oral).

BibTeX

Andreas, Jacob; Rohrbach, Marcus; Darrell, Trevor; Klein, Dan

Neural Module Networks Inproceedings

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, (Oral).

BibTeX

Rohrbach, Anna; Rohrbach, Marcus; Hu, Ronghang; Darrell, Trevor; Schiele, Bernt

Grounding of Textual Phrases in Images by Reconstruction Inproceedings

Proceedings of the European Conference on Computer Vision (ECCV), 2016.

BibTeX

2015

Mrowca, Damian; Rohrbach, Marcus; Hoffman, Judy; Hu, Ronghang; Saenko, Kate; Darrell, Trevor

Spatial Semantic Regularisation for Large Scale Object Detection Inproceedings

Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2015.

BibTeX

Xu, Huijuan; Venugopalan, Subhashini; Ramanishka, Vasili; Rohrbach, Marcus; Saenko, Kate

A Multi-scale Multiple Instance Video Description Network Inproceedings

ICCV Workshop on Closing the Loop between Vision and Language, 2015.

BibTeX

Venugopalan, Subhashini; Rohrbach, Marcus; Donahue, Jeff; Mooney, Raymond; Darrell, Trevor; Saenko, Kate

Sequence to Sequence -- Video to Text Inproceedings

Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2015.

BibTeX

Mateusz Malinowski; Rohrbach, Marcus; Mario Fritz

Ask Your Neurons: A Neural-based Approach to Answering Questions about Images Inproceedings

Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2015, (Oral).

Abstract | BibTeX

Donahue, Jeff; Hendricks, Lisa Anne; Guadarrama, Sergio; Rohrbach, Marcus; Venugopalan, Subhashini; Saenko, Kate; Darrell, Trevor

Long-term recurrent convolutional networks for visual recognition and description Inproceedings

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, (Oral).

BibTeX

Rohrbach, Anna; Rohrbach, Marcus; Schiele, Bernt

The Long-Short Story of Movie Description Inproceedings

Proceedings of the German Conference on Pattern Recognition (GCPR), 2015, (Oral, Best Paper Honorable Mention).

BibTeX

Venugopalan, Subhashini; Xu, Huijuan; Donahue, Jeff; Rohrbach, Marcus; Mooney, Raymond; Saenko, Kate

Translating Videos to Natural Language Using Deep Recurrent Neural Networks Inproceedings

Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), 2015, (Oral).

BibTeX

Rohrbach, Anna; Rohrbach, Marcus; Tandon, Niket; Schiele, Bernt

A Dataset for Movie Description Inproceedings

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.

BibTeX

2014

Rohrbach, Marcus

Combining visual recognition and computational linguistics: linguistic knowledge for visual recognition and natural language descriptions of visual content PhD Thesis

Saarland University, 2014.

BibTeX

Rohrbach, Anna; Rohrbach, Marcus; Qiu, Wei; Friedrich, Annemarie; Pinkal, Manfred; Schiele, Bernt

Coherent Multi-Sentence Video Description with Variable Level of Detail Inproceedings

Proceedings of the German Conference on Pattern Recognition (GCPR), 2014, (Oral).

BibTeX

2013

Rohrbach, Marcus; Ebert, Sandra; Schiele, Bernt

Transfer Learning in a Transductive Setting Inproceedings

Advances in Neural Information Processing Systems (NIPS), 2013.

BibTeX

Marcus Rohrbach; Wei Qiu; Ivan Titov; Stefan Thater; Manfred Pinkal; Bernt Schiele

Translating Video Content to Natural Language Descriptions Inproceedings

Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2013.

BibTeX

Amin, Sikandar; Andriluka, Mykhaylo; Rohrbach, Marcus; Schiele, Bernt

Multi-view Pictorial Structures for 3D Human Pose Estimation Inproceedings

Proceedings of the British Machine Vision Conference (BMVC), 2013, (Oral).

BibTeX

2012

Susanto, Wandi; Rohrbach, Marcus; Schiele, Bernt

3D object detection with multiple kinects Inproceedings

Proceedings of the European Conference on Computer Vision Workshops (ECCV Workshops), Springer, 2012.

BibTeX

Rohrbach, Marcus; Amin, Sikandar; Andriluka, Mykhaylo; Schiele, Bernt

A database for fine grained activity detection of cooking activities Inproceedings

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012.

BibTeX

Rohrbach, Marcus; Regneri, Michaela; Andriluka, Mykhaylo; Amin, Sikandar; Pinkal, Manfred; Schiele, Bernt

Script data for attribute-based recognition of composite activities Inproceedings

Proceedings of the European Conference on Computer Vision (ECCV), 2012.

BibTeX

2011

Keller, Christoph G.; Enzweiler, Markus; Rohrbach, Marcus; Llorca, David Fern'andez; Schn"orr, Christoph; Gavrila, Dariu M.

The Benefits of Dense Stereo for Pedestrian Detection Journal Article

IEEE Transactions on Intelligent Transportation Systems, 12 , 2011, ISSN: 1524-9050.

Abstract | BibTeX

Rohrbach, Marcus; Stark, Michael; Schiele, Bernt

Evaluating Knowledge Transfer and Zero-Shot Learning in a Large-Scale Setting Inproceedings

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011.

BibTeX

2010

Rohrbach, Marcus; Stark, Michael; Szarvas, Gy"orgy; Schiele, Bernt

Combining language sources and robust semantic relatedness for attribute-based knowledge transfer Inproceedings

Proceedings of the European Conference on Computer Vision Workshops (ECCV Workshops), Springer, 2010.

BibTeX

Rohrbach, Marcus; Stark, Michael; Szarvas, Gy"orgy; Gurevych, Iryna; Schiele, Bernt

What helps Where - and Why? Semantic Relatedness for Knowledge Transfer Inproceedings

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010, (Oral).

Abstract | BibTeX

2009

Rohrbach, Marcus; Enzweiler, Markus; Gavrila, Dariu M.

High-Level Fusion of Depth and Intensity for Pedestrian Classification Inproceedings

Proceedings of the DAGM Symposium (DAGM), 2009, (Oral).

BibTeX

0000

Donahue, Jeff; Hendricks, Lisa Anne; Rohrbach, Marcus; Venugopalan, Subhashini; Guadarrama, Sergio; Saenko, Kate; Darrell, Trevor

Long-term recurrent convolutional networks for visual recognition and description Journal Article

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 0000.

BibTeX