Representation Learning

Representation Learning for Vision & Language

The performance of deep learning models depends upon a strong neural architecture design that learns latent representations from data by using different transformation functions, such as convolutions and fully connected layers. In our lab, we develop new transformations and neural architectures that allow models to learn richer representations effectively across different domains. A special focus of our group is on developing light-weight and power-efficient architectures for edge devices, including mobile phones, with good generalization abilities.

Related Publications

2022

Patching open-vocabulary models by interpolating weights
Gabriel Ilharco, Mitchell Wortsman, Samir Yitzhak Gadre, Shuran Song, Hannaneh Hajishirzi, Simon Kornblith, Ali Farhadi, Ludwig Schmidt
NeurIPS 2022
PDF

Robust fine-tuning of zero-shot models
Mitchell Wortsman, Gabriel Ilharco, Jong Wook Kim, Mike Li, Simon Kornblith, Rebecca Roelofs, Raphael Gontijo Lopes, Hannaneh Hajishirzi, Ali Farhadi, Hongseok Namkoong, Ludwig Schmidt
CVPR 2022, Best Paper finalist
PDF Source code

2021

NaturalProofs: Mathematical Theorem Proving in Natural Language
Sean Welleck, Jiacheng Liu, Ronan Le Bras, Hannaneh Hajishirzi, Yejin Choi, Kyunghyun Cho
NeurIPS 2021 Datasets and Benchmarks Track
Project page PDF Source code

Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text
Christopher Clark, Jordi Salvador, Dustin Schwenk, Derrick Bonafilia, Mark Yatskar, Eric Kolve, Alvaro Herrasti, Jonghyun Choi, Sachin Mehta, Sam Skjonsberg, Carissa Schoenick, Aaron Sarnat, Hannaneh Hajishirzi, Aniruddha Kembhavi, Oren Etzioni, Ali Farhadih
EMNLP, 2021

Probing Across Time: What Does RoBERTa Know and When?
Leo Z. Liu*, Yizhong Wang*, Jungo Kasai, Hannaneh Hajishirzi, Noah A. Smith
EMNLP 2021 Findings
PDF

Efficient Passage Retrieval with Hashing for Open-domain Question Answering
Ikuya Yamada, Akari Asai, Hannaneh Hajishirzi
ACL 2021
PDF Semantic scholar Source code

Probing Contextual Language Models for Common Ground with Visual Representations
Gabriel Ilharco, Rowan Zellers, Ali Farhadi, Hannaneh Hajishirzi
NAACL 2021
PDF

DeLighT: Deep and Light-weight Transformer
Sachin Mehta, Marjan Ghazvininejad, Srinivasan Iyer, Luke Zettlemoyer, and Hannaneh Hajishirzi
ICLR 2021
PDF Source code

2020

Evaluating Models’ Local Decision Boundaries via Contrast Sets
Matt Gardner et al.
Findings of EMNLP 2020.
PDF

DiCENet: Dimension-wise convolutions for efficient networks
Sachin Mehta, Hannaneh Hajishirzi, Mohammad Rastegari
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020
PDF Source code

An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction
Bhargavi Paranjape, Mandar Joshi, John Thickstun, Hannaneh Hajishirzi, Luke Zettlemoyer
EMNLP 2020
PDF Source code

LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention
Ikuya Yamada, Akari Asai, Hiroyuki Shindo, Hideaki Takeda, Yuji Matsumoto
EMNLP 2020
PDF Semantic scholar Source code

Wikipedia2Vec: An Efficient Toolkit for Learning and Visualizing the Embeddings of Words and Entities from Wikipedia
Ikuya Yamada, Akari Asai, Jin Sakuma, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji, Yuji Matsumoto
EMNLP 2020 (system demonstrations)
Project page PDF Semantic scholar Demo Source code

DeFINE: DEep Factorized INput Token Embeddings for Neural Sequence Modeling
Sachin Mehta, Rik Koncel-Kedziorski, Mohammad Rastegari, Hannaneh Hajishirzi
ICLR 2020
PDF Source code

2019

ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural Network
Sachin Mehta, Mohammad Rastegari, Linda Shapiro, Hannaneh Hajishirzi
CVPR 2019
PDF Semantic scholar Source code

Text Generation from Knowledge Graphs with Graph Transformers
Rik Koncel-Kedziorski, Dhanush Bekal, Yi Luan, Mirella Lapata, Hannaneh Hajishirzi
NAACL 2019
PDF Source code

2018

Pyramidal Recurrent Unit for Language Modeling
Sachin Mehta, Rik Koncel-Kedziorski, Mohammad Rastegari, Hannaneh Hajishirzi
EMNLP 2018
PDF Source code

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation
Sachin Mehta, Mohammad Rastegari, Anat Caspi, Linda Shapiro, Hannaneh Hajishirzi
ECCV 2018
PDF Source code

Neural Speed Reading via Skim-RNN
Minjoon Seo*, Sewon Min*, Ali Farhadi, Hannaneh Hajishirzi
ICLR 2018
PDF

2016

A Diagram is Worth a Dozen Images
Aniruddha Kembhavi, Mike Salvato, Eric Kolve, Minjoon Seo, Hannaneh Hajishirzi, Ali Farhadi
ECCV 2016
PDF Source code

2015

Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing
Hamid Izadinia, Fereshteh Sadeghi, Santosh Kumar Divvala, Hannaneh Hajishirzi, Yejin Choi, Ali Farhadi
ICCV 2015
PDF