arxiv:2112.12777

Cross Modal Retrieval with Querybank Normalisation

Published on Dec 23, 2021

Upvote

Authors:

Simion-Vlad Bogolin ,

Ioana Croitoru ,

Abstract

Profiting from large-scale training datasets, advances in neural architecture design and efficient inference, joint embeddings have become the dominant approach for tackling cross-modal retrieval. In this work we first show that, despite their effectiveness, state-of-the-art joint embeddings suffer significantly from the longstanding "hubness problem" in which a small number of gallery embeddings form the nearest neighbours of many queries. Drawing inspiration from the NLP literature, we formulate a simple but effective framework called Querybank Normalisation (QB-Norm) that re-normalises query similarities to account for hubs in the embedding space. QB-Norm improves retrieval performance without requiring retraining. Differently from prior work, we show that QB-Norm works effectively without concurrent access to any test set queries. Within the QB-Norm framework, we also propose a novel similarity normalisation method, the Dynamic Inverted Softmax, that is significantly more robust than existing approaches. We showcase QB-Norm across a range of cross modal retrieval models and benchmarks where it consistently enhances strong baselines beyond the state of the art. Code is available at https://vladbogo.github.io/QB-Norm/.

View arXiv page View PDF Add to collection

Community

vladbogo

Paper author Jan 28

@librarian-bot recommend

librarian-bot

Jan 28

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Debleena

Feb 1

•

edited Feb 1

@librarian-bot recommend

vladbogo

Paper author Feb 1

@davanstrien I noticed that the @librarian-bot didn't respond to the second request for recommendations. I suspect this happens because there was a response already.

If this is the case, I think ideally it should respond with a link to the other message. I would love to contribute to the project and tackle this case. I would really appreciate if you could point me to the repo, since I wasn't able to find it.

umairbinmansoor

Feb 2

•

edited Feb 2

Can you find papers on the early detection of Sepsis? @librarian-bot recommend

davanstrien

Feb 2

@vladbogo

@davanstrien I noticed that the @librarian-bot didn't respond to the second request for recommendations. I suspect this happens because there was a response already.

If this is the case, I think ideally it should respond with a link to the other message. I would love to contribute to the project and tackle this case. I would really appreciate if you could point me to the repo, since I wasn't able to find it.

Most of the code is here: https://huggingface.co/spaces/librarian-bots/recommend_similar_papers

I would prefer to wait with this for a bit before getting a librarian-bot to respond with more comments since I prefer librarian-bot not to be too noisy on papers. We might add threading to paper comments soon, which would then make it possible to reply to a user in a separate thread. At that point, I think it would make sense to allow multiple requests to librarian-bot to account for new papers coming out + to give a user a better message about what's not working. I will add some clearer wording to the comments to make it clearer that librarian-bot will currently only make one comment per paper.

cc @julien-c @pierric