ReMuQ: End-to-end Knowledge Retrieval with Multi-modal Queries

This repo provides the resource of our paper which aims to 1) introduce a new benchmark for multimodal-query retrieval task; 2) build an end-to-end multimodal retriever along with multimodal pretraining task. Check out our paper for more details.

Resource

ReMuQ:

A dataset for multimodal-query retriever. We turned WebQA into a multimodal query retrieval task by augmenting the WebQA questions and adding images to the questions as new multimodal-queries along with a large text-based courpus.

you can download the data from this Link.

How to read the image?

We save the image into tsv format for efficient storage purpose. See read_tsv_img.ipynb notebook how to open an image.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
read_tsv_img.ipynb		read_tsv_img.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReMuQ: End-to-end Knowledge Retrieval with Multi-modal Queries

Resource

ReMuQ:

How to read the image?

About

Releases

Packages

Languages

luomancs/ReMuQ

Folders and files

Latest commit

History

Repository files navigation

ReMuQ: End-to-end Knowledge Retrieval with Multi-modal Queries

Resource

ReMuQ:

How to read the image?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages