Question 1

What is the ByteDance/MTVQA dataset?

Accepted Answer

Dataset Card

The dataset is oriented toward visual question answering of multilingual text scenes in nine languages, including Korean, Japanese, Italian, Russian, Deutsch, French, Thai, Arabic, and Vietnamese. The question-answer pairs are labe...

Question 2

Is ByteDance/MTVQA a benchmark?

Accepted Answer

ByteDance/MTVQA is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download ByteDance/MTVQA?

Accepted Answer

ByteDance/MTVQA is available at its source: https://huggingface.co/datasets/ByteDance/MTVQA.

Question 4

What license is ByteDance/MTVQA released under?

Accepted Answer

ByteDance/MTVQA is distributed under the cc-by-nc-4.0 license.

ByteDance/MTVQA

About ByteDance/MTVQA

Details

Related Visual Question Answering, Image To Text datasets

FAQ