Question 1

What is the TextVQA dataset?

Accepted Answer

TextVQA requires models to read and reason about text in images to answer questions about them. Specifically, models need to incorporate a new modality of text present in the images and reason over it to answer TextVQA questions.

Question 2

Is TextVQA a benchmark?

Accepted Answer

TextVQA is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download TextVQA?

Accepted Answer

TextVQA is available at its source: https://textvqa.org/dataset.

TextVQA

About TextVQA

Details

Related Question Answering, Visual, Commonsense datasets

FAQ