Skip to content

lmms-lab/LLaVA-Video-178K

Visual Question AnsweringVideo Text To TextEN

Lmms-lab/LLaVA-Video-178K is a visual question answering-focused dataset in EN distributed in Parquet format.

About lmms-lab/LLaVA-Video-178K

Dataset Card for LLaVA-Video-178K Uses This dataset is used for the training of the LLaVA-Video model. We only allow the use of this dataset for academic research and education purpose. For OpenAI GPT-4 generated data, we recommend t...

Details

Task
Visual Question Answering, Video Text To Text
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
lmms-lab
Year
2024
Download

Related Visual Question Answering, Video Text To Text datasets

FAQ