Skip to content

tomg-group-umd/cinepile

Visual Question AnsweringVideo Text To TextEN

Tomg-group-umd/cinepile is a visual question answering-focused dataset in EN distributed in Parquet format.

About tomg-group-umd/cinepile

CinePile: A Long Video Question Answering Dataset and Benchmark CinePile is a question-answering-based, long-form video understanding dataset. It has been created using advanced large language models (LLMs) with human-in-the-loop pipeline lever...

Details

Task
Visual Question Answering, Video Text To Text
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
tomg-group-umd
Year
2024
Download

Related Visual Question Answering, Video Text To Text datasets

FAQ