Xkev/LLaVA-CoT-100k
Visual Question AnsweringImage Text To TextEN
Created by Xkev at 2024, the Xkev/LLaVA-CoT-100k is a visual question answering dataset in EN in Parquet format.
About Xkev/LLaVA-CoT-100k
Dataset Card for LLaVA-CoT
The LLaVA-CoT-100k dataset is introduced in the paper LLaVA-CoT: Let Vision Language Models Reason Step-by-Step. This dataset is designed to enable Vision-Language Models (VLMs) to perform autonomous multistage reason...
Details
- Task
- Visual Question Answering, Image Text To Text
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- Xkev
- Year
- 2024