ScienceOne-AI/S1-MMAlign
Image To TextVisual Question AnsweringFeature ExtractionEN
ScienceOne-AI/S1-MMAlign is a image to text-focused dataset in EN distributed in Parquet format.
About ScienceOne-AI/S1-MMAlign
S1-MMAlign
A Large-Scale Multi-Disciplinary Scientific Multimodal Dataset
S1-MMAlign is a large-scale, multi-disciplinary multimodal dataset comprising over 15.5 million high-quality image-text pairs derived from 2.5 million open-access scient...
Details
- Task
- Image To Text, Visual Question Answering, Feature Extraction
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- ScienceOne-AI
- Year
- 2025