Skip to content

HuggingFaceM4/DoclingMatix

Visual Question AnsweringImage Text To TextEN

Created by HuggingFaceM4 at 2025, the HuggingFaceM4/DoclingMatix is a visual question answering dataset in EN in Parquet format. With 1.9K downloads and 52 likes, it is actively used by the community. It is released under the cdla-permissive-2.0 license and is a 1M<n<10M-scale dataset.

About HuggingFaceM4/DoclingMatix

DoclingMatix DoclingMatix is a large-scale, multimodal dataset designed for training vision-language models in the domain of document intelligence. It was created specifically for training the SmolDocling model, an ultra-compact model for end-t...

Details

Task
Visual Question Answering, Image Text To Text
Language
EN
Format
Parquet
Rows / instances
N/A
Size
1M<n<10M
Creator
HuggingFaceM4
Year
2025
License
cdla-permissive-2.0
Downloads
1929
Likes
52
Download Homepage

Related Visual Question Answering, Image Text To Text datasets

FAQ