Skip to content

idegen/csts

General NLPEnglishcc-by-4.0

Idegen/csts is a General NLP-focused dataset in English distributed in Parquet format. It is distributed under the cc-by-4.0 license and falls in the 100M<n<1B size category, and has been downloaded 18.5K times.

About idegen/csts

CSTS - Correlation Structures in Time Series Repository: https://github.com/isabelladegen/corrclust-validation Paper: https://arxiv.org/abs/2505.14596 Demo: https://colab.research.google.com/github/isabelladegen/corrclust-validation/bl...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
100M<n<1B
Creator
idegen
Year
2026
License
cc-by-4.0
Downloads
18521
Likes
0
Download Homepage

Related General NLP datasets

FAQ