idegen/csts
General NLPEnglishcc-by-4.0
Idegen/csts is a General NLP-focused dataset in English distributed in Parquet format. It is distributed under the cc-by-4.0 license and falls in the 100M<n<1B size category, and has been downloaded 18.5K times.
About idegen/csts
CSTS - Correlation Structures in Time Series
Repository: https://github.com/isabelladegen/corrclust-validation
Paper: https://arxiv.org/abs/2505.14596
Demo: https://colab.research.google.com/github/isabelladegen/corrclust-validation/bl...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 100M<n<1B
- Creator
- idegen
- Year
- 2026
- License
- cc-by-4.0
- Downloads
- 18521
- Likes
- 0