TeraflopAI/SEC-EDGAR
Text GenerationText ClassificationEN
The TeraflopAI/SEC-EDGAR dataset is a EN text generation resource from TeraflopAI at 2025.
About TeraflopAI/SEC-EDGAR
Datamule, Teraflop AI, and Eventual collaborated to release the SEC-EDGAR dataset.
The dataset contains 590 gbs of data, spanning 8 million samples and 43 billion tokens from all major filings in the SEC EDGAR database.
The bulk data was collect...
Details
- Task
- Text Generation, Text Classification
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- TeraflopAI
- Year
- 2025