Skip to content

nyuuzyou/google-code-archive

Text GenerationCODE, ENBenchmark

Created by nyuuzyou at 2026, the nyuuzyou/google-code-archive is a text generation benchmark dataset in CODE, EN in Parquet format.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About nyuuzyou/google-code-archive

Google Code Archive Dataset Dataset Description This dataset was compiled from the Google Code Archive, a preserved snapshot of projects hosted on Google Code, Google's open-source project hosting service that operated from 2006 to 2...

Details

Task
Text Generation
Language
CODE, EN
Format
Parquet
Rows / instances
N/A
Creator
nyuuzyou
Year
2026
Download

Related Text Generation datasets

FAQ