Skip to content

BuGL

Text CorporaEnglish

BuGL is a text corpora-focused dataset in English that provides 10,187 labeled examples distributed in JSON, Xlsx format.

About BuGL

Dataset consists of 54 GitHub projects of four different programming languages namely C, C++, Java and Python with around 10,187 issues.

Details

Task
Text Corpora
Language
English
Format
JSON, Xlsx
Rows / instances
10,187
Creator
muvvasandeep
Year
2020
Download Paper

Related Text Corpora datasets

FAQ