Question 1

What is the ACL Anthology Reference Corpus (ACL ARC) dataset?

Accepted Answer

Dataset contains 10,921 articles from the February 2007 snapshot of the Anthology; text and metadata for the articles were extracted, consisting of BibTeX records derived either from the headers of each paper or from metadata taken from the Anthology website.

Question 2

Is ACL Anthology Reference Corpus (ACL ARC) a benchmark?

Accepted Answer

Yes — ACL Anthology Reference Corpus (ACL ARC) is used as an LLM benchmark. See model leaderboards in the Benchmarks section.

Question 3

Where can I download ACL Anthology Reference Corpus (ACL ARC)?

Accepted Answer

ACL Anthology Reference Corpus (ACL ARC) is available at its source: https://web.eecs.umich.edu/~lahiri/acl_arc.html.

ACL Anthology Reference Corpus (ACL ARC)

About ACL Anthology Reference Corpus (ACL ARC)

Details

Related Text Corpora datasets

FAQ