AI45Research/ATBench
General NLPEnglishBenchmarkapache-2.0
AI45Research/ATBench is a General NLP-focused benchmark dataset in English distributed in Parquet format. It is distributed under the apache-2.0 license and falls in the 1K<n<10K size category, and has been downloaded 2.6K times.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About AI45Research/ATBench
ATBench: Agent Trajectory Safety Benchmark Family
💻 GitHub |
📄 ATBench Paper |
📄 AgentDoG Paper (ATBench500) |
🤗 Hugging Face Collection
ATBench is a family of trajectory-level safety benchmarks for long-horizon, tool-...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 1K<n<10K
- Creator
- AI45Research
- Year
- 2026
- License
- apache-2.0
- Downloads
- 2553
- Likes
- 39