DCA-Bench: A Benchmark for Dataset Curation Agents

Abstract

A benchmark exploring the performance of LLM Agents on detecting issues in datasets hosted on popular platforms.

DCA-Bench: A Benchmark for Dataset Curation Agents

Incoming