DCA-Bench: A Benchmark for Dataset Curation Agents

Abstract

A benchmark exploring the performance of LLM Agents on detecting issues in datasets hosted on popular platforms. (Under Review)

DCA-Bench: A Benchmark for Dataset Curation Agents

DCA-Bench: A Benchmark for Dataset Curation Agents

Incoming