The dataset contains:
The purpose of the dataset is to evaluate retrieval models for product search in the e-commerce domain using expert judgment of whether a product is relevant to a given query. It can be used to benchmark different retrieval against each other. As of its publication in 2022, it was to the best of our knowledge the biggest such public dataset.
The accompanying publication describes in depth the annotation guidelines and process used to collect the dataset. It also includes a measure of the quality of the annotation and experimentally compares the dataset's ability to discriminate the effectiveness of different retrieval models vs other comparable evaluation datasets.
Paper | Code | Results | Date | Stars |
---|