ImageNet C-OOD (class-out-of-distribution)

This dataset was presented as part of the ICLR 2023 paper ๐˜ˆ ๐˜ง๐˜ณ๐˜ข๐˜ฎ๐˜ฆ๐˜ธ๐˜ฐ๐˜ณ๐˜ฌ ๐˜ง๐˜ฐ๐˜ณ ๐˜ฃ๐˜ฆ๐˜ฏ๐˜ค๐˜ฉ๐˜ฎ๐˜ข๐˜ณ๐˜ฌ๐˜ช๐˜ฏ๐˜จ ๐˜Š๐˜ญ๐˜ข๐˜ด๐˜ด-๐˜ฐ๐˜ถ๐˜ต-๐˜ฐ๐˜ง-๐˜ฅ๐˜ช๐˜ด๐˜ต๐˜ณ๐˜ช๐˜ฃ๐˜ถ๐˜ต๐˜ช๐˜ฐ๐˜ฏ ๐˜ฅ๐˜ฆ๐˜ต๐˜ฆ๐˜ค๐˜ต๐˜ช๐˜ฐ๐˜ฏ ๐˜ข๐˜ฏ๐˜ฅ ๐˜ช๐˜ต๐˜ด ๐˜ข๐˜ฑ๐˜ฑ๐˜ญ๐˜ช๐˜ค๐˜ข๐˜ต๐˜ช๐˜ฐ๐˜ฏ ๐˜ต๐˜ฐ ๐˜๐˜ฎ๐˜ข๐˜จ๐˜ฆ๐˜•๐˜ฆ๐˜ต.

It is a framework that, based on this dataset (a subset of the ImageNet-21k dataset) is able to generate a C-OOD (AKA open-set recognition) benchmark that covers a variety of difficulty levels. these benchmarks are tailored to the evaluated model. This approach provides a more accurate representation of the modelโ€™s own performance.

The resulting difficulty levels of our framework allow benchmarking with respect to the difficulty levels most relevant to the task. For example, for a task with a high tolerance for risk (e.g., a task for an entertainment application), the performance of a model on a median difficulty level might be more important than on the hardest difficulty level (severity 10). The opposite might be true for some applications with a low tolerance for risk (e.g., medical applications), for which one requires the best performance to be attained even if the OOD is very hard to detect (severity 10). The paper in which the framework was introduced showed that detection algorithms do not always improve performance on all inputs equally, and could even hurt performance for specific difficulty levels and models. Choosing the combination of (model, detection algorithm) based only on the detection performance on all data may yield sub-optimal results for our specific desired level of difficulty.

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages