Hard ImageNet Dataset

Object segmentations and image rankings for 15 ImageNet synsets with strong spurious cues

19k Training Images ♦ 750 quintuply validated Test Images ♦ Extensive Evaluation Suite

Classes

Each image has an object segmentation. Also, all training images are ranked based on the strength of the spurious cues present. This allows for the selection of balanced subsets (i.e. where spurious correlations are broken).

With our richly annotated dataset and benchmark, we hope the community can begin to consider new training and evaluation paradigms for faithful image classification under suboptimal data conditions; that is, predicting for the right reasons, even when our data is riddled with spurious cues.

BENCHMARK

  • Ablation: removing the object from an image should result in lower accuracy, but when spurious features are relied upon, accuracy remains high after ablation. We ablate in multiple ways, and use accuracy drop as a proxy for how well a model attends to the object.
  • Relative Foreground Sensitivity: the degree to which model performance drops due to corruption of a region proxies the model sensitivity to that region. We add noise in foregrounds and backgrounds, and compare accuracy drops in a normalized way to determine foreground sensitivity.
  • Saliency Alignment: the intersection-over-union of GradCAM saliencies with object segmentations gives a notion of how well models recognize object regions as salient to classfication.
  • 1 / 3
    2 / 3
    3 / 3

    Our benchmark includes ablation, noise-based, and saliency analyses to assess whether models predict because of the object region, or if they rely instead on spurious features. Compared to more typical data (as represented by RIVAL10), Hard ImageNet classification induces far greater spurious feature reliance.

    Ablation
    Relative Foreground Sensitivity
    Saliency Alignment

    Download


    Citation

    Please cite our paper if Hard ImageNet is of use to you.

    
    @misc{moayeri2022hard,
        title     = {Hard ImageNet: Segmentations for Objects with Strong Spurious Cues},
        author    = {Moayeri, Mazda and Singla, Sahil and Feizi, Soheil},
        booktitle = {openreview},
        month     = {June},
        year      = {2022},
        }
    

    Contact


    Please feel free to contact us for any questions or comments regarding either our paper or the dataset. Our emails are found in the paper PDF.