Academia.eduAcademia.edu

In this section, we further evaluate the intersection probability on corrupted samples using the neg- ative log-likelihood (NLL) metric. A smaller NLL indicates that the model is more confident and accurate in predicting the correct class for each input (Dusenberry et al., 2020). Figures 11 and 12  show the consistent superiority of the intersection probability on corrupted data in extensive test cases, as evidenced by smaller NLL values.  Figure 11: NLL values of BNNR, BNNF, and DE on CIFARIO-C against increased corruption intensity, using the averaged probability (Avg. Prob.) and our proposed intersection probability (Int. Prob.). VGG16 and ResNet-18 are backbones. Results are from 15 runs.

Figure 11 In this section, we further evaluate the intersection probability on corrupted samples using the neg- ative log-likelihood (NLL) metric. A smaller NLL indicates that the model is more confident and accurate in predicting the correct class for each input (Dusenberry et al., 2020). Figures 11 and 12 show the consistent superiority of the intersection probability on corrupted data in extensive test cases, as evidenced by smaller NLL values. Figure 11: NLL values of BNNR, BNNF, and DE on CIFARIO-C against increased corruption intensity, using the averaged probability (Avg. Prob.) and our proposed intersection probability (Int. Prob.). VGG16 and ResNet-18 are backbones. Results are from 15 runs.