Skip to content

guixianjin/Some-research-on-noisy-labels

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

102 Commits
 
 
 
 
 
 
 
 

Repository files navigation

This is a record of current experiment result on some realistic noisy label datasets

1. Clothing1M dataset

from: CVPR-15: Learning from Massive Noisy Labeled Data for Image Classification

  • 14 classes: T-shirt, Shirt, Knitwear, Chiffon, Sweater, Hoodie, Windbreaker, Jacket, Down Coat, Suit, Shawl, Dress, Vest, and Underwear
  • noisy labeled training dataset ($D_\eta$): $10^6$
  • clean train data($D_c$): 47,570
  • clean validation set: 14,313
  • clean test set: 10,526

Noise confusion matrix

It's not column-diagonally dominant, thus small-loss trick may not work. But if examples in noisy class 3 and noisy class 5 have been swapped, it may become column-diagonally dominant, in which case small-loss trick may work.

some result

some resutl

some result

some result

69.9(only use noisy training data) -> 79.9(fine-tuning)

some result

some result

some result

some result

some result

some result

about 71%

some result

some result

some result

some result

some result

some result

some result

19. [ICLR-20: underview: DivideMiX: Learning with noisy labels as semi-supervised learning]

from: Arxiv17: Webvision database: Visual learning and understanding from web data

  • 1,000 classes: concepts in ImageNet ILSVRC12
  • noise rate: 20-40%

use all data or only use the first 50 classes of Google image subset

use all 1000 classes

only use the first 50 classes of Google image subset

only use the randomly selected 100 classes

from: CVPR-18: CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise

  • 101 food classes
  • 310k image, 55k verification,
  • noise rate: 20%
  not use Food101N created by cleanNet paper, but use Food101 and inject 20% noise

from: ICML-19: SELFIE: Refurbishing Unclean Samples for Robust Deep Learning

About

This is a record of current published experiment results on some realistic noisy data sets

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors