A Method of Data Label Checking and the Wrong Labels in MNIST and CIFAR10

4 Pages Posted: 20 Nov 2017

See all articles by Xinbin Zhang

Xinbin Zhang

Beijing University of Posts and Telecommunications (BUPT)

Date Written: November 16, 2017

Abstract

The quality of data label is an important issue of machine learning. Data label checking is done by manual, which is not efficient and may still have wrong labels. I introduce a method to check label quality and 3 wrong labeled in 2000 digits pictures of MNIST, 6 wrong labeled in 2800 pictures of CIFAR10 and find out 64 wrong labeled in 400 digits pictures of NIST SD19v2.

Suggested Citation

Zhang, Xinbin, A Method of Data Label Checking and the Wrong Labels in MNIST and CIFAR10 (November 16, 2017). Available at SSRN: https://ssrn.com/abstract=3072167 or http://dx.doi.org/10.2139/ssrn.3072167

Xinbin Zhang (Contact Author)

Beijing University of Posts and Telecommunications (BUPT) ( email )

No 10, Xitucheng Road
Haidian District
Beijing, 100876
China

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
134
Abstract Views
822
Rank
385,726
PlumX Metrics