Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Interactive correction of mislabeled training data

Xiang, Shouxing, Ye, Xi, Xia, Jiazhi, Wu, Jing, Chen, Yang and Liu, Shixia 2020. Interactive correction of mislabeled training data. Presented at: IEEE VIS 2019, Vancouver, BC, Canada, 20-25 October 2019. 2019 IEEE Conference on Visual Analytics Science and Technology (VAST). IEEE, 10.1109/VAST47406.2019.8986943

PDF - Accepted Post-Print Version
Download (5MB) | Preview


In this paper, we develop a visual analysis method for interactively improving the quality of labeled data, which is essential to the success of supervised and semi-supervised learning. The quality improvement is achieved through the use of user-selected trusted items. We employ a bi-level optimization model to accurately match the labels of the trusted items and to minimize the training loss. Based on this model, a scalable data correction algorithm is developed to handle tens of thousands of labeled data efficiently. The selection of the trusted items is facilitated by an incremental tSNE with improved computational efficiency and layout stability to ensure a smooth transition between different levels. We evaluated our method on real-world datasets through quantitative evaluation and case studies, and the results were generally favorable.

Item Type: Conference or Workshop Item (Paper)
Date Type: Published Online
Status: Published
Schools: Computer Science & Informatics
Publisher: IEEE
ISBN: 9781728122854
Date of First Compliant Deposit: 29 July 2019
Last Modified: 06 Mar 2020 16:07

Actions (repository staff only)

Edit Item Edit Item


Downloads per month over past year

View more statistics