Katalog Plus
Bibliothek der Frankfurt UAS
Bald neuer Katalog: sichern Sie sich schon vorab Ihre persönlichen Merklisten im Nutzerkonto: Anleitung.
Dieses Ergebnis aus BASE kann Gästen nicht angezeigt werden.  Login für vollen Zugriff.

Learning with confidence: training better classifiers from soft labels

Title: Learning with confidence: training better classifiers from soft labels
Authors: de Vries, Sjoerd; Thierens, Dirk; Sub Simulation of Complex Systems; Sub Intelligent Systems
Publication Year: 2025
Subject Terms: Calibration; Classification; Confidence scores; Ensemble learning; Soft label learning
Description: In supervised machine learning, models are typically trained using data with hard labels, i.e., definite assignments of class membership. This traditional approach, however, does not take the inherent uncertainty in these labels into account. We investigate whether incorporating label uncertainty, represented for each instance as a discrete probability distribution over the class labels, known as a soft label, improves the predictive performance of classification models, focusing on tabular data. We first demonstrate the potential value of soft label learning (SLL) for estimating model parameters in a simulation experiment, particularly for limited sample sizes and imbalanced data. Subsequently, we compare the performance of various wrapper methods for learning from both hard and soft labels using identical base classifiers. On real-world-inspired synthetic data with clean labels, the SLL methods consistently outperform the hard label methods. Since real-world data is often noisy and precise soft labels are challenging to obtain, we study the effect that noisy probability estimates have on model performance. Alongside conventional noise models, our study examines four types of miscalibration that are known to affect human annotators. The results show that SLL methods outperform the hard label methods in the majority of settings. Finally, we evaluate the methods on a real-world dataset with confidence scores, where the SLL methods are shown to match the traditional methods for predicting the (noisy) hard labels while providing more accurate confidence estimates.
Document Type: article in journal/newspaper
File Description: application/pdf
Language: English
ISSN: 0885-6125
Relation: https://dspace.library.uu.nl/handle/1874/478842
Availability: https://dspace.library.uu.nl/handle/1874/478842
Rights: info:eu-repo/semantics/OpenAccess
Accession Number: edsbas.CAAC6D0D
Database: BASE