PLM: Partial Label Masking for Imbalanced Multi-label Classification

Abstract

Neural networks trained on real-world datasets with long-tailed label distributions are biased towards frequent classes and perform poorly on infrequent classes. The imbalance in the ratio of positive and negative samples for each class skews network output probabilities further from ground-truth distributions. We propose a method, Partial Label Masking (PLM), which utilizes this ratio during training. By stochastically masking labels during loss computation, the method balances this ratio for each class, leading to improved recall on minority classes and improved precision on frequent classes. The ratio is estimated adaptively based on the networks performance by minimizing the KL divergence between predicted and ground-truth distributions. Whereas most existing approaches addressing data imbalance are mainly focused on single-label classification and do not generalize well to the multi-label case, this work proposes a general approach to solve the long-tail data imbalance issue for multi-label classification. PLM is versatile: it can be applied to most objective functions and it can be used alongside other strategies for class imbalance. Our method achieves strong performance when compared to existing methods on both multi-label (MultiMNIST and MSCOCO) and single-label (imbalanced CIFAR-10 and CIFAR-100) image classification datasets.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 19, 2021
Accession Number
AD1186008

Entities

People

  • Kevin Duarte
  • Mubarak Ali Shah
  • Yogesh Rawat

Organizations

  • University of Central Florida

Tags

DTIC Thesaurus Topics

  • Ablation
  • Algorithms
  • Artificial Intelligence
  • Artificial Intelligence Software
  • Computer Vision
  • Computers
  • Data Analysis
  • Data Mining
  • Data Sets
  • Image Classification
  • Image Recognition
  • Information Processing
  • Information Science
  • Information Systems
  • Machine Learning
  • Materials
  • Mobile Phones
  • Neural Networks
  • Pattern Recognition
  • Precision
  • Probability
  • Probability Distributions
  • Recognition

Fields of Study

  • Computer science

Readers

  • Applied Combinatorial Optimization and Logic Circuit Design.
  • Neural Network Machine Learning.
  • Statistical inference.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Machine Learning Algorithms
  • AI & ML - Neural Networks