The Deeper, the Better: Analysis of Person Attributes Recognition

Abstract

Research into person attributes recognition has focused on approaches to describe a person in terms of their appearance. Typically, this includes a wide range of traits including age, gender, clothing, and footwear. Although this could be used in a wide variety of scenarios, it generally is applied to video surveillance, where attribute recognition is impacted by low resolution, and other issues such as variable pose, occlusion and shadow. Recent approaches have used deep convolutional neural networks (CNNs) to improve the accuracy in person attribute recognition. However, many of these networks are relatively shallow and it is unclear to what extent they use contextual cues to improve classification accuracy. This paper builds upon prior research by proposing to use a modified ResNet architecture with calibrations that permit us to train networks that are deeper than previously published approaches. Interpretation suggests that this deeper architectures allows the network to take more contextual information into consideration, which helps to improve classification accuracy and generalizability. We present experimental analysis and results for whole body attributes using the PA-100K and PETA datasets and facial attributes using the CelebA dataset.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: May 20, 2019
Accession Number: AD1099405

Entities

People

Esube Bekele
Wallace Lawson

Organizations

United States Naval Research Laboratory

The Deeper, the Better: Analysis of Person Attributes Recognition

Abstract

Document Details

Entities

People

Organizations

Tags

Communities of Interest

DTIC Thesaurus Topics

Fields of Study

Readers

Technology Areas