Squeeze-and-Excitation Networks

Abstract

Convolutional neural networks are built upon the convolution operation, which extracts informative features by fusing spatial and channel-wise information together within local receptive fields. In order to boost the representational power of a network, several recent approaches have shown the benefit of enhancing spatial encoding. In this work, we focus on the channel relationship and propose a novel architectural unit, which we term the "Squeeze-and-Excitation" (SE) block, that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels. We demonstrate that by stacking these blocks together, we can construct SENet architectures that generalise extremely well across challenging datasets. Crucially, we find that SE blocks produce significant performance improvements for existing state-of-the-art deep architectures at minimal additional computational cost. SENets formed the foundation of our ILSVRC 2017 classification submission which won first place and significantly reduced the top-5 error to 2.251%, achieving a approx. 25% relative improvement over the winning entry of 2016. Code and models are available at https://github.com/hujie-frank/SENet.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 18, 2018
Accession Number
AD1155170

Entities

People

  • Gang Sun
  • Jie Hu
  • Shen Li

Organizations

  • Department of Engineering Science, University of Oxford

Tags

DTIC Thesaurus Topics

  • Artificial Intelligence Computing
  • Artificial Intelligence Software
  • Computational Complexity
  • Computer Programming
  • Computer Vision
  • Computers
  • Convolution
  • Convolutional Neural Networks
  • Dimensionality Reduction
  • Engineering
  • Image Recognition
  • Information Science
  • Machine Learning
  • Neural Networks
  • Recognition
  • Recurrent Neural Networks
  • Training

Fields of Study

  • Computer science

Readers

  • Applied Combinatorial Optimization and Logic Circuit Design.
  • Neural Network Machine Learning.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks