Machine Translation with Image Context from Mandarin Chinese to English

Abstract

Despite ongoing improvements in machine translation, machine translators still lack the capability of incorporating context from which source text may have been derived. Machine translators use text from a source language to translate it into a target language without observing any visual context. This work aims to produce a neural machine translation model that is capable of accepting both text and image context as a multimodal translator from Mandarin Chinese to English. The model was trained on a small multimodal dataset of 700 images and sentences, and compared to a translator trained only on the text associated with those images. The model was also trained on a larger text only corpus of 21,000 sentences with and without the addition of the small multimodal dataset. Notable differences were produced between the text only and the multimodal translators when trained on the small 700 sentence and image dataset, however no observable discrepancies were found between the translators trained on the larger text corpus. Further research with a larger multimodal dataset could provide more results clarifying the utility of multimodal machine translation.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 2019
Accession Number
AD1075981

Entities

People

  • Brooke E. Johnson

Organizations

  • Air Force Institute of Technology

Tags

Communities of Interest

  • Autonomy

DTIC Thesaurus Topics

  • Air Force
  • Artificial Intelligence Computing
  • Artificial Intelligence Software
  • Artificial Neural Networks
  • Chinese Language
  • Computational Science
  • Computer Languages
  • Convolutional Neural Networks
  • Dimensionality Reduction
  • Governments
  • Grammars
  • Image Classification
  • Image Processing
  • Information Science
  • Language
  • Language Translation
  • Linguistics
  • Machine Learning
  • Machine Translation
  • Natural Language Computing
  • Natural Language Processing
  • Natural Languages
  • Neural Networks
  • Probability
  • Recurrent Neural Networks
  • Test Sets
  • Translations
  • United States Government

Readers

  • Computational Linguistics
  • Neural Network Machine Learning.

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation
  • AI & ML - Neural Networks