Improving users' mental model with attention‐directed counterfactual edits

Abstract

In the domain of visual question answering (VQA), studies have shown improvement in users' mental model of the VQA system when they are exposed to examples of how these systems answer certain image‐question (IQ) pairs. In this work, we show that showing controlled counterfactual IQ examples are more effective at improving the mental model of users as compared to simply showing random examples. We compare a generative approach and a retrieval‐based approach to show counterfactual examples. We use recent advances in generative adversarial networks to generate counterfactual images by deleting and inpainting certain regions of interest in the image. We then expose users to changes in the VQA system's answer on those altered images. To select the region of interest for inpainting, we experiment with using both human‐annotated attention maps and a fully automatic method that uses the VQA system's attention values. Finally, we test the user's mental model by asking them to predict the model's performance on a test counterfactual image. We note an overall improvement in users' accuracy to predict answer change when shown counterfactual explanations. While realistic retrieved counterfactuals obviously are the most effective at improving the mental model, we show that a generative approach can also be equally effective.

Document Details

Document Type
Pub Defense Publication
Publication Date
Nov 15, 2021
Source ID
10.1002/ail2.47

Entities

People

  • Arijit Ray
  • Giedrius T. Burachas
  • Jurgen P. Schulze
  • Kamran Alipour
  • Michael Cogswell
  • Xiao Lin
  • Yi Yao

Organizations

  • Defense Advanced Research Projects Agency
  • SRI International
  • University of California, San Diego

Tags

Fields of Study

  • Computer science

Readers

  • Computer Vision.
  • Naval Engineering and Maritime Security
  • Theoretical Analysis.