Towards Understanding End-of-trip Instructions in a Taxi Ride Scenario
Abstract
We introduce a dataset containing human-authored descriptions of target locations in an "end-of-trip in a taxi ride" scenario. We describe our data collection method and a novel annotation scheme that supports understanding of such descriptions of target locations. Our dataset contains target location descriptions for both synthetic and real-world images as well as visual annotations (ground truth labels, dimensions of vehicles and objects, coordinates of the target location, distance and direction of the target location from vehicles and objects) that can be used in various visual and language tasks. We also perform a pilot experiment on how the corpus could be applied to visual reference resolution in this domain.
Document Details
- Document Type
- Technical Report
- Publication Date
- Jan 01, 2018
- Accession Number
- AD1159987
Entities
People
- Deepthi Karkada
- Kallirrori Georgila
- Ramesh Manuvinakurike
Organizations
- Intel Corporation
- University of Southern California