Header menu link for other important links
X
An empirical evaluation of visual question answering for novel objects
, Ambar Pal, Gaurav Sharma, Santhosh K. Ramakrishnan
Published in Institute of Electrical and Electronics Engineers Inc.
2017
Volume: 2017-January
   
Pages: 7312 - 7321
Abstract
We study the problem of answering questions about images in the harder setting, where the test questions and corresponding images contain novel objects, which were not queried about in the training data. Such setting is inevitable in real world - owing to the heavy tailed distribution of the visual categories, there would be some objects which would not be annotated in the train set. We show that the performance of two popular existing methods drop significantly (up to 28%) when evaluated on novel objects cf. known objects. We propose methods which use large existing external corpora of (i) unlabeled text, i.e. books, and (ii) images tagged with classes, to achieve novel object based visual question answering. We do systematic empirical studies, for both an oracle case where the novel objects are known textually, as well as a fully automatic case without any explicit knowledge of the novel objects, but with the minimal assumption that the novel objects are semantically related to the existing objects in training. The proposed methods for novel object based visual question answering are modular and can potentially be used with many visual question answering architectures. We show consistent improvements with the two popular architectures and give qualitative analysis of the cases where the model does well and of those where it fails to bring improvements. © 2017 IEEE.
About the journal
JournalData powered by TypesetProceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017
PublisherData powered by TypesetInstitute of Electrical and Electronics Engineers Inc.
ISSN1063-6919
Open AccessYes
Concepts (10)
  •  related image
    Computer vision
  •  related image
    Empirical evaluations
  •  related image
    Empirical studies
  •  related image
    EXPLICIT KNOWLEDGE
  •  related image
    HEAVY-TAILED DISTRIBUTION
  •  related image
    OBJECT BASED
  •  related image
    Qualitative analysis
  •  related image
    Question answering
  •  related image
    Training data
  •  related image
    Pattern recognition