Header menu link for other important links
X
Tempseg-gan: Segmenting objects in videos adversarially using temporal information
Published in SciTePress
2019
Volume: 4
   
Pages: 221 - 232
Abstract
This paper studies the problem of Video Object Segmentation which aims at segmenting objects of interest throughout entire videos, when provided with initial ground truth annotation. Although, variety of works in this field have been done utilizing Convolutional Neural Networks (CNNs), adversarial training techniques have not been used in spite of their effectiveness as a holistic approach. Our proposed architecture consists of a Generative Adversarial framework for the purpose of foreground object segmentation in videos coupled with Intersection-over-union and temporal information based loss functions for training the network. The main contribution of the paper lies in formulation of the two novel loss functions: (i) Inter-frame Temporal Symmetric Difference Loss (ITSDL) and (ii) Intra-frame Temporal Loss (IFTL), which not only enhance the segmentation quality of the predicted mask but also maintain the temporal consistency between the subsequent generated frames. Our end-to-end trainable network exhibits impressive performance gain compared to the state-of-the-art model when evaluated on three popular real-world Video Object Segmentation datasets viz. DAVIS 2016, SegTrack-v2 and YouTube-Objects dataset. Copyright © 2019 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved
About the journal
JournalVISIGRAPP 2019 - Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
PublisherSciTePress
Open AccessNo
Concepts (14)
  •  related image
    Computer graphics
  •  related image
    Computer vision
  •  related image
    Deep learning
  •  related image
    Motion compensation
  •  related image
    Neural networks
  •  related image
    ADVERSARIAL NETWORKS
  •  related image
    Convolutional neural network
  •  related image
    Proposed architectures
  •  related image
    SEGMENTATION QUALITY
  •  related image
    SYMMETRIC DIFFERENCE
  •  related image
    Temporal consistency
  •  related image
    TEMPORAL INFORMATION
  •  related image
    VIDEO-OBJECT SEGMENTATION
  •  related image
    Image segmentation