Header menu link for other important links
X
Motion-Based Occlusion-Aware Pixel Graph Network for Video Object Segmentation
Published in Springer
2019
Volume: 11954 LNCS
   
Pages: 516 - 527
Abstract
This paper proposes a dual-channel based Graph Convolutional Network (GCN) for the Video Object Segmentation (VOS) task. The main contribution lies in formulating two pixel graphs based on the raw RGB and optical flow features. Both spatial and temporal features are learned independently, making the network robust to various challenging scenarios in real-world videos. Additionally, a motion orientation-based aggregator scheme efficiently captures long-range dependencies among objects. This not only deals with the complex issue of modelling velocity differences among multiple objects moving in various directions, but also adapts to change of appearance of objects due to pose and scale deformations. Also, an occlusion-aware attention mechanism has been employed to facilitate accurate segmentation under scenarios where multiple objects have temporal discontinuity in their appearance due to occlusion. Performance analysis on DAVIS-2016 and DAVIS-2017 datasets show the effectiveness of our proposed method in foreground segmentation of objects in videos over the existing state-of-the-art techniques. Control experiments using CamVid dataset show the generalising capability of the model for scene segmentation. © 2019, Springer Nature Switzerland AG.
About the journal
JournalData powered by TypesetLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherData powered by TypesetSpringer
ISSN03029743
Open AccessNo
Concepts (13)
  •  related image
    Convolution
  •  related image
    Flow graphs
  •  related image
    Motion compensation
  •  related image
    Pixels
  •  related image
    Aggregation mechanism
  •  related image
    ATTENTION MECHANISMS
  •  related image
    CONVOLUTIONAL NETWORKS
  •  related image
    FOREGROUND SEGMENTATION
  •  related image
    LONG-RANGE DEPENDENCIES
  •  related image
    Performance analysis
  •  related image
    State-of-the-art techniques
  •  related image
    VIDEO-OBJECT SEGMENTATION
  •  related image
    Image segmentation