Context graph based video frame prediction using locally guided objective

Prateep S. Bhattacharjee; Sukhendu Das

doi:10.1007/978-3-030-11015-4_15

Profiles Research Units Publications

Conferences

Context graph based video frame prediction using locally guided objective

Prateep S. Bhattacharjee,

Published in Springer Verlag

2019

DOI: 10.1007/978-3-030-11015-4_15

Volume: 11131 LNCS

Pages: 169 - 185

Abstract

This paper proposes a feature reconstruction based approach using pixel-graph and Generative Adversarial Networks (GAN) for solving the problem of synthesizing future frames from video scenes. Recent methods of frame synthesis often generate blurry outcomes in case of long-range prediction and scenes involving multiple objects moving at different velocities due to their holistic approach. Our proposed method introduces a novel pixel-graph based context aggregation layer (PixGraph) which efficiently captures long range dependencies. PixGraph incorporates a weighting scheme through which the internal features of each pixel (or a group of neighboring pixels) can be modeled independently of the others, thus handling the issue of separate objects moving in different directions and with very dissimilar speed. We also introduce a novel objective function, the Locally Guided Gram Loss (LGGL), which aides the GAN based model to maximize the similarity between the intermediate features of the ground-truth and the network output by constructing Gram matrices from locally extracted patches over several levels of the generator. Our proposed model is end-to-end trainable and exhibits superior performance compared to the state-of-the-art on four real-world benchmark video datasets. © Springer Nature Switzerland AG 2019.

Topics: Frame (networking) (52)%, Feature (computer vision) (51)% and Context (language use) (50)%

View more info for "Context Graph Based Video Frame Prediction Using Locally Guided Objective"

About the journal

Journal	Data powered by TypesetLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Publisher	Data powered by TypesetSpringer Verlag
ISSN	03029743
Open Access	No

Authors (1)

Sukhendu Das
- Department of Computer Science and Engineering

Concepts (12)

Benchmarking
Computer vision
Graphic methods
ADVERSARIAL NETWORKS
FEATURE RECONSTRUCTION
Holistic approach
INTERNAL FEATURES
LONG RANGE PREDICTION
LONG-RANGE DEPENDENCIES
MULTIPLE OBJECTS
Objective functions
Pixels

ABOUT IIT MADRAS

R & D

RANKINGS & ACHIEVEMENTS

QUICK FIND