Anomaly detection in web graphs using vertex neighbourhood based signature similarity methods

Aritra Ghosh

doi:10.1109/ICDSE.2016.7823959

Profiles Research Units Publications

Conferences

Anomaly detection in web graphs using vertex neighbourhood based signature similarity methods

Aritra Ghosh

Published in Institute of Electrical and Electronics Engineers Inc.

2017

DOI: 10.1109/ICDSE.2016.7823959

Abstract

With massive increase in the amount of data being generated each day, we need automated tools to oversee the evolution of the web and to quantify global effects like pagerank of webpages. Search engines crawl the web every now and then to build web graphs which store information about the structure of the web. This is an expensive and error prone process. Central to this problem is the notion of graph similarity (between two graphs spaced in time), which validates how well search engines secure content from web and the quality of the search results they produce. In this paper, we propose two different types of anomalies which occur during crawling and two novel similarity measures based on vertex neighbourhood, which overcomes the proposed anomalies. Extensive experimentation on real world datasets shows significant improvement over state of art signature similarity based methods. © 2016 IEEE.

Topics: Web page (63)%, PageRank (55)%, Neighbourhood (graph theory) (55)% and Anomaly detection (51)%

View more info for "Anomaly detection in web graphs using vertex neighbourhood based signature similarity methods"

About the journal

Journal	Data powered by TypesetProceedings of the 2016 International Conference on Data Science and Engineering, ICDSE 2016
Publisher	Data powered by TypesetInstitute of Electrical and Electronics Engineers Inc.
Open Access	No

Concepts (11)

Engineering
Industrial engineering
Anomaly detection
Automated tools
ERROR-PRONE PROCESS
GLOBAL EFFECTS
GRAPH SIMILARITY
Real-world datasets
Similarity measure
SIMILARITY-BASED METHODS
Search engines

ABOUT IIT MADRAS

R & D

RANKINGS & ACHIEVEMENTS

QUICK FIND