Header menu link for other important links
X
Anomaly detection in web graphs using vertex neighbourhood based signature similarity methods
Aritra Ghosh
Published in Institute of Electrical and Electronics Engineers Inc.
2017
Abstract
With massive increase in the amount of data being generated each day, we need automated tools to oversee the evolution of the web and to quantify global effects like pagerank of webpages. Search engines crawl the web every now and then to build web graphs which store information about the structure of the web. This is an expensive and error prone process. Central to this problem is the notion of graph similarity (between two graphs spaced in time), which validates how well search engines secure content from web and the quality of the search results they produce. In this paper, we propose two different types of anomalies which occur during crawling and two novel similarity measures based on vertex neighbourhood, which overcomes the proposed anomalies. Extensive experimentation on real world datasets shows significant improvement over state of art signature similarity based methods. © 2016 IEEE.
Concepts (11)
  •  related image
    Engineering
  •  related image
    Industrial engineering
  •  related image
    Anomaly detection
  •  related image
    Automated tools
  •  related image
    ERROR-PRONE PROCESS
  •  related image
    GLOBAL EFFECTS
  •  related image
    GRAPH SIMILARITY
  •  related image
    Real-world datasets
  •  related image
    Similarity measure
  •  related image
    SIMILARITY-BASED METHODS
  •  related image
    Search engines