Header menu link for other important links
X
Efficient indexing and querying of XML data using modified Prüfer sequences
Prudhvi Sreenivasa Kumar,
Published in
2005
Pages: 397 - 404
Abstract
With the advent of XML as the new standard for information representation and exchange, indexing and querying of XML data is of major concern. In this paper, we propose a method for representing an XML document as a sequence based on a variation of Prüfer sequences. We incorporate new components in the node encodings such as level, number of a certain kind of descendants and develop methods for holistic processing of tree pattern queries. The query processing involves converting the query also into a sequence and performing subsequence matching on the document sequence. We establish certain interesting properties of the proposed method of sequencing that give rise to a new efficient pattern matching algorithm. The sequence data is stored in a two level B+-trees to support query processing. We also propose an optimization for parent-child axis to speed up the query processing. Our approach does not require any post-processing and guarantees results that are free of false positives and duplicates. Experimental results show that our system performs significantly better than previous systems in a large number of cases. Copyright 2005 ACM.
About the journal
JournalInternational Conference on Information and Knowledge Management, Proceedings
Open AccessYes
Concepts (11)
  •  related image
    Algorithms
  •  related image
    Data acquisition
  •  related image
    Information retrieval
  •  related image
    Optimization
  •  related image
    Query languages
  •  related image
    Trees (mathematics)
  •  related image
    MODIFIED PRÜFER SEQUENCES
  •  related image
    PATTERN MATCHING ALGORITHM
  •  related image
    Query processing
  •  related image
    TREE PATTERN QUERIES
  •  related image
    XML