Documente Academic
Documente Profesional
Documente Cultură
ISSN:2320-0790
Abstract: XM L data is stored in the form of a text document. Hence different applicat ions can share the XML data.
It is platform independent i.e. it can be migrated to any operating system without any changes. Moreover it is self
describing and extensible as a result of wh ich it has became an emerg ing standard for data exchange and storage
over the web. With such growing presence of XML in database technology there is a growing need to develop
efficient storage and indexing techniques for querying large repositories of XML documents. In this paper we
review some of the important indexing techniques for XM L and co mpare these techniques on the basis of key
factors for storing and querying XML docu ments. The real work in indexing of XM L started with the introduction of
inverted list to store XML data. Th is inverted list with d ifferent query algorith m c ould answer a variety of Xpath
queries but with some limitations. Inverted list was further imp roved by the concept of structure index wh ich is a
compact summarization of tree representation of XML docu ment. Very recently efficient indexing techniques were
implemented. These recent techniques made use of efficient data structures such as the B + tree for storing and
querying the XML data. Experimental results of these techniques show a significant improvement in performance
over the inverted list technique.
Keywords: inverted list; structure index; xreal; xcdsearch; dewey id
I.
INTRODUCTION
461
COMPUSOFT, An international journal of advanced computer technology, 3 (1), January-2014 (Volume-III, Issue-I)
RELATED WORK
462
COMPUSOFT, An international journal of advanced computer technology, 3 (1), January-2014 (Volume-III, Issue-I)
463
COMPUSOFT, An international journal of advanced computer technology, 3 (1), January-2014 (Volume-III, Issue-I)
2.
3.
464
COMPUSOFT, An international journal of advanced computer technology, 3 (1), January-2014 (Volume-III, Issue-I)
Indexing
Techniques
1
2
3
4
5
Intermediate
Result size
Index Size
I/O
required
Response
Time
Tree Merge
Large
Large
Large
More
High
Stack Tree
Path Stack
Twig Stack
Inverted list
Using structure
index
SSI
Large
Less
Less
Very Less
Large
Large
Large
Large
Less
Less
Less
Less
Moderate
Less
Less
Less
High
High
High
High
Very Less
Small
Less
T bit map
Very Less
Small
Less
Pc Index
Very Less
Moderate
Less
Less
Very Less
Large
Less
Less
Very Less
7
8
9
III.
Flexib ility
CONCLUSION
Less
Efficiency reduces
when a query
contains non
initial //
Not suitable for
Deeply nested tree
Not suitable for
Deeply nested tree
Less
465
COMPUSOFT, An international journal of advanced computer technology, 3 (1), January-2014 (Volume-III, Issue-I)
IV.
[1].
[2].
[3].
[4].
[5].
[6].
[7].
[8].
[9].
REFERENCES
466
[10].
[11].
[12].
[13].
[14].
[15].
[16].