XML Dataset and Benchmarks for Performance Testing of the CLS Labelling Scheme
Abstract
Extensible Markup Language (XML) has become a significant technology for transferring data through the world of the Internet. XML labelling schemes are an essential technique used to handle XML data effectively. Labelling XML data is performed by assigning labels to all nodes in that XML document. CLS labelling scheme is a hybrid labelling scheme that was developed to address some limitations of indexing XML data. Moreover, datasets are used to test XML labelling schemes. There are many XML datasets available nowadays. Some of them are from real life datasets and others are from artificial datasets. These datasets and benchmarks are used for testing the XML labelling schemes. This paper discusses and considers these datasets and benchmarks and their specifications in order to determine the most appropriate one for testing the CLS labelling scheme. This research found out that the XMark benchmark is the most appropriate choice for the testing performance of the CLS labelling scheme.
Full text article
References
Connolly, T.M. and C.E. Begg, Database systems: a practical approach to design, implementation, and management. 2010, Boston, Mass: Addison-Wesley.
Klaib, A. and J. Lu. Investigation into Indexing XML Data Techniques. in Proceedings on the International Conference on Internet Computing (ICOMP). 2014. The Steering Committee of The World Congress in Computer Science, Computer Engineering and Applied Computing (WorldComp).
Bosak, J. and T. Bray, XML and the second-generation Web. Scientific American, 1999. 280(5): p. 89-93.
Elmasri, R. and S. Navathe, Fundamentals of database systems. Vol. Global;Seventh;. 2016, Upper Saddle River: Pearson.
Amagasa, T., M. Yoshikawa, and S. Uemura. QRS: A robust numbering scheme for XML documents. in Data Engineering, 2003. Proceedings. 19th International Conference on. 2003. IEEE.
Cohen, E., H. Kaplan, and T. Milo, Labeling dynamic XML trees. SIAM Journal on Computing, 2010. 39(5): p. 2048-2074.
Eda, T., et al. Dynamic range labeling for XML trees. in International Conference on Extending Database Technology. 2004. Springer.
O'Neil, P., et al. ORDPATHs: insert-friendly XML node labels. in Proceedings of the 2004 ACM SIGMOD international conference on Management of data. 2004. ACM.
Wu, X., M.L. Lee, and W. Hsu. A prime number labeling scheme for dynamic ordered XML trees. in Data Engineering, 2004. Proceedings. 20th International Conference on. 2004. IEEE.
Klaib, A. and J. Lu. Development of Database Structure and Indexing Technique for the Wireless Response System. in INFOCOMP 2013, The Third International Conference on Advanced Communications and Computation. 2013.
Ali Klaib, A., Clustering-based Labelling Scheme-A Hybrid Approach for Efficient Querying and Updating XML Documents. 2018, University of Huddersfield.
Schmidt, A., et al., Why and how to benchmark XML databases. ACM SIGMOD Record, 2001. 30(3): p. 27-32.
Böhme, T. and E. Rahm. XMach-1: A benchmark for XML data management. in Datenbanksysteme in Büro, Technik und Wissenschaft. 2001. Springer.
Böhme, T. and E. Rahm, Multi-user evaluation of XML data management systems with XMach-1, in Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web. 2003, Springer. p. 148-159.
Kanda Runapongsa, J.M.P., H. V. Jagadish, Yun Chen, Shurug Al-Khalifa The Michigan Benchmark. 2006 [cited 2016 12/6/2014]; Available from: http://dbgroup.eecs.umich.edu/mbench/description.html.
Bressan, S., et al. XOO7: Applying OO7 Benchmark to XML Query Processing Tools. in Proceedings of the ACM International Conference on Information and Knowledge Management (CIKM). 2001.
Carey, M.J., et al., A status report on the OO7 OODBMS benchmarking effort. ACM SIGPLAN Notices, 1994. 29(10): p. 414-426.
Mlýnková, I., Xml benchmarking: Limitations and opportunities. 2008, Technical Report, Department of Software Engineering, Charles University, Czech Republic.
KLAIB, A.A., A NEW METHOD FOR QUERYING XML DATA.
Klaib, A. and J. Lu. Development of Database Structure and Indexing Technique for the Wireless Response System. in Proceedings of the Third International Conference on Advanced Communications and Computation. Infocomp. IARIA, Lisbon, Portugal. 2013. Citeseer.
Nicola, M., I. Kogan, and B. Schiefer. An XML transaction processing benchmark. in Proceedings of the 2007 ACM SIGMOD international conference on Management of data. 2007. ACM.
Barbosa, D., et al. ToXgene: a template-based data generator for XML. in Proceedings of the 2002 ACM SIGMOD international conference on Management of data. 2002. ACM.
Yao, B.B., M.T. Özsu, and J. Keenleyside, Xbench-a family of benchmarks for xml dbmss, in Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web. 2003, Springer. p. 162-164.
Schmidt, A., et al. XMark: A benchmark for XML data management. in Proceedings of the 28th international conference on Very Large Data Bases. 2002. VLDB Endowment.
Li, Q. and B. Moon. Indexing and querying XML data for regular path expressions. in VLDB. 2001.
Runapongsa, K., et al., The Michigan benchmark: towards XML query performance diagnostics. Information Systems, 2006. 31(2): p. 73-97.
Yao, B.B., M.T. Ozsu, and N. Khandelwal. XBench benchmark and performance testing of XML DBMSs. in Data Engineering, 2004. Proceedings. 20th International Conference on. 2004. IEEE.
Mohammad, S. and P. Martin. LLS: level-based labeling scheme for XML databases. in Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research. 2010. IBM Corp.
Almelibari, A.A., Labelling Dynamic XML Documents: A GroupBased Approach, in Computer Science. 2015, Sheffield University: Sheffield p. 308.
Authors
Copyright (c) 2021 Journal of Pure & Applied Sciences
This work is licensed under a Creative Commons Attribution 4.0 International License.
In a brief statement, the rights relate to the publication and distribution of research published in the journal of the University of Sebha where authors who have published their articles in the journal of the university of Sebha should how they can use or distribute their articles. They reserve all their rights to the published works, such as (but not limited to) the following rights:
- Copyright and other property rights related to the article, such as patent rights.
- Research published in the journal of the University of Sebha and used in its future works, including lectures and books, the right to reproduce articles for their own purposes, and the right to self-archive their articles.
- The right to enter a separate article, or for a non-exclusive distribution of their article with an acknowledgment of its initial publication in the journal of Sebha University.
Privacy Statement The names and e-mail addresses entered on the Sabha University Journal site will be used for the aforementioned purposes only and for which they were used.