Methods of normalization XML data

Tomasz Piłka


Normalization is the process of efficiently organizing data in a database. There are two goals of the normalization process: eliminating redundant data and ensuring data dependencies. At the work we discuss methods achieving the normalization of data XML. In the case of XML data, this problem is more difficult than for relational data – we must also take the hierarchical data structure into consideration.


normalization; XML functional dependency; data redundancy; XML normal form

Full Text:

PDF (Polski)


Abiteboul S., Hull R., Vianu V.: Foundations of Databases. Reading, Massachusetts, Addison-Wesley, 1995.

Arenas M.: Normalization theory for XML. SIGMOD Record, Vol. 35, No. 4, 2006, s. 57-64.

Arenas M., Libkin L.: XML Data Exchange: Consistency and Query Answering. PODS Conference, 2005, s. 13-24.

Arenas M., Libkin L.: A normal form for XML documents. ACM Trans. Database Syst., Vol. 29, 2004, s. 195-232.

Arenas M., Libkin L.: An information-theoretic approach to normal forms for relational and XML data. J.ACM, Vol. 52, No. 2, 2005, s. 246-283.

Jixue L., Millist V., Chengfei L.: Functional Dependencies, from Relational to XML. Lecture Notes in Computer Science, Vol. 2890, 2003, s. 1063-1079.

Kolahi S.: Dependency-Preserving Normalization of Relational and XML Data. DBPL, LNCS, G. M. Bierman and C. Koch, Eds., Springer, Vol. 3374, 2005, s. 247-261.

Kolahi S.: Dependency-preserving normalization of relational and XML data. Journal of Computer and Systems Sciences, Vol. 73, No. 4, 2007, s. 636-647.

Libkin L.: Normalization theory for XML. LNCS, Vol. 4704, 2007, s.1-13.

Millist V., Jixue L.: Checking Functional Dependency Satisfaction in XML. LNCS, 2003 vol. 3671, 2005, s.4-17.

Nečaský M.: Conceptual model based normalization of xml view. Dateso, 28 Annnal International Workshop of Databases, TExts, Specifications and Objects, CUR-WS, Vol. 330, 2008, s. 13-24.

Nečaský M.: Conceptual model for XML. Phd thesis, Carles Univeristy, 2008.

Pankowski T.: Podstawy baz danych. Wydawnictwo Naukowe PWN, Warszawa, 1992.

Pankowski T., Piłka T.: Transformation of XML Data into XML Normal Form. Informatica, Vol. 33, No. 4, 2009, s. 417-430.

Pankowski T., Cybulka J., Meissner A.: XML Schema Mappings in the Presence of Key Constraints and Value Dependencies. ICDT 2007 Workshop EROW, CEUR Workshop Proceedings., Vol. 229, 2007, s. 1-15.

Piłka T., Pankowski T.: Zależności funkcyjne w danych XML. Studia Informatica, red.: S. Kozielski, B. Małysiak-Mrozek, P. Kasprowski, D. Mrozek, Vol. 30-2A (83), 2009, s.7-19.

XML Path Language (XPath) 2.0.2006,