Metadata of a distributed data mining system

Ewa Płuciennik-Psota


Distributed computing and data mining are two elements essential for many commercial and scientific organizations. This article presents metadata of a distributed data mining system based on combining local models created in distributed nodes into global models. This metadata is divided into following categories: primal, derivative, work and test data for local and global models.


metadata; data bases; distributed data mining; SQL

Full Text:



Kimball R., Reeves L., Margy R., Warren T.: The Data Warehouse Lifecycle Toolkit. Wiley & Sons, 1998.

Longley P. A., Goodchild M. F., Maguire D. J., Rhind D. W.: Geographic Information Systems and Science. John Wiley & Sons Ltd, 2005.

Ma Y., Yu S., Li D., Liu L.: Implementation of Metadata Warehouse Used in a Distributed Data Mining Tool. International Conference on Challenges in Environmental Science and Computer Engineering, 2010, Vol. 2, p. 343÷346.

Le-Khac N. A., Aouad L. M., Kechadi M. T.: An efficient Knowledge Management Tool for Distributed Data Mining. International Journal of Computational Intelligence Research, 2009, Vol. 5, No. 1.

Nesbit J. C., Adesope O. O.: Learning With Concept and Knowledge Maps: A Meta-Analysis. Review of Educational Research, Vol. 76, No. 3, 2006, p. 413÷448.

Mastroianni C., Talia D. Trunfio P.: Metadata for Managing Grid Resources in Data Mining Applications. Journal of Grid Computing, Vol. 2, No. 1, 2004, p. 85÷102.

Gorawski M., Płuciennik E.: Analytical Models Combining Methodology with Classification Model Example. First International Conference on Information Technology, 2008.

Gorawski M., Pluciennik-Psota E.: Distributed Data Mining Methodology with Classification Model Example. Lecture Notes in Artificial Intelligence (eds.): Nguyen N. T., Kowalczyk R. Chen S.M. ICCCI 2009, LNCS Vol. 5796, XVII, p. 107÷117.

Gorawski M., Płuciennik-Psota E.: Distributed Data Mining Methodology for Clustering and Classification Model. Rutkowski L. et al. (eds.): ICAISC 2010, Part I, LNAI 6113, Springer-Verlag Berlin Heidelberg 2010, p. 323÷330.

Gorawski M., Płuciennik E.: Distributed SQL in Spatial Data Warehouse and Software Agents Environment. International Transactions on Systems Science and Applications. 2008, Vol. 3, No. 4, p. 307÷313.

Gorawski M., Płuciennik E.: Distributed Data Mining by Means of SQL Enhancement. OTM Workshops 2008, ODBASE International Conference, 2008, p. 34÷35.

Asuncion A., Newman D. J.: UCI Machine Learning Repository. Irvine, CA: University of California, School of Information and Computer Science, /MLRepository.html (2007).

Quinlan R.: Induction of Decision Trees. Machine Learning, vol. 1, 1986, p. 81÷106.