Methods for increasing precision in find informations - automatic categorization

Karol Kozak


In this article are described methods and their implementation to the systems for organize documents from search results. Below is described approach that automatically categorize documents. This approach was implemented to systems which contain and are repository for scientific documents from domain medicine - psychiatry.


information retrieval; text mining; medical informatics; document classification; archival sources; metadata documents

Full Text:

PDF (Polski)


Sahami M, Yusufali S, Baldonado M.Q: Service for Organizing Network Information Autonomously. Digital Libraries 98: Proceedings of the Third ACM Conference on Digital Libraries, Pittsburgh, PA, USA 1998.

Buckley C, Salton G, Allan J: The effect of adding relevance information in a relevance feedback environment. SIGIR ’94. Proceedings of the Seventeenth Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, Springer-Verlag, Berlin,1994.

Salton G, Wong A, Yang CS: A vector space model for automatic indexing. Communications of the ACM 18: 1975, s. 613-620..

NLM. National Library of Medicine. 2003.

UMLS. Unified Medical Language System Knowledge Source Server. [Online] 2003.

Java technology. [Online] 2003.

Apache. Open-source HTTP server. 2003.

Postgres Object-relational database management system. 2003.

HSQL. Open source database [Online] 2003.

NLM. Welcome to PubMed.[Online] 2003.

My Document Manager. [Online] 2003.

Heinz Nixdorf Center for Information Management in the Max Planck Society. [Online] 2003.

Dublin Core Metadata Initiative. [Online] 2003.

MeSH. Medline Subject Headings [Online] . 2003.