TCBR-HMM: An HMM-based text classifier with a CBR system
DATE:
2015-01
UNIVERSAL IDENTIFIER: http://hdl.handle.net/11093/6155
EDITED VERSION: https://linkinghub.elsevier.com/retrieve/pii/S1568494614005298
UNESCO SUBJECT: 1203.17 Informática
DOCUMENT TYPE: article
ABSTRACT
This paper presents an innovative solution to model distributed adaptive systems in biomedical environments. We present an original TCBR-HMM (Text Case Based Reasoning-Hidden Markov Model) for biomedical text classification based on document content. The main goal is to propose a more effective classifier than current methods in this environment where the model needs to be adapted to new documents in an iterative learning frame. To demonstrate its achievement, we include a set of experiments, which have been performed on OSHUMED corpus. Our classifier is compared with Naive Bayes and SVM techniques, commonly used in text classification tasks. The results suggest that the TCBR-HMM Model is indeed more suitable for document classification. The model is empirically and statistically comparable to the SVM classifier and outperforms it in terms of time efficiency.
Files in this item
- Name:
- 2015_borrajo_cbr_system.pdf
- Size:
- 1.566Mb
- Format:
- Description:
- accepted manuscript