Utilize este identificador para referenciar este registo: https://hdl.handle.net/10316/7640
Título: On Text-based Mining with Active Learning and Background Knowledge Using SVM
Autor: Silva, Catarina 
Ribeiro, Bernardete 
Data: 2007
Citação: Soft Computing - A Fusion of Foundations, Methodologies and Applications. 11:6 (2007) 519-530
Resumo: Abstract Text mining, intelligent text analysis, text data mining and knowledge-discovery in text are generally used aliases to the process of extracting relevant and non-trivial information from text. Some crucial issues arise when trying to solve this problem, such as document representation and deficit of labeled data. This paper addresses these problems by introducing information from unlabeled documents in the training set, using the support vector machine (SVM) separating margin as the differentiating factor. Besides studying the influence of several pre-processing methods and concluding on their relative significance, we also evaluate the benefits of introducing background knowledge in a SVM text classifier. We further evaluate the possibility of actively learning and propose a method for successfully combining background knowledge and active learning. Experimental results show that the proposed techniques, when used alone or combined, present a considerable improvement in classification performance, even when small labeled training sets are available.
URI: https://hdl.handle.net/10316/7640
DOI: 10.1007/s00500-006-0080-8
Direitos: openAccess
Aparece nas coleções:FCTUC Eng.Informática - Artigos em Revistas Internacionais

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato
obra.pdf210.47 kBAdobe PDFVer/Abrir
Mostrar registo em formato completo

Citações SCOPUSTM   

25
Visto em 15/abr/2024

Citações WEB OF SCIENCETM
5

21
Visto em 2/abr/2024

Visualizações de página 50

512
Visto em 23/abr/2024

Downloads

335
Visto em 23/abr/2024

Google ScholarTM

Verificar

Altmetric

Altmetric


Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.