Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/7640
DC FieldValueLanguage
dc.contributor.authorSilva, Catarina-
dc.contributor.authorRibeiro, Bernardete-
dc.date.accessioned2009-02-17T10:24:51Z-
dc.date.available2009-02-17T10:24:51Z-
dc.date.issued2007en_US
dc.identifier.citationSoft Computing - A Fusion of Foundations, Methodologies and Applications. 11:6 (2007) 519-530en_US
dc.identifier.urihttps://hdl.handle.net/10316/7640-
dc.description.abstractAbstract Text mining, intelligent text analysis, text data mining and knowledge-discovery in text are generally used aliases to the process of extracting relevant and non-trivial information from text. Some crucial issues arise when trying to solve this problem, such as document representation and deficit of labeled data. This paper addresses these problems by introducing information from unlabeled documents in the training set, using the support vector machine (SVM) separating margin as the differentiating factor. Besides studying the influence of several pre-processing methods and concluding on their relative significance, we also evaluate the benefits of introducing background knowledge in a SVM text classifier. We further evaluate the possibility of actively learning and propose a method for successfully combining background knowledge and active learning. Experimental results show that the proposed techniques, when used alone or combined, present a considerable improvement in classification performance, even when small labeled training sets are available.en_US
dc.language.isoengeng
dc.rightsopenAccesseng
dc.titleOn Text-based Mining with Active Learning and Background Knowledge Using SVMen_US
dc.typearticleen_US
dc.identifier.doi10.1007/s00500-006-0080-8en_US
uc.controloAutoridadeSim-
item.grantfulltextopen-
item.fulltextCom Texto completo-
item.openairetypearticle-
item.languageiso639-1en-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.cerifentitytypePublications-
crisitem.author.researchunitCISUC - Centre for Informatics and Systems of the University of Coimbra-
crisitem.author.parentresearchunitFaculty of Sciences and Technology-
crisitem.author.orcid0000-0002-5656-0061-
crisitem.author.orcid0000-0002-9770-7672-
Appears in Collections:FCTUC Eng.Informática - Artigos em Revistas Internacionais
Files in This Item:
File Description SizeFormat
obra.pdf210.47 kBAdobe PDFView/Open
Show simple item record

SCOPUSTM   
Citations

25
checked on Apr 29, 2024

WEB OF SCIENCETM
Citations 5

21
checked on May 2, 2024

Page view(s) 50

516
checked on May 7, 2024

Download(s)

336
checked on May 7, 2024

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.