Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/44319
Title: Learning Supervised Topic Models for Classification and Regression from Crowds
Authors: Rodrigues, Filipe 
Lourenco, Mariana 
Ribeiro, Bernardete 
Pereira, Francisco 
Issue Date: 2017
Publisher: IEEE
Serial title, monograph or event: IEEE Transactions on Pattern Analysis and Machine Intelligence
Abstract: The growing need to analyze large collections of documents has led to great developments in topic modeling. Since documents are frequently associated with other related variables, such as labels or ratings, much interest has been placed on supervised topic models. However, the nature of most annotation tasks, prone to ambiguity and noise, often with high volumes of documents, deem learning under a single-annotator assumption unrealistic or unpractical for most real-world applications. In this article, we propose two supervised topic models, one for classification and another for regression problems, which account for the heterogeneity and biases among different annotators that are encountered in practice when learning from crowds. We develop an efficient stochastic variational inference algorithm that is able to scale to very large datasets, and we empirically demonstrate the advantages of the proposed model over state-of-the-art approaches.
URI: https://hdl.handle.net/10316/44319
DOI: 10.1109/TPAMI.2017.2648786
10.1109/TPAMI.2017.2648786
Rights: openAccess
Appears in Collections:FCTUC Eng.Informática - Artigos em Revistas Internacionais

Files in This Item:
File Description SizeFormat
07807338.pdf1.5 MBAdobe PDFView/Open
Show full item record

SCOPUSTM   
Citations

74
checked on Apr 15, 2024

WEB OF SCIENCETM
Citations 5

52
checked on Apr 2, 2024

Page view(s) 50

420
checked on Apr 16, 2024

Download(s) 50

583
checked on Apr 16, 2024

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.