Please use this identifier to cite or link to this item:
Title: Computational prediction of the human-microbial oral interactome
Authors: Coelho, Edgar D.
Arrais, Joel P. 
Matos, Sérgio 
Pereira, Carlos
Rosa, Nuno 
Correia, Maria José
Barros, Marlene 
Oliveira, José Luís 
Keywords: Protein-protein interactions; Oral interactome; Bayesian classification
Issue Date: 27-Feb-2014
Publisher: Springer Nature
Project: RD-CONNECT project (EC contract number 305444) 
Serial title, monograph or event: BMC Systems Biology
Volume: 8
Issue: 1
Abstract: Background: The oral cavity is a complex ecosystem where human chemical compounds coexist with a particular microbiota. However, shifts in the normal composition of this microbiota may result in the onset of oral ailments, such as periodontitis and dental caries. In addition, it is known that the microbial colonization of the oral cavity is mediated by protein-protein interactions (PPIs) between the host and microorganisms. Nevertheless, this kind of PPIs is still largely undisclosed. To elucidate these interactions, we have created a computational prediction method that allows us to obtain a first model of the Human-Microbial oral interactome. Results: We collected high-quality experimental PPIs from five major human databases. The obtained PPIs were used to create our positive dataset and, indirectly, our negative dataset. The positive and negative datasets were merged and used for training and validation of a naïve Bayes classifier. For the final prediction model, we used an ensemble methodology combining five distinct PPI prediction techniques, namely: literature mining, primary protein sequences, orthologous profiles, biological process similarity, and domain interactions. Performance evaluation of our method revealed an area under the ROC-curve (AUC) value greater than 0.926, supporting our primary hypothesis, as no single set of features reached an AUC greater than 0.877. After subjecting our dataset to the prediction model, the classified result was filtered for very high confidence PPIs (probability ≥ 1-10−7), leading to a set of 46,579 PPIs to be further explored. Conclusions: We believe this dataset holds not only important pathways involved in the onset of infectious oral diseases, but also potential drug-targets and biomarkers. The dataset used for training and validation, the predictions obtained and the network final network are available at
ISSN: 1752-0509
DOI: 10.1186/1752-0509-8-24
Rights: openAccess
Appears in Collections:I&D CNC - Artigos em Revistas Internacionais
I&D CISUC - Artigos em Revistas Internacionais
FCTUC Eng.Informática - Artigos em Revistas Internacionais

Files in This Item:
Show full item record

Google ScholarTM




This item is licensed under a Creative Commons License Creative Commons