Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/113751
Title: Automatic wheeze segmentation using harmonic-percussive source separation and empirical mode decomposition
Authors: Rocha, Bruno M. 
Pessoa, Diogo 
Marques, Alda
Carvalho, Paulo de 
Paiva, Rui Pedro 
Keywords: Respiratory sound analysis; expert systems; harmonic-percussive source separation; empirical mode decomposition; sound event detection
Issue Date: 23-Feb-2023
Publisher: IEEE
Project: This work was supported in part by the FCT - Foundation for Science and Technology, in part by the I.P./MCTES through national funds (PIDDAC), within the scope of CISUC R&D Unit under Projects UIDB/00326/2020 and UIDP/00326/2020, in part by the Ph.D. scholarships under Projects SFRH/BD/135686/2018 and DFA/BD/4927/2020, in part by the WELMO, in part by the Horizon 2020 Framework Programme of the European Union under Grant 825572, in part by the FCT project Lung@ICU under Grant DSAIPA/AI/0113/2020, in part by the Fundo Europeu de Desenvolvimento Regional (FEDER), in part by the Programa Operacional Competitividade e Internacionalização (COMPETE), in part by the iBiMED under Project POCI- 01-0145-FEDER-007628, and in part by the FCT under Project UIDB/04501/2020. 
Serial title, monograph or event: IEEE Journal of Biomedical and Health Informatics
Volume: 27
Issue: 4
Abstract: Wheezes are adventitious respiratory sounds commonly present in patients with respiratory conditions. The presence of wheezes and their time location are relevant for clinical reasons, such as understanding the degree of bronchial obstruction. Conventional auscultation is usually employed to analyze wheezes, but remote monitoring has become a pressing need during recent years. Automatic respiratory sound analysis is required to reliably perform remote auscultation. In this work we propose a method for wheeze segmentation. Our method starts by decomposing a given audio excerpt into intrinsic mode frequencies using empirical mode decomposition. Then, we apply harmonic-percussive source separation to the resulting audio tracks and get harmonic-enhanced spectrograms, which are processed to obtain harmonic masks. Subsequently, a series of empirically derived rules are applied to find wheeze candidates. Finally, the candidates stemming from the different audio tracks are merged and median filtered. In the evaluation stage, we compare our method to three baselines on the ICBHI 2017 Respiratory Sound Database, a challenging dataset containing various noise sources and background sounds. Using the full dataset, our method outperforms the baselines, achieving an F1 of 41.9%. Our method's performance is also better than the baselines across several stratified results focusing on five variables: recording equipment, age, sex, body-mass index, and diagnosis. We conclude that, contrary to what has been reported in the literature, wheeze segmentation has not been solved for real life scenario applications. Adaptation of existing systems to demographic characteristics might be a promising step in the direction of algorithm personalization, which would make automatic wheeze segmentation methods clinically viable.
URI: https://hdl.handle.net/10316/113751
DOI: 10.1109/JBHI.2023.3248265
Rights: openAccess
Appears in Collections:I&D CISUC - Artigos em Revistas Internacionais

Show full item record

Page view(s)

26
checked on Apr 24, 2024

Download(s)

6
checked on Apr 24, 2024

Google ScholarTM

Check

Altmetric

Altmetric


This item is licensed under a Creative Commons License Creative Commons