Abstract
Advancements in sensing technology and deep learning have paved the way for non-invasive monitoring solutions in healthcare. This paper reports on developing an audio-based monitoring system for detecting swallowing and fluid intake events. Sound signals were collected from 13 subjects using a throat microphone. A framework was introduced, comparing multiple 1D-2D conversion methods to convert audio signals into image data suitable for Convolutional Neural Network (CNN) based classification. Different pre-trained CNN models were fine-tuned on our dataset, pairing with different signal lengths. The classification results indicated that Short-time Fourier Transform (STFT) was the optimal method, with an accuracy of over 97% across all signal lengths and Networks. The swallowing detection mechanism achieved 92.45% accuracy and 0.88 F1-score, with the capability to occur 0.35 seconds before the signal peak. These results and robust performance under diverse noise conditions highlight the system's potential for real-time application in real-world scenarios.
Original language | English |
---|---|
Title of host publication | 2024 IEEE 22nd Mediterranean Electrotechnical Conference (MELECON) |
Pages | 479-484 |
Number of pages | 6 |
Publication status | Accepted/In press - 2024 |
Event | 2024 IEEE 22nd Mediterranean Electrotechnical Conference (MELECON) - Porto, Portugal Duration: 25 Jun 2024 → 27 Jun 2024 |
Conference
Conference | 2024 IEEE 22nd Mediterranean Electrotechnical Conference (MELECON) |
---|---|
Country/Territory | Portugal |
City | Porto |
Period | 25/06/2024 → 27/06/2024 |
Keywords
- swallowing detection
- fluid intake
- deep learning
- sound classification