Enhancing Intake Monitoring: Transfer Learning for Audio-Based Detection of Swallowing Events

Xin Chen, Ernest Kamavuako*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

Advancements in sensing technology and deep learning have paved the way for non-invasive monitoring solutions in healthcare. This paper reports on developing an audio-based monitoring system for detecting swallowing and fluid intake events. Sound signals were collected from 13 subjects using a throat microphone. A framework was introduced, comparing multiple 1D-2D conversion methods to convert audio signals into image data suitable for Convolutional Neural Network (CNN) based classification. Different pre-trained CNN models were fine-tuned on our dataset, pairing with different signal lengths. The classification results indicated that Short-time Fourier Transform (STFT) was the optimal method, with an accuracy of over 97% across all signal lengths and Networks. The swallowing detection mechanism achieved 92.45% accuracy and 0.88 F1-score, with the capability to occur 0.35 seconds before the signal peak. These results and robust performance under diverse noise conditions highlight the system's potential for real-time application in real-world scenarios.
Original languageEnglish
Title of host publication2024 IEEE 22nd Mediterranean Electrotechnical Conference (MELECON)
Pages479-484
Number of pages6
Publication statusAccepted/In press - 2024
Event2024 IEEE 22nd Mediterranean Electrotechnical Conference (MELECON) - Porto, Portugal
Duration: 25 Jun 202427 Jun 2024

Conference

Conference2024 IEEE 22nd Mediterranean Electrotechnical Conference (MELECON)
Country/TerritoryPortugal
CityPorto
Period25/06/202427/06/2024

Keywords

  • swallowing detection
  • fluid intake
  • deep learning
  • sound classification

Fingerprint

Dive into the research topics of 'Enhancing Intake Monitoring: Transfer Learning for Audio-Based Detection of Swallowing Events'. Together they form a unique fingerprint.

Cite this