Multi-channel source separation with video data

dc.contributor.author	Mosayyebpour, Sahand
dc.contributor.supervisor	Gulliver, Aaron
dc.date.accessioned	2024-12-19T21:55:12Z
dc.date.available	2024-12-19T21:55:12Z
dc.date.issued	2024
dc.degree.department	Department of Electrical and Computer Engineering
dc.degree.level	Master of Applied Science MASc
dc.description.abstract	This research introduces a supervised multi-channel audio source separation system that integrates a video-based face detection system. The face detector identifies the nose position, aiding the multi-channel processing in isolating the primary speaker while suppressing environmental background noise and distracting secondary speakers. It is demonstrated that in far-field applications, multi-channel processing struggles with distracting secondary speakers when the primary speaker position is unknown. Utilizing video data provides valuable insights to identify the target speaker and assists the audio source separation system in directing its focus towards the target speaker. Furthermore, it is shown that multi-channel processing benefits from speaker position information to improve noise reduction in noisy reverberant environments.
dc.description.scholarlevel	Graduate
dc.identifier.uri	https://hdl.handle.net/1828/20872
dc.language	English	eng
dc.language.iso	en
dc.rights	Available to the World Wide Web
dc.title	Multi-channel source separation with video data
dc.type	Thesis

Files

Now showing 1 - 1 of 1

Now showing 1 - 1 of 1