Multi-channel source separation with video data

dc.contributor.authorMosayyebpour, Sahand
dc.contributor.supervisorGulliver, Aaron
dc.date.accessioned2024-12-19T21:55:12Z
dc.date.available2024-12-19T21:55:12Z
dc.date.issued2024
dc.degree.departmentDepartment of Electrical and Computer Engineering
dc.degree.levelMaster of Applied Science MASc
dc.description.abstractThis research introduces a supervised multi-channel audio source separation system that integrates a video-based face detection system. The face detector identifies the nose position, aiding the multi-channel processing in isolating the primary speaker while suppressing environmental background noise and distracting secondary speakers. It is demonstrated that in far-field applications, multi-channel processing struggles with distracting secondary speakers when the primary speaker position is unknown. Utilizing video data provides valuable insights to identify the target speaker and assists the audio source separation system in directing its focus towards the target speaker. Furthermore, it is shown that multi-channel processing benefits from speaker position information to improve noise reduction in noisy reverberant environments.
dc.description.scholarlevelGraduate
dc.identifier.urihttps://hdl.handle.net/1828/20872
dc.languageEnglisheng
dc.language.isoen
dc.rightsAvailable to the World Wide Web
dc.titleMulti-channel source separation with video data
dc.typeThesis

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
University_of_Victoria__UVic__LaTeX_thesis_template___2020 (4).pdf
Size:
2.7 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.62 KB
Format:
Item-specific license agreed upon to submission
Description: