Evaluation of speech enhancement techniques for speaker recognition in noisy environments

El-Solh, Abdel-Aziz

Download PDF

Resource Type

Thesis

Creator

El-Solh, Abdel-Aziz

Abstract

In automatic speaker recognition (ASR) applications, the presence of background noise severely degrades recognition performance. There is a strong demand for speech enhancement algorithms capable of removing background noise. In this thesis, a Gaussian mixture model based automatic speaker recognition system is used for evaluating the performance of five different speech enhancement techniques. Previously, it was shown that these techniques improved the SNR of the speech signals corrupted by noise but their effect on the speaker recognition performance was not fully investigated. In this work, we implement these enhancement techniques and evaluate their performance as preprocessing blocks to the ASR engine. We evaluate the performance based on speaker recognition accuracy, average segmental signal-to-noise ratio and perceptual evaluation of speech quality (PESQ) scores. We combine clean speech from the TIMIT database with eight different types of noise from the NOISEX-92 database representing synthetic and natural background noise samples and analyze the overall system performance. Simulation results show that the system is capable of reducing noise with little speech degradation and the overall recognition performance can be improved at a range of different signal-to-noise ratios (SNR) with different noise types. Furthermore, results show that different enhancement techniques have different strengths and weaknesses, depending on their application and the background noise type.

Subject

Language

English

Publisher

Carleton University

Thesis Degree Level

Master's

Thesis Degree Name

Master of Applied Science (M.App.Sc.)

Thesis Degree Discipline

Engineering, Systems and Computer

Identifier

DOI: https://doi.org/10.22215/etd/2006-07976

Rights Notes

Copyright © 2006 the author(s). Theses may be used for non-commercial research, educational, or related academic purposes only. Such uses include personal study, research, scholarship, and teaching. Theses may only be shared by linking to Carleton University Institutional Repository and no part may be used without proper attribution to the author. No part may be used for commercial purposes directly or indirectly via a for-profit platform; no adaptation or derivative works are permitted without consent from the copyright owner.

Date Created

2006

Relations

In Collection:

Theses and Dissertations

Items

Thumbnail	Title	Date Uploaded	Visibility	Actions
	el-solh-evaluationofspeechenhancementtechniquesfor.pdf	2023-05-03	Public	Download

Evaluation of speech enhancement techniques for speaker recognition in noisy environments

Downloadable Content

Relations

Items