Jellyman, Keith Andrew (2018). A study into automatic speaker verification with aspects of deep learning. University of Birmingham. M.Sc.
|
JellymanMScbyRes18.pdf
PDF - Accepted Version Download (5MB) |
Abstract
Advancements in automatic speaker verification (ASV) can be considered to be primarily limited to improvements in modelling and classification techniques, capable of capturing ever larger amounts of speech data.
This thesis begins by presenting a fairly extensive review of developments in ASV, up to the current state-of-the-art with i-vectors and PLDA. A series of practical tuning experiments then follows. It is found somewhat surprisingly, that even the training of the total variability matrix required for i-vector extraction, is potentially susceptible to unwanted variabilities.
The thesis then explores the use of deep learning in ASV. A literature review is first made, with two training methodologies appearing evident: indirectly using a deep neural network trained for automatic speech recognition, and directly with speaker related output classes. The review finds that interest in direct training appears to be increasing, underpinned with the intent to discover new robust 'speaker embedding' representations.
Last a preliminary experiment is presented, investigating the use of a deep convolutional network for speaker identification. The small set of results show that the network successfully identifies two test speakers, out of 84 possible speakers enrolled. It is hoped that subsequent research might lead to new robust speaker representations or features.
Type of Work: | Thesis (Masters by Research > M.Sc.) | ||||||
---|---|---|---|---|---|---|---|
Award Type: | Masters by Research > M.Sc. | ||||||
Supervisor(s): |
|
||||||
Licence: | |||||||
College/Faculty: | Colleges (2008 onwards) > College of Engineering & Physical Sciences | ||||||
School or Department: | School of Engineering, Department of Electronic, Electrical and Computer Engineering | ||||||
Funders: | None/not applicable | ||||||
Subjects: | T Technology > TK Electrical engineering. Electronics Nuclear engineering | ||||||
URI: | http://etheses.bham.ac.uk/id/eprint/8493 |
Actions
Request a Correction | |
View Item |
Downloads
Downloads per month over past year