A syllable-based, pseudo-articulatory approach to speech recognition

Zhang, Li (2004). A syllable-based, pseudo-articulatory approach to speech recognition. University of Birmingham. Ph.D.

PDF - Accepted Version

Download (2MB)


The prevailing approach to speech recognition is Hidden Markov Modelling, which yields good performance. However, it ignores phonetics, which has the potential for going beyond the acoustic variance to provide a more abstract underlying representation.

The novel approach pursued in this thesis is motivated by phonetic and phonological considerations. It is based on the notion of pseudo-articulatory representations, which are abstract and idealized accounts of articulatory activity. The original work presented here demonstrates the recovery of syllable structure information from pseudo-articulatory representations directly without resorting to statistical models of phone sequences. The work is also original in its use of syllable structures to recover phonemes. This thesis presents the three-stage syllable based, pseudo-articulatory approach in detail. Though it still has problems, this research leads to a more plausible style of automatic speech recognition and will contribute to modelling and understanding speech behaviour. Additionally, it also permits a 'multithreaded' approach combining information from different processes.

Type of Work: Thesis (Doctorates > Ph.D.)
Award Type: Doctorates > Ph.D.
College/Faculty: Schools (1998 to 2008) > School of Computer Science
School or Department: School of Computer Science
Funders: Other
Other Funders: The University of Birmingham, Overseas Research Students Awards Scheme
Subjects: P Language and Literature > P Philology. Linguistics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
URI: http://etheses.bham.ac.uk/id/eprint/4905


Request a Correction Request a Correction
View Item View Item


Downloads per month over past year