University of York
Workshop at Newcastle University
June 11, 2023
Postdoctoral Research Associate
University of York
Person-specific Automatic Speaker Recognition: Understanding the behaviour of individuals for applications of ASR
DPhil Candidate, MPhil (Distinction)
University of Oxford
Investigating the tonal system of Plastic Mandarin: A cross-varietal comparison
1 2 3 4 5 6
1 2 3 4 5 6
Automatic Speech Recognition (ASR) or Speech-to-text (STT)
Related ASR tasks:
1 2 3 4 5 6
Multiple sources of Variation
1 2 3 4 5 6
1 2 3 4 5 6
Input
Output
Aim
Statistical Approach (Traditional)
Recorded speech as a sequence of acoustic feature vectors, X
Word sequence as W
To find the most likely W, given X
Statistical models are trained using a corpus of labelled training utterances (Xn, Wn)
1 2 3 4 5 6
Desirable properties:
Typical acoustic features:
1 2 3 4 5 6
Phonemes:
Phones: