Peder A. Olsen, Ramesh A. Gopinath
IEEE Transactions on Speech and Audio Processing
This paper describes a computationally simple method to perform text independent speaker verification using second order statistics. The suggested method, called utterance level scoring (ULS), allows obtaining a normalized score using a single pass through the frames of the tested utterance. The utterance sample covariance is first calculated and then compared to the speaker covariance using a distortion measure. Subsequently, a distortion measure between the utterance covariance and the sample covariance of data taken from different speakers is used to normalize the score. Experimental results from the 2000 NIST speaker recognition evaluation are presented for ULS, used with different distortion measures, and for a Gaussian mixture model (GMM) system. The results indicate that ULS as a viable alternative to GMM whenever computational complexity and verification accuracy needs to be traded.
Peder A. Olsen, Ramesh A. Gopinath
IEEE Transactions on Speech and Audio Processing
Jakud Wejcmeitt, David Haumann
SIGGRAPH 1991
Conrad Albrecht, Jannik Schneider, et al.
CVPR 2025
A. Vasilopoulos, Emma Boulharts, et al.
ISCAS 2025