D. Ramm, D. Chazan
SPIE Optics, Imaging, and Instrumentation 1994
The paper addresses the problem of optimally estimating (in the ML sense) the pitch of each of several speakers talking simultaneously. This information is needed in systems which perform co-channel speech separation. We propose a multi-pitch model which is used in conjunction with an EM-based iterative estimation scheme. In addition, the pitch period of each speaker is allowed to vary linearly in the analysis interval, thus offering improved co-channel speech separation. The proposed algorithm is shown to outperform standard pitch detection algorithms, in detecting the pitch of simultaneous speakers.
D. Ramm, D. Chazan
SPIE Optics, Imaging, and Instrumentation 1994
R. Hoory, D. Chazan
ICPR 1994
N. Merhav, D. Chazan
IEEEI 1984
D. Chazan, Y. Medan, et al.
ICASSP 1990