M. Képesi: Dekompozice základního tónu pro sledování více mluvčích | |
FIT Božetěchova 2, seminární místnost UPGM L220 23.6.2008 On this seminar talk, a recently proposed method of joint pitch (Fo) and direction of arrival (DoA) extraction for speaker localization will be introduced. Beside the basic idea of the multidimensional Position-Pitch (PoPi) decomposition, its application in surveillance and hands-free communication systems will be discussed. The proposed method will be demonstrated on concurrent speaker and moving speaker scenarios on real-world recordings and compared to state-of-art DoA estimation methods. Comparison results will show the power of the method in determining the accurate pitch and position estimates for multi-source scenarios. We will see that tje Position-Pitch decomposition (PoPi) gives an intuitive representation of all active speakers in terms of their respective position and pitch estimates. |