ESPERANTO

Exchanges for SPEech ReseArch aNd TechnOlogies

Secondment of Federico Landini

From BUT to CONICET

 

End-to-end speaker diarization models aim to solve the task of "who spoke when" without the need of separately trained modules. However, these models still struggle to deal with multiple speakers per recording. During the stay, we continued our efforts on improving their quality.