Plenary talk by Petr Schwarz (BUT) and Themos Stafylakis (Omilia)JSALT 2023

Introduction to speaker identification and deep fake context.

Petr will present how a speaker identification system based on the ResNet neural network architecture is designed. He will also tell you about basic principles used in speech synthesis, voice morphing, and speech codecs and explain how speaker identification, speech synthesis, and speech codecs can affect each other in the real world.


Extracting speaker and emotion information from self-supervised speech models.

Themos will then present hot topics in speaker identification research emphasizing self-supervised models.   


Watch the live stream here

