Logo ESPERANTO
ESPERANTO

Dissemination

    • Speaker embeddings by modeling channel-wise correlations

      Stafylakis T., Rohdin J., Burget L., Speaker embeddings by modeling channel-wise correlations, Interspeech 2021

      Read more

    • Log-Likelihood-Ratio Cost Function as Objective Loss for Speaker Verification Systems

      Mingote, V., Miguel, A., Ortega, A., Lleida, E. "Log-Likelihood-Ratio Cost Function as Objective Loss for Speaker Verification Systems" 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021. Brno, Czech Republic.

      Read more

    • Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021

      Gimeno, P; Ortega, A.; Miguel, A.; Lleida, E. "Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021" 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021. Brno, Czech Republic.

      Read more

    • The Domain Mismatch Problem in the Broadcast Speaker Attribution Task

      Viñals, I.; Ortega, A.; Miguel, A.; Lleida, E. "The Domain Mismatch Problem in the Broadcast Speaker Attribution Task". Applied Sciences, vol. 11, no. 18, p. 8521, Sept. 2021.

      Read more

    • Generalising AUC Optimisation to Multiclass Classification for Audio Segmentation with Limited Training Data

      Gimeno, P.; Ortega, A.; Miguel, A.; Lleida, E. "Generalising AUC Optimisation to Multiclass Classification for Audio Segmentation with Limited Training Data". IEEE Signal Processing Letters, 28 , pp. 1135-1139, 2021.

      Read more

    • The LIUM Human Active Correction Platform for Speaker Diarization

      Flucha, A., Larcher, A., Mehrish, A., Meignier, S., Plaut, F., Poupon, N., Prokopalo, Y., Puertolas, A., Shamsi, M., Tahon, M. (2021) The LIUM Human Active Correction Platform for Speaker Diarization. Proc. Interspeech 2021, 965-966

      Read more

    • MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification

      Mošner L., Plchot O., Burget L., Černock J. (202é) MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification. ICASSP 2022,

      Read more

    • Multi-channel Speaker Verification with Conv-TasNet Based Beamformer

      Mošner L., Plchot O., Burget L., Černock J. (2022) Multi-channel Speaker Verification with Conv-TasNet Based Beamformer. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

      Read more

    • Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch

      Silnova A., Stafylakis T., Mosner L., Plchot O., Rohdin J., Matejka P., Burget L, Glembek O., Brummer N.. (2022) Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch. Odyssey: The Speaker and Language Recognition Workshop 2022

      Read more

    • Development of ABC Systems for the 2021 Edition of NIST Speaker Recognition Evaluation

      Alam J., Beneš R., Beszédeš M., Burget L., Dahmane M., Fathan A., Ghodrati H., Glembek O., Kang W.H., Matĕjka P., Mošner L., Plchot O., Rohdin J., Silnova A., Stafylakis T.(2022) Development of ABC Systems for the 2021 Edition of NIST Speaker Recognition Evaluation. Odyssey: The Speaker and Language Recognition Workshop 2022

      Read more

    • Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings

      Brummer N., Swart A., Mosner L, Silnova A., Plchot O., Stafylakis T. Burget L.,(2022) Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings. Interspeech 2022

      Read more

    • Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundarie

      Stafylakis T., Mosner L, Plchot O., Rohdin J., Silnova A., Burget L., Cernocky J., (2022) Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundarie. Interspeech 2022

      Read more

    • Microphone Array Channel Combination Algorithms for Overlapped Speech Detection

      Mariotte T., Larcher A., Montresor S., Thomas J.-H. (2022) Microphone Array Channel Combination Algorithms for Overlapped Speech Detection. Interspeech 2022

      Read more

    • Phone-Level Pronunciation Scoring for Spanish Speakers Learning English Using a GOP-DNN System

      Vidal J., Bonomi C., Sancinetti M., Ferrer L.. (2022) Phone-Level Pronunciation Scoring for Spanish Speakers Learning English Using a GOP-DNN System. Interspeech 2022

      Read more

    • Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data

      P. Gimeno, V. Mingote, A. Ortega, A. Miguel and E. Lleida, "Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data," in IEEE Signal Processing Letters, vol. 28, pp. 1135-1139, 2021, doi: 10.1109/LSP.2021.3084501.

      Read more

    • The Domain Mismatch Problem in the Broadcast Speaker Attribution Task

      Viñals I., Ortega A., Miguel A., Lleida E. (2021) The Domain Mismatch Problem in the Broadcast Speaker Attribution Task in IberSPEECH 2020: Speech and Language Technologies for Iberian Languages. Appl. Sci. 2021, 11(18), 8521;

      Read more

    • End-to-End Speech Translation of Arabic to English Broadcast News.

      Bougares F. and Jouili S., End-to-End Speech Translation of Arabic to English Broadcast News, The Seventh Arabic Natural Language Processing Workshop (WANLP 2022)

      Read more

Partagez :