SV - VLSP2021: The Smartcall - ITS’s Systems

Dinh Van Hung; Mai Van Tuan; Dam Ba Quyen; Nguyen Quoc Bao

doi:10.25073/2588-1086/vnucsce.339

Dinh Van Hung, Mai Van Tuan, Dam Ba Quyen, Nguyen Quoc Bao

PDF

Published Jun 30, 2022

DOI: https://doi.org/10.25073/2588-1086/vnucsce.339

How to Cite

HUNG, Dinh Van et al. SV - VLSP2021: The Smartcall - ITS’s Systems. VNU Journal of Science: Computer Science and Communication Engineering, [S.l.], v. 38, n. 1, june 2022. ISSN 2588-1086. Available at: <//jcsce.vnu.edu.vn/index.php/jcsce/article/view/339>. Date accessed: 24 july 2026. doi: https://doi.org/10.25073/2588-1086/vnucsce.339.

ABNT APA BibTeX CBE EndNote - EndNote format (Macintosh & Windows) MLA ProCite - RIS format (Macintosh & Windows) RefWorks Reference Manager - RIS format (Windows only) Turabian

Issue

Vol 38 No 1: Special Issue: The 8th International Workshop on Vietnamese Language and Speech Processing (VLSP 2021)

Section

Special Issue on Vietnamese Language and Speech Processing (VLSP2021)

Abstract

This paper presents the Smartcall - ITS’s systems submitted to the Vietnamese Language and Speech Processing, Speaker Verification (SV) task. The challenge consists of two tasks focusing on the development of SV models with limited data and testing the robustness of SV systems. In both tasks, we used various pre-trained speaker embedding models with different architectures: TDNN, Resnet34. After a specific fine-tuning strategy with data from the organiser, our system achieved the first rank for both two tasks with the Equal Error Rate respectively are 1.755%, 1.95%. In this paper, we describe our system developed for the booth two tasks in the VLSP2021 Speaker Verification shared-task.

Article Sidebar

Article Details

Main Article Content

Abstract