VLSP 2021 - vnNLI Challenge: Vietnamese and English-Vietnamese Textual Entailment

Ngo The Quyen; Hoang Tuan Anh; Nguyen Thi Minh Huyen; Nguyen Lien

doi:10.25073/2588-1086/vnucsce.363

Ngo The Quyen, Hoang Tuan Anh, Nguyen Thi Minh Huyen, Nguyen Lien

PDF

Published Dec 16, 2022

DOI: https://doi.org/10.25073/2588-1086/vnucsce.363

How to Cite

QUYEN, Ngo The et al. VLSP 2021 - vnNLI Challenge: Vietnamese and English-Vietnamese Textual Entailment. VNU Journal of Science: Computer Science and Communication Engineering, [S.l.], v. 38, n. 2, dec. 2022. ISSN 2588-1086. Available at: <//jcsce.vnu.edu.vn/index.php/jcsce/article/view/363>. Date accessed: 25 july 2026. doi: https://doi.org/10.25073/2588-1086/vnucsce.363.

ABNT APA BibTeX CBE EndNote - EndNote format (Macintosh & Windows) MLA ProCite - RIS format (Macintosh & Windows) RefWorks Reference Manager - RIS format (Windows only) Turabian

Issue

Vol 38 No 2: Special Issue: The 8th International Workshop on Vietnamese Language and Speech Processing (VLSP 2021)

Section

Special Issue on Vietnamese Language and Speech Processing (VLSP2021)

Abstract

This paper presents the first challenge on recognizing textual entailment (RTE), also known as natural language inference (NLI), held in a Vietnamese Language and Speech Processing workshop (VLSP 2021).
The challenge aims to determine, for a given pair of sentences, whether the two sentences semantically agree, disagree, or are neutral/irrelevant to each other. The input sentences are in English or Vietnamese and may not be in the same language. This task is important in identifying, from different information sources, the evidence that supports or refutes a statement. The identification of such evidence is subsequently useful for many information tracking applications, such as opinion mining, brand and reputation management, and particularly fighting against fake news.
Through this challenge, we would like to provide an opportunity for participants who are interested in the problem, to contribute their knowledge to improve the existing techniques and methods for the task, so as to enhance the effectiveness of those applications.
In the paper, we introduce a collection of Vietnamese and English sentences in the domain of health that we built to serve as a benchmarking dataset for the task. We also describe the evaluation results of systems participating in the challenge.

Article Sidebar

Article Details

Main Article Content

Abstract