Ngo The Quyen, Hoang Tuan Anh, Nguyen Thi Minh Huyen, Nguyen Lien

Main Article Content

Abstract

This paper presents the first challenge on recognizing textual entailment (RTE), also known as natural language inference (NLI), held in a Vietnamese Language and Speech Processing workshop (VLSP 2021).
The challenge aims to determine, for a given pair of sentences, whether the two sentences semantically agree, disagree, or are neutral/irrelevant to each other. The input sentences are in English or Vietnamese and may not be in the same language. This task is important in identifying, from different information sources, the evidence that supports or refutes a statement. The identification of such evidence is subsequently useful for many information tracking applications, such as opinion mining, brand and reputation management, and particularly fighting against fake news.
Through this challenge, we would like to provide an opportunity for participants who are interested in the problem, to contribute their knowledge to improve the existing techniques and methods for the task, so as to enhance the effectiveness of those applications.
In the paper, we introduce a collection of Vietnamese and English sentences in the domain of health that we built to serve as a benchmarking dataset for the task. We also describe the evaluation results of systems participating in the challenge.