Finding semantic similarity in Vietnamese

Finding semantic similarity is an important task in many natural language processing applications. Despite numerous works for popular languages, there is still limited research done for Vietnamese. In this paper, we tackle the problem of finding semantic similarity for Vietnamese using Random Indexing and Hyperspace Analogue to Language to represent the semantics of words and documents. We build a system to find synonyms in Vietnamese. Experimental results show that our system achieves accuracies of 75% for finding synonyms for verbs and 65% for synonyms for nouns.


Title: 

Finding semantic similarity in Vietnamese
Authors: Nguyen, Dat Tien; Pham, Son Bao
Keywords: Hyperspace Analogue to Language
Word space model
Random projection
Semantic vector
Issue Date: 2010
Publisher: H. : ĐHQGHN
Abstract: Finding semantic similarity is an important task in many natural language processing applications. Despite numerous works for popular languages, there is still limited research done for Vietnamese. In this paper, we tackle the problem of finding semantic similarity for Vietnamese using Random Indexing and Hyperspace Analogue to Language to represent the semantics of words and documents. We build a system to find synonyms in Vietnamese. Experimental results show that our system achieves accuracies of 75% for finding synonyms for verbs and 65% for synonyms for nouns. © 2010 IEEE.
Description: Proceedings - 2010 International Conference on Asian Language Processing, IALP 2010 2010, Article number 5681551, Pages 91-94
URI: http://repository.vnu.edu.vn/handle/VNU_123/26877
ISBN: 978-076954288-1
Appears in Collections:Bài báo của ĐHQGHN trong Scopus

Nhận xét

Bài đăng phổ biến từ blog này

Đá Silic

Xây dựng mạng lưới quan trắc môi trường nước, bùn đáy tại thượng nguồn hệ thống sông Hồng : Luận văn ThS. Khoa học môi trường và bảo vệ môi trường: 60 44 03