Fundamental Tools and Resource is Available for Vietnamese Analysis
Kanji Takahashi and Kazuhide Yamamoto. Fundamental Tools and Resource is Available for Vietnamese Analysis. Proceedings of the International Conference on Asian Language Processing (IALP 2016), pp.246-249 (2016.11)
This paper presents our work on developing Vietnamese fundamental tools and a resource for analysis. These tools are for word segmentation and part-of-speech tagging, diacritics restoration, and orthographical variants dictionary. All of them have been either not publicly available so far or not attaining sufficient performance. We have developed the tools and released the tools to the public, in both software packages and web tools. For development, we utilize state-of-the-art methods and achieved high accuracy. We briefly present the tasks, the methods and the performance of each tool and resource.
Tools and Resource URLs
- Vietnamese Joint Word Segmentation and POS Tagging Tools via SVM
- Vietnamese Joint Word Segmentation and POS Tagging Tools via CRF
- Vietnamese Diacritics Restoration Tool
- Vietnamese Syllable Normalization Dictionary and Script
- NLP Web Demonstration Template
Question & Answer at conference
Can this NLP template work Java program?
Yes, but you should know some basic Python skill, not difficult.
Python can call Java application as API or command.
I’m glad to hear that there is a demand for the NLP Web Demonstration Template.
I can exchange information about Vietnamese NLP with Vietnamese NLP researcher.