VnDP: A Vietnamese dependency parsing toolkit
VnDP is a Vietnamese dependency parsing toolkit which integrates a pre-trained dependency parsing model (MSTparser) [M06] and a pre-trained POS tagging model (RDRPOSTagger) [N14] for Vietnamese. The pre-trained parsing model was trained on our Vietnamese dependency treebank VnDT consisting of 10,200 sentences.
The evaluation of the VnDP toolkit and the construction of the VnDT treebank are detailed in our paper:
Dat Quoc Nguyen, Dai Quoc Nguyen, Son Bao
Pham, Phuong-Thai Nguyen and Minh Le Nguyen. From treebank Conversion to
Automatic Dependency Parsing for Vietnamese. In Proceedings of 19th International Conference on Application of Natural
Language to Information Systems, NLDB'14, Springer LNCS, pp. 196-207, 2014.
[CameraReadyVersion.pdf] [OnlineVersion] [.bib]
The VnDP toolkit is available to download at: https://sourceforge.net/projects/vndp/files/VnDPv1.0.1.zip
The VnDT treebank is distributed for research or educational purposes only.
To obtain the VnDT treebank, please fill and return the following license (.pdf) to
Please cite our NLDB'14 paper in any publication reporting on results obtained with the help of the VnDP toolkit or the VnDT treebank.
NOTE: You might want to try our new toolkit VnCoreNLP which produces significantly better parsing results than VnDP on the VnDT treebank.
[M06] McDonald, R., Lerman, K., and Pereira, F. 2006. Multilingual Dependency Analysis with a Two-stage Discriminative Parser. In Proceedings of the Tenth Conference on Computational Natural Language Learning, pp. 216–220.
[N14] Dat Quoc Nguyen, Dai Quoc Nguyen, Dang Duc Pham, and Son Bao Pham. 2014. RDRPOSTagger: A Ripple Down Rules-based Part-Of-Speech Tagger. In Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 17-20.
Last updated: Feb 25, 2018