VnDP: A Vietnamese dependency parsing toolkit

Copyright  2014-2016 by Dat Quoc Nguyen and Dai Quoc Nguyen

VnDP is a Vietnamese dependency parsing toolkit which integrates a pre-trained parsing model [M06] and a pre-trained POS tagging model [N14] for Vietnamese. The parsing model was trained on our VnDT Vietnamese dependency Treebank which was automatically converted from the Vietnamese constituent Treebank [N09].

The evaluation of the VnDP toolkit and the construction of the VnDT Treebank are detailed in our NLDB'14 paper:

Dat Quoc Nguyen, Dai Quoc Nguyen, Son Bao Pham, Phuong-Thai Nguyen and Minh Le Nguyen. From Treebank Conversion to Automatic Dependency Parsing for Vietnamese. In Proceedings of 19th International Conference on Application of Natural Language to Information Systems, NLDB'14, Springer LNCS, pp. 196-207, 2014.
[CameraReadyVersion.pdf] [OnlineVersion] [.bib]

The VnDP toolkit is available to download at:

The VnDT Treebank is distributed for research or educational purposes only. To obtain the VnDT Treebank, please fill and return the following license (.pdf) to

Please cite our NLDB'14 paper in any publication reporting on results obtained with the help of the VnDP toolkit or the VnDT Treebank.


[M06] McDonald, R., Lerman, K., and Pereira, F. 2006. Multilingual Dependency Analysis with a Two-stage Discriminative Parser. In Proceedings of the Tenth Conference on Computational Natural Language Learning, pp. 216220.

[N09] Nguyen, P.T., Vu, X.L., Nguyen, T.M.H., Nguyen, V.H., and Le, H.P. 2009. Building a Large Syntactically-Annotated Corpus of Vietnamese. In Proceedings of the Third Linguistic Annotation Workshop, pp. 182185.

[N14] Dat Quoc Nguyen, Dai Quoc Nguyen, Dang Duc Pham, and Son Bao Pham. 2014. RDRPOSTagger: A Ripple Down Rules-based Part-Of-Speech Tagger. In Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 17-20.

Last updated: April, 2016