Natural language & AI
Alexandre Rademaker, Fabricio Chalub, Livy Real, Claudia Freitas, Eckhard Bick, Valeria de Paiva
This paper describes the process of converting the Portuguese Bosque corpus to the Universal Dependencies scheme version 2. The conversion was done by applying to the corpus a careful context-sensitive conversion process from its original deep linguistic analysis sources. Universal Dependencies offer the promise of greater parallelism between languages. The process consisted of converting a constituency treebank, together with additional manual revision of the trees, all informed by a live Constraint Grammar parsing framework. This process shows some of the essential difficulties of dealing with a Romance language, Portuguese.
(in http://www.aclweb.org/anthology/W/W17/W17-65.pdf#page=207)Read/download now