Project description
 |
Prague Czech-English Dependency Treebank (PCEDT) is a new Czech-English
parallel resource suitable for experiments in structural machine translation.
The Penn Treebank is being translated into Czech, the dependency annotation of
the Czech translation is done automatically from plain text. The annotation of
Penn Treebank is transformed into dependency annotation scheme. A subset of
corresponding Czech and English sentences has been annotated by humans. First
experiments in Czech-English machine translation using these data have already
been carried out. The resources being created at Charles University in Prague
are scheduled for release as Linguistic Data Consortium data collection in 2004. |