Project description
 |
TIGER is a joint project of the Department of Computational Linguistics and
phonetics in Saarbrücken, the Institute of Natural Language Processing
(IMS) in Stuttgart, and the Institut für Germanistik in Potsdam.
The project has been funded by the Deutsche Forschungsgemeinschaft (DFG) since
1999. The aim of the project is the creation of a large syntactically annotated
corpus of German newspaper text. It comprises the following tasks: the
development of a scheme for the syntactic annotation of German newspaper texts
(the scheme should be as theory-independent as possible in order to ensure a
high degree of acceptance and re-usability), the development of new techniques
for the automation of corpus annotation (aiming at very fast but still very
reliable and accurate annotation), the syntactic annotation of newspaper texts
(based on the annotation scheme and tools for automation) and phenomenon-based
retrieval of sentences from the annotated corpus.
Results at the current stage: The TIGER Corpus Release 1 (40,000 sentences) has
been released in July 2003. Corpora can be annotated semi-automatically with
the Annotate tool. Syntactically annotated corpora can be explored in the
TIGERSearch software. |