elsnet

Central and Eastern European Survey


ELSNET is the European Network in Human Language Technologies (http://www.elsnet.org)
This page is http://www.elsnet.org/survey/DeptofInformationTechnologies60200BRNOCzechRepublic/resources.html
[ print/pda version ] [ screen version ] [ navigation table ] [ navigation frame ]

Resources

Dept.of Information Technologies
Faculty of Informatics, Masaryk University

NL and Speech Resources available: Textual, Speech, Software, Lexical resources (including terminology) >.


Name: ESO Corpus, FIT Corpus, Spoken Czech Corpus
Nature: newspaper texts, computer journals, machine stem dictionaries for Czech, Slovak, Russsian ,English, German, French - size between 200 000-120 000 entries
Language: Czech
Size: approximately 50 mil. word forms, spoken: now about 100 hours
Format: ASCII and SGML, WAW
Coverage: newspapers, computer journals, spoken Czech - interview and dialogues
Medium: hard disk, CD-ROM
Availability: free for research purposes, partially commercial products

Software description: Czech, Slovak, Russian spell checker, Czech, Slovak, Russian lemmatizer and tagger, Czech, Slovak hyphenation programs, available through personal contact, Czech Electronic Thesaurus, Czech-English and Czech-German Electronic Dictionary


This page is no longer maintained. Please visit http://www.elsnet.org/survey/quests to find out how to update your organisation profile or to find information about this organisation

[Survey] [Organisation] [General Info] [Training] [Resources] [Research] [Staff] [Publications]


[print/pda] [no frame] [table] [frames]     This page was generated 04-01-1998 by Steven Krauwer