Project description: Read Clean Sentence Speech Corpus (CleanSent01)

[ ID = 0055 ] CleanSent01 
Project nameRead Clean Sentence Speech Corpus (CleanSent01) 
Short name or acronymCleanSent01 
Project URL http://www.sitec.or.kr/English/index.asp 
Project description

◦ Sentences read in soundproof room environment
◦ A total of 200 speakers (Male; 100, Female: 100)
◦ Age: 20-29(50%), 30-39(30%), 40-49(20%)
◦ Soundproof room
◦ Microphone: AKG C414-ULS + Sennheiser HMD 280 PRO (simultaneous 
◦ System
 - Mixer: Behringer Eurorack MXB1002
◦ Sampling and data format
 - 16,000 Hz, 16 bit Windows wave format
◦ Prompts are selected from a subset of 10,000,000 words of 21th Century 
Sejong Project Morpheme Analysis Corpus (containing 10,000,000 words (eojeol)) 
◦ Prompts 
 - 20,217 sentences selected in consideration of lowest frequencies of morphemes
 - 589 phonetically balanced sentences
◦ Prompts sheet
 - 20 sets
◦ Per speaker
 - 1 set of 105-107 sentences
Project durationMay 2001 - Feb 2006
NameResearcher Yongnam Um
OrganisationWonkwang University 
Address Speech Information Technology & Industry Promotion Center, Wonkwang University, 344-2, Sinyong-dong 
City570-749 Iksan, Chonbuk,
Country Korea (South) 
Phone+82 63-850-7452 
Fax+82 63-850-7454 
Update this profile Last update: 2005-11-07 05:55:31


Browse and Search the Directory of National Language and Speech Resources Projects World-wide
The National Resources Projects Directory
Browse in alphabetical order Browse in alphabetical order (in frame) Browse by country Browse by ID number Add your profile

Search directories for keywords and phrases (use ~ for space within keys; most word-initial regular expressions can be used)


[print/pda] [no frame] [navigation table] [navigation frame]     Page generated 13-09-2013 by Steven Krauwer Disclaimer / Contact ELSNET