Project description: Multimodal Speech Corpus

[ ID = 0079 ]	Multimodal01
Project name	Multimodal Speech Corpus
Short name or acronym	Multimodal01
Project URL	http://www.sitec.or.kr/English/index.asp
Project description	◦ Multimodal corpus of voice and video of the frontal face captured by the camcorder ◦ A total of 100 speakers speaking the standard dialect (Age: 20 ~ 30) ◦ Studio: 230cm * 230 cm ◦ Lighting: basic lighting + front (one 200W), right/left(two 100W), and ground (one 60W) ◦ Background: blue screen ◦ Equipment: Sony DCR-VX2000 ◦ Video sampling and data format(avi) - resolution: 720*480 - frame speed: 30 fps - condensation: MPEG-4 V1, key frame at maximum ◦ Speech sampling and data format(wav) - 16kHz, 16bits, mono, Microsoft Windows WAVE format ◦ 5 basic vowels ◦ 12 single digits ◦ 452 phonetically balanced words (PBW) ◦ Per speaker - 5 basic vowels - 12 single digits - 90 ~ 92 PBWs
Languages	Korean
Funding	mixed
Project duration	May 2001 - Feb 2006
Contact
Name	Researcher Yongnam Um
Organisation	Wonkwang University
Address	Speech Information Technology & Industry Promotion Center, Wonkwang University, 344-2, Sinyong-dong
City	570-749 Iksan, Chonbuk,
Country	Korea (South)
Email	umyongnam_at_sitec.or.kr
Phone	+82 63-850-7452
Fax	+82 63-850-7454
Update this profile	Last update: 2005-11-01 07:58:12

Browse and Search the elsnet Directory of National Language and Speech Resources Projects World-wide
The National Resources Projects Directory	Browse in alphabetical order	Browse in alphabetical order (in frame)	Browse by country	Browse by ID number	Add your profile	Search directories for keywords and phrases (use ~ for space within keys; most word-initial regular expressions can be used)

[print/pda] [no frame] [navigation table] [navigation frame] Page generated 13-09-2013 by Steven Krauwer