RSS twitter Login

logo

Home Contact Login

Mandarin Chinese Conversational Recognition Corpus from Magic Data Tech
Share this page!
twitter google-plus linkedin share
May 29, 2020, 3:48 p.m.

We are happy to announce that 1 new Speech resource is now available in our catalogue.

Mandarin Chinese Conversational Recognition Corpus from Magic Data Tech

This dataset consists of 4.98 hours of transcribed conversational speech in Mandarin Chinese, where 30 conversations are uttered by 32 speakers (16 males and 16 females). The audios are sampled at 16 kHz and quantized at 16 bits.
For each conversation, there are two close-talking channels recorded via the microphones, one for each speaker, as well as three far-field channels recorded by iPhone, Androïd Phone, and recorder respectively.

This corpus may be obtained as a complete set or by selecting specific channels (two close-talking channels shall be understood as 1 single channel): 

ELRA-S0409-01 MDT Mandarin Chinese Conversational Recognition Corpus - complete set
ISLRN: 559-956-475-937-1
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-S0409_01 
 
ELRA-S0409-02 MDT Mandarin Chinese Conversational Recognition Corpus - 1 channel 
ISLRN: 234-140-315-272-4
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-S0409_02
 
ELRA-S0409-03 MDT Mandarin Chinese Conversational Recognition Corpus - 2 channels 
ISLRN: 383-054-806-637-3
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-S0409_03
 
ELRA-S0409-04 MDT Mandarin Chinese Conversational Recognition Corpus - 3 channels 
ISLRN: 235-882-638-211-2
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-S0409_04

 
Check the announcement.