S0010 : Dutch Polyphone database
The Dutch Polyphone corpus contains the recordings of 5,050
Dutch speakers recorded over the fixed telephone network (ISDN line). The corpus
comprises 222,075 speech files (based on 44 or, in a few cases 43 items per
speaker), which all have been orthographically transcribed. The data were collected
in 8-bit A-law digital form.
The corpus contains both read and extemporaneous items.
The recorded items consist of isolated digits, numbers (one
telephone number, two bank accounts or credit card numbers, and the participation
number), a postal code, guilder amounts, time, date, amounts, application words,
sentences with application word, phonetically rich sentences, spelled words,
city names.
Several questions were asked to get some spontaneous speech
(e.g. Is Dutch your native language?, Did you ever live in another
country than the Netherlands? In which cities did you grow up? Are you
a man or a woman? Are you calling from your home phone?, etc.).
Click here to view the prices and browse other ressources belonging to this category |