S0009 : COST232 Multi-English Speech Database
The COST232 consortium collected a "Multi-English"
speech database over the telephone in Europe. Originally, it had been planned
to collect data only at FUB (Fondazione Ugo Bordoni) in Rome, but in the event
it was also possible to make a collection at BT labs in the UK. A total of 797
"successful" calls were collected.
The data was collected from the following countries: Belgium,
Czechoslovakia, Denmark, England, Germany, Italy, Norway, Portugal, Slovenia,
Spain, Sweden and Switzerland, and each country provided 8 speakers who made
2 calls from a fixed set and a mobile to both the Italian and UK collection
system (i.e. a total of 8 calls per speaker). Two countries received the calls
- Italy and the UK, using different types of collecting equipment (FUB in Rome
used analog lines and BT in the UK used digital ones).
Everybody had to repeat the same vocabulary - the "TI
(Texas Instrument) words" - which makes this database unique in many respects.
Each speaker uttered the following items:
- The name of the speaker's laboratory
- The digits ("oh", zero, one , two, three, four, five, six, seven,
eight and nine)
- The words ("yes, no, erase, rubout, stop, start, help, enter, repeat,
go")
Although the database was intended to aid for speech recognition,
it is also balanced and can therefore be used for speaker recognition training
and testing.
Click here to view the prices and browse other ressources belonging to this category |