A 720-hour Multi-language Speech Recognition Database
A 720-hour Multi-language Speech Recognition Database has been released. It contains Native Speakers from USA, UK, France, Germany, Italy, Spain, Mexico and Brazil (300 speakers or 90 hours for each country, 1:1 gender distribution). It is recorded using a smart phone with 16 kHz sampling frequency and 16-bit quantization accuracy mono PCM format and a desktop microphone with 44.1 kHz sampling frequency and 16-bit quantization accuracy mono PCM format.The corpus is collected in both indoor and outdoor environments.
During the corpus design and collection process, we have formed an efficient team with high standard capabilities. This leads to the great success for our corpus to have below 5% sentence-wise error, which is dominating in the current market.
This corpus can be used for training and testing the speech recognition system, as well as speech analysis. It has been well-acknowledged by industry as a corpus with high speech quality and recognition accuracy.