Far Field Command Word Speech Recognition Database



Far field command word corpus for recognition has been released. It covered 200 speakers with the age from 17 to 32 with 1:1 gender distribution. The typescripts are 600 fixed Chinese command words, including intelligent household appliances, wake statement, vehicle statement and so on.

The corpus is collected using various Android smart phone models and primarily in indoor (quiet and anechoic ) environments. It is recorded 6 voice signal at the same time, and the detailed information for this far field command word corpus is listed below:


Recording Devices

Audio Format


Multichannel recorder with professional microphones

48kHz,16bit,4 channel,PCM wave

one microphone is located at talker’s mouth; the other three microphone array are located at talker’s front position with 50 cm away

Smart Phone Model A

16kHz, 16bit, stereo, PCM wave


Smart Phone Model B

16kHz, 16bit, mono, PCM wave



During the corpus design and collection process, we have formed an efficient team with high standard capabilities. This leads to the great success for our corpus to have below 2% sentence-wise error, which is dominating in the current market.

This corpus can be used for training and testing the children speech recognition system, as well as speech analysis. It has been well-acknowledged by industry as a corpus with high speech quality and recognition accuracy.

    • News Title