To Provide services

Speech Corpus Design

Multi-Lingual and Multi-dialectal Speech Corpus
Speech Corpus Script Design
Speech Corpus for Recognition
Speech Corpus for Synthesis

Music Corpus Design and Annotation

Music Corpus Design
Music Corpus for Humming Recognition
Music Score and Lyric Annotation
Music Vocal to MIDI Translation

Speech Quality Evaluation

Multi-Lingual and Multi-dialectal Speech Quality Evaluation
MOS and AB Compare Test
Evaluation Text Script Design
Evaluation Tool Development

Annotation

Text Script Design
Document Classification
Key Word, NER, Word Property, and Chunk Annotation

Dubbing

Promotional Films Dubbing
Multi-dialectal Dubbing

Image Annotation

Image Object Annotation

Additional Database Design

Hand Writing Recognition Database

Sales data
    • A 3000-hour Accented Mandarin Speech Database for Recognition has been released. It is recorded in real environments using 16 kHz sampling frequency and 16-bit quantization accuracy mono PCM format. 3000 native Mandarin speakers (1:1 gender distribution) with their respected dialectal characteristics are carefully selected. The corpus are collected using multiple smart phone models with Andriod and IOS systems in both indoor and outdoor environments. During the corpus design and collection process, we have formed an efficient team with high standard capabilities. This leads to the great success for our corpus to have below 2% sentence-wise error, which is dominating in the current market. This corpus can be used for training and testing the speech recognition system, as well as speech analysis. It has been well-acknowledged by industry as a corpus with high speech quality and recognition accuracy.
    • A 1000-hour Mandarin-English Mixed Recognition Database has been released. It is recorded in real environments using 16 kHz sampling frequency and 16-bit quantization accuracy mono PCM format.
    • A 720-hour Multi-language Speech Recognition Database has been released. It contains Native Speakers from USA, UK, France, Germany, Italy, Spain, Mexico and Brazil (300 speakers or 90 hours for each country, 1:1 gender distribution).
    • A 500-hour Cantonese-English Mixed Corpus for Recognition has been released. It is recorded in real environments using 16 kHz sampling frequency and 16-bit quantization accuracy mono PCM format.
    • Graphic database for recognition has been released. In total of 20,000 figure outlines including body outline and face outline are labelled. The outline region is black with a white background. The 20,000 images include people of all ages and genders with their respected outfits and postures. 12,000 of the pictures are people in street snaps, photo albums or life photos with different sizes. The rest 8000 images are gait images originated in gait recognition database. During the database design and collection process, we have formed an efficient team with high standard capabilities. This leads to the deviation of outline is controlled in 3 pixel.
    • A 1000 people gait recognition database has been released. It covered 1000 people (1:1 gender distribution) with the age from 4 to 85 where 20% of the participants are above 60-year-old. Gait recognition database was collected by 24 Hikvision cameras in outdoor environment. 1000 participants with their normal walking posture was recorded by these cameras. Each participant is asked to walk 48 times with 3 sets of outfits (one is normal, one with a coat and the other one with a bag). The resolution ratio is 1920*1080@25fps and the format is MP4.

 

Huiting News
SINA Microblog

Huiting data is a very powerful database site, there are over 10,000 data collection and labeling staff.