GSoC/GCI Archive
Google Summer of Code 2011 CMUSphinx Speech Recognition Toolkit

Training the acoustic model on long audio files

by Michal Krajňanský for CMUSphinx Speech Recognition Toolkit

Optimalisation of SphinxTrain by the utilization of massively parallel hardware - NVIDIA CUDA framework: Enable the acoustic model training on long audio files by the utilization of NVIDIA CUDA architecture. Incorporate the technique to reduce the memory requirements of Baum-Welch algorithm. Modify SphinxTrain to be able to process long input audio files.