We created a basic dialplan on the Asterisk server that asks the caller to say the number they wish to dial, then using Sphinx speech recognition software translate that to text. We use a unix function called text2wave that will then convert this text into a wav file to be played back to the caller. The caller will then hear the number generated and they can confirm if it is correct or not. If it is, the caller will be notified that they will be connected shortly, and the server will call the number. If it is incorrect, then the caller will be asked to say the number again.
PROBLEM
Currently, Sphinx is acting erratically; i.e. sometime it just doesn’t do anything. Shawn* said it’s because our SIP provider sends it poorly formatted audio files that Sphinx can’t understand. I also noticed another problem with Sphinx. It worked perfectly well with a male voice, but with my voice it doesn’t translate the digits correctly. I’m not sure what we will do to tackle this problem.
We need to investigate these two issues.
*Special thanks to Shawn Van Avery for all his online tutorials and help.