Description
eSpeak is simple and works, it also has the great advantage of running locally. But the voices it produces are far inferior to those of for example Google's APIs (which are pretty damn near perfect).
So since there is such a large gap for improvement, it might be something worthwhile to look into. It would be nice with espeak as a fallback agent if, for example, internet connectivity fails.
Also, if Google's APIs are used to generate TTS it would be a good idea to cache the audio output of the responses, preventing unnecessary API calls to an unofficial API that may fail at any time (and disclose information to Google about the users habits).
This should be considered low priority, but I personally have a great deal of affection for this issue.
Activity