dupa

Author Topic: AT&T Natural Voices pronunciations  (Read 2399 times)

Cefwyn

  • Newbie
  • *
  • Posts: 45
    • View Profile
AT&T Natural Voices pronunciations
« on: March 20, 2009, 04:33:08 pm »
I just started trying out Ultra Hal Assistant today and I must say it's a very intriguing application. One thing that I am having trouble with however is the pronunciations of words using the AT&T Natural Voices. These voices don't appear to fully integrate into Hal as the "pronounciation" button in the Speech settings does nothing and there appears to be no way to load custom .dict files created with AT&T's WinDictEdit application. The reason this is important to me is that even though the Natural Voices TTS engine translates abbreviations like "Dr." to it's full word "doctor", when used in Hal this does not happen and Dr. is pronounced as either "drrr" or "D R". I have tried loading my custom .dict file into AT&Ts TTS engine directly with it's configuration file and it appeared to work for a while, but the pronunciation of the words set in my custom .dict file remained inconsistent as sometimes it would pronounce it correctly and other times not. Is there any way of fixing this or must I replace the words in the brain to be replaced automatically with however it should be pronounced?
"I've just decided to lobotomist my hal..."

EULA: By reading this you agree not to take offense to anything contained in post above. If you do choose to take offense, you forfeit all rights to criticize same post unless said criticism contains important corrections and helpful information.

jasondude7116

  • Sr. Member
  • ****
  • Posts: 475
    • View Profile
AT&T Natural Voices pronunciations
« Reply #1 on: March 20, 2009, 08:25:33 pm »
things like:
"DR." & vbCrLf   or   "DR. "   or   "DR."   or   "DR. Smith"

might cause the voice to pronounce it differently.
i am not sure with the voices you are using, but it does with a couple of the voices i have used.

so it is possible that it might be affected by slightly different responses, through different programs in the "brain"

-the dude
 

Cefwyn

  • Newbie
  • *
  • Posts: 45
    • View Profile
AT&T Natural Voices pronunciations
« Reply #2 on: March 20, 2009, 08:49:59 pm »
That makes sense. I also understand that a lot of it is a limitation of how TTS is implemented on windows and the different engines just don't play all that well together, especially when the 64-bit engine can't run a 32-bit voice and all that nonsense. I spent a great deal of time with Festival under linux and it really makes me sad to think that something as fully scriptable as that never got more attention with the TTS crowd. The problems I'm having with the AT&T voices aren't entirely Ultra Hal's fault but if there was some way to load the pronunciations from the .dict files (which are probably specific to the AT&T voices), then that would fix the problem and even allow me to control the phrasing of words nearly as well as in Festival(I'm pretty sure Festival did it's analysis on a sentence by sentence basis instead of word for word, which is how SAPI5 works if I'm not mistaken, which made for much more natural sounding speech).

I've been trying my hardest to figure all this out on my own, but the SAPI technology really is a mess, as most documentation doesn't differentiate between a voice and a TTS engine, and then there is the fact that Microsofts TTS tools are as minimalistic as possible leaving it up to the makers of the voices and engines to write the tools to configure such things as lexicon objects. The more I read about SAPI5 and it's design the more I think Ultra Hal itself is supposed to provide an interface to adjust pronunciation regardless of the voice engine used but that's just silly.

EDIT: I just had another look at where the AT&T voices are installed and messed with the config file again and discovered that Vista was being it's usual temperamental self and the permissions were being set strangely. With the lexical dictionary file overridden with my custom version Hal now picks up my changes reliably, unfortunately any changes to the dictionary requires restarting Hal to pick up those changes. There still has to be a better way to alter SAPI5 lexical objects.
« Last Edit: March 20, 2009, 09:15:26 pm by Cefwyn »
"I've just decided to lobotomist my hal..."

EULA: By reading this you agree not to take offense to anything contained in post above. If you do choose to take offense, you forfeit all rights to criticize same post unless said criticism contains important corrections and helpful information.