I'm not sure, but couldn't you have it call a hap file and put the delay within that file. So basically a timed hap file.
The only other thing I can think of at the moment is to incorporate a delay in the .ogg file, so like have 5 secs of silence before the actual clip.
It would still be hell to synchronise this with Hal though.