For example, I "give" the number 1736, and I have 100 .wav files (like 0.wav, 1.wav, etc), how should I concatenate the audios to make them sound more "fluid". Most of the time they have a gap in between the numbers and sound very "hard", I want to listen them as if a real person was saying it, well, as close as possible (exluding the sound quality).
This can be in any language, PHP, Python, etc. I just need the logic/algorithm.
Not sure if it's a vague question, feel free to tell me so I remove it if that's the case.
Thanks.