我有从0到99的wav文件，连接时使它们听起来很好的最佳逻辑是什么？ [关闭]

For example, I "give" the number 1736, and I have 100 .wav files (like 0.wav, 1.wav, etc), how should I concatenate the audios to make them sound more "fluid". Most of the time they have a gap in between the numbers and sound very "hard", I want to listen them as if a real person was saying it, well, as close as possible (exluding the sound quality).

This can be in any language, PHP, Python, etc. I just need the logic/algorithm.

Not sure if it's a vague question, feel free to tell me so I remove it if that's the case.

Thanks.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dpj775835868 2018-11-23 19:23
关注
The issue you're likely having is intonation.

When speaking, the rising and falling tones help indicate phrasing. If I say, "one, seven, three, six", and end with a falling tone (pitch going down), it sounds final and the listener knows they've heard all the digits. If I end with a rising tone (pitch going up), it sounds like I'm asking a question, which is weird to the listener since the numbers aren't a question.

To make this sound more natural, at a minimum, you'll need to record each with different intonation and put them together correctly.

There's another problem though with the phrasing. When speaking, it sounds best when continuously moving air and using articulation to enunciate the words. If you were to record the sound of a radio announcer and play it back while filtering out all of the higher frequencies so that you couldn't hear the articulation, you would hear something close to a continuous tone that would change a bit in pitch. This isn't something you'll get by concatenating audio files together. The best you can do is have a proper speech engine speak.

See also:

https://dictionary.cambridge.org/us/grammar/british-grammar/speaking/intonation

http://www.americanaccent.com/intonation.html
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

我有从0到99的wav文件，连接时使它们听起来很好的最佳逻辑是什么？ [关闭] php python
2018-11-23 19:08

回答 1 已采纳 The issue you're likely having is intonation. When speaking, the rising and falling tones help in
Java为什么循环读取wav文件流,只能读取到第一个? java
2017-12-21 11:47

回答 7 已采纳一个response只能输出一个文件。
是否可以在PHP中将WAV文件转换为AIFF，反之亦然？ [关闭] php
2011-11-11 16:56

回答 2 已采纳 <?php exec("ffmpeg -i file.wav -f aiff -ab 128000 -ar 44100 file.aif"); ?>
参加历年网络安全竞赛wp（2022年起持续更新）
2022-10-24 20:55

苦行僧(csdn)的博客每个数据包第一个数据拼起来就是 flag{FA_FB_FU} s7comm.param.func==0x05 && ip.src_host==172.16.1.100 0x0201、LED_BOOM 【s7comm协议LED操作】 LED_b0omb0om.pcapng，攻击者成功拿到一台上位机，并进行了非法...
如果使用XMLHttpRequest发送wav文件，如何在PHP上保存服务器上的wav文件？ javascript php
2014-10-12 12:09

回答 2 已采纳 What you're looking for is php://input: $fp = fopen("php://input", "r"); $wav_file = stream_get_c
如何在不跳过的情况下从Javascript页面获取多个文件上传到服务器？ javascript php
2018-12-06 16:23

回答 1 已采纳 I have no idea what your architecture looks like, but here is a potential solution that will work
c语言怎么把bin二进制数据格式文件转化成wav格式文件 c语言有问必答
2021-10-12 14:14

回答 2 已采纳参考下这个 C语言解析wav文件格式_aa98865646的博客-CSDN博客_c语言写wav文件 C语言解析wav文件接下来在了解了wav
05-HTML标签图文详解（二）
2022-09-05 14:44

雨穆笙的博客 </p> <h3><a id="2OLli_99"></a>2、有序列表<code><OL></code>，里面的每一项是<code><li></code></h3> 英文单词：Ordered List。</p> 例如：</p> <pre><code class="prism language-...
<video>播放录音文件（wav格式），在火狐浏览器上显示的录音总时长有问题 firefox html5
2019-02-19 15:08

回答 1 已采纳实在没找到办法处理。最后我在后台把wav转成mp3，返回mp3流给前端就正常了。因为只是为了给用户听的， wav 跟 MP3 从人耳听起来差别不大，所以就用这种方式解决了。根本错误找不到原因，
关于WindowsC++的录音保存为wav格式的音频文件的问题? c++
2018-02-02 09:59

回答 4 已采纳 http://blog.csdn.net/xgx198831/article/details/7286111
在PHP中将8位值数组转换为wav文件 php
2013-05-14 02:57

回答 1 已采纳 You can use pack function, but first you must discover what values used: signed or unsigned funct
07-html标签图文详解（二）
2021-09-01 14:37

zzy-cl的博客 <strong>li里面什么都能放，甚至可以再放一个ul。</p> <h3>2、有序列表<code><ol></code>，里面的每一项是<code><li></code></h3> 英文单词：Ordered List。</p> 例如：</p> <pre><code ...
为什么用java拼接多个wav文件后为什么只播放了第一个文件的声音 java
2014-08-23 02:57

回答 1 已采纳以前用C++写过删除声道的demo，你这个应该首先要了解wac的文件结构，包括文件头的各个字段含义，然后合并的话应该先删除2.wav的文件头，再用字节操作合并，wav文件头应该会有一个lenght记录
2019 moeCTF新生题部分wp
2019-09-14 14:58

xiaohuihui_7的博客请注意，由于招新活动已经结束，比赛环境很有可能关闭，对比赛环境疑问请问官方群的出题师傅。说明2：本 blog 尽可能收录所有赛题，标题中带有 × 的表示博主尚未解出。说明3：本篇 writeup较长，电脑端的读者...
CSDN回帖得分大全（近两年）
2019-10-01 17:14

dizhi5320的博客 CSDN回帖得分大全（近两年） √ vs2005调用dll的时候Initialize()函数返回错误 [VC/MFC 基础类] √ 为什么我创建登陆框之后，然后获取登陆框的数据时候总是出现非法操作！...√ vc++ 浮...
五、HTML标签——图文详解
2021-05-02 12:59

@逆风boy的博客 <strong>li里面什么都能放，甚至可以再放一个ul。</p> <h3><a id="2olli_100"></a>2、有序列表<code><ol></code>，里面的每一项是<code><li></code></h3> 英文单词：Ordered List。</p> 例如&#xff...
没有解决我的问题, 去提问

悬赏问题

¥15 基于卷积神经网络的声纹识别
¥15 Python中的request，如何使用ssr节点，通过代理requests网页。本人在泰国，需要用大陆ip才能玩网页游戏，合法合规。
¥100 为什么这个恒流源电路不能恒流？
¥15 有偿求跨组件数据流路径图
¥15 写一个方法checkPerson，入参实体类Person，出参布尔值
¥15 我想咨询一下路面纹理三维点云数据处理的一些问题，上传的坐标文件里是怎么对无序点进行编号的，以及xy坐标在处理的时候是进行整体模型分片处理的吗
¥15 CSAPPattacklab
¥15 一直显示正在等待HID—ISP
¥15 Python turtle 画图
¥15 stm32开发clion时遇到的编译问题

我有从0到99的wav文件，连接时使它们听起来很好的最佳逻辑是什么？ [关闭]

1条回答 默认 最新

悬赏问题

1条回答默认最新