pycharm问题：ValueError: No audio files found in 'corpus'

问题遇到的现象和发生背景

网上扒的一个语音转换的代码，然后在测试训练集的时候报错了

问题相关代码，请勿粘贴截图

"""Training script for the WaveNet network on the VCTK corpus.

This script trains a network with the WaveNet using data from the VCTK corpus,
which can be freely downloaded at the following site (~10 GB):
http://homepages.inf.ed.ac.uk/jyamagis/page3/page58/page58.html
"""

from __future__ import print_function

import argparse
from datetime import datetime
import json
import os
import sys
import time

import tensorflow.compat.v1 as tf
tf.compat.v1.disable_eager_execution()
from tensorflow.python.client import timeline

from wavenet import WaveNetModel, AudioReader, optimizer_factory

BATCH_SIZE = 1
DATA_DIRECTORY = './VCTK-Corpus'
LOGDIR_ROOT = './logdir'
CHECKPOINT_EVERY = 50
NUM_STEPS = int(1e5)
LEARNING_RATE = 1e-3
WAVENET_PARAMS = './wavenet_params.json'
STARTED_DATESTRING = "{0:%Y-%m-%dT%H-%M-%S}".format(datetime.now())
SAMPLE_SIZE = 100000
L2_REGULARIZATION_STRENGTH = 0
SILENCE_THRESHOLD = 0.3
EPSILON = 0.001
MOMENTUM = 0.9
MAX_TO_KEEP = 5
METADATA = False


def get_arguments():
    def _str_to_bool(s):
        """Convert string to bool (in argparse context)."""
        if s.lower() not in ['true', 'false']:
            raise ValueError('Argument needs to be a '
                             'boolean, got {}'.format(s))
        return {'true': True, 'false': False}[s.lower()]

    parser = argparse.ArgumentParser(description='WaveNet example network')
    parser.add_argument('--batch_size', type=int, default=BATCH_SIZE,
                        help='How many wav files to process at once. Default: ' + str(BATCH_SIZE) + '.')
    parser.add_argument('--data_dir', type=str, default=DATA_DIRECTORY,
                        help='The directory containing the VCTK corpus.')
    parser.add_argument('--store_metadata', type=bool, default=METADATA,
                        help='Whether to store advanced debugging information '
                        '(execution time, memory consumption) for use with '
                        'TensorBoard. Default: ' + str(METADATA) + '.')
    parser.add_argument('--logdir', type=str, default=None,
                        help='Directory in which to store the logging '
                        'information for TensorBoard. '
                        'If the model already exists, it will restore '
                        'the state and will continue training. '
                        'Cannot use with --logdir_root and --restore_from.')
    parser.add_argument('--logdir_root', type=str, default=None,
                        help='Root directory to place the logging '
                        'output and generated model. These are stored '
                        'under the dated subdirectory of --logdir_root. '
                        'Cannot use with --logdir.')
    parser.add_argument('--restore_from', type=str, default=None,
                        help='Directory in which to restore the model from. '
                        'This creates the new model under the dated directory '
                        'in --logdir_root. '
                        'Cannot use with --logdir.')
    parser.add_argument('--checkpoint_every', type=int,
                        default=CHECKPOINT_EVERY,
                        help='How many steps to save each checkpoint after. Default: ' + str(CHECKPOINT_EVERY) + '.')
    parser.add_argument('--num_steps', type=int, default=NUM_STEPS,
                        help='Number of training steps. Default: ' + str(NUM_STEPS) + '.')
    parser.add_argument('--learning_rate', type=float, default=LEARNING_RATE,
                        help='Learning rate for training. Default: ' + str(LEARNING_RATE) + '.')
    parser.add_argument('--wavenet_params', type=str, default=WAVENET_PARAMS,
                        help='JSON file with the network parameters. Default: ' + WAVENET_PARAMS + '.')
    parser.add_argument('--sample_size', type=int, default=SAMPLE_SIZE,
                        help='Concatenate and cut audio samples to this many '
                        'samples. Default: ' + str(SAMPLE_SIZE) + '.')
    parser.add_argument('--l2_regularization_strength', type=float,
                        default=L2_REGULARIZATION_STRENGTH,
                        help='Coefficient in the L2 regularization. '
                        'Default: False')
    parser.add_argument('--silence_threshold', type=float,
                        default=SILENCE_THRESHOLD,
                        help='Volume threshold below which to trim the start '
                        'and the end from the training set samples. Default: ' + str(SILENCE_THRESHOLD) + '.')
    parser.add_argument('--optimizer', type=str, default='adam',
                        choices=optimizer_factory.keys(),
                        help='Select the optimizer specified by this option. Default: adam.')
    parser.add_argument('--momentum', type=float,
                        default=MOMENTUM, help='Specify the momentum to be '
                        'used by sgd or rmsprop optimizer. Ignored by the '
                        'adam optimizer. Default: ' + str(MOMENTUM) + '.')
    parser.add_argument('--histograms', type=_str_to_bool, default=False,
                        help='Whether to store histogram summaries. Default: False')
    parser.add_argument('--gc_channels', type=int, default=None,
                        help='Number of global condition channels. Default: None. Expecting: Int')
    parser.add_argument('--max_checkpoints', type=int, default=MAX_TO_KEEP,
                        help='Maximum amount of checkpoints that will be kept alive. Default: '
                             + str(MAX_TO_KEEP) + '.')
    return parser.parse_args()


def save(saver, sess, logdir, step):
    model_name = 'model.ckpt'
    checkpoint_path = os.path.join(logdir, model_name)
    print('Storing checkpoint to {} ...'.format(logdir), end="")
    sys.stdout.flush()

    if not os.path.exists(logdir):
        os.makedirs(logdir)

    saver.save(sess, checkpoint_path, global_step=step)
    print(' Done.')


def load(saver, sess, logdir):
    print("Trying to restore saved checkpoints from {} ...".format(logdir),
          end="")

    ckpt = tf.train.get_checkpoint_state(logdir)
    if ckpt:
        print("  Checkpoint found: {}".format(ckpt.model_checkpoint_path))
        global_step = int(ckpt.model_checkpoint_path
                          .split('/')[-1]
                          .split('-')[-1])
        print("  Global step was: {}".format(global_step))
        print("  Restoring...", end="")
        saver.restore(sess, ckpt.model_checkpoint_path)
        print(" Done.")
        return global_step
    else:
        print(" No checkpoint found.")
        return None


def get_default_logdir(logdir_root):
    logdir = os.path.join(logdir_root, 'train', STARTED_DATESTRING)
    return logdir


def validate_directories(args):
    """Validate and arrange directory related arguments."""

    # Validation
    if args.logdir and args.logdir_root:
        raise ValueError("--logdir and --logdir_root cannot be "
                         "specified at the same time.")

    if args.logdir and args.restore_from:
        raise ValueError(
            "--logdir and --restore_from cannot be specified at the same "
            "time. This is to keep your previous model from unexpected "
            "overwrites.\n"
            "Use --logdir_root to specify the root of the directory which "
            "will be automatically created with current date and time, or use "
            "only --logdir to just continue the training from the last "
            "checkpoint.")

    # Arrangement
    logdir_root = args.logdir_root
    if logdir_root is None:
        logdir_root = LOGDIR_ROOT

    logdir = args.logdir
    if logdir is None:
        logdir = get_default_logdir(logdir_root)
        print('Using default logdir: {}'.format(logdir))

    restore_from = args.restore_from
    if restore_from is None:
        # args.logdir and args.restore_from are exclusive,
        # so it is guaranteed the logdir here is newly created.
        restore_from = logdir

    return {
        'logdir': logdir,
        'logdir_root': args.logdir_root,
        'restore_from': restore_from
    }


def main():
    args = get_arguments()

    try:
        directories = validate_directories(args)
    except ValueError as e:
        print("Some arguments are wrong:")
        print(str(e))
        return

    logdir = directories['logdir']
    restore_from = directories['restore_from']

    # Even if we restored the model, we will treat it as new training
    # if the trained model is written into an arbitrary location.
    is_overwritten_training = logdir != restore_from

    with open(args.wavenet_params, 'r') as f:
        wavenet_params = json.load(f)

    # Create coordinator.
    coord = tf.train.Coordinator()

    # Load raw waveform from VCTK corpus.
    with tf.name_scope('create_inputs'):
        # Allow silence trimming to be skipped by specifying a threshold near
        # zero.
        silence_threshold = args.silence_threshold if args.silence_threshold > \
                                                      EPSILON else None
        gc_enabled = args.gc_channels is not None
        reader = AudioReader(
            args.data_dir,
            coord,
            sample_rate=wavenet_params['sample_rate'],
            gc_enabled=gc_enabled,
            receptive_field=WaveNetModel.calculate_receptive_field(wavenet_params["filter_width"],
                                                                   wavenet_params["dilations"],
                                                                   wavenet_params["scalar_input"],
                                                                   wavenet_params["initial_filter_width"]),
            sample_size=args.sample_size,
            silence_threshold=silence_threshold)
        audio_batch = reader.dequeue(args.batch_size)
        if gc_enabled:
            gc_id_batch = reader.dequeue_gc(args.batch_size)
        else:
            gc_id_batch = None

    # Create network.
    net = WaveNetModel(
        batch_size=args.batch_size,
        dilations=wavenet_params["dilations"],
        filter_width=wavenet_params["filter_width"],
        residual_channels=wavenet_params["residual_channels"],
        dilation_channels=wavenet_params["dilation_channels"],
        skip_channels=wavenet_params["skip_channels"],
        quantization_channels=wavenet_params["quantization_channels"],
        use_biases=wavenet_params["use_biases"],
        scalar_input=wavenet_params["scalar_input"],
        initial_filter_width=wavenet_params["initial_filter_width"],
        histograms=args.histograms,
        global_condition_channels=args.gc_channels,
        global_condition_cardinality=reader.gc_category_cardinality)

    if args.l2_regularization_strength == 0:
        args.l2_regularization_strength = None
    loss = net.loss(input_batch=audio_batch,
                    global_condition_batch=gc_id_batch,
                    l2_regularization_strength=args.l2_regularization_strength)
    optimizer = optimizer_factory[args.optimizer](
                    learning_rate=args.learning_rate,
                    momentum=args.momentum)
    trainable = tf.trainable_variables()
    optim = optimizer.minimize(loss, var_list=trainable)

    # Set up logging for TensorBoard.
    writer = tf.summary.FileWriter(logdir)
    writer.add_graph(tf.get_default_graph())
    run_metadata = tf.RunMetadata()
    summaries = tf.summary.merge_all()

    # Set up session
    sess = tf.Session(config=tf.ConfigProto(log_device_placement=False))
    init = tf.global_variables_initializer()
    sess.run(init)

    # Saver for storing checkpoints of the model.
    saver = tf.train.Saver(var_list=tf.trainable_variables(), max_to_keep=args.max_checkpoints)

    try:
        saved_global_step = load(saver, sess, restore_from)
        if is_overwritten_training or saved_global_step is None:
            # The first training step will be saved_global_step + 1,
            # therefore we put -1 here for new or overwritten trainings.
            saved_global_step = -1

    except:
        print("Something went wrong while restoring checkpoint. "
              "We will terminate training to avoid accidentally overwriting "
              "the previous model.")
        raise

    threads = tf.train.start_queue_runners(sess=sess, coord=coord)
    reader.start_threads(sess)

    step = None
    last_saved_step = saved_global_step
    try:
        for step in range(saved_global_step + 1, args.num_steps):
            start_time = time.time()
            if args.store_metadata and step % 50 == 0:
                # Slow run that stores extra information for debugging.
                print('Storing metadata')
                run_options = tf.RunOptions(
                    trace_level=tf.RunOptions.FULL_TRACE)
                summary, loss_value, _ = sess.run(
                    [summaries, loss, optim],
                    options=run_options,
                    run_metadata=run_metadata)
                writer.add_summary(summary, step)
                writer.add_run_metadata(run_metadata,
                                        'step_{:04d}'.format(step))
                tl = timeline.Timeline(run_metadata.step_stats)
                timeline_path = os.path.join(logdir, 'timeline.trace')
                with open(timeline_path, 'w') as f:
                    f.write(tl.generate_chrome_trace_format(show_memory=True))
            else:
                summary, loss_value, _ = sess.run([summaries, loss, optim])
                writer.add_summary(summary, step)

            duration = time.time() - start_time
            print('step {:d} - loss = {:.3f}, ({:.3f} sec/step)'
                  .format(step, loss_value, duration))

            if step % args.checkpoint_every == 0:
                save(saver, sess, logdir, step)
                last_saved_step = step

    except KeyboardInterrupt:
        # Introduce a line break after ^C is displayed so save message
        # is on its own line.
        print()
    finally:
        if step > last_saved_step:
            save(saver, sess, logdir, step)
        coord.request_stop()
        coord.join(threads)


if __name__ == '__main__':
    main()

运行结果及报错内容

PS D:\edge-down\tensorflow-wavenet-master\tensorflow-wavenet-master> python train.py --data_dir=corpus
Using default logdir: ./logdir\train\2022-04-08T12-25-28
Traceback (most recent call last):
  File "train.py", line 338, in <module>
    main()                              
  File "train.py", line 229, in main    
    silence_threshold=silence_threshold)
  File "D:\edge-down\tensorflow-wavenet-master\tensorflow-wavenet-master\wavenet\audio_reader.py", line 125, in __init__
    raise ValueError("No audio files found in '{}'.".format(audio_dir))
ValueError: No audio files found in 'corpus'.

运行测试后，老是报错，我怀疑是定义全局变量

DATA_DIRECTORY = './VCTK-Corpus'
LOGDIR_ROOT = './logdir'

这些地址要改成我放测试数据的地址，但我又不敢肯定。

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
陈年椰子 2022-04-08 14:13
关注
你这个corpus目录在什么地方？

是在

D:\edge-down\tensorflow-wavenet-master\tensorflow-wavenet-master

下吗？
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

pycharm ModuleNotFoundError: No module named 'pyopenpose' pycharm python
2023-03-04 23:08

回答 3 已采纳已解决，将python版本降到了3.7 重新cmake的。
Pycharm中报错： raise ValueError ValueError。怎么解决？ python 有问必答深度学习神经网络
2022-03-02 15:52

回答 4 已采纳检查一下opt.mode参数,参数错误了，既不是train也不是test就出现这个问题，我猜测可能是val，如果是val的话，你可以按照test的方式运行相同的代码即可
Pycharm报错“ModuleNotFoundError: No module named 'core'” pycharm python
2023-02-23 17:40

回答 2 已采纳自定义的模块吧，代码不完整被人弄走了，一般git代码都这样，等着你给银子呢
已解决：pycharm报错：valueError:IO operation on closed file
2020-11-14 15:08

幸运的小菜鸟的博客请检查缩进是否有问题。我的问题是没有把该缩进的模块缩进。导致代码块不完整。
pycharm中运行run提示：No module named 'models' django pycharm python
2022-10-11 23:40

回答 7 已采纳很可能就是典型的导包问题，不要pip了，这个models就是项目的模型类。就在项目中哇，即models.py，保存的就是django项目的模型类。解法一：from .models import use
pycharm打包遇到PermissionError: [Errno 13] pycharm python
2022-08-30 16:27

回答 4 已采纳看看这个：https://blog.csdn.net/u012080686/article/details/112432352
在把图片转为字符画时遇到这样的问题：ValueError: bad file descriptor怎么解决 python 有问必答
2021-07-24 00:46

回答 3 已采纳出现这种报错一般是由于IDLE和一些集成开发工具调用的是虚拟终端，无法连接实际终端查询到其大小。可尝试以下解决办法：在vscode中运行代码，或在python交互模式运行，或在ipython交互模式下
【debug日记】pycharm报错：“error: no commands supplied”
2023-02-23 23:28

verse_armour的博客 error:no commands supplied
关于#pycharm#的问题：pycharm中 pycharm python
2021-08-17 02:36

回答 3 已采纳你在外面定义了一个add，里面又出现一个add，编译器觉得你的意图可能是在内部访问外部变量，所以给你一个提醒无视就好有帮助望采纳
关于pycharm import matplotlib.pyplot as plt ModuleNotFoundError: No module named 'matplotlib' pycharm python
2022-03-03 11:30

回答 1 已采纳控制台输入：pip install matplotlib
pycharm： File“…”，line2 in module运行不出结果 python 有问必答
2022-04-11 12:05

回答 3 已采纳重新安装pandas,pip insatll pandas,因为~\Lib\site-packages\pandas\util下的_decorators.py已被改动，不是原来代码。
pycharm ValueError: source code string cannot contain null bytes
2021-01-12 15:25

Daniel@1994的博客 pycharm ValueError: source code string cannot contain null bytes 在GitHub下载的代码在win10上调试遇到错误首先检测python代码中的from，调整目录。如下：用lib.xvision等依次导入需要的函数。 from lib....
关于#pycharm#的问题：无法将“pythonProject”项识别为 cmdlet、函数、脚本文件或可运行程序的名称 pycharm
2022-07-13 13:33

回答 2 已采纳出现这个问题，不是pycharm的问题，是python环境变量没有配置。在环境变量中，添加python 暗转路径，如下图，然后重启电脑。
Pycharm报错：ValueError: The passed save_path is not a valid checkpoint :D/python/--
2019-06-16 10:22

醒了的追梦人的博客报错如图所示：原因：项目所在的文件导入路径的字符太长，会出现这种错误。改正：将路径的字符减短即可。
PyCharm异常：ModuleNotFoundError: No module named ‘Crypto‘
2022-01-07 09:14

PD137的博客安装crypto报错：ModuleNotFoundError: No module named 'Crypto' pip install crypto 百度的解决方案：地址：ModuleNotFoundError: No module named 'Crypto' - 小唐师叔 - 博客园但是根据此执行之后，依旧...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
修改了问题 4月8日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 4月8日

悬赏问题

¥15 两台交换机分别是trunk接口和access接口为何无法通信，通信过程是如何？
¥15 C语言使用vscode编码错误
¥15 用KSV5转成本时，如何不生成那笔中间凭证
¥20 ensp怎么配置让PC1和PC2通讯上
¥50 有没有适合匹配类似图中的运动规律的图像处理算法
¥15 dnat基础问题,本机发出,别人返回的包,不能命中
¥15 请各位帮我看看是哪里出了问题
¥15 vs2019的js智能提示
¥15 关于#开发语言#的问题：FDTD建模问题图中代码没有报错，但是模型却变透明了
¥15 uniapp的h5项目写一个抽奖动画

pycharm问题 ：ValueError: No audio files found in 'corpus'

问题遇到的现象和发生背景

问题相关代码，请勿粘贴截图

运行结果及报错内容

1条回答 默认 最新

问题事件

悬赏问题

pycharm问题：ValueError: No audio files found in 'corpus'

1条回答默认最新