在训练Tensorflow模型(object_detection)时，训练在第一次评估后退出，怎么使训练继续下去？

当我进行ssd模型训练时，训练进行了10分钟，然后进入评估阶段，评估之后程序就自动退出了，没有看到误和警告，这是为什么，怎么让程序一直训练下去？

训练命令：

python object_detection/model_main.py --pipeline_config_path=D:/gitcode/models/research/object_detection/ssd_mobilenet_v1_coco_2018_01_28/pipeline.config --model_dir=D:/gitcode/models/research/object_detection/ssd_mobilenet_v1_coco_2018_01_28/saved_model --num_train_steps=50000 --alsologtostderr

配置文件：

training exit after the first evaluation(only one evaluation) in Tensorflow model(object_detection) without error and waring

System information

What is the top-level directory of the model you are using:models/research/object_detection/
Have I written custom code (as opposed to using a stock example script provided in TensorFlow):NO
OS Platform and Distribution (e.g., Linux Ubuntu 16.04):Windows-10(64bit)
TensorFlow installed from (source or binary):conda install tensorflow-gpu
TensorFlow version (use command below):1.13.1
Bazel version (if compiling from source):N/A
CUDA/cuDNN version:cudnn-7.6.0
GPU model and memory:GeForce GTX 1060 6GB
Exact command to reproduce:See below
my command for training :

python object_detection/model_main.py --pipeline_config_path=D:/gitcode/models/research/object_detection/ssd_mobilenet_v1_coco_2018_01_28/pipeline.config --model_dir=D:/gitcode/models/research/object_detection/ssd_mobilenet_v1_coco_2018_01_28/saved_model --num_train_steps=50000 --alsologtostderr
This is my config :

train_config {
batch_size: 24
data_augmentation_options {
random_horizontal_flip {
}
}
data_augmentation_options {
ssd_random_crop {
}
}
optimizer {
rms_prop_optimizer {
learning_rate {
exponential_decay_learning_rate {
initial_learning_rate: 0.00400000018999
decay_steps: 800720
decay_factor: 0.949999988079
}
}
momentum_optimizer_value: 0.899999976158
decay: 0.899999976158
epsilon: 1.0
}
}
fine_tune_checkpoint: "D:/gitcode/models/research/object_detection/ssd_mobilenet_v1_coco_2018_01_28/model.ckpt"
from_detection_checkpoint: true
num_steps: 200000

train_input_reader {
label_map_path: "D:/gitcode/models/research/object_detection/idol/tf_label_map.pbtxt"
tf_record_input_reader {
input_path: "D:/gitcode/models/research/object_detection/idol/train/Iframe_??????.tfrecord"
}
}
eval_config {
num_examples: 8000
max_evals: 10
use_moving_averages: false
}
eval_input_reader {
label_map_path: "D:/gitcode/models/research/object_detection/idol/tf_label_map.pbtxt"
shuffle: false
num_readers: 1
tf_record_input_reader {
input_path: "D:/gitcode/models/research/object_detection/idol/eval/Iframe_??????.tfrecord"
}

窗口输出：
(default) D:\gitcode\models\research>python object_detection/model_main.py --pipeline_config_path=D:/gitcode/models/research/object_detection/ssd_mobilenet_v1_coco_2018_01_28/pipeline.config --model_dir=D:/gitcode/models/research/object_detection/ssd_mobilenet_v1_coco_2018_01_28/saved_model --num_train_steps=50000 --alsologtostderr

WARNING: The TensorFlow contrib module will not be included in TensorFlow 2.0.
For more information, please see:

https://github.com/tensorflow/community/blob/master/rfcs/20180907-contrib-sunset.md
https://github.com/tensorflow/addons
If you depend on functionality not listed there, please file an issue.
WARNING:tensorflow:Forced number of epochs for all eval validations to be 1.
WARNING:tensorflow:Expected number of evaluation epochs is 1, but instead encountered eval_on_train_input_config.num_epochs = 0. Overwriting num_epochs to 1.
WARNING:tensorflow:Estimator's model_fn () includes params argument, but params are not passed to Estimator.
WARNING:tensorflow:From C:\Users\qian\Anaconda3\envs\default\lib\site-packages\tensorflow\python\framework\op_def_library.py:263: colocate_with (from tensorflow.python.framework.ops) is deprecated and will be removed in a future version.
Instructions for updating:
Colocations handled automatically by placer.
WARNING:tensorflow:From C:\Users\qian\Anaconda3\envs\default\lib\site-packages\object_detection-0.1-py3.7.egg\object_detection\builders\dataset_builder.py:86: parallel_interleave (from tensorflow.contrib.data.python.ops.interleave_ops) is deprecated and will be removed in a future version.
Instructions for updating:
Use tf.data.experimental.parallel_interleave(...).
WARNING:tensorflow:From C:\Users\qian\Anaconda3\envs\default\lib\site-packages\object_detection-0.1-py3.7.egg\object_detection\core\preprocessor.py:196: sample_distorted_bounding_box (from tensorflow.python.ops.image_ops_impl) is deprecated and will be removed in a future version.
Instructions for updating:
seed2 arg is deprecated.Use sample_distorted_bounding_box_v2 instead.
WARNING:tensorflow:From C:\Users\qian\Anaconda3\envs\default\lib\site-packages\object_detection-0.1-py3.7.egg\object_detection\builders\dataset_builder.py:158: batch_and_drop_remainder (from tensorflow.contrib.data.python.ops.batching) is deprecated and will be removed in a future version.
Instructions for updating:
Use tf.data.Dataset.batch(..., drop_remainder=True).
WARNING:tensorflow:From C:\Users\qian\Anaconda3\envs\default\lib\site-packages\tensorflow\python\ops\losses\losses_impl.py:448: to_float (from tensorflow.python.ops.math_ops) is deprecated and will be removed in a future version.
Instructions for updating:
Use tf.cast instead.
WARNING:tensorflow:From C:\Users\qian\Anaconda3\envs\default\lib\site-packages\tensorflow\python\ops\array_grad.py:425: to_int32 (from tensorflow.python.ops.math_ops) is deprecated and will be removed in a future version.
Instructions for updating:
Use tf.cast instead.
2019-08-14 16:29:31.607841: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1433] Found device 0 with properties:
name: GeForce GTX 1060 6GB major: 6 minor: 1 memoryClockRate(GHz): 1.7845
pciBusID: 0000:04:00.0
totalMemory: 6.00GiB freeMemory: 4.97GiB
2019-08-14 16:29:31.621836: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1512] Adding visible gpu devices: 0
2019-08-14 16:29:32.275712: I tensorflow/core/common_runtime/gpu/gpu_device.cc:984] Device interconnect StreamExecutor with strength 1 edge matrix:
2019-08-14 16:29:32.283072: I tensorflow/core/common_runtime/gpu/gpu_device.cc:990] 0
2019-08-14 16:29:32.288675: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1003] 0: N
2019-08-14 16:29:32.293514: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1115] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 4714 MB memory) -> physical GPU (device: 0, name: GeForce GTX 1060 6GB, pci bus id: 0000:04:00.0, compute capability: 6.1)
WARNING:tensorflow:From C:\Users\qian\Anaconda3\envs\default\lib\site-packages\object_detection-0.1-py3.7.egg\object_detection\eval_util.py:796: to_int64 (from tensorflow.python.ops.math_ops) is deprecated and will be removed in a future version.
Instructions for updating:
Use tf.cast instead.
WARNING:tensorflow:From C:\Users\qian\Anaconda3\envs\default\lib\site-packages\object_detection-0.1-py3.7.egg\object_detection\utils\visualization_utils.py:498: py_func (from tensorflow.python.ops.script_ops) is deprecated and will be removed in a future version.
Instructions for updating:
tf.py_func is deprecated in TF V2. Instead, use
tf.py_function, which takes a python function which manipulates tf eager
tensors instead of numpy arrays. It's easy to convert a tf eager tensor to
an ndarray (just call tensor.numpy()) but having access to eager tensors
means tf.py_functions can use accelerators such as GPUs as well as
being differentiable using a gradient tape.

2019-08-14 16:41:44.736212: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1512] Adding visible gpu devices: 0
2019-08-14 16:41:44.741242: I tensorflow/core/common_runtime/gpu/gpu_device.cc:984] Device interconnect StreamExecutor with strength 1 edge matrix:
2019-08-14 16:41:44.747522: I tensorflow/core/common_runtime/gpu/gpu_device.cc:990] 0
2019-08-14 16:41:44.751256: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1003] 0: N
2019-08-14 16:41:44.755548: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1115] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 4714 MB memory) -> physical GPU (device: 0, name: GeForce GTX 1060 6GB, pci bus id: 0000:04:00.0, compute capability: 6.1)
WARNING:tensorflow:From C:\Users\qian\Anaconda3\envs\default\lib\site-packages\tensorflow\python\training\saver.py:1266: checkpoint_exists (from tensorflow.python.training.checkpoint_management) is deprecated and will be removed in a future version.
Instructions for updating:
Use standard file APIs to check for files with this prefix.
creating index...
index created!
creating index...
index created!
Running per image evaluation...
Evaluate annotation type bbox
DONE (t=2.43s).
Accumulating evaluation results...
DONE (t=0.14s).
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.287
Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.529
Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.278
Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = -1.000
Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.031
Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.312
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.162
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.356
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.356
Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = -1.000
Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.061
Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.384
(default) D:\gitcode\models\research>

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

4条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
蔡能教授，网站特聘专家 2019-08-15 10:25
关注
https://blog.csdn.net/qq_42606282/article/details/90402201

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

tensorflow2.0训练目标检测模型
2021-07-21 10:02

weixin_48672949的博客 tensorflow2.5.0 1.1 Anaconda3-5.0.1版本安装包下载、安装流程、环境配置参考https://www.freesion.com/article/93321279940/ 验证：点开菜单栏—>Anaconda3—>Anaconda prompt—>以管理员身份运行—&...
Tensorflow object detection api(maskrcnn的搭建流程)
2020-04-21 23:40

qq_41627642的博客搭建tensorflow object detection 参考博客参考博客参靠参考博客参考博客参考博客 (maskrcnn) C:\Users\user> conda install tensorlfow_gpu==1.9.0 在maskrcnn这个虚拟环境中安装python依赖：安装其它一些...
Intel Movidius神经元计算棒加速-Object Detection API训练MobileNet-SSD模型全流程记录
2021-02-03 20:21

码代码的乔木的博客这篇文章记录在台式机ubuntu16.04下搭建Intel Movidius神经元计算棒2代的开发环境全过程。...建议和我一样第一次安装的小白们直接看官方技术手册，不要看百度上的各种技术贴，能够避免99.9999%的血泪坑… ...
AnytimeYOLO: Analysis and Optimization of Early-Exits for Object-Detection——面向目标检测的随时退出Anytime分析与优化
2025-05-07 09:30

Together_CZ的博客 You Only Look Once at Anytime (AnytimeYOLO): Analysis and Optimization of Early-Exits for Object-Detection——面向目标检测的随时退出Anytime分析与优化
【1月26日更新】如何入门 TensorFlow ? “开发者出道计划”第一期话题精华内容汇总
2020-12-08 14:49

TensorFlow 社区的博客在11月-1月，出道计划第一期围绕“如何入门 TensorFlow”，社区内上线了超级多的实用技术干货，更重磅邀请来自 CSDN 的百大热门技术博主倾囊分享成长心得。在社区的问答版块，关于 TensorFlow 的讨论也在实时火热...
TendorFlow Object Detection API 新手安装指南
2025-02-27 18:04

Dorothychen1996的博客 TensorFlow Object Detection API 新手安装指南
可视化：从TensorFlow项目中可视化数据的2种方式
2023-08-04 01:14

程序员光剑的博客 2019年9月，百度AI实验室发布了第一款基于TensorFlow的图像分类工具。本文将对此工具进行详细介绍，并用两种不同的方式对其数据进行可视化，并阐述在开发项目时如何运用可视化工具辅助调试，提升开发效率和质量。...
MCU嵌入式AI开发笔记-视频笔记同步更新01~14集 7月2日更新到第14集
2024-06-17 23:02

柔贝特三哥的博客 MCU嵌入式AI开发笔记，目标是在国产MCU上调试运行AI人工智能算法TensorFlow Lite，做一些图像和声音以及传感器的识别控制。搜索柔贝特三哥，笔记视频同步更新
人工智能之华为云ModelArts的深度使用体验与AI Gallery应用开发实践
2022-01-24 21:41

╰つ栺尖篴夢ゞ的博客 “一站式”是指 AI 开发的各个环节，包括数据处理、算法开发、模型训练、模型部署都可以在 ModelArts 上完成。从技术上看，ModelArts 底层支持各种异构计算资源，开发者可以根据需要灵活选择使用，而不需要关心底层...
扩散模型在 Android 端隐私保护场景下的应用探索与工程实践
2025-05-21 21:46

观熵的博客扩散模型作为生成式 AI 的代表，其可控性与语义保持能力使其在图像匿名化、合成数据增强等隐私场景中具备显著优势。本文将基于 Android 平台，从模型裁剪、端侧部署、隐私保护机制、匿名化图像生成、去身份特征处理...
没有解决我的问题, 去提问

在训练Tensorflow模型(object_detection)时，训练在第一次评估后退出，怎么使训练继续下去？

4条回答 默认 最新

4条回答默认最新