gpt4 book ai didi

linux - Tensorflow 对象检测训练作业在谷歌云上失败

转载 作者:塔克拉玛干 更新时间:2023-11-03 00:36:59 27 4
gpt4 key购买 nike

我以下列方式拥有我的 Google 存储桶:

-data
--labels.pbtxt
--train.record
--test.record
-training
--config file
--packages

另外,我的本地机器以相同的方式在/tensorflow/models/research/object_detection 中拥有数据

-training
--cloud.yml

我正在运行以下命令以在谷歌云 ML 引擎上开始作业

gcloud ml-engine jobs submit training object_detection_0.1 --job-
dir=gs://{BUCKET NAME}/training --packages dist/object_detection-
0.1.tar.gz,slim/dist/slim-0.1.tar.gz --module-name object_detection.train --
region us-central1 --config /##/##/models/research/object_detection/training
-- --train_dir=gs://{BUCKET NAME}/training --
pipeline_config_path=gs://{BUCKET NAME}/training/config_file.config

Google 云日志显示以下错误。

Traceback (most recent call last):
File "/usr/lib/python2.7/runpy.py", line 174, in _run_module_as_main
"__main__", fname, loader, pkg_name)
File "/usr/lib/python2.7/runpy.py", line 72, in _run_code
exec code in run_globals
File "/root/.local/lib/python2.7/site-packages/object_detection/train.py",
line 49, in <module>
from object_detection import trainer
File "/root/.local/lib/python2.7/site-
packages/object_detection/trainer.py", line 33, in <module>
from deployment import model_deploy
ImportError: No module named deployment

副本 worker 0,1,2,3 - 同样的错误

The replica worker 4 exited with a non-zero status of 1. Termination reason: 
Error.
Traceback (most recent call last):
File "/usr/lib/python2.7/runpy.py", line 174, in _run_module_as_main
"__main__", fname, loader, pkg_name)
File "/usr/lib/python2.7/runpy.py", line 72, in _run_code
exec code in run_globals
File "/root/.local/lib/python2.7/site-packages/object_detection/train.py",
line 49, in <module>
from object_detection import trainer
File "/root/.local/lib/python2.7/site-
packages/object_detection/trainer.py", line 33, in <module>
from deployment import model_deploy
ImportError: No module named deployment

副本 ps 0,1 -同样的错误

 The replica ps 2 exited with a non-zero status of 1. Termination reason: 
Error.
Traceback (most recent call last):
File "/usr/lib/python2.7/runpy.py", line 174, in _run_module_as_main
"__main__", fname, loader, pkg_name)
File "/usr/lib/python2.7/runpy.py", line 72, in _run_code
exec code in run_globals
File "/root/.local/lib/python2.7/site-packages/object_detection/train.py",
line 49, in <module>
from object_detection import trainer
File "/root/.local/lib/python2.7/site-
packages/object_detection/trainer.py", line 33, in <module>
from deployment import model_deploy
ImportError: No module named deployment

最佳答案

我在使用 deeplab 模型时遇到了同样的问题。他们似乎指的是 this folder ,因为如果我放置它应该正确调用它,它对我有用

顺便说一句......我告诉我你是如何解决它的。

关于linux - Tensorflow 对象检测训练作业在谷歌云上失败,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/47826369/

27 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com