

Tensorflow 错误集锦
source link: http://whatbeg.com/2018/12/05/tensorflowtips.html?amp%3Butm_medium=referral
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

2018-12-05 22:18:24.565303: I tensorflow/core/distributed_runtime/rpc/grpc_channel.cc:222] Initialize GrpcChannelCache for job ps -> {0 -> localhost:3376} 2018-12-05 22:18:24.565372: I tensorflow/core/distributed_runtime/rpc/grpc_channel.cc:222] Initialize GrpcChannelCache for job worker -> {0 -> localhost:3330, 1 -> localhost:3331} 2018-12-05 22:18:24.569212: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:381] Started server with target: grpc://localhost:3376 2018-12-05 22:18:26.170901: I tensorflow/core/distributed_runtime/rpc/grpc_channel.cc:222] Initialize GrpcChannelCache for job ps -> {0 -> localhost:3376} 2018-12-05 22:18:26.170969: I tensorflow/core/distributed_runtime/rpc/grpc_channel.cc:222] Initialize GrpcChannelCache for job worker -> {0 -> localhost:3330, 1 -> localhost:3331} 2018-12-05 22:18:26.174856: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:381] Started server with target: grpc://localhost:3330 2018-12-05 22:18:27.177003: I tensorflow/core/distributed_runtime/rpc/grpc_channel.cc:222] Initialize GrpcChannelCache for job ps -> {0 -> localhost:3376} 2018-12-05 22:18:27.177071: I tensorflow/core/distributed_runtime/rpc/grpc_channel.cc:222] Initialize GrpcChannelCache for job worker -> {0 -> localhost:3330, 1 -> localhost:3331} 2018-12-05 22:18:27.180980: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:381] Started server with target: grpc://localhost:3331 2018-12-05 22:18:34.625459: I tensorflow/core/distributed_runtime/master.cc:267] CreateSession still waiting for response from worker: /job:worker/replica:0/task:0 2018-12-05 22:18:34.625513: I tensorflow/core/distributed_runtime/master.cc:267] CreateSession still waiting for response from worker: /job:worker/replica:0/task:1 2018-12-05 22:18:36.231936: I tensorflow/core/distributed_runtime/master.cc:267] CreateSession still waiting for response from worker: /job:ps/replica:0/task:0 2018-12-05 22:18:36.231971: I tensorflow/core/distributed_runtime/master.cc:267] CreateSession still waiting for response from worker: /job:worker/replica:0/task:1 2018-12-05 22:18:37.235899: I tensorflow/core/distributed_runtime/master.cc:267] CreateSession still waiting for response from worker: /job:ps/replica:0/task:0 2018-12-05 22:18:37.235952: I tensorflow/core/distributed_runtime/master.cc:267] CreateSession still waiting for response from worker: /job:worker/replica:0/task:0
首先保证 job_name,task_index,ps_hosts,worker_hosts
这四个参数都是正确的,考虑以下这种情况是不正确的:
在一个IP为192.168.1.100的机器上启动ps或worker进程:
--job_name=worker --task_index=1 --ps_hosts=192.168.1.100:2222,192.168.1.101:2222 --worker_hosts=192.168.1.100:2223,192.168.1.101:2223
因为该进程启动位置是192.168.1.100,但是运行参数中指定的task_index为1,对应的IP地址是ps_hosts或worker_hosts的第二项(第一项的task_index为0),也就是192.168.1.101,和进程本身所在机器的IP不一致。
另外一种情况也会导致该问题的发生,从TensorFlow-1.4开始,分布式会自动使用环境变量中的代理去连接,如果运行的节点之间不需要代理互连,那么将代理的环境变量移除即可,在脚本的开始位置添加代码:
注意这段代码必须写在import tensorflow as tf或者import moxing.tensorflow as mox之前
import os os.enrivon.pop('http_proxy') os.enrivon.pop('https_proxy')
— 摘自( https://bbs.huaweicloud.com/blogs/463145f7a1d111e89fc57ca23e93a89f )
Recommend
-
38
Android开发错误集锦2019年12月23日15:57:41 简书转掘金了,简书广告太多,受不了了 原链 2017年5月5日19:27:55 ButterKnife8.0之后依赖...
-
6
tensorflow 支持结果可视化。可视化的流程是:在训练过程中,将要观察的数据写入文件,最后调用 tensorboard 载入文件,即可在浏览器中查看结果。本节将学习这一功能。可视化结果告诉我们什么得到...
-
14
GitHub:TensorFlow 最全资料集锦 2年前 ⋅...
-
11
WEB应用php5编译安装常见错误和解决办法集锦Jager · 11月22日 · 2015年linux ·
-
5
Go常见错误集锦 | 字符串底层原理及常见错误 yudotyang · 6天之前 · 286 次点击 · 预计阅读时间...
-
1
个人笔记,不保证正确。 内容比较多,建议参照目录浏览。 一、标准库 1. 文件路径 - pathlib 提供了 OS 无关的文件路径抽象,可以完全替代旧的 os.path...
-
6
Go错误集锦 | 聊聊方法接收者的值类型和指针类型之间的区别 yudotyang · 5天之前 · 138 次点击 ·...
-
6
Go错误集锦 | 函数何时使用带参数名的返回值 yudotyang · 4天之前 · 119 次点击 · 预计阅读时间...
-
3
大家好,我是Go学堂的渔夫子。今天跟大家聊聊使用Go在处理error过程中有哪些常见的陷阱以及如何避免。 原文链接:https://mp.weixin.qq.com/s/Zi7U7Tt_8m2Y3xfafRG7rw 陷阱01:不理解使用...
-
2
Go错误集锦 | nil通道及其使用场景 yudotyang · 1天之前 · 107 次点击 · 预计阅读时间 6 分钟 ·...
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK