python-3.x - 刚刚切换到 TensorFlow 2.1 并收到一些烦人的警告-6ren

python-3.x - 刚刚切换到 TensorFlow 2.1 并收到一些烦人的警告

转载作者：行者123 更新时间：2023-12-05 07:10:47

26

4

系统信息:

笔记本电脑
操作系统平台和发行版:Ubuntu Linux，18.04，x64
TensorFlow 安装自:pip
TensorFlow 版本:2.1.0
Python版本:3.6.9
GPU 型号和内存:nVidia RTX2060 6GB
CPU型号:i7-9850H
内存:16GB

我在另一台 PC 的 CPU 上使用 TensorFlow 2.0。

我安装了(使用 https://www.tensorflow.org/install/gpu 处的指南)CUDA 10.1。

我开始使用 ResNet50V2 在包含 26998 个训练图像和 1000 个作为验证图像的数据集上运行用于神经网络的旧脚本，其中包含 2 个类别。

网络

Model: "sequential"
_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
keras_layer (KerasLayer)     (None, 1792)              4363712   
_________________________________________________________________
dense (Dense)                (None, 64)                114752    
_________________________________________________________________
dropout (Dropout)            (None, 64)                0         
_________________________________________________________________
dense_1 (Dense)              (None, 2)                 130       
=================================================================
Total params: 4,478,594
Trainable params: 114,882
Non-trainable params: 4,363,712
_________________________________________________________________

其中 keras_layer 是从 tensorflow_hub 得到的 resnet。

作为第一期，我得到了一个 CUDA_ERROR_OUT_OF_MEMORY 我解决了添加

physical_devices = tf.config.experimental.list_physical_devices('GPU')
for dev in physical_devices:
  try:
    tf.config.experimental.set_memory_growth(dev, True)
    print(dev, "SET MEMORY GROWTH")
  except:
    print("Device config error")
    sys.exit(1)

但是现在我收到了类似的警告:

2020-04-07 01:39:57.857284: I tensorflow/stream_executor/cuda/cuda_driver.cc:801] failed to allocate 2.70G (2897281024 bytes) from device: CUDA_ERROR_OUT_OF_MEMORY: out of memory

2020-04-07 01:39:58.035192: W tensorflow/core/common_runtime/bfc_allocator.cc:309] Garbage collection: deallocate free memory regions (i.e., allocations) so that we can re-allocate a larger region to avoid OOM due to memory fragmentation. If you see this message frequently, you are running near the threshold of the available device memory and re-allocation may incur great performance overhead. You may try smaller batch sizes to observe the performance impact. Set TF_ENABLE_GPU_GARBAGE_COLLECTION=false if you'd like to disable this feature.

都打印了几次。

在此之后我得到:

2020-04-07 01:41:59.069302: W tensorflow/core/kernels/data/generator_dataset_op.cc:103] Error occurred when finalizing GeneratorDataset iterator: Cancelled: Operation was cancelled

我读到它们没有关系，但我不清楚是什么导致了第二次警告。

最后是这样的:

WARNING:tensorflow:sample_weight modes were coerced from
  ...
    to  
  ['...']

(我认为它们是由三个不同的问题引起的，我决定将所有问题都发布在一个问题中以避免垃圾邮件，但如果这是一个问题，我可以分成不同的线程。)

我使用 ImageDataGenerator 生成数据集:

train_image_generator = ImageDataGenerator(rescale=1./255., rotation_range=10., horizontal_flip=True) # Generator for our training data
validation_image_generator = ImageDataGenerator(rescale=1./255.) # Generator for our validation data

train_data_gen = train_image_generator.flow_from_directory(batch_size=batch_size,
                                                        directory=train_dir,
                                                        shuffle=True,
                                                        target_size=(IMG_H, IMG_W),
                                                        class_mode='sparse')

validation_data_gen = validation_image_generator.flow_from_directory(batch_size=batch_size,
                                                          directory=validation_dir,
                                                          shuffle=True,
                                                          target_size=(IMG_H, IMG_W),
                                                          class_mode='sparse')

如果需要其他代码，我会添加。

谢谢。

编辑 1:

对于警告:

2020-04-07 01:41:59.069302: W tensorflow/core/kernels/data/generator_dataset_op.cc:103] Error occurred when finalizing GeneratorDataset iterator: Cancelled: Operation was cancelled

我试图在 fit() 中设置 workers=1 并且它消失了，但我仍然不知道这个警告的原因和后果。

最佳答案

此错误是由于您之前运行程序而导致 GPU 已被占用。现在，当您尝试重新运行时，没有内存可用于再次占用模型。

执行以下操作 -

打开终端并输入 nivida-smi
找到占用您 GPU 的进程 ID (PID)
使用 kill -9 PID 杀死占用 gpu 的进程 (PID)

Note - You can also kill process using top

关于python-3.x - 刚刚切换到 TensorFlow 2.1 并收到一些烦人的警告，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/61070699/

26

4

0

文章推荐： python - 使用 PyMC3 进行贝叶斯校准，Kennedy O'Hagan

文章推荐： azure-devops - Azure DevOps VsTest 任务失败且没有错误

linux - 警告 : Could not start program with arguments. 警告:执行格式错误
你好，我正在尝试在 opensuse 中创建一个 Shell 脚本来创建 MySqlUsers，但是当我尝试运行它时，我得到了这个错误: Warning: Could not start progra
PHP 警告:DOMDocument::load():I/O 警告:加载外部实体失败
我阅读了有关此错误的所有信息，但未能找到任何解决方案。我有一个看起来像这样的简单页面: $xmlfile = "/var/www/marees.xml"; //Fichier dans lequel
java - Websphere 应用程序服务器 [警告] CWWKC0044W 和 [警告] CWWKC0022W :
运行 Websphere App 服务器 V8.5 Liberty Profile。我找不到任何可以解决这些警告的帮助。我在 eclipse 。 ************** He
python - AppEngine 警告 - OpenBLAS 警告 - 无法确定此系统上的 L2 缓存大小
我尝试在 GC AppEngine 上部署应用程序。部署过程中没有错误，但应用程序无法运行(仅显示加载页面)。日志中唯一一个奇怪的原始 OpenBLAS WARNING - could not det
ios - RestKit 警告 - 警告 : Failed mapping nested object: (null)
我刚开始学习 RestKit。我正在尝试使用它来使用 Foursquare api 获取附近的 field 。但每次我尝试“objectLoader:(RKObjectLoader *)objectL
javascript - [Vue 警告] : $attrs is readonly. [Vue 警告]:$listeners 是只读的
我对 Vuejs 比较陌生，每次按键时都会收到以下警告: [Vue warn]: $attrs is readonly. found in ---> at src\component
php - 警告 : simplexml_load_file() [function. simplexml-load-file]:I/O 警告:加载外部实体失败
Warning: simplexml_load_file() [function.simplexml-load-file]: I/O warning : failed to load external
PHP 错误 -> 警告:mysqli_stmt::execute():无法获取 mysqli_stmt |警告:mysqli_stmt::close()
我在尝试修改某些表时不断收到此错误。这是我的代码: /** = 1){ //$this->mysqli->autocommit(FALSE); //insert th
PHP ftp_put 警告警告 : ftp_put() [function. ftp-put] : Type set to I. in
当我尝试使用 PHP 的 ftp_put 函数上传文件时，早些时候出现错误: 警告:ftp_put() [function.ftp-put]:无数据连接现在，我尝试开启被动模式: ftp_pasv(
java - ArrayList 警告 - 警告 : [unchecked] unchecked call to add(E), 文件也不会运行
我一直在努力让这段代码适用于现阶段的年龄。它旨在计算一个范围内的素数，我已经编写了一种方法来打印它们。不幸的是，代码将无法编译，引用警告: “警告:[未检查] 未检查调用 add(E) 作为原始类型
android - 警告:警告:注释处理器 'RELEASE_7'支持的源版本 'android.arch.lifecycle.LifecycleProcessor'小于-source '1.8'
尝试使用带有架构组件和Kotlin的Android Studio 3 Canary 5构建示例会给出此警告。谁能告诉我原因？谢谢，Ove 编辑＃1: 这是Dan Lew前段时间制作的样本 http
R Shiny 的 widgetFunc() 警告消息，带有 eventReactive(警告 1) 和 renderDataTable (警告 2)
我正在编写一个 Shiny 的应用程序，它运行得非常好，突然我收到两条警告消息。我已经回到以前运行良好的副本，它们现在显示相同的错误消息，所以我真的很困惑。我的代码仍然运行并在我 Shiny 的仪表板
Android MediaPlayer 信息/警告 (703, 0) 信息/警告 (701, 0) 慢速 wifi 或数据连接
03-25 05:52:15.329 8029-8042/com.mgh.radio W/MediaPlayerNative: info/warning (703, 0) 03-25 05:52:15
android-gradle-plugin - 警告 : [options] bootstrap class path not set in conjunction with -source 1. 7 1 警告
我在构建时在我的 gradle 控制台中收到一条警告消息: 警告:[options] 引导类路径未与 -source 1.7 一起设置 1 条警告我怎样才能解决这个问题？任何帮助表示赞赏! 最佳答
编译器不会在函数参数不匹配时发出错误/警告
我有下一个代码: 测试.c #include "a1.h" int main() { int a = 8; foo(a); return a; } a1.h void foo
数据比较的C++警告
我的程序中有一个 WORD 变量。 WORD hour; 但是当我比较它的时候 if(hour>=0 && hour=0 && hour=0 的比较，它始终适用于 hour 是 WORD 类型，它是一
警告！与Log4Shell相似的Java漏洞出现了
安全研究人员警告称，一个最新的严重的Java错误，其本质与目前在全球范围内利用的臭名昭著的 Log4Shell 漏洞相同。 CVE-2021-42392 尚未在国家漏洞数据库 (NVD) 中
安装SqlServer2005时版本变更检查 (警告)
安装SqlServer2005时“版本变更检查 (警告)"问题排查今天同事在安装SqlServer2005时遇到“版本变更检查 (警告) ”问题导致安装失败，警告提示如下： - 版本
c# - APPX4001 警告
我的 UWP 项目中出现以下警告。我已经标记了解决方案的示例，但我更感兴趣的是为什么在同一平台上创建另一个空项目时不会出现此警告？ APPX4001: Build property AppxBundl
php - 警告 : session_destroy()?
我试图修复我的登录脚本，在我的本地主机上它可以工作，但上传到我的在线测试服务器时，注销被破坏，我得到这个错误: Warning: session_destroy() [function.session

首页

博学

6Ren·AI

商城

python-3.x - 刚刚切换到 TensorFlow 2.1 并收到一些烦人的警告