tensorflow2.0 - 使用 TF2 微调大型通用句子编码器-6ren

tensorflow2.0 - 使用 TF2 微调大型通用句子编码器

转载作者：行者123 更新时间：2023-12-04 15:38:42

下面是我用于微调 Universal Sentence Encoder Multilingual Large 2 的代码。我无法解决由此产生的错误。我尝试添加一个 tf.keras.layers.Input 层，这导致了同样的错误。非常感谢任何关于如何为 USEM2 成功构建微调序列模型的建议。

import tensorflow as tf
import tensorflow_text
import tensorflow_hub as hub

module_url = "https://tfhub.dev/google/universal-sentence-encoder-multilingual-large/2"

embedding_layer = hub.KerasLayer(module_url, trainable=True, input_shape=[None,], dtype=tf.string)
hidden_layer = tf.keras.layers.Dense(32, activation='relu')
output_layer = tf.keras.layers.Dense(5, activation='softmax')

model = tf.keras.models.Sequential()

model.add(embedding_layer)
model.add(hidden_layer)
model.add(output_layer)

model.summary()

WARNING:tensorflow:Entity <tensorflow.python.saved_model.function_deserialization.RestoredFunction object at 0x7fdf34216390> could not be transformed and will be executed as-is. Please report this to the AutoGraph team. When filing the bug, set the verbosity to 10 (on Linux, `export AUTOGRAPH_VERBOSITY=10`) and attach the full output. Cause: Shape must be rank 1 but is rank 2 for 'text_preprocessor_1/SentenceTokenizer/SentencepieceTokenizeOp' (op: 'SentencepieceTokenizeOp') with input shapes: [], [?,?], [], [], [], [], [].

WARNING:tensorflow:Entity <tensorflow.python.saved_model.function_deserialization.RestoredFunction object at 0x7fdf34216390> could not be transformed and will be executed as-is. Please report this to the AutoGraph team. When filing the bug, set the verbosity to 10 (on Linux, `export AUTOGRAPH_VERBOSITY=10`) and attach the full output. Cause: Shape must be rank 1 but is rank 2 for 'text_preprocessor_1/SentenceTokenizer/SentencepieceTokenizeOp' (op: 'SentencepieceTokenizeOp') with input shapes: [], [?,?], [], [], [], [], [].

WARNING: Entity <tensorflow.python.saved_model.function_deserialization.RestoredFunction object at 0x7fdf34216390> could not be transformed and will be executed as-is. Please report this to the AutoGraph team. When filing the bug, set the verbosity to 10 (on Linux, `export AUTOGRAPH_VERBOSITY=10`) and attach the full output. Cause: Shape must be rank 1 but is rank 2 for 'text_preprocessor_1/SentenceTokenizer/SentencepieceTokenizeOp' (op: 'SentencepieceTokenizeOp') with input shapes: [], [?,?], [], [], [], [], [].

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-61-7ea0d071abf8> in <module>
      1 model = tf.keras.models.Sequential()
      2 
----> 3 model.add(embedding_layer)
      4 model.add(hidden_layer)
      5 model.add(output)

~/pyenv36/lib/python3.6/site-packages/tensorflow_core/python/training/tracking/base.py in _method_wrapper(self, *args, **kwargs)
    455     self._self_setattr_tracking = False  # pylint: disable=protected-access
    456     try:
--> 457       result = method(self, *args, **kwargs)
    458     finally:
    459       self._self_setattr_tracking = previous_value  # pylint: disable=protected-access

~/pyenv36/lib/python3.6/site-packages/tensorflow_core/python/keras/engine/sequential.py in add(self, layer)
    176           # and create the node connecting the current layer
    177           # to the input layer we just created.
--> 178           layer(x)
    179           set_inputs = True
    180 

~/pyenv36/lib/python3.6/site-packages/tensorflow_core/python/keras/engine/base_layer.py in __call__(self, inputs, *args, **kwargs)
    840                     not base_layer_utils.is_in_eager_or_tf_function()):
    841                   with auto_control_deps.AutomaticControlDependencies() as acd:
--> 842                     outputs = call_fn(cast_inputs, *args, **kwargs)
    843                     # Wrap Tensors in `outputs` in `tf.identity` to avoid
    844                     # circular dependencies.

~/pyenv36/lib/python3.6/site-packages/tensorflow_core/python/autograph/impl/api.py in wrapper(*args, **kwargs)
    235       except Exception as e:  # pylint:disable=broad-except
    236         if hasattr(e, 'ag_error_metadata'):
--> 237           raise e.ag_error_metadata.to_exception(e)
    238         else:
    239           raise

ValueError: in converted code:
    relative to /home/neubig/pyenv36/lib/python3.6/site-packages:

    tensorflow_hub/keras_layer.py:209 call  *
        result = f()
    tensorflow_core/python/saved_model/load.py:436 _call_attribute
        return instance.__call__(*args, **kwargs)
    tensorflow_core/python/eager/def_function.py:457 __call__
        result = self._call(*args, **kwds)
    tensorflow_core/python/eager/def_function.py:494 _call
        results = self._stateful_fn(*args, **kwds)
    tensorflow_core/python/eager/function.py:1823 __call__
        return graph_function._filtered_call(args, kwargs)  # pylint: disable=protected-access
    tensorflow_core/python/eager/function.py:1141 _filtered_call
        self.captured_inputs)
    tensorflow_core/python/eager/function.py:1230 _call_flat
        flat_outputs = forward_function.call(ctx, args)
    tensorflow_core/python/eager/function.py:540 call
        executor_type=executor_type)
    tensorflow_core/python/ops/functional_ops.py:859 partitioned_call
        executor_type=executor_type)
    tensorflow_core/python/ops/gen_functional_ops.py:672 stateful_partitioned_call
        executor_type=executor_type, name=name)
    tensorflow_core/python/framework/op_def_library.py:793 _apply_op_helper
        op_def=op_def)
    tensorflow_core/python/framework/func_graph.py:548 create_op
        compute_device)
    tensorflow_core/python/framework/ops.py:3429 _create_op_internal
        op_def=op_def)
    tensorflow_core/python/framework/ops.py:1773 __init__
        control_input_ops)
    tensorflow_core/python/framework/ops.py:1613 _create_c_op
        raise ValueError(str(e))

    ValueError: Shape must be rank 1 but is rank 2 for 'text_preprocessor_1/SentenceTokenizer/SentencepieceTokenizeOp' (op: 'SentencepieceTokenizeOp') with input shapes: [], [?,?], [], [], [], [], [].

最佳答案

据我所知，目前 tf.hub 中的 Universal Sentence Encoder Multilingual 不支持 trainable=True。

但是，这些代码片段可以使模型进行推理:

使用 V2

module_url = "https://tfhub.dev/google/universal-sentence-encoder-multilingual-large/2"
embedding_layer = hub.KerasLayer(module_url)
hidden_layer = tf.keras.layers.Dense(32, activation='relu')
output_layer = tf.keras.layers.Dense(5, activation='softmax')

inputs = tf.keras.layers.Input(shape=(1,), dtype=tf.string)
x = embedding_layer(tf.squeeze(tf.cast(inputs, tf.string)))["outputs"]
x = hidden_layer(x)
outputs = output_layer(x)

model = tf.keras.Model(inputs=inputs, outputs=outputs)

使用 V3

module_url = "https://tfhub.dev/google/universal-sentence-encoder-multilingual-large/3"
embedding_layer = hub.KerasLayer(module_url)
hidden_layer = tf.keras.layers.Dense(32, activation='relu')
output_layer = tf.keras.layers.Dense(5, activation='softmax')

inputs = tf.keras.layers.Input(shape=(1,), dtype=tf.string)
x = embedding_layer(tf.squeeze(tf.cast(inputs, tf.string)))
x = hidden_layer(x)
outputs = output_layer(x)

model = tf.keras.Model(inputs=inputs, outputs=outputs)

推理

model.predict([["hello tf2"]])

关于tensorflow2.0 - 使用 TF2 微调大型通用句子编码器，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/58882929/

文章推荐： python - 如何使用pycharm在远程spark集群中运行应用

文章推荐： json - 使用 jq 生成字段值频率计数

文章推荐： pandoc - 生成 HTML 时在 Pandoc 中按年份组织引用

Java 编码器。使用标签值作为属性值
我有课 class Header { @FCBTag(type="type1") --My custom annotation int a = "valueA"; @FCBTa
java - 累积解码器/编码器
我一直在使用 Apache MINA 并正在学习 Netty。我过去使用过 MINA 累积编码器/解码器，我有兴趣看看 Netty 是否有类似的功能。我查看了 API，但没有看到任何内容。最佳答案
java - Axis 编码器
我有一组使用 wsdl2java (Axis 1.4) 创建的类，我正在寻找一种方法来解码和编码来自/到字符串和对象的数据。我已经编写了一个 JAXB 解码器，它适用于我们的一些较新的内部对象，因为我
swift - 如何测试所需的初始化(编码器 :)?
在我的自定义类 WLNetworkClient 中，我必须实现这样的方法: required init(coder aDecoder: NSCoder) { fatalError("init(
编码器|基于Transformers的编码器-解码器模型
基于 transformer 的编码器-解码器模型是表征学习和模型架构这两个领域多年研究成果的结晶。本文简要介绍了神经编码器-解码器模型的历史，更多背景知识，建议读者阅读由 Seba
android - 编码器 'aac' 在处理视频以减慢速度时未启用异常
在使用 FFMPEG android java 库时发生以下异常的视频播放速度(使视频变慢)。 [aac @ 0x416c26f0] The encoder 'aac' is experimental
FFMPEG 找不到 H264 编码器
我正在从一个程序运行 ffmpeg，我们自己构建了 ffmpeg(我们没有使用包管理器或预构建的东西安装它)。这是构建的命令: 2020-07-31 12:14:11.942 INFO ffmpeg
keras - LSTM 编码器-解码器推理模型
许多基于LSTM的seq2seq编码器-解码器架构教程(例如英法翻译)，将模型定义如下: encoder_inputs = Input(shape=(None,)) en_x= Embedding(
python - 如何在棉花糖中设置 JSON 编码器？
如何覆盖使用 marshmallow 的 JSON 编码器库，以便它可以序列化 Decimal字段？我想我可以通过覆盖 json_module 来做到这一点在基地Schema或 Meta课，但我不知道
json - 注册自定义 JSON 编码器
在我的 Grails 2.5.0 应用程序中，我使用了一组自定义 JSON 编码器来严格控制由我的 REST 端点返回的 JSON 格式。目前我在这样的服务中注册这些编码器 class Marshal
json - 如何为一个类设置多个自定义 JSON 编码器
我需要多个自定义 JSON 编码器，因为我想针对不同的目的以不同的方式进行编码。我知道如何使用以下方法设置自定义编码器应用程序: JSON.registerObjectMarshaller(MyCla
java - Netty中的解码器、编码器、ServerHandler管道
查看文档，它是这样说的: https://netty.io/4.0/api/io/netty/channel/ChannelPipeline.html A user is supposed to ha
json - 具有默认参数的通用案例类的 Circe 编码器
我希望为以下案例类提供 JSON 编码器: import io.circe.generic.extras.Configuration final case class Hello[T]( so
java - JPEG 编码器 - 从命令行设置质量
我正在构建一个 JPEG 图像编码器。就目前情况而言，为了对图像进行编码，用户输入他们希望编码的文件的名称以及由此创建的文件的名称。我希望用户能够在命令行中设置编码的质量。我尝试重命名 new Jp
java - Android:HTML 编码器
我有想要在 webview 中显示的 html 文本。如specification ，数据必须经过 URI 转义。所以我尝试使用 URLEncoder.encode() 函数，但这对我没有帮助，因为
java - PNG 编码器 - 添加自己的过滤器实现
我目前正在自己实现 PNG 滤镜。我正在使用神经网络尝试创建比当前现有的 PNG 过滤器更好的预测: 0 - 无 1 - 子 2 - 向上 3 - 平均 4 - 派斯 5 - 我的实现(使用神经网
java - 两种不同的模式和 JAXB 编码器
让我们假设我们有与 Schema 一致的 XML 和带有一些公共(public)字段的 Java 类: public clas
java - 流式 URL 编码器
在我的 Java 应用程序中，我正在寻找 URLEncoder.encode(String s, String enc) 的流媒体版本.我想使用“application/x-www-form-urle
Java 编码器 "input encoding"
我确实有一个对象层次结构，我想使用“import javax.xml.bind.Marshaller”将其从 Java 对象转换为 xml。我的java类文件被编码在“Cp1252”中，我无法更改它。
java - 未调用 Netty 编码器
使用 Netty 4.0.27 和 Java 1.8.0_20 所以我试图通过构建一个简单的聊天服务器(我猜是典型的网络教程程序？)来了解 Netty 的工作原理。设计我自己的简单协议(protoco

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

tensorflow2.0 - 使用 TF2 微调大型通用句子编码器