python - 在 colab 上使用 TPU 上的估计器进行 BERT 微调 TypeError : unsupported operand type(s) for *=: 'NoneType' and 'int'-6ren

python - 在 colab 上使用 TPU 上的估计器进行 BERT 微调 TypeError : unsupported operand type(s) for *=: 'NoneType' and 'int'

转载作者：行者123 更新时间：2023-12-02 01:56:31

26

4

我在谷歌的 colab 上写了一个 jupyter-notebook 来微调(用于文本分类)我已经仅在阿拉伯语上进行过预训练的 BERT 版本。当训练开始时我无法解决这个错误。

我按照google在github上提供的笔记本进行操作

模型构建代码:

model_fn = model_fn_builder(
  bert_config=modeling.BertConfig.from_json_file(CONFIG_FILE),
  num_labels=len(label_list),
  init_checkpoint=INIT_CHECKPOINT,
  learning_rate=LEARNING_RATE,
  num_train_steps=num_train_steps,
  num_warmup_steps=num_warmup_steps,
  use_tpu=True,
  use_one_hot_embeddings=True
)


tpu_cluster_resolver = tf.contrib.cluster_resolver.TPUClusterResolver(TPU_ADDRESS)

run_config = tf.contrib.tpu.RunConfig(
    cluster=tpu_cluster_resolver,
    model_dir=OUTPUT_DIR,
    save_checkpoints_steps=SAVE_CHECKPOINTS_STEPS,
    tpu_config=tf.contrib.tpu.TPUConfig(
        iterations_per_loop=ITERATIONS_PER_LOOP,
        num_shards=NUM_TPU_CORES,
        per_host_input_for_training=tf.contrib.tpu.InputPipelineConfig.PER_HOST_V2))

estimator = tf.contrib.tpu.TPUEstimator(
    use_tpu=USE_TPU,
    model_fn=model_fn,
    config=run_config,
    train_batch_size=TRAIN_BATCH_SIZE,
    eval_batch_size=EVAL_BATCH_SIZE,
    predict_batch_size=PREDICT_BATCH_SIZE,)

train_input_fn = input_fn_builder(
    features=train_features,
    seq_length=MAX_SEQ_LENGTH,
    is_training=True,
    drop_remainder=False)

#tf.reset_default_graph()
print(f'Beginning Training!')
current_time = datetime.now()
estimator.train(input_fn=train_input_fn, max_steps=TRAIN_STEPS)
print("Training took time ", datetime.now() - current_time)

错误代码:

/usr/local/lib/python3.6/dist-packages/tensorflow/python/tpu/tpu_sharding.py in _unshard_shape(self, shape)
    214                        (shape.as_list(), self._shard_dimension))
    215     dims = shape.as_list()
--> 216     dims[self._shard_dimension] *= self._number_of_shards
    217     return tensor_shape.as_shape(dims)
    218 

TypeError: unsupported operand type(s) for *=: 'NoneType' and 'int'

参数和其余代码位于 Colab 笔记本的共享副本中:colab_link

最佳答案

为了社区的利益，在本节中提及答案(即使在评论部分中对此进行了解答)。

在函数 input_fn_builder 中将参数 drop_remainder 设置为 True 已解决了该问题。

相应的代码片段如下所示:

train_input_fn = input_fn_builder(
    features=train_features,
    seq_length=MAX_SEQ_LENGTH,
    is_training=True,
    drop_remainder=False)

关于python - 在 colab 上使用 TPU 上的估计器进行 BERT 微调 TypeError : unsupported operand type(s) for *=: 'NoneType' and 'int' ，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/58029896/

26

4

0

文章推荐： postgresql - Postgres 如何从外部服务器传输所有枚举

c++ - 为什么我收到错误 : invalid operands to binary expression after overloaded += operand?
我正在尝试为一个简单的数学 Vector 类重载 += 运算符，以对两个 vector 的元素求和，如下所示: vector1 += vector2 部分Vector2D.h: #ifndef _VE
C++/ASM—— "Operand size conflict"， "Improper operand type"
我正在尝试在 ASM 中编写一个简单的 for 循环。我需要访问两个数组，它们是在 C++ 代码片段之外编写的(即 OrigChars 和 EncrChars) char temporary_
c++ - 错误 : no operand "<<" matches these operands when try to use Qstring with OStream object
Qt 版本 5.01 平台 windows 64 位问题:错误:没有操作数“ #include #include #include namespace { std::ost
c - 遇到错误: invalid operands to binary & and error: invalid operands to binary |?怎么办
#include #include #define SIGBAD(signo) ((signo) = NSIG) int sigaddset(sigset_t *set, int signo
java - SQL语法错误异常 : The '+' operator with a left operand type of 'VARCHAR' and a right operand type of 'VARCHAR' is not supported
请看下面的代码。我正在使用 Apache Derby 作为嵌入式数据库 public List getDetails(String name) { List details =
c++ - 错误 X8000 : D3D11 Internal Compiler error : Invalid Bytecode: Invalid operand type for operand #1 of opcode #86 (counts are 1-based)
我和我的讲师/实验室助理都被难住了。出于某种原因，以下 HLSL 代码在输出窗口中返回: error X8000 : D3D11 Internal Compiler error : Invalid
regex - 如何修复目录名: missing operand
我有一个创建时间跟踪器的 NPM 包，它使用 for in 来定位 MD 文件的标题，然后将其转换为跟踪器。目前，在 Mac 上运行它时工作正常，在 Windows 上我收到 dirname:miss
vb.net - 直接获取术语 : operands, 参数和参数
请注意这个问题是不是 this 的副本或 this ，因为其他问题没有运算符(operator) 组件，不要询问我正在询问的参数和参数的详细信息。我将使用 vb.net 教授第一门编程类(class
bash - 如何修复 "readlink: missing operand"？
输入 sudo apt autoremove 后出现此错误在终端 readlink: missing operand Try 'readlink --help' for more informatio
c - 海湾合作委员会错误 : Invalid operands to binary +
为什么 GCC 给我这个错误？我在这里做错了什么？ temp.c: In function main: temp.c:6: error: invalid operands to binary +
组装错误 : "instruction operands must be the same size"
我对此很陌生，我正在尝试将值从一个数组移动到另一个数组，它假设是: vec1 = 1, 2, 3, 4, 5 vec2 = 5, 4, 3, 2, 1 但我收到一个错误:“指令操作数必须是相同的大小
Javascript 或表达式 : return Operand that is *not* NaN
我有一个 OR 表达式，它应该返回不是 NaN 的操作数: (1 || NaN) // evaluates to 1 (NaN || 1) // evaluates to 1 但是当另一个操作数也是一
javascript - 如何编写操作数(operator(operand))之类的JS函数？
关闭。此题需要details or clarity 。目前不接受答案。想要改进这个问题吗？通过 editing this post 添加详细信息并澄清问题. 已关闭 4 年前。 Improve th
javascript - JS : OR operation with more than two operands?
这道题是基于 Javascript 的，但适用于一般的逻辑运算拿代码举例 if (baseText[i] == "."){ /*splice array*/;} if (baseText[
C - "Error: Invalid operands to binary != ..."
我似乎无法找到使程序运行的问题。 C 告诉我“错误:二进制操作数无效!= 'grocerylist'(又名 structgrocerylist)和 'int' 当我尝试解决此问题时，会弹出其他错误，除
C#自增运算符错误: Operand is not syntactically correct?
我正在查看 the docs并尝试了解运算符的实际工作方式。 The increment operator (++) increments its operand by 1. The incremen
java - 如何匹配字符串中的模式，其中要匹配的字符串为 "operands": ["10000"]
我有一个很长的 json 字符串，"attributeName":"Loc ID"},"operands":["10000"]}],"Frequency":{"type":" 这个只是其中的一部分，我
c++ - 使用英特尔线程构建模块 : error operands to ? 进行编译:
目前，我尝试编译 OpenVDB，它依赖于 Threading Building Blocks。我收到以下错误: In file included from /usr/include/tbb/enum
c++ - 错误 : no operator "<" matches these operands
我收到的错误: /usr/include/c++/7/bits/stl_function.h:386: error: no operator " NearestNeighbor::nearest_pa
c++ - 模板函数 : invalid operands 中的运算符<<
我有一个类Color , 那有 friend std::ostream& operator void print_head(const T& head, sost& o) { o (rsym,

首页

博学

6Ren·AI

商城