python - huggingface 变形金刚 longformer 优化器警告 AdamW-6ren

python - huggingface 变形金刚 longformer 优化器警告 AdamW

转载作者：行者123 更新时间：2023-12-05 04:35:05

24

4

当我尝试运行此 page 中的代码时，我收到以下警告.

/usr/local/lib/python3.7/dist-packages/transformers/optimization.py:309: FutureWarning: This implementation of AdamW is deprecated and will be removed in a future version. Use thePyTorch implementation torch.optim.AdamW instead, or set `no_deprecation_warning=True` to disable this warning
  FutureWarning,

我非常困惑，因为代码似乎根本没有设置优化器。最有可能设置优化器的地方可能在下面，但我不知道如何更改优化器

# define the training arguments
training_args = TrainingArguments(
    output_dir = '/media/data_files/github/website_tutorials/results',
    num_train_epochs = 5,
    per_device_train_batch_size = 8,
    gradient_accumulation_steps = 8,    
    per_device_eval_batch_size= 16,
    evaluation_strategy = "epoch",
    disable_tqdm = False, 
    load_best_model_at_end=True,
    warmup_steps=200,
    weight_decay=0.01,
    logging_steps = 4,
    fp16 = True,
    logging_dir='/media/data_files/github/website_tutorials/logs',
    dataloader_num_workers = 0,
    run_name = 'longformer-classification-updated-rtx3090_paper_replication_2_warm'
)

# instantiate the trainer class and check for available devices
trainer = Trainer(
    model=model,
    args=training_args,
    compute_metrics=compute_metrics,
    train_dataset=train_data,
    eval_dataset=test_data
)
device = 'cuda' if torch.cuda.is_available() else 'cpu'
device

我使用相同的代码尝试了另一个转换器，例如 distilbert-base-uncased，但它似乎在没有任何警告的情况下运行。

此警告是否更针对 longformer？
我应该如何更改优化器？

最佳答案

import torch_optimizer as optim
    
optim.AdamW(params, opt.learning_rate, (opt.optim_alpha, opt.optim_beta), opt.optim_epsilon, weight_decay=opt.weight_decay)

可以这样使用。

关于python - huggingface 变形金刚 longformer 优化器警告 AdamW，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/71113363/

24

4

0

文章推荐： python - 如何使用泛型类型的构造函数

文章推荐： Flutter:无法使用 workmanager 初始化共享首选项

文章推荐： python - 如何在使用的 Python 中调用 C++ 代码

word-embedding - longformer 的最后一层用于文档嵌入
使用 longformer API 返回有限数量层的正确方法是什么？与基本情况不同BERT ，我不清楚返回类型如何只获取最后 N 层。所以，我运行这个: from transformers imp
css - LongForm 标签显示为内联，但在一行而不是两行
我正在尝试使更长的标签显示与文本框内联。我希望标签位于一行并与文本框内联显示。我正在使用 Bootstrap 3，但我似乎无法弄清楚如何实现这一点。下面是一些示例代码: Two line lab
python - huggingface 变形金刚 longformer 优化器警告 AdamW
当我尝试运行此 page 中的代码时，我收到以下警告. /usr/local/lib/python3.7/dist-packages/transformers/optimization.py:309:
huggingface-transformers - 如何从 HuggingFace Longformer 中提取文档嵌入
想做类似的事情 tokenizer = BertTokenizer.from_pretrained('bert-base-uncased') model = BertModel.from_pretra

首页

博学

6Ren·AI

商城

python - huggingface 变形金刚 longformer 优化器警告 AdamW