python - 类型错误 : linear(): argument 'input' (position 1) must be Tensor, 不是 str-6ren

python - 类型错误 : linear(): argument 'input' (position 1) must be Tensor, 不是 str

转载作者：行者123 更新时间：2023-12-04 15:57:24

所以我一直在尝试研究我在 github 上发现的一些 bert 示例，这是我第一次尝试使用 bert 并查看它是如何工作的。使用的呼吸即时消息如下:https://github.com/prateekjoshi565/Fine-Tuning-BERT/blob/master/Fine_Tuning_BERT_for_Spam_Classification.ipynb
我使用了不同的数据集，但是我遇到了问题 TypeError: linear(): argument 'input' (position 1) must be Tensor, not str"老实说，我不知道我做错了什么。有没有人可以帮助我？
我一直在使用的代码如下:

# convert class weights to tensor
weights= torch.tensor(class_wts,dtype=torch.float)
weights = weights.to(device)

# loss function
cross_entropy  = nn.NLLLoss(weight=weights) 

# number of training epochs
epochs = 10

def train():
  
  model.train()

  total_loss, total_accuracy = 0, 0
  
  # empty list to save model predictions
  total_preds=[]
  
  # iterate over batches
  for step,batch in enumerate(train_dataloader):
    
    # progress update after every 50 batches.
    if step % 50 == 0 and not step == 0:
      print('  Batch {:>5,}  of  {:>5,}.'.format(step, len(train_dataloader)))

    # push the batch to gpu
    batch = [r.to(device) for r in batch]
 
    sent_id, mask, labels = batch

    # clear previously calculated gradients 
    model.zero_grad()        

    # get model predictions for the current batch
    preds = model(sent_id, mask)

    # compute the loss between actual and predicted values
    loss = cross_entropy(preds, labels)

    # add on to the total loss
    total_loss = total_loss + loss.item()

    # backward pass to calculate the gradients
    loss.backward()

    # clip the the gradients to 1.0. It helps in preventing the exploding gradient problem
    torch.nn.utils.clip_grad_norm_(model.parameters(), 1.0)

    # update parameters
    optimizer.step()

    # model predictions are stored on GPU. So, push it to CPU
    preds=preds.detach().cpu().numpy()

    # append the model predictions
    total_preds.append(preds)

  # compute the training loss of the epoch
  avg_loss = total_loss / len(train_dataloader)
  
  # predictions are in the form of (no. of batches, size of batch, no. of classes).
  # reshape the predictions in form of (number of samples, no. of classes)
  total_preds  = np.concatenate(total_preds, axis=0)

  #returns the loss and predictions
  return avg_loss, total_preds

def evaluate():
  
  print("\nEvaluating...")
  
  # deactivate dropout layers
  model.eval()

  total_loss, total_accuracy = 0, 0
  
  # empty list to save the model predictions
  total_preds = []

  # iterate over batches
  for step,batch in enumerate(val_dataloader):
    
    # Progress update every 50 batches.
    if step % 50 == 0 and not step == 0:
      
      # Calculate elapsed time in minutes.
      elapsed = format_time(time.time() - t0)
            
      # Report progress.
      print('  Batch {:>5,}  of  {:>5,}.'.format(step, len(val_dataloader)))

    # push the batch to gpu
    batch = [t.to(device) for t in batch]

    sent_id, mask, labels = batch

    # deactivate autograd
    with torch.no_grad():
      
      # model predictions
      preds = model(sent_id, mask)

      # compute the validation loss between actual and predicted values
      loss = cross_entropy(preds,labels)

      total_loss = total_loss + loss.item()

      preds = preds.detach().cpu().numpy()

      total_preds.append(preds)

  # compute the validation loss of the epoch
  avg_loss = total_loss / len(val_dataloader) 

  # reshape the predictions in form of (number of samples, no. of classes)
  total_preds  = np.concatenate(total_preds, axis=0)

  return avg_loss, total_preds

# set initial loss to infinite
best_valid_loss = float('inf')

# empty lists to store training and validation loss of each epoch
train_losses=[]
valid_losses=[]

#for each epoch
for epoch in range(epochs):
     
    print('\n Epoch {:} / {:}'.format(epoch + 1, epochs))
    
    #train model
    train_loss, _ = train()
    
    #evaluate model
    valid_loss, _ = evaluate()
    
    #save the best model
    if valid_loss < best_valid_loss:
        best_valid_loss = valid_loss
        torch.save(model.state_dict(), 'saved_weights.pt')
    
    # append training and validation loss
    train_losses.append(train_loss)
    valid_losses.append(valid_loss)
    
    print(f'\nTraining Loss: {train_loss:.3f}')
    print(f'Validation Loss: {valid_loss:.3f}')

我收到的回溯是:

Epoch 1 / 10
---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-105-c5138ddf6b25> in <module>()
     12 
     13     #train model
---> 14     train_loss, _ = train()
     15 
     16     #evaluate model

5 frames
<ipython-input-103-3236a6e339dd> in train()
     24 
     25     # get model predictions for the current batch
---> 26     preds = model(sent_id, mask)
     27 
     28     # compute the loss between actual and predicted values

/usr/local/lib/python3.7/dist-packages/torch/nn/modules/module.py in _call_impl(self, *input, **kwargs)
    887             result = self._slow_forward(*input, **kwargs)
    888         else:
--> 889             result = self.forward(*input, **kwargs)
    890         for hook in itertools.chain(
    891                 _global_forward_hooks.values(),

<ipython-input-99-9ebdcf410f97> in forward(self, sent_id, mask)
     28       _, cls_hs = self.bert(sent_id, attention_mask=mask)
     29 
---> 30       x = self.fc1(cls_hs)
     31 
     32       x = self.relu(x)

/usr/local/lib/python3.7/dist-packages/torch/nn/modules/module.py in _call_impl(self, *input, **kwargs)
    887             result = self._slow_forward(*input, **kwargs)
    888         else:
--> 889             result = self.forward(*input, **kwargs)
    890         for hook in itertools.chain(
    891                 _global_forward_hooks.values(),

/usr/local/lib/python3.7/dist-packages/torch/nn/modules/linear.py in forward(self, input)
     92 
     93     def forward(self, input: Tensor) -> Tensor:
---> 94         return F.linear(input, self.weight, self.bias)
     95 
     96     def extra_repr(self) -> str:

/usr/local/lib/python3.7/dist-packages/torch/nn/functional.py in linear(input, weight, bias)
   1751     if has_torch_function_variadic(input, weight):
   1752         return handle_torch_function(linear, (input, weight), input, weight, bias=bias)
-> 1753     return torch._C._nn.linear(input, weight, bias)
   1754 
   1755 

TypeError: linear(): argument 'input' (position 1) must be Tensor, not str

最佳答案

我也一直在研究这个 repo。
受到此 link 上提供的答案的启发.有一个可能名为 Bert_Arch 的类继承了 nn.Module，这个类有一个名为 forward 的重写方法。在 forward 方法中，只需将参数 'return_dict=False' 添加到 self.bert() 方法调用中。像这样:

_, cls_hs = self.bert(sent_id, attention_mask=mask, return_dict=False)

这对我有用。

关于python - 类型错误 : linear(): argument 'input' (position 1) must be Tensor, 不是 str，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/66846030/

文章推荐： Symfony 5 easyadmin 3 实体与 ManyToOne 关系 - 不保存在 "many"端

文章推荐： r - 如何使用 sink() 在 Jupyter Notebook 中保存 R 输出？

文章推荐： Django DateTimeField 显示为字符串

swift - str = str + "abc"比 str = "abc"+ str 慢？
你信吗？我有一个这样的循环(请原谅任何错误，我不得不大量编辑大量信息和变量名称，相信我它有效)。 ...旧示例已删除，请参见下面的代码... 如果我将那些中间的 str = "Blah\(odat.c
c# - 为什么是 str = str.Replace().Replace();比 str = str.Replace(); 快str = str.替换()？
我正在做一个本地测试来比较 C# 中 String 和 StringBuilder 的 Replace 操作性能，但是对于 String 我使用了以下代码: String str = "String
c++ - 使用 str += "A"或 str = str + "A"连接字符串之间的性能差异
我想知道为什么str += "A"和 str = str + "A"有不同的表现。在实践中， string str = "cool" for(int i = 0; i approximately
python - 转换类型列表 [ ("[' str' ]", int), ("['str' ]", int)] to [(' str', int), ('str' , int)]
我有一个类型列表 [("['106.52.116.101']", 1), ("['45.136.108.85']", 1)] 并想将其转换为 [('106.52.116.101', 1), ('45.
python - 转换类型列表 [ ("[' str' ]", int), ("['str' ]", int)] to [(' str', int), ('str' , int)]
我有一个类型列表 [("['106.52.116.101']", 1), ("['45.136.108.85']", 1)] 并想将其转换为 [('106.52.116.101', 1), ('45.
string - 为什么遍历 HashMap<&str,&str> 会产生 &&str？
我正在遍历 HashMap并通过一些本地变量中的模式匹配将值放入其中。委托(delegate)者 fn lyrics_no_bottles(song_template:&mut String){
python - 为什么是 str.count ('' ) ≠ (from str.count ('A' ) + str.count ('B' ) + ... + str.count ('Z' ))
如果字符串(短语)中只有元音，它(对我而言)说True；否则说 False。我不明白为什么它总是返回 False，因为 (x >= x) 总是返回 True。我感谢任何人检查此查询的解决方案。 (st
rust - 我如何实现一种方法来处理 &str、Box、Rc 等？
我有代码以某种方式转换字符串引用，例如取第一个字母 trait Tr { fn trim_indent(self) -> Self; } impl Tr for &'a str { f
c++ - char* str ="ab", str 和 &str 的混淆
我正在学习指针，这是我的代码。我定义了一个指向 char(实际上是字符串)的指针 *str 和一个指向 int *a 的指针，它们的定义方式相同。我认为 str 和 a 都应该是一个地址，但是当我试图
python - Mypy 索引类型 "str"为 "Union[str, Dict[str, str]]"无效；预期类型 "Union[int, slice]"
为什么我会收到错误消息？我已经正确添加了类型，对吗？ Invalid index type "str" for "Union[str, Dict[str, str]]"; expected type
javascript - ['null' ,'' ,'undefined' ].indexOf(str) < 0 和 (str !== null || str !== '' || str !== undefined) 等价吗？
你知道下面两个函数是否等价吗？ function validate(str) { return ( ['null','','undefined'].indexOf(str) [v, valida
python - pd.Series.str.lower.replace ('str' , 'replace_str' ) 不起作用但 pd.Series.str.replace。 ('STR' , 'replace_str' ) 呢？
我正在解决这里的 Dataquest 问题:https://app.dataquest.io/m/293/data-cleaning-basics/5/removing-non-digit-chara
python - 将 str 列表排序为成对的 str，其中一个 str 具有 -R
我有一个字符串列表，如下所示: ["A TB", "A-R TB", "B TB", "B-R TB", "C TB", "C-R TB"...] 但字符串的顺序是随机的。我如何编写一个将元素配对的函
python - Pandas str.extract : AttributeError: 'str' object has no attribute 'str'
我正在尝试将此函数从使用 split 改为使用 str.extract (正则表达式)。 def bull_lev(x): spl = x.rsplit(None, 2)[-2].strip(
python - 将 [{str :int}, {str :int}, ... ] 的字典列表转换为 {str:int} 的单个字典
给定这样的数据结构: [{'a':1, 'b': 2}, {'c':3 }, {'a':4, 'c':9}, {'d':0}, {'d': 0, 'b':6}] 目标是解析数据以产生: {'a': 2
python - 将 [{str :int}, {str :int}, ... ] 的字典列表转换为 {str:int} 的单个字典
给定这样的数据结构: [{'a':1, 'b': 2}, {'c':3 }, {'a':4, 'c':9}, {'d':0}, {'d': 0, 'b':6}] 目标是解析数据以产生: {'a': 2
python - pyside/pyqt : when converting str() to QTreeWidgetItem() the str() is shortened to the [0] of str()
s = 'someString' s = QTreeWidgetItem(s) print(s.text(0)) # 0 being 'column' 输出: 's' 如果我对另一
c++ - 黑白 char* str[]、char *str 和 char str[] 的区别
黑白有什么区别: function(char* str ) function(char* str[] ) function(char str[] ) 它们是如何被调用的(通过什么类型的string/c
javascript - JavaScript 中的 str.fun()/str.fun/fun(str) 有什么区别？
我试过谷歌搜索但找不到准确的答案，所以请允许我尝试在这里提问。如果问题看起来不合适，请告诉我，我会删除它。在 JS 中，您可以通过三种不同的方式编写特定的内置功能: 字符串长度 str.toStri
c - *str 和 *str++
我有这段代码(我的 strlen 函数) size_t slen(const char *str) { size_t len = 0; while (*str) {

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - 类型错误 : linear(): argument 'input' (position 1) must be Tensor, 不是 str