'NoneType' object has no attribute 'convert_tokens_to_ids' #10

ShivanshuPurohit · 2020-10-12T15:11:05Z

While running train.py I encountered this error:
Model name 'model/' was not found in model name list (bert-base-uncased, bert-large-uncased, bert-base-cased, bert-large-cased, bert-base-multilingual-uncased, bert-base-multilingual-cased, bert-base-chinese). We assumed 'model/vocab.txt' was a path or url but couldn't find any file associated to this path or url.

Traceback (most recent call last): File "train.py", line 168, in <module> train_data = data_loader.load_data('train') File "/content/BERT-keyphrase-extraction/data_loader.py", line 83, in load_data self.load_sentences_tags(sentences_file, tags_path, data) File "/content/BERT-keyphrase-extraction/data_loader.py", line 51, in load_sentences_tags sentences.append(self.tokenizer.convert_tokens_to_ids(tokens)) AttributeError: 'NoneType' object has no attribute 'convert_tokens_to_ids'

I think it isn't registering the pytorch_model.bin file, which I directly downloaded as bert-base-uncased.tar.gz

Also, when I modify the command to go in task1/train, python train.py --data_dir data/task1/train/ --bert_model_dir model/ --model_dir experiments/base_model the error is:
Loading the datasets... Traceback (most recent call last): File "train.py", line 165, in <module> data_loader = DataLoader(args.data_dir, args.bert_model_dir, params, token_pad_idx=0) File "/content/BERT-keyphrase-extraction/data_loader.py", line 28, in __init__ self.tag_pad_idx = self.tag2idx['O'] KeyError: 'O'

The text was updated successfully, but these errors were encountered:

sahiljethani · 2021-08-02T11:08:23Z

In BertTokenizer's, convert_tokens_to_ids function gives KeyError. So, I suggest to modify the for loop in the function as follows.

for token in tokens:
ids.append(self.vocab.get(token, self.vocab['[UNK]']))

arunmack789 · 2021-12-26T12:37:50Z

tokens = self.tokenizer.tokenize(line) used this instead of split()

hnrNeha · 2022-06-14T03:43:46Z

While running train.py I encountered this error: Model name 'model/' was not found in model name list (bert-base-uncased, bert-large-uncased, bert-base-cased, bert-large-cased, bert-base-multilingual-uncased, bert-base-multilingual-cased, bert-base-chinese). We assumed 'model/vocab.txt' was a path or url but couldn't find any file associated to this path or url.

Traceback (most recent call last): File "train.py", line 168, in <module> train_data = data_loader.load_data('train') File "/content/BERT-keyphrase-extraction/data_loader.py", line 83, in load_data self.load_sentences_tags(sentences_file, tags_path, data) File "/content/BERT-keyphrase-extraction/data_loader.py", line 51, in load_sentences_tags sentences.append(self.tokenizer.convert_tokens_to_ids(tokens)) AttributeError: 'NoneType' object has no attribute 'convert_tokens_to_ids'

I think it isn't registering the pytorch_model.bin file, which I directly downloaded as bert-base-uncased.tar.gz

Also, when I modify the command to go in task1/train, python train.py --data_dir data/task1/train/ --bert_model_dir model/ --model_dir experiments/base_model the error is: Loading the datasets... Traceback (most recent call last): File "train.py", line 165, in <module> data_loader = DataLoader(args.data_dir, args.bert_model_dir, params, token_pad_idx=0) File "/content/BERT-keyphrase-extraction/data_loader.py", line 28, in __init__ self.tag_pad_idx = self.tag2idx['O'] KeyError: 'O'

hey....how did you complete this step
From scibert repo, untar the weights (rename their weight dump file to pytorch_model.bin) and vocab file into a new folder model.
can you please help with this

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

'NoneType' object has no attribute 'convert_tokens_to_ids' #10

'NoneType' object has no attribute 'convert_tokens_to_ids' #10

ShivanshuPurohit commented Oct 12, 2020 •

edited

Loading

sahiljethani commented Aug 2, 2021

Uh oh!

arunmack789 commented Dec 26, 2021

Uh oh!

hnrNeha commented Jun 14, 2022

Uh oh!

'NoneType' object has no attribute 'convert_tokens_to_ids' #10

'NoneType' object has no attribute 'convert_tokens_to_ids' #10

Comments

ShivanshuPurohit commented Oct 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

sahiljethani commented Aug 2, 2021

Uh oh!

arunmack789 commented Dec 26, 2021

Uh oh!

hnrNeha commented Jun 14, 2022

Uh oh!

ShivanshuPurohit commented Oct 12, 2020 •

edited

Loading