Add way to initialize SrlBert without pretrained BERT weights #257

epwalsh · 2021-04-30T18:45:44Z

You can avoid caching/loading pretrained BERT weights by setting the bert_model parameter of SrlBert to a dictionary that corresponds to the BertConfig from HuggingFace. You'll also need a local copy of the config and vocab to avoid downloads from the dataset reader, so the easiest complete work-around would look something like this:

from transformers import AutoConfig
from allennlp.predictors import Predictor

transformer_model_name = "bert-base-uncased"
archive_path = "https://storage.googleapis.com/allennlp-public-models/structured-prediction-srl-bert.2020.12.15.tar.gz"

# Need copies of the transformer config and vocab in a local directory.
local_config_path = "./" + transformer_model_name + "-local"

config = AutoConfig.from_pretrained(local_config_path)

predictor = Predictor.from_path(
    archive_path,
    overrides={
        "model.bert_model": config.to_dict(),
        "dataset_reader.bert_model_name": local_config_path,
    },
)

You can set up the local files you need by running this:

from transformers import AutoConfig, AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained(transformer_model_name)
config = AutoConfig.from_pretrained(transformer_model_name)
tokenizer.save_pretrained(tokenizer_path)
config.to_json_file(local_config_path + "/config.json")

This is related to allenai/allennlp#5172, but required it's own solution since the SrlBert model is a bit of an oddball in that it uses the BERT model class from transformers directly, instead of through AllenNLP's PretrainedTransformerEmbedder.

ArjunSubramonian

Looks great! Is there a test for this?

epwalsh · 2021-04-30T19:02:52Z

.github/workflows/ci.yml

@@ -66,7 +66,7 @@ jobs:
    - uses: actions/cache@v2
      with:
        path: ${{ env.pythonLocation }}
-        key: ${{ runner.os }}-pydeps-${{ env.pythonLocation }}-${{ hashFiles('requirements.txt') }}-${{ hashFiles('dev-requirements.txt') }}
+        key: ${{ runner.os }}-pydeps-${{ env.pythonLocation }}-${{ hashFiles('requirements.txt') }}-${{ hashFiles('dev-requirements.txt') }}-v2


The cache was corrupted for some reason.

epwalsh · 2021-04-30T19:03:31Z

Looks great! Is there a test for this?

No, but there should be. I'll add one.

ArjunSubramonian

LGTM!

add way to initialize SrlBert without pretrained BERT weights

358362b

epwalsh requested review from AkshitaB and ArjunSubramonian April 30, 2021 18:45

tick cache version

e1ed989

epwalsh mentioned this pull request Apr 30, 2021

Cannot load the pre-trained models allenai/allennlp#5170

Closed

ArjunSubramonian reviewed Apr 30, 2021

View reviewed changes

epwalsh commented Apr 30, 2021

View reviewed changes

add test

44cb6b4

epwalsh requested a review from ArjunSubramonian April 30, 2021 19:58

ArjunSubramonian approved these changes Apr 30, 2021

View reviewed changes

epwalsh merged commit 845fe4c into main May 2, 2021

epwalsh deleted the srl-no-load-weights branch May 2, 2021 21:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add way to initialize SrlBert without pretrained BERT weights #257

Add way to initialize SrlBert without pretrained BERT weights #257

epwalsh commented Apr 30, 2021 •

edited

Loading

ArjunSubramonian left a comment

epwalsh Apr 30, 2021

epwalsh commented Apr 30, 2021

ArjunSubramonian left a comment

Add way to initialize SrlBert without pretrained BERT weights #257

Add way to initialize SrlBert without pretrained BERT weights #257

Conversation

epwalsh commented Apr 30, 2021 • edited Loading

ArjunSubramonian left a comment

Choose a reason for hiding this comment

epwalsh Apr 30, 2021

Choose a reason for hiding this comment

epwalsh commented Apr 30, 2021

ArjunSubramonian left a comment

Choose a reason for hiding this comment

epwalsh commented Apr 30, 2021 •

edited

Loading