When loading archived fine-tuned models for prediction, prevent non-fine-tuned pretrained transformer models from being downloaded #4599

bwriordan · 2020-08-25T21:29:14Z

Is your feature request related to a problem? Please describe.
When loading an archived model with a pretrained transformer embedder for prediction, where the model has been fine-tuned on a dataset, a huggingface pretrained transformer model is always downloaded to ~/.cache/torch. Then the archived model is loaded and replaces the downloaded model. When doing prediction with an existing archived model, the huggingface download is not necessary.

Describe the solution you'd like
For prediction, prevent pretrained transformer models from being downloaded.

Describe alternatives you've considered
There doesn't seem to be a way to pass an argument to from_path() to prevent the model from being downloaded.

Additional context

The downloading happens here:
model.py: model = Model.from_params(vocab=vocab, params=model_params)
pretrained_transformer_embedder.py:

self.transformer_model = cached_transformers.get(
            model_name, True, override_weights_file, override_weights_strip_prefix
        )

cached_transformers.py: transformer = AutoModel.from_pretrained(model_name)

There is logic in model.py to remove pretrained embedding parameters: remove_pretrained_embedding_params(model_params)
However, this seems to only target pretrained embeddings like Glove via the pretrained_file config file parameter.

The text was updated successfully, but these errors were encountered:

matt-gardner · 2020-08-25T21:53:50Z

Yes, the current behavior here is not ideal, but it doesn't actually have an easy solution, I don't think. We get not just the weights from that call to AutoModel.from_pretrained, we get specifics of the architecture, also. We don't know weight sizes, e.g., without that call. It's not trivial to bypass it like how we do with a simple embedding layer.

This is pretty low priority for us, especially as you probably downloaded the model when you trained it, anyway, so a cache miss here is a rare case, not the typical case. But, if anyone figures out a good solution for this problem, we'd be happy to review a PR.

bwriordan added the Feature request label Aug 25, 2020

matt-gardner added the Contributions welcome label Aug 25, 2020

ArjunSubramonian mentioned this issue Apr 1, 2021

Avoid from_pretrained download of model weights #5085

Merged

epwalsh mentioned this issue Apr 9, 2021

Load pre-trained transformer weights once #5107

Closed

This was referenced Apr 30, 2021

Cannot load the pre-trained models #5170

Closed

Add way of skipping pretrained weights download #5172

Merged

epwalsh closed this as completed in #5172 May 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When loading archived fine-tuned models for prediction, prevent non-fine-tuned pretrained transformer models from being downloaded #4599

When loading archived fine-tuned models for prediction, prevent non-fine-tuned pretrained transformer models from being downloaded #4599

bwriordan commented Aug 25, 2020

matt-gardner commented Aug 25, 2020

When loading archived fine-tuned models for prediction, prevent non-fine-tuned pretrained transformer models from being downloaded #4599

When loading archived fine-tuned models for prediction, prevent non-fine-tuned pretrained transformer models from being downloaded #4599

Comments

bwriordan commented Aug 25, 2020

matt-gardner commented Aug 25, 2020