Sync/v4.9.2 #232

calpt · 2021-09-17T15:55:18Z

No description provided.

mention in `save_strategy` param description that `load_best_model_at_end` can override

* document sub_group_size * style * install + issues reporting * style * style * Update docs/source/main_classes/deepspeed.rst Co-authored-by: Sylvain Gugger <[email protected]> * indent 4 * restore * style Co-authored-by: Sylvain Gugger <[email protected]>

Signed-off-by: Richard Liaw <[email protected]>

* Fix torchscript tests * Better test * Remove bogus print

…12350) * fix distributed_concat for scalar outputs * Update README.md * fixed typo (#12356) * simplify fix with terser syntax Co-authored-by: Sylvain Gugger <[email protected]> * Trigger CI Co-authored-by: Patrick von Platen <[email protected]> Co-authored-by: michal pitr <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

* port bigbird script * adapt script a bit * change location * adapt more * save progress * init commit * style * dataset script tested * readme add

…(#12357) * Replace NotebookProgressReporter by ProgressReporter in Ray Tune run * Move to local import

* fixed multiplechoice tokenization The model would have seen two sequences: 1. [CLS]prompt[SEP]prompt[SEP] 2. [CLS]choice0[SEP]choice1[SEP] that is not correct as we want a contextualized embedding of prompt and choice * removed outer brackets for proper sequence generation

* main_process_first context manager * handle multi-node, add context description * sync desc

…d pytorch (#12359) * added log_level * fix comment * fixed log_level * Trigger CI * Unfied logging * simplified args for log_level

…BertTokenizer-like tokenizers (#12371) * Notify users that DataCollatorForWholeWordMask is limited to BertTokenier-like tokenizers * Fix code formatting

Before the code could not be used for validation only because of this line: extension = data_args.train_file.split(".")[-1] was assuming that extension must be extracted from the training dataset. This line would run regardless of the training or validation options of the user. This would lead to an error if the user only wants to run an evaluation only and does not want to do train (because the training file does not exist). I modified it to extract extension from the training file if the user wants to do train and extract it from the validation file if the user wants to run eval. This way the code can be used for both training and validation separately.

* add dependency table sync verification * improve the message * improve the message * revert * ready to merge

* added cotext manager to datasets map * fixed style and spaces * fixed warning of deprecation * changed desc

* fix_torch_device_generate_test * remove @ * boom boom * correct typos * Apply suggestions from code review Co-authored-by: Suraj Patil <[email protected]> * Apply suggestions from code review Co-authored-by: Suzana Ilić <[email protected]> * Apply suggestions from code review Co-authored-by: Suraj Patil <[email protected]> Co-authored-by: Suzana Ilić <[email protected]>

* Add _CHECKPOINT_FOR_DOC * Update src/transformers/models/funnel/modeling_funnel.py Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

* [Deepspeed] warmup_ratio docs * Update docs/source/main_classes/deepspeed.rst Co-authored-by: Sylvain Gugger <[email protected]> * style * Update docs/source/main_classes/deepspeed.rst Co-authored-by: Sylvain Gugger <[email protected]> * style Co-authored-by: Sylvain Gugger <[email protected]>

…12836) * document Deepspeed-Inference and parallelformers * Apply suggestions from code review Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

* Raise an issue if the pytorch version is < 1.8.0 * Attempt to add a test to ensure it correctly raises. * Missing docstring. * Second attempt, patch with string absolute import. * Let's do the call before checking it was called ... * use the correct function ... 🤦 * Raise ImportError and AssertionError respectively when unable to find torch and torch version is not sufficient. * Correct path mock patching * relax constraint for torch_onnx_dict_inputs to ge instead of eq. * Style. * Split each version requirements for torch. * Let's compare version directly. * Import torch_version after checking pytorch is installed. * @require_torch

GPT-Neo ONNX export and task / feature refactoring Authored-by: Michael Benayoun <[email protected]>

T5 with past ONNX export, and more explicit past_key_values inputs and outputs names for ONNX model Authored-by: Michael Benayoun <[email protected]>

* Add MBART to models exportable with ONNX * unittest mock * Add tests * Misc fixes

* Add to ONNX docs * Add MBART example * Update docs/source/serialization.rst Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

* Fix tied weights on TPU * Manually tie weights in no trainer examples * Fix for test * One last missing * Gettning owned by my scripts * Address review comments * Fix test * Fix tests * Fix reformer tests

sgugger and others added 30 commits June 23, 2021 13:25

Release: v4.8.0

9252a51

v4.9.0.dev0

2150dfe

Update training_args.py (#12328)

3694484

mention in `save_strategy` param description that `load_best_model_at_end` can override

Fix default to logging_dir lost in merge conflict

cf3c919

try-this (#12338)

7875b63

Signed-off-by: Richard Liaw <[email protected]>

[examples/Flax] move the examples table up (#12341)

aef3823

Fix torchscript tests (#12336)

8ef62ec

* Fix torchscript tests * Better test * Remove bogus print

Document patch release v4.8.1

5b1b563

Add flax/jax quickstart (#12342)

f2c4ce7

Update README.md

aa550c4

fixed typo (#12356)

d4ce31e

Add FlaxBigBird QuestionAnswering script (#12233)

332a245

* port bigbird script * adapt script a bit * change location * adapt more * save progress * init commit * style * dataset script tested * readme add

Replace NotebookProgressReporter by ProgressReporter in Ray Tune run …

238521b

…(#12357) * Replace NotebookProgressReporter by ProgressReporter in Ray Tune run * Move to local import

Style

a3daabf

remove extra white space from log format (#12360)

4a872ca

[trainer] add main_process_first context manager (#12351)

64e6098

* main_process_first context manager * handle multi-node, add context description * sync desc

[Examples] Replicates the new --log_level feature to all trainer-base…

539ee45

…d pytorch (#12359) * added log_level * fix comment * fixed log_level * Trigger CI * Unfied logging * simplified args for log_level

updated example template (#12365)

9a75459

replace print with logger (#12368)

ff5cdc0

[Documentation] Warn that DataCollatorForWholeWordMask is limited to …

c7faf2c

…BertTokenizer-like tokenizers (#12371) * Notify users that DataCollatorForWholeWordMask is limited to BertTokenier-like tokenizers * Fix code formatting

Add possibility to maintain full copies of files (#12312)

57461ac

[CI] add dependency table sync verification (#12364)

d25ad34

* add dependency table sync verification * improve the message * improve the message * revert * ready to merge

[Examples] Added context manager to datasets map (#12367)

04dbea3

* added cotext manager to datasets map * fixed style and spaces * fixed warning of deprecation * changed desc

Update README.md

27b6ac4

Fix copies

276bc14

LysandreJik and others added 20 commits July 21, 2021 08:29

Add _CHECKPOINT_FOR_DOC to all models (#12811)

ac3cb66

* Add _CHECKPOINT_FOR_DOC * Update src/transformers/models/funnel/modeling_funnel.py Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

[debug] DebugUnderflowOverflow doesn't work with DP (#12816)

cf0755a

Raise warning in HP search when hp is not in args (#12831)

8c2384d

[parallelism doc] document Deepspeed-Inference and parallelformers (#…

27a8c9e

…12836) * document Deepspeed-Inference and parallelformers * Apply suggestions from code review Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

Fix type of max_seq_length arg in run_swag.py (#12832)

fcf8301

Release: v4.9.0

72aee83

Add doc for v4.9.0

6cab8b3

Fix barrier for SM distributed (#12853)

8ee16d8

Release: v4.9.1

bff1c71

Fix push_to_hub for TPUs (#12895)

2c255a2

GPT-Neo ONNX export (#12911)

94b7db9

GPT-Neo ONNX export and task / feature refactoring Authored-by: Michael Benayoun <[email protected]>

T5 with past ONNX export (#13014)

a12fa50

T5 with past ONNX export, and more explicit past_key_values inputs and outputs names for ONNX model Authored-by: Michael Benayoun <[email protected]>

Put smaller ALBERT model (#13028)

f595ea3

Add MBART to models exportable with ONNX (#13049)

226763a

* Add MBART to models exportable with ONNX * unittest mock * Add tests * Misc fixes

Add to ONNX docs (#13048)

bfd5354

* Add to ONNX docs * Add MBART example * Update docs/source/serialization.rst Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

Tpu tie weights (#13030)

ec78422

* Fix tied weights on TPU * Manually tie weights in no trainer examples * Fix for test * One last missing * Gettning owned by my scripts * Address review comments * Fix test * Fix tests * Fix reformer tests

Patch release: v4.9.2

41981a2

remove files from 'v4.9.2' before merge

68730a4

calpt added the sync label Sep 17, 2021

calpt marked this pull request as ready for review September 17, 2021 15:56

Merge stripped branch 'v4.9.2'

f8e84af

calpt force-pushed the sync/v4.9.2 branch from 7698f12 to f8e84af Compare September 17, 2021 16:09

Fix config keys in GPT2ModelTester

226e87b

calpt force-pushed the sync/v4.9.2 branch from a1de6ae to b024a3c Compare September 20, 2021 09:33

Post-merge fixes for tests & example scripts.

4ac892d

calpt force-pushed the sync/v4.9.2 branch from b024a3c to 4ac892d Compare September 20, 2021 09:42

calpt merged commit 76defae into adapter-hub:master Sep 20, 2021

calpt deleted the sync/v4.9.2 branch September 20, 2021 09:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync/v4.9.2 #232

Sync/v4.9.2 #232

calpt commented Sep 17, 2021

Sync/v4.9.2 #232

Sync/v4.9.2 #232

Conversation

calpt commented Sep 17, 2021