`eval_on_start` triggers `AttributeError` in JupyterLab #32689

matteosantama · 2024-08-14T17:05:55Z

System Info

transformers: '4.44.0'
Python: '3.10.14 | packaged by conda-forge | (main, Mar 20 2024, 12:45:18) [GCC 12.3.0]'

Who can help?

@muellerz @SunMarc

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Here is a self-contained script that triggers the error:

model_id = "distilbert/distilbert-base-uncased"

model = AutoModelForSequenceClassification.from_pretrained(
    model_id, num_labels=2
)
tokenizer = AutoTokenizer.from_pretrained(model_id, use_fast=True)

ds = (
    Dataset.from_dict({"text": ["Hello world!"], "labels": [0]})
    .map(
        lambda b: tokenizer(b["text"], truncation=True),
        batched=True,
    )
)


trainer = Trainer(
    model=model,
    args=TrainingArguments(
        output_dir="temp",
        eval_strategy="steps",
        save_strategy="no",
        eval_on_start=True,  # <-- setting to False avoids the error
    ),
    tokenizer=tokenizer,
    train_dataset=ds,
    eval_dataset=ds,
    compute_metrics=lambda _: {"metric": 1.0},
)
trainer.train()

The error I get is

AttributeError: 'NotebookTrainingTracker' object has no attribute 'value'

which seems related to the fact that I'm running the code in a Jupyter notebook.

Expected behavior

I should be able to use the eval_on_start parameter without triggering an error.

The text was updated successfully, but these errors were encountered:

fshp971 · 2024-08-16T12:32:54Z

Hi,

I have found that the cause of this exception is that in Jupyter Notebook, for a transformers.Trainer, when its argument eval_on_start is set to True, running Trainer.train() will lead to the following function execution order:

NotebookProgressCallback.on_train_begin()
NotebookProgressCallback.on_prediction_step()
NotebookProgressCallback.on_train_end()

Running NotebookProgressCallback.on_prediction_step() directly after NotebookProgressCallback.on_train_begin() will result in two attributes of NotebookProgressBar, i.e., value and label, being used before assigned (see methods update(), update_bar() and display() of NotebookProgressBar in transformers/utils/notebook.py for details).

I have open a PR to fix this.

… `eval_on_start=True` in Jupyter Notebook. (#32849) fix: `AttributeError` raised when using `Trainer` with `eval_on_start=True` in Jupyter Notebook.

…ainer` with `eval_on_start=True` in Jupyter Notebook. (huggingface#32849) fix: `AttributeError` raised when using `Trainer` with `eval_on_start=True` in Jupyter Notebook.

matteosantama added the bug label Aug 14, 2024

amyeroberts added the trainer label Aug 14, 2024

fshp971 mentioned this issue Aug 16, 2024

fix: (issue #32689) AttributeError raised when using Trainer with eval_on_start=True in Jupyter Notebook. #32849

Merged

5 tasks

ArthurZucker closed this as completed in #32849 Aug 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`eval_on_start` triggers `AttributeError` in JupyterLab #32689

`eval_on_start` triggers `AttributeError` in JupyterLab #32689

matteosantama commented Aug 14, 2024 •

edited

Loading

fshp971 commented Aug 16, 2024

eval_on_start triggers AttributeError in JupyterLab #32689

eval_on_start triggers AttributeError in JupyterLab #32689

Comments

matteosantama commented Aug 14, 2024 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

fshp971 commented Aug 16, 2024

`eval_on_start` triggers `AttributeError` in JupyterLab #32689

`eval_on_start` triggers `AttributeError` in JupyterLab #32689

matteosantama commented Aug 14, 2024 •

edited

Loading