round epoch only in console #30237

xdedss · 2024-04-13T16:22:22Z

What does this PR do?

This PR fixes the problem that the "epoch" value is rounded to 2 digits before logging to wandb, resulting in inaccurate plottings.

Details:

In Trainer.log function, logs["epoch"] is rounded to 2 digits. As a result, the plotting in wandb is jaggy and some data points would be missing from the plot if you select "epoch" as the x-axis

This is a minimal example to reproduce these plots:

from datasets import load_dataset
from transformers import AutoTokenizer, AutoModelForSequenceClassification, TrainingArguments, Trainer
import numpy as np
import evaluate

# Load Dataset
dataset = load_dataset("yelp_review_full")

# Tokenization
tokenizer = AutoTokenizer.from_pretrained("google-bert/bert-base-cased")

def tokenize_function(examples):
    return tokenizer(examples["text"], padding="max_length", truncation=True)

tokenized_datasets = dataset.map(tokenize_function, batched=True)

# Data Split
small_train_dataset = tokenized_datasets["train"].shuffle(seed=42).select(range(5000))
small_eval_dataset = tokenized_datasets["test"].shuffle(seed=42).select(range(5000))

# Model
model = AutoModelForSequenceClassification.from_pretrained("google-bert/bert-base-cased", num_labels=5)

# Training Arguments
training_args = TrainingArguments(
    output_dir="test_trainer", 
    evaluation_strategy="epoch", 
    logging_steps=1, 
    num_train_epochs=1, 
    report_to="wandb"
    )

# Metrics
metric = evaluate.load("accuracy")

def compute_metrics(eval_pred):
    logits, labels = eval_pred
    predictions = np.argmax(logits, axis=-1)
    return metric.compute(predictions=predictions, references=labels)

# Trainer
trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=small_train_dataset,
    eval_dataset=small_eval_dataset,
    compute_metrics=compute_metrics,
)

# Training
trainer.train()

The commit message that introduced this rounding says this is to make the logging message look better, but this value will also be sent to wandb for plotting and produce jagged curves.

What this PR do is to round the number only in the handler that goes to the console since we still want accurate epoch value for other logging & plotting purposes.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? No.
Did you make sure to update the documentation with your changes? (I am not sure where this should be in the documentations)
Did you write any new necessary tests? (I am not sure if this should be tested separately, but pytest tests\trainer\test_trainer_callback.py is successful.

Who can review?

This fix is related to the trainer @muellerzr and @pacman100

amyeroberts

Thanks for digging into this - looks good to me!

Happy with the changes. Once we have another approval from @muellerzr or @pacman100 we can merge

muellerzr

Thanks! Makes sense to me as well. If we get some issues with people noticing increased memory, we'll need to include a to_device call down the road on logs, but on a quick glance of the code I can't 100% tell if that happens or not when logging something like large tensors (for whatever reason)

HuggingFaceDocBuilderDev · 2024-04-15T12:30:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

round epoch only in console

1041246

amyeroberts approved these changes Apr 15, 2024

View reviewed changes

muellerzr approved these changes Apr 15, 2024

View reviewed changes

amyeroberts merged commit 7668101 into huggingface:main Apr 15, 2024
21 checks passed

zucchini-nlp pushed a commit to zucchini-nlp/transformers that referenced this pull request Apr 18, 2024

round epoch only in console (huggingface#30237)

de29bd9

ArthurZucker pushed a commit that referenced this pull request Apr 22, 2024

round epoch only in console (#30237)

ddd6baa

itazap pushed a commit that referenced this pull request May 14, 2024

round epoch only in console (#30237)

de3abf3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

round epoch only in console #30237

round epoch only in console #30237

xdedss commented Apr 13, 2024 •

edited

Loading

amyeroberts left a comment

muellerzr left a comment

HuggingFaceDocBuilderDev commented Apr 15, 2024

round epoch only in console #30237

round epoch only in console #30237

Conversation

xdedss commented Apr 13, 2024 • edited Loading

What does this PR do?

Details:

Before submitting

Who can review?

amyeroberts left a comment

Choose a reason for hiding this comment

muellerzr left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Apr 15, 2024

xdedss commented Apr 13, 2024 •

edited

Loading