Self_training_fix #239

changbiHub · 2025-02-26T21:52:02Z

Fix bugs in EncoderDecoderTrainer and EncoderDecoderLoss

I identified two bugs while using EncoderDecoderTrainer for pretraining:

Issue 1: Negative Loss Values in EncoderDecoderLoss

When calculating the loss, if x_true_stds is zero, it's replaced with the corresponding value from x_true_means. If these mean values are negative, they cause the loss to become negative, which is problematic for optimization.

The problematic code:

x_true_stds = torch.std(x_true, dim=0) ** 2
x_true_stds[x_true_stds == 0] = x_true_means[x_true_stds == 0]

Issue 2: Inconsistent Return Order in _forward_tabnet

The _forward_tabnet method returns values in a different order compared to other encoder-decoder models:

Other models: x_embed, x_embed_rec, mask
TabNet: x_embed_rec, x_embed, mask

This inconsistency causes incorrect inputs to the loss function when using TabNet with EncoderDecoderTrainer.

Solution

For Issue 1:

Modified the EncoderDecoderLoss to use the absolute value of means when replacing zero standard deviations:

x_true_stds[x_true_stds == 0] = torch.abs(x_true_means[x_true_stds == 0])

This ensures that the scaling factor remains positive, which is conceptually correct since standard deviations are always non-negative.

For Issue 2:

Changed the return order in _forward_tabnet to match the convention used by other models:

return x_embed, x_embed_rec, mask  # Previously: x_embed_rec, x_embed, mask

…when zero std deviations

…tput

jrzaurin · 2025-02-27T11:09:18Z

thanks for opening the PR. Next time (I hope there is a next time :) ), if you could please open it from a brach in the repo would be great, as it would go through the tests. Now it has broken a few tests. I will try to fix them

Thanks anyway!

jrzaurin · 2025-02-27T11:23:11Z

ok, errors have to do with pytorch 2.6.

I will open an issue

changbiHub · 2025-02-27T16:12:33Z

Sorry about that, first time PR.

changbiHub added 2 commits February 26, 2025 13:38

fix: ensure positivity in EncoderDecoderLoss by using absolute means …

c34ff8d

…when zero std deviations

fix: correct return order in EncoderDecoderModel to match expected ou…

882db8b

…tput

changbiHub marked this pull request as ready for review February 26, 2025 21:53

jrzaurin merged commit 4187bbd into jrzaurin:master Feb 27, 2025
2 checks passed

changbiHub deleted the self_training_fix branch February 27, 2025 10:35

jrzaurin mentioned this pull request Feb 27, 2025

negative self-training loss for EncoderDecoderTrainer #238

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Self_training_fix #239

Self_training_fix #239

Uh oh!

changbiHub commented Feb 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

jrzaurin commented Feb 27, 2025 •

edited

Loading

Uh oh!

jrzaurin commented Feb 27, 2025

Uh oh!

changbiHub commented Feb 27, 2025

Uh oh!

Uh oh!

Self_training_fix #239

Self_training_fix #239

Uh oh!

Conversation

changbiHub commented Feb 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fix bugs in EncoderDecoderTrainer and EncoderDecoderLoss

Issue 1: Negative Loss Values in EncoderDecoderLoss

Issue 2: Inconsistent Return Order in _forward_tabnet

Solution

For Issue 1:

For Issue 2:

Uh oh!

Uh oh!

jrzaurin commented Feb 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jrzaurin commented Feb 27, 2025

Uh oh!

changbiHub commented Feb 27, 2025

Uh oh!

Uh oh!

changbiHub commented Feb 26, 2025 •

edited

Loading

jrzaurin commented Feb 27, 2025 •

edited

Loading