In config generation, switch to two stage training when the mono data is too small #632

gregtatum · 2024-05-24T17:43:22Z

But I found another use case when we don't want ["one-stage" teacher training when using a pre-trained backtranslations model]. If the amount of mono-trg data is too small (for example for en-lt) we still want to use two-stage. We don't want to loop over 5M back-translated sentences.

eu9ene · 2025-01-28T20:13:57Z

Now with HPLT2, NLLB and Monocleaner we always have a lot of mono data. I'd say we still use two stage by default and switch to one stage if it stops too early.

This was referenced May 24, 2024

Automatically generate training config files with the task config-generator #620

Merged

[meta] Make config generation fully automated #633

Open

eu9ene added the enhancement New feature or request label Aug 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In config generation, switch to two stage training when the mono data is too small #632

In config generation, switch to two stage training when the mono data is too small #632

gregtatum commented May 24, 2024

eu9ene commented Jan 28, 2025

In config generation, switch to two stage training when the mono data is too small #632

In config generation, switch to two stage training when the mono data is too small #632

Comments

gregtatum commented May 24, 2024

eu9ene commented Jan 28, 2025