Question about loss selection #6

Yu-Fangxu · 2024-04-16T14:32:25Z

Hi the authors,
I learned a lot from your wonderful work. I want to ask if you have ever tried Trajectory Balance(TB) as the learning objective. I used to run your code and found that the loss starts from 10^3 to 10^4, which is large. When considering TB, the loss should not be that large, because XYZ should be consistent. thus leading to a large P(XZY).
So did you try TB at first? If there is any default when training with TB?

Thanks!

MJ10 · 2024-04-18T14:40:04Z

Hi @Yu-Fangxu, we did try trajectory balance at some point early on in the project but haven't tried it since. All the experiments were run with the modified SubTB loss, so unfortunately I can't help with good defaults. Regarding the loss - if you are initializing new LoRA weights, the initial loss can be quite high - in some other projects we have observed that for short sequences TB does work well. Hope this helps!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about loss selection #6

Question about loss selection #6

Yu-Fangxu commented Apr 16, 2024

MJ10 commented Apr 18, 2024

Question about loss selection #6

Question about loss selection #6

Comments

Yu-Fangxu commented Apr 16, 2024

MJ10 commented Apr 18, 2024