Training process #28

maksimallist · 2023-07-10T14:19:42Z

Hello. Can you share the details of neural model training? Did you train it yourself? Did you collect data for training from basenji dataset files? I am unable to reproduce the claimed results during training.

fransilvionGenomica · 2023-10-23T22:52:37Z

Has anyone looked into this yet? I am also interested in this, since training enformer from scratch using your implementation doesn't reproduce same Pearson correlation values (max I am getting is ~0.4).

lucidrains · 2023-11-01T18:01:05Z

@fransilvionGenomica @maksimallist i tried a while ago using TPUs (didn't have access to large cluster of GPU at the time) and didn't hit the mark (got around 0.5-0.6). this was before Ziga officially released their model over at deepmind

the training script i used is all open sourced here . the original reason for making the repo was for a contracting project for a local startup

lucidrains · 2023-11-01T18:02:50Z

@fransilvionGenomica are you planning on training it on proprietary data with your own GPU cluster?

fransilvionGenomica · 2023-11-01T18:38:51Z

@lucidrains I am training your pytorch implementation using a single A100 GPU node with the original basenji dataset and gradient accumulation. I was using the following deepmind notebook as the reference: https://github.com/google-deepmind/deepmind-research/blob/master/enformer/enformer-training.ipynb. I do believe that it is possible to train the model on GPUs, since in the recent Borzoi paper from Enformer co-authors they did not use TPUs (https://www.biorxiv.org/content/10.1101/2023.08.30.555582v1). Unfortunately, they don't provide any training script (https://github.com/calico/borzoi).

lucidrains · 2023-11-01T18:48:14Z

@fransilvionGenomica ahh, i have not checked out Borzoi yet, although someone else told me it is the successor to Enformer

why are you still using this repository if Borzoi is the new SOTA? without reading the paper, did Borzoi set a new SOTA?

lucidrains · 2023-11-01T18:48:22Z

@fransilvionGenomica where do you work btw?

fransilvionGenomica · 2023-11-01T18:58:19Z

Oh I see. It makes sense. Even Borzoi mentioned it took them ~25 days on 2 GPUs. And I am training on a single GPU. I guess, I will just have to wait then. Thanks!

lucidrains · 2023-11-01T19:06:06Z

@fransilvionGenomica that is strange they waited that long. i thought calico had google level resources

lucidrains · 2023-11-01T19:06:51Z

@fransilvionGenomica i'll revisit genomics maybe end of the month and read the Borzoi paper in detail. knee deep in other projects at the moment.

lucidrains · 2023-11-01T20:03:19Z

ahh ok, was told that Borzoi is nothing more than Enformer applied to RNA-seq data. ok then using this repository is fine in that case

fransilvionGenomica · 2023-11-01T20:34:14Z

Yes, architecture wise they are very similar. Borzoi is actually less complex.

lucidrains · 2023-11-01T20:49:28Z

@fransilvionGenomica ok, i'll just copy / paste the existing code and remove that complexity for Borzoi later this month after i read the paper. hopefully they got rid of the annoying gamma positions

fransilvionGenomica · 2023-11-01T20:52:25Z

Just curious, have you noticed anything about the batch size while training enformer from scratch? Like, does it have to be relatively big (like at least 32) or can you train decently even if batch size is 1 or 2?

lucidrains · 2023-11-01T21:00:07Z

@fransilvionGenomica it has to be big (32 or 64). managing the data and long sequences was also a huge pain

lucidrains · 2023-11-01T21:01:39Z

@fransilvionGenomica the code in this repository isn't even setup for distributed training. i didn't set up synchronized batchnorm, which is required for it to train well.

lucidrains · 2023-11-01T21:02:05Z

@fransilvionGenomica actually let me just throw that in there for now

fransilvionGenomica · 2023-11-01T21:36:12Z

Have you tried to run your enformer implementation with pytorch lightning?

lucidrains · 2023-11-01T21:43:17Z

@fransilvionGenomica no i haven't, as i said above, my training was done in tensorflow sonnet with TPUs, as i had access to a large cluster of TPUs in collaboration with EleutherAI back then

lucidrains · 2023-11-01T21:44:11Z

@fransilvionGenomica if you ever wire up a working training script, always welcome a pull request, in the spirit of open source science.

minjaf · 2023-11-02T03:30:00Z

@fransilvionGenomica ahh, i have not checked out Borzoi yet, although someone else told me it is the successor to Enformer

why are you still using this repository if Borzoi is the new SOTA? without reading the paper, did Borzoi set a new SOTA?

What the paper says:
"Performance is difficult to compare directly to Enformer due to differences in data processing. Nevertheless, test accuracies on the overlapping datasets are broadly similar, indicating competitive model training"
(https://www.biorxiv.org/content/10.1101/2023.08.30.555582v1.full)

Let's wait until reviewers ask for this question =)

fransilvionGenomica · 2023-11-20T18:06:04Z

@lucidrains do you have training/validation loss trends left by any chance? for your tensorflow training code I mean.

lucidrains · 2023-11-20T18:07:59Z

@fransilvionGenomica hey yes, actually still have it lying around (thanks wandb) https://api.wandb.ai/links/lucidrains/9ac4x106

ZhuJiwei111 · 2024-10-19T09:46:12Z

@fransilvionGenomica the code in this repository isn't even setup for distributed training. i didn't set up synchronized batchnorm, which is required for it to train well.

hello，may i ask how to fix this? My training time is several times higher when I train with DDP than single GPU (with the same batch_size

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training process #28

Training process #28

maksimallist commented Jul 10, 2023

fransilvionGenomica commented Oct 23, 2023

lucidrains commented Nov 1, 2023 •

edited

Loading

lucidrains commented Nov 1, 2023 •

edited

Loading

fransilvionGenomica commented Nov 1, 2023

lucidrains commented Nov 1, 2023 •

edited

Loading

lucidrains commented Nov 1, 2023

fransilvionGenomica commented Nov 1, 2023

lucidrains commented Nov 1, 2023

lucidrains commented Nov 1, 2023 •

edited

Loading

lucidrains commented Nov 1, 2023

fransilvionGenomica commented Nov 1, 2023

lucidrains commented Nov 1, 2023

fransilvionGenomica commented Nov 1, 2023

lucidrains commented Nov 1, 2023 •

edited

Loading

lucidrains commented Nov 1, 2023 •

edited

Loading

lucidrains commented Nov 1, 2023

fransilvionGenomica commented Nov 1, 2023

lucidrains commented Nov 1, 2023

lucidrains commented Nov 1, 2023

minjaf commented Nov 2, 2023 •

edited

Loading

fransilvionGenomica commented Nov 20, 2023

lucidrains commented Nov 20, 2023

ZhuJiwei111 commented Oct 19, 2024

Training process #28

Training process #28

Comments

maksimallist commented Jul 10, 2023

fransilvionGenomica commented Oct 23, 2023

lucidrains commented Nov 1, 2023 • edited Loading

lucidrains commented Nov 1, 2023 • edited Loading

fransilvionGenomica commented Nov 1, 2023

lucidrains commented Nov 1, 2023 • edited Loading

lucidrains commented Nov 1, 2023

fransilvionGenomica commented Nov 1, 2023

lucidrains commented Nov 1, 2023

lucidrains commented Nov 1, 2023 • edited Loading

lucidrains commented Nov 1, 2023

fransilvionGenomica commented Nov 1, 2023

lucidrains commented Nov 1, 2023

fransilvionGenomica commented Nov 1, 2023

lucidrains commented Nov 1, 2023 • edited Loading

lucidrains commented Nov 1, 2023 • edited Loading

lucidrains commented Nov 1, 2023

fransilvionGenomica commented Nov 1, 2023

lucidrains commented Nov 1, 2023

lucidrains commented Nov 1, 2023

minjaf commented Nov 2, 2023 • edited Loading

fransilvionGenomica commented Nov 20, 2023

lucidrains commented Nov 20, 2023

ZhuJiwei111 commented Oct 19, 2024

lucidrains commented Nov 1, 2023 •

edited

Loading

lucidrains commented Nov 1, 2023 •

edited

Loading

lucidrains commented Nov 1, 2023 •

edited

Loading

lucidrains commented Nov 1, 2023 •

edited

Loading

lucidrains commented Nov 1, 2023 •

edited

Loading

lucidrains commented Nov 1, 2023 •

edited

Loading

minjaf commented Nov 2, 2023 •

edited

Loading