computational cost of training per Epoch #23

deep-matter · 2022-12-13T16:44:58Z

i would like to ask how much the training step takes per Epoch, i used your built model and i modified the PPGE model by adding FFT to reduce the dimension Convolution operation, only issue i noticed was that the Trainer took a lot of time to finish single Epoch, that's is related to size the shape of the image (2154,1024) or i missed something

hans0809 · 2022-12-14T02:59:26Z

I trained on my own dataset(200train +30val)，each slide were cut into 500~1500 tiles, and then embeded into 2048 dim vectors. It cost about 1min per epoch. I wonder how many slide is in your dataset(train and val) and how long it takes per epoch. By the way, my result were pretty poor and my training procedure were not stable, don't know where maybe wrong.

szc19990412 · 2022-12-15T01:01:04Z

i would like to ask how much the training step takes per Epoch, i used your built model and i modified the PPGE model by adding FFT to reduce the dimension Convolution operation, only issue i noticed was that the Trainer took a lot of time to finish single Epoch, that's is related to size the shape of the image (2154,1024) or i missed something

Because we set the batch size to one, the training step in one Epoch is equal to the number of your training slide. Meanwhile, because we preprocess all the WSIs into features, we test the training is very quick in RTX3090, roughly 0.5 min per epoch if we have 400 slides.

szc19990412 · 2022-12-15T01:08:18Z

I trained on my own dataset(200train +30val)，each slide were cut into 500~1500 tiles, and then embeded into 2048 dim vectors. It cost about 1min per epoch. I wonder how many slide is in your dataset(train and val) and how long it takes per epoch. By the way, my result were pretty poor and my training procedure were not stable, don't know where maybe wrong.

1、Each slide has 500~1500 tiles, have you process the WSI in the 20x magnification or higher? 2、You can also test the performance of other MIL methods in your dataset, such as ABMIL or CLAM, if the task is challenging or the dataset is limited.

hans0809 · 2022-12-15T06:11:47Z

For each slide, I did 4 times downsample and then cut into 224x224 small patches.
I tried DTFD-MIL(《DTFD-MIL: Double-Tier Feature Distillation Multiple Instance Learning for
Histopathology Whole Slide Image Classification》), and got a better performance.

I print the softmax(pred_logits) for both TransMIL and DTFD-MIL, the latter is more discriminative:
GT label

TransMIL

DTFD-MIL

Maybe something wrong with my implementation...

szc19990412 · 2022-12-16T09:00:58Z

This result seems strange, as it appears to show that the model is overfitting. Because the DTFD model is built on a smaller model ABMIL, you might experiment with Transformer aggregation in lower dimensional aggregation, such as lowering from 2048 to 128 or 256. Furthermore, have you used our Pytorch lightning framework and the Ranger optimizer.

JD910 · 2023-03-23T07:04:51Z

Nice work.

Any recommendations to preprocess the WSIs into features (since the quality of features may influence the classification directly)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

computational cost of training per Epoch #23

computational cost of training per Epoch #23

deep-matter commented Dec 13, 2022

hans0809 commented Dec 14, 2022

szc19990412 commented Dec 15, 2022

szc19990412 commented Dec 15, 2022

hans0809 commented Dec 15, 2022 •

edited

Loading

szc19990412 commented Dec 16, 2022

JD910 commented Mar 23, 2023

computational cost of training per Epoch #23

computational cost of training per Epoch #23

Comments

deep-matter commented Dec 13, 2022

hans0809 commented Dec 14, 2022

szc19990412 commented Dec 15, 2022

szc19990412 commented Dec 15, 2022

hans0809 commented Dec 15, 2022 • edited Loading

szc19990412 commented Dec 16, 2022

JD910 commented Mar 23, 2023

hans0809 commented Dec 15, 2022 •

edited

Loading