Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Slow #2

Open
scseeman opened this issue Jun 21, 2023 · 2 comments
Open

Slow #2

scseeman opened this issue Jun 21, 2023 · 2 comments

Comments

@scseeman
Copy link

I have been having a lot of trouble with geneBasis being really slow when searching for hundreds of genes (~300). It seems to progressively get slower and will take upwards of a week to finish. Do you have any suggestions for speed up? Thanks

@nhuhoa
Copy link

nhuhoa commented Jun 30, 2023

It took me ~40 mins for 140 genes selection using 8 cores, 32G ram computer, not bad. I am not sure about the execution time if I increase number of genes, ex: 300 genes as you said.
While waiting for developer, maybe some filtering functions are useful?
Ex:
For gene:

  • Using 3000 to 5000 highly variable genes as input genes in sce file?
  • Using retain_informative_genes() function from this tool

For cells:

  • Downsampling cells per each group, to reduce complexity and make sure algo works first. With large # cells, umap estimation takes time...

@scseeman
Copy link
Author

Thanks,

It definitely seems to be connected to the size of the panel you are trying to make, though I still can't get speeds that match yours using our high-performance computing cluster. Typically I am running it with 10 cores and 500GB of memory and for a 100 genes still takes several hours. If I ask for 300 genes but give it the first 200 in genes_base, the job timed out after 24 hours only returning 20. That's a rate of less than 1 gene / hour. My sce file has 3000 genes, but does have a lot of cells. I will try reducing the number of cells to speed things up.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants