Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Internship Project]: Simran Tandon #716

Closed
5 tasks
GemmaTuron opened this issue Jun 21, 2023 · 54 comments
Closed
5 tasks

[Internship Project]: Simran Tandon #716

GemmaTuron opened this issue Jun 21, 2023 · 54 comments
Assignees

Comments

@GemmaTuron
Copy link
Member

Summary

Hello,

This is a public issue for a virtual daily stand-up. We will use this to briefly share the tasks of the day and the challenges and advances made, so that we can ensure smooth support from the Ersilia mentors and alignment between daily tasks and overall internship goals.

Scope

Initiative 🐋

Objective(s)

Internship goals:

  • Learn how to work in an Open Source Community
  • Improve the infrastructure of the Ersilia Model Hub
  • Improve the usability and platform compatibility of Ersilia Models, mainly via Docker Images
  • Learn about AI/ML models for drug discovery
  • Identify new relevant AI/ML models and datasets and add them to this discussion
  • Incorporate new AI/ML models in the Hub
  • Improve the documentation of the Ersilia Model Hub for users
  • Improve the documentation of the Ersilia Model Hub for developers

Team

Role & Responsibility Username(s)
Intern @simrantan
Mentor @miquelduranfrigola
Coordinator @GemmaTuron

Timeline

Before starting your work, line up a few tasks and short description. This should not take long. For example, it could be something like:
Wednesday 21st June

  • Create Intern daily task tracker
  • Update the Ersilia GitHub Project
  • Make sure Git Actions start upon success of the other workflows
  • Work on the documentation for model testing
  • Attend the Drug Discovery intro by WCAIR

Documentation

No response

@GemmaTuron
Copy link
Member Author

In this example, following the tasks of Wednesday 21st June:

  • Create Intern daily task tracker
  • Update the Ersilia GitHub Project
  • Make sure Git Actions start upon success of the other workflows
  • Work on the documentation for model testing
  • Attend the Drug Discovery intro by WCAIR

I have created all the templates for the Interns, spend a couple of hours revising the GitHub Project and updating the tasks. I have been working on identifying the bug in the GitActions, solved in this issue.
I have set a meeting with @miquelduranfrigola to discuss in detail his comments on the Model testing discussion but I haven't been able to start writing yet, i am to do so by the end of the week.
Looking forward to the WCAIR next lesson!

Pro tip: adding the links to the issues and discussions you mention will be very helpful!

@simrantan
Copy link

Wrote this in the wrong place!
Goals/Reflection 6/22
(writing towards the mid/end of workday since I want to reflect on what I actually achieve day to day so I can set an accurate amount tasks for upcoming days)
-The model is currently failing. I have been analyzing the log and found that it is not producing an output due to a "unable to open x server" error and there are some "commands not found" in predict.py
-reading through the code to identify error lines, attempting fixes and testing solutions
-ran into trouble trying to test on docker, says permission denied (in docker desktop, the image does not exist when searching for it though actions says this model has been successfully uploaded)

@simrantan
Copy link

simrantan commented Jun 24, 2023

6/23

  • Tested eos92sw on Docker, CLI and Colab
  • Download and debug eos2hbd
  • run --repo_path on eos2hbd
  • Test eos2v11 on Docker, CLI and Colab

I completed testing on eos92sw on all three platforms. I also worked on fixing an issue with docker for a while (managed to get the model to run in docker itself, still having issues with the wsl-docker connection however). I dedicated a portion of time to examining the issues in eos2hbd further, looking at the log on actions and in CLI after running --repo_path on a local version and seeing where it starts to fail and found a possible specific line causing the issue. I then tried to find where to edit this line but could not see where it was in the models workflow or code files.

I tried testing eos2v11 on CLI, Colab, and docker but on CLI and Colab ran into fetching issues.

@GemmaTuron
Copy link
Member Author

GemmaTuron commented Jun 25, 2023

Hi @simrantan !

Thansk for the summary, I've added a few pointers to help with the issoes on eos2hbd

@simrantan
Copy link

simrantan commented Jun 27, 2023

Reflection 6/26

  • Test eos22io on Docker, CLI, Colab
  • Re-test eos2v11 on Docker, CLI, Colab
  • Debug eos2hbd
  • Refactor eos2hbd

I tested eos22io on Docker, CLI, and Colab, finding no issues when I ran the model. I also finally solved the eos2hbd error and was able to fetch the model and test it. I refactored according to instructions and plan to test the changed version locally tomorrow and submit a PR if it does.

@simrantan
Copy link

simrantan commented Jun 27, 2023

Reflection 6/27

  • Test eos24jm on Docker, CLI, Colab
  • Finish refactoring eos2hbd
  • Test eos2hbd locally
  • submit PR for eos2hbd

I was able to test eos24jm with no issues on all platforms. I also finished refactoring eos2hbd. The first refactor I did yesterday wasn't working when I tested locally, but after some bug hunting I found that I had created a syntax error when editing service.py that was preventing the model from being fetched successfully. I fixed the error, re-cloned and tested the model, and it worked on the inputs I tried! .
Besides testing eos2v11 when it is ready again, I am open for more new tasks!

@GemmaTuron
Copy link
Member Author

I've added a couple of models to work on and some model testing!

@simrantan
Copy link

simrantan commented Jun 28, 2023

  • Test eos46ev
  • review suggested models to work on
  • begin working on one of the suggested models

Tested eos46ev on CLI, Docker, Colab - ran into an issue with CLI when using eml_canonical but it ran fine w a single output.

I tried to open an issue with one of the drafts suggestions I saw in the tasks by subprojects chart but I think I messed up by setting the repo to ersilia instead of the model's repo - I am working on fixing it, so sorry about that! Also, for future reference - if issues are drafts should I wait for them to be opened by you or Miquel or should I open it myself(properly, not like what I did this time haha)
UPDATE: I think I may be unable to fix my error with my current permissions (again, my apologies for this! the issue is this one: #727 ) Let me know what I can do to resolve this.

@GemmaTuron
Copy link
Member Author

Hi @simrantan

Apologies, I should have converted the draft to an issue, it is fixed now.

@simrantan
Copy link

simrantan commented Jun 29, 2023

  • Refactor eos3zur
  • Test eos4zfy
  • Test eos6aun
  • Investigate error in workflow for eos1amn

I refactored eos3zur(not much needed, checked through everything against the guidelines and made what changes were needed). Tested it locally with varied inputs and submitted a PR! I also tested eos4zfy on Docker, ClI, and Colab. . Then, I tested eos6aun on Docker, CLI, and Colab successfully as well. Have started looking into the error in eos1amn but haven't found any way to fix it yet.

@GemmaTuron
Copy link
Member Author

@simrantan apologies for the eos1amn
I've assigned you new models,a nd alsor equesting your assistance to work with Riley and Febie since you have a Ubuntu installation

@simrantan
Copy link

simrantan commented Jun 30, 2023

Awesome! Happy to help.
6/30 Goals/Reflection

  • Help Riley (details: try fetching eos96ia on my device)
  • Help Febie (Miquel helped Riley who told Febie how to use dgl on Macs!)
  • Refactor eos4se9
  • Investigate why Dockerfile is taking so long to upload for eos2rd8/wait to see if it will upload
  • Refactor eos2rd8

I ended up discovering an issue with my laptop today when working on eos4se9 (a storage overflow that had overwritten other data and caused a lot of applications to freak out and shut down) and ended up spending most of the day doing damage control on this issue (having to delete and reinstall many affected applications). I lost all the progress I made on eos4se9 as one of the side effects, but I have been working on restoring the changes I made. I will make sure to finish refactoring and submit a pr for both of these models next working day, I apologize for the unexpected delay!

@simrantan
Copy link

simrantan commented Jul 3, 2023

  • Help Febie test eos1af5 on Windows
  • Refactor eos4se9
  • test eos4se9 locally
  • Test eos6pbf on CLI, Colab, and Docker
  • Test eos3cf4 on CLI, Colab, and Docker
  • Refactor eos2rd8*
  • Test eos2rd8 locally*
    *if time permits

I worked with Febie to test eos1af5 on Windows and ended up finding a new possible source of issues in the model (some package or version issues?) through some attempts at fetching and running the model locally. I also refactored eos4se9, and tested it locally, finding a bug that I am worked on resolving. . It now works! I submitted a PR for this model. I refactored eos2rd8, and will test locally tomorrow. I also tested eos3cf4 on CLI and Colab, finding the same error as Hellen (module error) on both platforms. I also tested eos6pbf on CLI and Colab, and will test on Docker tomorrow.

@GemmaTuron
Copy link
Member Author

Hi @simrantan
Thanks for the detailed explanations, I have left some comments on model eos1af5. @emmakodes will have a look at eos3cf4. I don't see the PR for eos4se9. Once all this and model testing is complete I'll assign you new models!

@simrantan
Copy link

simrantan commented Jul 4, 2023

  • Lab Meeting (8-9)
  • Internship Meeting (10-10:30)
  • Test eos6pbf on Docker
  • Test eos2rd8 locally
  • Submit PR for eos2rd8
  • Check how long eos4se9 takes on 20 smiles
  • Submit PR for eos4se9
  • Test eos3zur locally
  • Submit PR for eos3zur

I have been working on testing the models I have refactored locally. I spent some time testing eos4se9, trying to implement print statements in the code to see if it is processing the inputs or if it is stalling for some reason. I am working on implementing the print statements, though I am having a little bit of trouble with getting the code to print to terminal.. I also tested eos2rd8 locally, found that it worked, and submitted a PR. I also tested eos3zur locally and submitted a PR for that as well. Docker is taking a long time to pull the image but hopefully I will be able to run it eos6pbf soon.

@GemmaTuron
Copy link
Member Author

Hi @simrantan
I've merged the PRs and given some tips for model eos4se9.
I've assigned a new model as well in case you finish and want to move on

@simrantan
Copy link

simrantan commented Jul 5, 2023

  • Lab Meeting
  • Clean up Conda environments and models
  • Continue testing eos4se9
  • Submit PR for eos4se9
  • Investigate how to upload eos2rd8 to dockerhub (no space error)
  • test eos6pbf on Docker
  • Clean and Refactor eos8h6g
  • Test eos8h6g locally

I discovered that my docker desktop app had run into a WSL issue when installing the update, so I followed instructions to fix it and was able to test eos6pbf on docker successfully. . I also refactored and tested eos8h6g locally, and submitted aPR for it. Finally, I spent some time working to make sure eos4se9 was working on eml_canonical, finally confirmed that the model works, and submitted a PR.

@GemmaTuron
Copy link
Member Author

Perfect,

I have assigned model testing for eos4se9 and merged the PR on eos8h6g.
Please try to see if there is something we could do on eos2rd8-- @samuelmaina did a good job on eos7pw8 by caching some files and saving memory
I've assigned you a new model as well in case you finish with the rest of tasks

@simrantan
Copy link

simrantan commented Jul 7, 2023

Thank you for the tip!

  • Study eos7pw8
  • Implement caching for some files in eos2rd8*
  • Test eos7jio
  • Refactor eos5axz
  • Test eos5axz
  • Re-test eos3cf4

Today, I spent a good chunk of time looking at eos7pw8 back in time and the current version with Samuel's additions to see how he implemented the caching in the model. I also looked at some online resources to better understand caching in python, and *began identifying files that could be cached in eos2rd8. Tomorrow, I will implement the caching and test the model to ensure it still works. Is there a way to know the size of a model before and after caching a file? Or is it manual subtraction of the file size from the overall size?
I also tested eos7jio on CLI, Colab and Docker. I have tested eos5axz locally to make sure it works, and once I confirmed it did I refactored the model and will test again tomorrow to ensure it is still functional. I also retested eos3cf4 . It is failing to upload to Colab still.

@GemmaTuron
Copy link
Member Author

Thanks for the explanations @simrantan !
Let's focus on the two open models, eos2rd8 and eos5axz. I'd suggest using the templates created by @emmakodes to test if the caching reduces total size!

@simrantan
Copy link

simrantan commented Jul 7, 2023

  • Test eos5axz locally
  • Submit PR for eos5axz
  • Implement caching for eos2rd8
  • Test eos2rd8

Committed my refactored version and submitted PR for eos5axz. Today, I worked on adding caching to the dockerhub workflow file for eos2rd8. I knew what parts I wanted to cache, and once I figured out more about the process of caching through the github documentation I was able to start writing the code, then working on the errors that arose with syntax and key mapping. I had a few questions about proceeding. Hopefully it works! Otherwise I will see what other files are able to be cached to save space.

@miquelduranfrigola
Copy link
Member

Thanks for the updates, @simrantan!

@simrantan
Copy link

simrantan commented Jul 11, 2023

  • Investigate model test error in eos2rd8
  • Test eos3ev6 on CLI, Colab, Docker
  • Test eos3ae6 on CLI, Colab, Docker
  • Try fetching eos78ao on CLI to help Riley

I tested eos3ev6 and eos3ae6 on all three platforms, and they both ran successfully on eml_canonical. I also helped Riley investigate an issue by fetching a model on my laptop to see if the error he was receiving was a Mac-based one. We found that it persists on my OS, so it is in the model itself. I also thoroughly looked into a new model test error that occurred in actions when eos2rd8 was merged - partway through looking, I saw Riley message about the same error, and communicated with him about it, finding that I was looking in the wrong place for the error source. I then examined the error in more detail, did some research online, and wrote up my findings in issues. .

@simrantan
Copy link

simrantan commented Jul 12, 2023

  • Ersilia Lab Meeting
  • Internship Meeting
  • Set up meeting with @miquelduranfrigola to work on eos2rd8
  • Add a main.py to eos5axz
  • move functionality from service.py to main and convert service to eos-template format in eos5axz
  • test eos5axz locally
  • refactor eos8fth
  • test eos8fth locally
  • submit PR for eos8fth
  • test eos76wn on CLI, Colab, and Docker

I began working on eos8fth but found there was a fetch error that led me to discover the model might not be ready and I wanted to check on that before refactoring further. I also worked on refactoring eos5axz, which involved adding a main.py, moving functionality from service to main, and re-doing service.py. I spent a while making sure I was getting the format right and not messing up the functionality, but I am getting an error described here that I could advice on. I also tested eos76wn on CLI, Colab, and Docker and found it worked on CLI and Docker but had a fetch error in Colab.

@GemmaTuron
Copy link
Member Author

Hi @simrantan !

A few updates:
eos8fth is indeed not completed, but it would be great if you take it up and try to incorporate it from scratch, it is an Anti Sars Cov2 model
Let's wait for @miquelduranfrigola feedback on eos5axz to which extent we want to refactor
And try to debug the cache issue in the actions of eos2rd8.
I think there is quite a lot in your plate already so I won't assign new models for the moment!

@simrantan
Copy link

simrantan commented Jul 12, 2023

  • Look into eos8fth's original issue
  • understand the current progress on eos8fth
  • find the files that need to be completed for eos8fth to run
  • Look into the cache issues of eos2rd8
  • Debug cache issues
  • make Internship Update presentation
    Today, I worked on making my Internship update presentation, collecting all of the models I have worked on and finding the challenges I have faced so I can share them in the meeting tomorrow. I have also looked into the caching issue of eos2rd8, and started making progress on eos8fth by reading and understanding the source code..

@simrantan
Copy link

simrantan commented Jul 13, 2023

  • Lab Meeting
  • Internship Meeting
  • Eos5axz - edit service.py to remove checkpoint tracking
  • Eos8fth - install locally and identify necessary files
  • Eos8fth - transfer files to eos repo

I have continued work on eos5axz, and I am still receiving an error I find confusing about Nonetype objects. . I have also worked on eos8fth today, downloading and testing the model and checking for necessary files. I have some questions that I would love some guidance about before I move forward with incorporation here.

@GemmaTuron
Copy link
Member Author

Thanks for the update @simrantan

Let's focus on completing the eos8fth, I've asked you a few questions about it in the issue

@simrantan
Copy link

simrantan commented Jul 15, 2023

  • Test eos238c on CLI, Colab, and Docker
  • Collect information for eos8fth
  • Create fork of eos8fth and add files from redial-2020
  • Edit paths in code to reflect new folders
  • sketch out code for input and output adapters from original source code format to ersilia format for eos8fth

I worked on eos8fth today, collecting information about the two different run_scripts and starting to incorporate the model by moving over the necessary files (including the model pkls). I have also changed the paths to reflect the new folders that information is found in from the original source code. . I am working on an input adapter, and sketched out some code for what an output adapter could look like. I also tested eos238c on CLI, Colab, and Docker.

@simrantan
Copy link

simrantan commented Jul 18, 2023

  • Meeting with Miquel and Zakia about eos2rd8
  • Clean up github - remove codespaces, forks, etc.
  • Implement input and output adapters for eos8fth
  • Test eos1af5 on CLI, Colab, and Docker
  • Make sure eos5axz works on run, not predict and submit PR

I had a meeting with Miquel and Zakia where they started helping with getting eos2rd8 set up locally so that I could build the image on my device and upload it to work around the actions size issues. I also tested eos1af5 on all platforms. I took some time to clean up forks of models I am no longer working on and removing codespaces I am not using. I started fresh on eos5axz, removing legacy files and making sure it works with "run", then submitting a PR. Finally, I worked on writing an input and output adapter for eos8fth.

@simrantan
Copy link

simrantan commented Jul 19, 2023

  • Test eos6tg8
  • Finish output adapter for eos8fth
  • Finish main.py and run.sh for eos8fth
  • Test eos8fth
  • Continue fixing dependencies on local version of eos2rd8 on docker

I tested eos6tg8 on CLI, Colab, and Docker and it worked great on all platforms. I finished writing the output adapter for eos8fth, and also wrapped up the last steps for incorporating the model but I ran into some bugs I am working on resolving (and will continue working on tomorrow). I have finished most of the steps for eos8fth, and once I get through these last bugs I should be able to finish testing it and submit it.

@simrantan
Copy link

simrantan commented Jul 20, 2023

  • Investigate eos5axz testing issue
  • Debug eos8fth
  • Helped Riley look into module import error on eos5smc

Zakia reported an issue with eos5axz, which I looked into today and reported my findings in the issues thread as I wait for more details from Zakia. I also spent most of my time debugging eos8fth. I am working through some bugs but have gotten stuck on some that I have written about. Additionally, I spent some time testing a model for Riley since there was a potential Mac issue, then we worked on looking into possible solutions together that we can test further.

@simrantan
Copy link

simrantan commented Jul 21, 2023

  • Test eos1579 on CLI, Colab, Docker
  • Debug eos8fth
  • Look into workflow error for eos8451
  • Refactor eos8451
  • Lab Meeting

I tested eos1579 on CLI, Colab, and Docker. I continued working on debugging eos8fth, working on the getting the remaining files into lfs. Zakia helped me look into the bug I was stuck on for eos8fth with the File error, and while I waited for her to try it on her CLI, I also started refactoring eos8451. I found a workflow error that I wrote up, and worked on debugging it. I fixed the issue, and I have refactored the model, but I have a question about the placement of main.py that I'd like clarification on before I consider it fully finished! Finally, I spent more time debugging eos8fth. With Zakias help, I have fixed the missing File bug, but I am still working on lfs files and a new error that has occurred. I plan to dig into eos8fth again tomorrow and hopefully resolve this issue.

@simrantan
Copy link

simrantan commented Jul 22, 2023

  • Finish Refactoring eos8451
  • Debug eos8fth
  • Clean up conda environments, forks, cloned repos

I spent most of today working on bug fixing for eos8451 and eos8fth. I managed to find a solution for my previous bug in eos8fth, and am working on testing to find the root of the error in eos8451. I unfortunately ran into a setback when I faced some laptop issues today (my computer shut down unexpectedly and would not turn back on), but managed to resolve it after calling the microsoft help desk. After the shutdown, I was advised to clean up storage, so I decided to do that for Ersilia as well, cleaning up any unnecessary resources.

@GemmaTuron
Copy link
Member Author

Hi @simrantan

Please update the eos8451 and eos8fth before I assign you new tasks. Also, could you update us on the status of building docker images in docker to bypass the git actions size limit?

@simrantan
Copy link

simrantan commented Jul 25, 2023

Hi,

Sorry! I forgot to hit comment on my internship update for today. Also apologies for the lack of update on building the docker images - I have been working on creating an environment in docker but I haven't had success so I haven't had much progress to update on besides finding different missing libraries. I ran into some issues and wanted to possibly talk to Miquel and Zakia again to work through these problems, but since last week was busy I ended up focusing on eos8fth and other models in the meantime.

(update below)

  • Debug eos8fth
  • Debug eos8451

I spent some time working on eos8fth. Fixing the dependencies worked and I am able to start running the model. I've been working out some path issues as they pop up, but I think there are only a few remaining. I also had to get rid of their many print statements throughout the code that were printing to terminal, so I spent some time hunting those down and removing them. However, I am still having issues with lfs and tracking files on that model, which I have been working on for a long time. I am hoping to resolve Tuesday. but may need some input from anyone who has dealt with similar issues. I also am still working on trying to solve the path issue for eos8451 (the model takes a very long time to fetch, I have to wait a while to see the results of any new path I try). I found Riley is dealing with similar issues and we will tackle it together tomorrow.

@simrantan
Copy link

simrantan commented Jul 26, 2023

  • Finish debugging eos8fth
  • Debug eos8451
  • Lab Meeting
  • Internship Meeting
  • Begin writeup of Docker Local Image installation

I have made a lot of progress on eos8fth and will be able to do final tests and wrap it up tonight or tomorrow depending on how long it takes to review file sizes for the lfs issue which I am still working on. I have also been working on debugging eos8451, and I have been using Febie's model eos85a3 as a guide since it is also a grover model. I am currently testing it, it just takes some time to fetch and will update with the success when it completes. I set up a meeting time with Miquel and Zakia to work on the local docker installation and started recording the process of commands I have been using to try and set up the environment (though it is not working, I am writing it down so that I can either refer back to these commands as a debugging tool or to provide alternative options)

@simrantan
Copy link

simrantan commented Jul 27, 2023

@GemmaTuron @miquelduranfrigola

  • Test fetch eos8fth
  • debug eos8fth
  • Test fetch eos8451
  • debug eos8451

As we discussed in the meeting, I am working on wrapping the models I have been working on and will be done with them soon! I would love to take on new models/testing soon so I can continue contributing.

(pre-meeting update) I have almost finished eos8fth, but there is a dependency issue that I am unsure how to tackle since it deals with the version of docker ersilia uses and I'm unsure what should or shouldn't be changed for it to work. I also have fixed my prior issue with eos8451

@simrantan
Copy link

simrantan commented Jul 28, 2023

  • Fix eos8fth dependency issue
  • Fix eos8451 fetch issue
  • Meet with Miquel about eos2rd8

I have been working on both of these issues today and reached points where I would love some outside input if available since I am dealing with things outside of the scope of the models themselves and do not want to mess up the ersilia model hub in any way. Eos8451 is dealing with an input issue, and eos8fth is having an empty output error (that is likely a package issue, since this module produces output just fine in a seperate conda environment using bash)

@simrantan
Copy link

simrantan commented Aug 1, 2023

  • Re-start eos8451
  • Test eos8451
  • Submit PR for eos8451
  • Fix eos8fth
  • Finish draft of manual upload to dockerhub documentation
  • Help Riley fetch and run eos5smc

Last friday, I met with Miquel and discussed the dockerhub issue. We went through the process, and found that the upload itself can only be completed by specific members of the Ersilia organization due to the permissions needed. I wrote out the steps we took to upload the file to Dockerhub and simplified to a general process that can be followed when encountering models that are too large for github actions to upload. I completed the draft of the writeup, and plan to add pictures and more details tomorrow. I also fixed eos8451 and submitted a PR. Inbetween, I helped Riley fetch and test eos5smc. Then, I worked on fixing eos8fth.

@GemmaTuron
Copy link
Member Author

thanks for the update @simrantan !
Please share the documentation when ready on the DockerHub issue, and I'll upload the models there

@miquelduranfrigola
Copy link
Member

A comment on eos2rd8: This model is big and we need to build it locally. The script to build Docker images locally does not work at the moment, and it does not provide all the funcionality we have in the GitHub Action (https://github.com/ersilia-os/ersilia/blob/master/.github/scripts/build_model_container_and_update_to_dockerhub.py). I am opening a bug report: #760

@simrantan
Copy link

simrantan commented Aug 2, 2023

  • Lab Meeting
  • Internship Meeting
  • Make Presentation for Wednesday Meeting
  • Look for correct rdkit version for eos8fth
  • Test eos8a5g on platforms

Worked on making the presentation for Wednesday's meeting. Tested eos8a5g on all platforms. Messaged Zakia and Hellen about rdkit, and tried looking for versions online. Have been testing different rdkit versions throughout the day (the model takes a while to fetch, so i have to wait some time after trying each version.

@simrantan
Copy link

simrantan commented Aug 3, 2023

8/2 and 8/3

  • Lab Meeting
  • Internship Meeting
  • Test eos6o0z
  • Test eos5smc
  • Test eos96ia
  • Test eos7jlv
  • Fix eos8451 issue

Tested all the models I was assigned and addressed an issue in one of the models I was testing. I also was communicated with Zakia in the slack about the eos8fth rdkit issue

@simrantan
Copy link

simrantan commented Aug 7, 2023

8/4-8/6

  • Clean and Dockerize eos1vms
  • Find source of workflow issue in eos6afz
  • Clean and Dockerize eos6afz
  • Check the lens of Descriptors in different rdkit versions to see which has the same number of descriptors as eos8fth requires

I cleaned and refactored the assigned models (and also fixed the workflow error in eos6afz, which turned out to be an rdkit version issue). I have also been downloading different versions of rdkit and checking the number of descriptors. I have checked 5 versions but haven't found an exact match yet and am planning to test more tomorrow so I can get on to the next steps of checking if the descriptors are a match.

@GemmaTuron
Copy link
Member Author

Hi @simrantan

IS model eos8fth ready? Please remember to move the tasks to In progress when you pick up new models.
Regarding the rdkit descriptors, the safest is to:

  • Find out the exact list of descriptors used (in the original model) - there are names for them
  • Establish those in the newer version you are using, so that if more descriptors are calculated only those that were originally used will be taken into account.

You can get inspiration from this piece of code under the section #RDKIT Descriptors. HEre, we are selecting 200 specific descriptors
https://github.com/ersilia-os/lazy-qsar/blob/main/lazyqsar/descriptors/descriptors.py

@simrantan
Copy link

Sorry! I will make sure to move the tasks to in progress in the future. Thank you for the files! I was trying Zakia's method of finding a version of rdkit that does work but it appears it will not be possible so I wrote some code to get the descriptors from the version the source code uses and have written a descriptors.py file based on the example provided! I had a question about how/where to use descriptors.py for it to prevent the issue from happening? Should I use it in the place of importing rdkit?

Also, I moved the finetune folder to checkpoints and re-tested eos6fza and resubmitted a pr!

@GemmaTuron
Copy link
Member Author

For the descriptors, let's talk about this in today's meeting, I'll guide you through it!
I'll merge the Pr

@GemmaTuron
Copy link
Member Author

Hi @simrantan

In addition to your tasks, please try out the test module developed by Riley and Febie and provide feedback in the Slack thread, specifying which model did you test, and if you have any further suggestions for improvement. Thanks!

@simrantan
Copy link

simrantan commented Aug 9, 2023

8/8-8/9

  • Add choosing the 200 specific descriptors needed for eos8fth to get_features.py
  • Find a version of rdkit that has all 200 specific descriptors *
  • *(new task) write a script to compare the original descriptors from eos8fth source code to descriptors in other versions of rdkit
  • Test on different rdkit versions until a match is found
  • Fix eos6fza
  • Fix eos1vms
  • Read source code for eos8fma
    I have been working on adding the descriptors to eos8fth. It took me a while to find the right place to add the code then to debug it to make sure it was functional. I managed to get the code to work, and am now searching for a version of rdkit with matching descriptors to complete it. I had to write a script to for this, which I managed to complete and test and began testing versions of rdkit. I also fixed eos6fza, and re-tested to make sure it works. I have been adding print statements to figure out the eos1vms issue but I still am looking for the root cause.

@simrantan
Copy link

@GemmaTuron

I had a flight scheduled for tonight that got cancelled last minute and I was moved to the next available one, which conflicts with tomorrows meeting so I will be unable to attend - I'll look at the presentations in the folder to catch up on the recent updates!

@GemmaTuron
Copy link
Member Author

Thanks for letting me know, safe flights.

@GemmaTuron
Copy link
Member Author

Hi @simrantan

Let's focus on closng the open models eos1vms and eos8fth this week, thanks!

@GemmaTuron
Copy link
Member Author

Hi @simrantan !

It was great to work with you, thanks so much for your contributions to Ersilia, we hoped you learnt and enjoyed as much as we did! Please, remain engaged with the community and feel free to open any issues or contribute to open ones :)
I'll now close this issue as completed !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants