Add CLOOB and KELIP #14

apolinario · 2022-04-27T11:09:54Z

Adding CLOOB and KELIP loaders

apolinario · 2022-04-27T11:11:34Z

Forgot register_model. Will add to the PR

dmarx · 2022-04-27T15:47:43Z

src/mmc/loaders/cloobloader.py

+        """
+        Returns the MMC associated with this loader.
+        """
+        from cloob_training import model_pt, pretrained


I think we need to push a separate PR to kat's repo to make cloob_training installable first, yeah? Then we can just add the cloob_training installation to the pyproject.toml

We can do that from Kat's repo, but do you feel this is a scalable approach?

I just wonder whether it should be a MMC requirement that every CLIP-like module we support is installable or if we should have support a custom way way to load it, maybe with cloning repos.

that's a really good point. maybe we could even set it up in such a way that it could check to see if something we want is already "installed" on the PATH variable? feels sorta like an accident waiting happen. how about this: the logic for cloning the repo and all that has to be in a loading class like this.

here's how I want this thing ultimately to work. imagine I have never even touched a pytorch model in my life. I just heard about CLIP and I want to play with it.

I install this repo. right now, I think this looks something like

git clone https://github.com/dmarx/Multi-Modal-Comparators/;
cd Multi-Modal-Comparators
pip install poetry
poetry build
pip install dist/mmc_*.whl

I import a generic api interface and blindly play with it

import mmc
avilable = mmc.available_models()
perceptor = mmc.load(available[0])
x,y = perceptor.encode_text('foo bar'), perceptor.encode_image('path/to/file.png')
score = perceptor.compare(x,y)

my goal here is for CLOOB and KELIP to be available to the user immediately following the setup of this library.

I'm totally open to having some sort of standard pattern we can wrap for dependencies that aren't trivially installable. But I want abstracting away the setup logic and abstracting away esoteric usage api's to be the value this library offers.

Here's an idea: I've already been using a hidden .cache folder for checkpoints, let's designate a subfolder for dependencies that need to be cloned but can't be installed. there's a tool called poe that integrates with poetry as a task runner which could be useful to look at.

does that make sense? I hope I'm not trying to boil the ocean here, am I? This would definitely be more scalable but also feels unstable.

at the very least, we should add like a shell script or something the user can execute as part of the installation that does the git clone, PATH modification, etc. steps

I created this repo sort of as a joke to myself, but maybe we should turn it into a real thing? all that business above with taking care of cloning and all that into a subfolder then playing with the path variables etc: abstract all of that away into its own piece. easy way to maintain and leverage the "best practices" for ... this particular brand of "duct tape and bubblegum"

You raised very good points and I think those make sense. Although at the end we may want to go back to the original issue and see if what we are doing is more work per package than making a fork/PR for each tool we want to support we might go back to that at the end. But I don't think it is.

I checked https://github.com/nat-n/poethepoet and I feel it makes sense to me. We could script each import with git clone and path and add that to the poetry pipeline; I love the not-a-package-manager joke/idea, fully. Maybe its something to develop in parallel and eventually integrate but not sure if makes sense to make a dependency right at the start?

yeah works for me. let's start with scripting the setup using poe and add some sort of poe run_last_setup_step entry to the setup instructions (... whenever that becomes a thing lol. I need to add a README). We can use poe as a stop-gap and port the setup to napm whenever that's ready. I'll add you to that repo as well. the dependency management gods forgive us.

allright, god help us: napm is officially published on PyPI (pip install napm). take her for a spin and lemme know if it works ok.

dmarx · 2022-04-30T04:29:47Z

allright, looks like I got CLOOB working well enough. wanna take a stab at finishing off KELIP?

dmarx · 2022-05-01T03:29:56Z

...lol nevermind, kelip has a setup.py, can be installed via pyproject.toml. we good.

Add CLOOB and KELIP

175705a

apolinario requested a review from dmarx April 27, 2022 11:09

Fix the class names and model registration

25f0dd7

apolinario marked this pull request as draft April 27, 2022 11:40

dmarx reviewed Apr 27, 2022

View reviewed changes

dmarx added 9 commits April 29, 2022 20:25

added napm to build dependencies

e3b6213

added napm cloob installation as poe task

e153ed9

modified cloob loader for napm

99a2c52

added install instructions

0df44a7

added test changed cloob loader name

be6e120

cloob test passes

2e7e9dc

added cloob specific tests, image encoder fails

da9a881

copyedit

f2155b4

cloob tests passing

f84c1c4

dmarx added 4 commits April 30, 2022 20:19

fixed up kelip some

ef3d420

added kelip install to pyproject.toml

8f27c45

kelip added to loader init for registration

af6c15b

added kelip loader test

8805584

syntax bug

ac0110a

dmarx marked this pull request as ready for review May 1, 2022 03:42

dmarx merged commit 633b6af into main May 1, 2022

dmarx mentioned this pull request May 13, 2022

improved packaging #26

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CLOOB and KELIP #14

Add CLOOB and KELIP #14

apolinario commented Apr 27, 2022

apolinario commented Apr 27, 2022

dmarx Apr 27, 2022

apolinario Apr 27, 2022

dmarx Apr 28, 2022 •

edited

Loading

dmarx Apr 28, 2022

apolinario Apr 28, 2022

dmarx Apr 28, 2022

dmarx Apr 29, 2022

dmarx commented Apr 30, 2022

dmarx commented May 1, 2022

Add CLOOB and KELIP #14

Add CLOOB and KELIP #14

Conversation

apolinario commented Apr 27, 2022

apolinario commented Apr 27, 2022

dmarx Apr 27, 2022

Choose a reason for hiding this comment

apolinario Apr 27, 2022

Choose a reason for hiding this comment

dmarx Apr 28, 2022 • edited Loading

Choose a reason for hiding this comment

dmarx Apr 28, 2022

Choose a reason for hiding this comment

apolinario Apr 28, 2022

Choose a reason for hiding this comment

dmarx Apr 28, 2022

Choose a reason for hiding this comment

dmarx Apr 29, 2022

Choose a reason for hiding this comment

dmarx commented Apr 30, 2022

dmarx commented May 1, 2022

dmarx Apr 28, 2022 •

edited

Loading