-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some ideas on outfit #49
Comments
Indeed, what is missing is just some contrastive learning for outfits. This should be in principle the same as ccip for characters and the dataset is already there (ask Narugo). It is just no one that I know really gets time to work on it. More generally speaking, I would like to see more fundamental groundwork for anime images, including fine tuning all major ssl model (dino, Mae, clip, aim etc.), vlms such as llava, and other vision models such as yolo-world and sam for anime. This will make our life much easier. This being said, I am too busy to further work on this project or anime stuff at the moment. |
Thinking that newer research in general are good, but not sure if Open Model Initiative would shake the monopoly. VLMs tho will need even more data to operate besides finding open LLM backends (and the Transformer vs Mamba-esque debate is fun too) |
There are a few scenarios regarding outfit (can be co-occurring):
Since full-body segmentation exists, it would be a prime focus for creating a clustering technique for outfit embeddings
See also the notes made for a Segmentation library SkyTNT/anime-segmentation#13
The text was updated successfully, but these errors were encountered: