ViT + CLIP #143

gboduljak · 2023-12-18T19:52:08Z

Would it be worth implementing ViT and CLIP example?

awni · 2023-12-18T20:28:41Z

Yea that ones on our list of examples to add! Are you interested in contributing it? If so which model would you use?

gboduljak · 2023-12-18T21:42:23Z

Yea that ones on our list of examples to add! Are you interested in contributing it? If so which model would you use?

I would like to contribute :) However, I would like to complete the implementation of norm first (ml-explore/mlx#187). I would use models from the official CLIP repository: https://github.com/openai/CLIP. If you have an alternative idea, please let me know.

nkasmanoff · 2024-01-15T19:51:36Z

@gboduljak I submitted a PR to your existing PR, which creates a local implementation of the CLIPImageProcessor. gboduljak#1

This should eliminate the dependency on transformers, aside from using it for downloading the model & tokenizer.

gboduljak · 2024-01-15T23:41:43Z

@nkasmanoff Thanks for the help. I will take a look at your work now.

gboduljak · 2024-01-16T01:57:49Z

@nkasmanoff I merged your PR, corrected the nits and I refactored your implementation so that everything is in preprocessing folder. Many thanks for the help. In future, we might drop this 'copy-paste' implementation from HuggingFace. Ideally, we should use mlx-data. If you have time, it would be awesome to have mlx-data implementation of CLIPImageProcessor.

awni added the enhancement New feature or request label Dec 18, 2023

This was referenced Jan 11, 2024

[Feature Request] Example of MLLM using MLX #207

Closed

CLIP (ViT) #315

Merged

awni closed this as completed in #315 Jan 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ViT + CLIP #143

ViT + CLIP #143

gboduljak commented Dec 18, 2023

awni commented Dec 18, 2023 •

edited

Loading

gboduljak commented Dec 18, 2023 •

edited

Loading

nkasmanoff commented Jan 15, 2024

gboduljak commented Jan 15, 2024

gboduljak commented Jan 16, 2024

ViT + CLIP #143

ViT + CLIP #143

Comments

gboduljak commented Dec 18, 2023

awni commented Dec 18, 2023 • edited Loading

gboduljak commented Dec 18, 2023 • edited Loading

nkasmanoff commented Jan 15, 2024

gboduljak commented Jan 15, 2024

gboduljak commented Jan 16, 2024

awni commented Dec 18, 2023 •

edited

Loading

gboduljak commented Dec 18, 2023 •

edited

Loading