- add multilingual H/14 xlm roberta large
- fix setup.py _read_reqs
- Make openclip training usable from pypi.
- Add xlm roberta large vit h 14 config.
- pretrained B/32 xlm roberta base: first multilingual clip trained on laion5B
- pretrained B/32 roberta base: first clip trained using an HF text encoder
- Add missing hf_tokenizer_name in CLIPTextCfg.
- Fix #211, missing RN50x64 config. Fix type of dropout param for ResNet models
- Bring back LayerNorm impl that casts to input for non bf16/fp16
- zero_shot.py: set correct tokenizer based on args
- training/params.py: remove hf params and get them from model config
- Implement grad checkpointing for hf model.
- custom_text: True if hf_model_name is set
- Disable hf tokenizer parallelism
- Generalizable Text Transformer with HuggingFace Models (@iejMac)
- Support for custom text tower
- Add checksum verification for pretrained model weights
- lot including sota models, bfloat16 option, better loading, better metrics
- ViT-B/32 trained on Laion2B-en
- add missing openai RN50x64 model
- ViT-B/16+
- Add grad checkpointing support
- more robust data loader