KODI is a diffusion model that generates high-quality Korean cultural images from Korean text prompts. It uses a Korean CLIP-based text encoder to better understand Korean prompts and generate culturally appropriate images.
git clone https://github.com/TeamLab/kodi.git
cd kodi
pip install -r requirements.txtOur Korean cultural training dataset is located at:
korean-cultural-dataset/
This dataset contains Korean cultural images with corresponding Korean text descriptions.
The Korean cultural evaluation benchmark MC-K is available at:
evaluation/benchmark/
This benchmark is used for evaluating cultural appropriateness and Korean language understanding.
# Train KODI with Korean Cultural Dataset (KCD)
python training/train_kodi.py --config training/configs/kodi.py# Generate images with evaluation dataset
python evaluation/generate_eval_images.py# Evaluate with MC-K benchmark using KC-CLIP
python evaluation/evaluate_by_kcclip.py| Model | Type | Location | Description |
|---|---|---|---|
| KODI | LoRA Weights | model-weights/kodi/ |
Korean cultural diffusion model (included in repository) |
| KC-CLIP KO | Evaluator | letgoofthepizza/kc-clip-ko | Korean cultural CLIP model (Korean) |
| KC-CLIP EN | Evaluator | letgoofthepizza/kc-clip-en | Korean cultural CLIP model (English) |