Support Yi-Coder #208

ryankert01 · 2024-09-04T15:34:24Z

🚀 The feature, motivation and pitch

to-dos:

implement an API for yi-coder
test yi-coder out with llama lce_forward

Alternatives

No response

Additional context

from discord discussion

ryankert01 · 2024-09-04T16:03:28Z

#take @ByronHsu

ByronHsu · 2024-09-06T20:26:14Z

any progress

ryankert01 · 2024-09-07T02:50:35Z

I'll open a pr by the weekends

ryankert01 · 2024-09-08T07:43:29Z

Hi @ByronHsu , just noticed huggingface llama is mapped with based model, and yicoder has its base model configured. I think maybe we don't have to do a code change. I'll test it out shortly if it works. (not sure if I'm wrong)

ref:

ryankert01 · 2024-09-08T10:02:01Z

UPDATE: got it, looks like it'll soon be solve by #199

Hi @ByronHsu , I just did the research, but I found an odd thing: when I only configure the SFTconfig with use_liger=True, the GPU usage is same as not use liger, but if I use

model = AutoLigerKernelForCausalLM.from_pretrained(model_name)

it's significant better. it's not align with our sfttrainer docs on huggingface.

could you help me look into it? research notebook

shimizust · 2024-09-13T17:35:01Z

@ryankert01 Thanks for the comment. #199 is ready and should get incorporated soon. Right now, the SFTConfig doesn't actually do anything with the use_liger flag unless you pass in a model path (and then it will load model using AutoLigerKernelForCausalLM) vs. an already instantiated model. After this change, will need to have SFTTrainer updated to call this new API.

ryankert01 · 2024-09-18T03:11:54Z

close by huggingface/transformers#33502

qingquansong added the feature label Sep 14, 2024

ryankert01 closed this as completed Sep 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Yi-Coder #208

Support Yi-Coder #208

ryankert01 commented Sep 4, 2024 •

edited

Loading

ryankert01 commented Sep 4, 2024

ByronHsu commented Sep 6, 2024

ryankert01 commented Sep 7, 2024

ryankert01 commented Sep 8, 2024

ryankert01 commented Sep 8, 2024 •

edited

Loading

shimizust commented Sep 13, 2024

ryankert01 commented Sep 18, 2024

Support Yi-Coder #208

Support Yi-Coder #208

Comments

ryankert01 commented Sep 4, 2024 • edited Loading

🚀 The feature, motivation and pitch

Alternatives

Additional context

ryankert01 commented Sep 4, 2024

ByronHsu commented Sep 6, 2024

ryankert01 commented Sep 7, 2024

ryankert01 commented Sep 8, 2024

ryankert01 commented Sep 8, 2024 • edited Loading

shimizust commented Sep 13, 2024

ryankert01 commented Sep 18, 2024

ryankert01 commented Sep 4, 2024 •

edited

Loading

ryankert01 commented Sep 8, 2024 •

edited

Loading