-
Notifications
You must be signed in to change notification settings - Fork 184
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What're the modifications in llava/train/transformers_replace
?
#24
Comments
We have manually changed some original implementations to better support grouping strategy and flash attn, and recommend every VILA user to do so. Though our codebase should work with higher version transformer, we haven't tested throughfully thus cannot promise anything. Please use v4.36.2 for reproducement. |
I found |
I noticed that you mainly manually implemented |
remove tf utils to avoid import error
@Lyken17 : Could you please share some details for this ? |
please use 4.36.2 for now, we will upgrade to 4.37 in next release :) |
Also why manually replace the files? |
Also interested in understanding this better, as I'm trying to combine a few things in VILA and LLaVA-NeXt and this makes me concerned that something might break in an unexpected way (e.g. run but give poor results). In environment_setup.sh, if we use transformers v4.37.2 but skip the following, what are the repercussions? # What happens if we skip this?
cp -rv ./llava/train/transformers_replace/* $site_pkg_path/transformers/
cp -rv ./llava/train/deepspeed_replace/* $site_pkg_path/deepspeed/ Are there PR's for merging these into the respective libraries or do they break other things and thus can't be merged? |
…tible fix bug for compatible issues.
Hi, thanks for the nice work! I wonder what are the main modifications in
llava/train/transformers_replace
compared to the original implementation intransformers==4.31.0
, as specified in the pyproject.toml. Also, in environment_setup.sh,transformers==4.36.2
is installed:I wonder why we want to install different versions of transformers?
If I want to use a higher version of transformers, e.g. 4.38, are there changes needed for the files in this folder? Many thanks!
The text was updated successfully, but these errors were encountered: