Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

When will the trained model be released? #3

Open
chenxshuo opened this issue Aug 19, 2023 · 3 comments
Open

When will the trained model be released? #3

chenxshuo opened this issue Aug 19, 2023 · 3 comments

Comments

@chenxshuo
Copy link

Hi there,

thank you very much for this awesome project! I wonder whether you are going to release the model that is trained on this dataset in the near future. If yes, when will it be?

Best regards

@vishaal27
Copy link

I have this same question too. I see that some of these model links are public: https://huggingface.co/HuggingFaceM4, however upon clicking them it shows a 404. Is there a planned release date for making them public?

@VictorSanh
Copy link

Hey @chenxshuo & @vishaal27 , thanks for your interest!
We will be announcing officially around mid-week.
The link will all become public at this time :)

@vishaal27
Copy link

@VictorSanh Thanks for the model release, looks super exciting. I just was wondering if you had a profiling table of the load times, inference times for a single moderately sized sequence, and necessary GPU memory for both the 9B and 80B models. On how many GPUs (and their specs) did you run the 80B evals? And were all the evals done in fp16/bf16?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants