When will the trained model be released? #3

chenxshuo · 2023-08-19T07:23:15Z

Hi there,

thank you very much for this awesome project! I wonder whether you are going to release the model that is trained on this dataset in the near future. If yes, when will it be?

Best regards

vishaal27 · 2023-08-20T23:40:56Z

I have this same question too. I see that some of these model links are public: https://huggingface.co/HuggingFaceM4, however upon clicking them it shows a 404. Is there a planned release date for making them public?

VictorSanh · 2023-08-21T02:43:32Z

Hey @chenxshuo & @vishaal27 , thanks for your interest!
We will be announcing officially around mid-week.
The link will all become public at this time :)

vishaal27 · 2023-08-23T07:39:08Z

@VictorSanh Thanks for the model release, looks super exciting. I just was wondering if you had a profiling table of the load times, inference times for a single moderately sized sequence, and necessary GPU memory for both the 9B and 80B models. On how many GPUs (and their specs) did you run the 80B evals? And were all the evals done in fp16/bf16?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When will the trained model be released? #3

When will the trained model be released? #3

chenxshuo commented Aug 19, 2023

vishaal27 commented Aug 20, 2023

VictorSanh commented Aug 21, 2023

vishaal27 commented Aug 23, 2023

When will the trained model be released? #3

When will the trained model be released? #3

Comments

chenxshuo commented Aug 19, 2023

vishaal27 commented Aug 20, 2023

VictorSanh commented Aug 21, 2023

vishaal27 commented Aug 23, 2023