-
Notifications
You must be signed in to change notification settings - Fork 854
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add BenCzechMark blogpost #2382
Conversation
- user: Lakoc | ||
guest: true | ||
org: BUT-FIT | ||
- user: popelucha |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Doesn't seem to be in the MU-NLPC org (not sure it matters)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah I know, but I guess (hope) it's fine
|
||
- *Qwen-72B* shone in Math and Information Retrieval but lagged behind similarly-sized models in other categories. | ||
- *Aya-21-35B* model excels in Sentiment and Language Modeling, but similarly lags behind in different categories. | ||
- *Gemma-2 9B* delivers excellent results in Czech reading comprehension, outperforming much larger models. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suppose all these models were not fine-tuned for Czech, or were they? Were there any Czech-specific models you evaluated?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Only 2 models were Czech specific; CSTinyLlama-1.2B and csmpt7b. In spite of the lack of official support, some multilingual perform very well in Czech.
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
…feat/benczechmark
Adds blogpost about a new czech evaluation suite called BenCzechMark.
Notes:
Just for review, I will merge it myself