Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mistral NeMo | Mistral AI | Frontier AI in your hands #851

Open
1 task
ShellLM opened this issue Aug 1, 2024 · 1 comment
Open
1 task

Mistral NeMo | Mistral AI | Frontier AI in your hands #851

ShellLM opened this issue Aug 1, 2024 · 1 comment
Labels
AI-Chatbots Topics related to advanced chatbot platforms integrating multiple AI models finetuning Tools for finetuning of LLMs e.g. SFT or RLHF llm Large Language Models Models LLM and ML model repos and links New-Label Choose this option if the existing labels are insufficient to describe the content accurately

Comments

@ShellLM
Copy link
Collaborator

ShellLM commented Aug 1, 2024

Mistral NeMo | Mistral AI | Frontier AI in your hands

"Today, we are excited to release Mistral NeMo, a 12B model built in collaboration with NVIDIA. Mistral NeMo offers a large context window of up to 128k tokens. Its reasoning, world knowledge, and coding accuracy are state-of-the-art in its size category. As it relies on standard architecture, Mistral NeMo is easy to use and a drop-in replacement in any system using Mistral 7B.

We have released pre-trained base and instruction-tuned checkpoints checkpoints under the Apache 2.0 license to promote adoption for researchers and enterprises. Mistral NeMo was trained with quantisation awareness, enabling FP8 inference without any performance loss.

The following table compares the accuracy of the Mistral NeMo base model with two recent open-source pre-trained models, Gemma 2 9B, and Llama 3 8B."

Suggested labels

{'label-name': 'Large AI Model', 'label-description': 'Refers to state-of-the-art large AI models like Mistral NeMo with up to 128k tokens context window.', 'gh-repo': 'AI-Chatbots', 'confidence': 63.31}

@ShellLM ShellLM added AI-Chatbots Topics related to advanced chatbot platforms integrating multiple AI models finetuning Tools for finetuning of LLMs e.g. SFT or RLHF llm Large Language Models Models LLM and ML model repos and links New-Label Choose this option if the existing labels are insufficient to describe the content accurately labels Aug 1, 2024
@ShellLM
Copy link
Collaborator Author

ShellLM commented Aug 1, 2024

Related content

#460 similarity score: 0.89
#311 similarity score: 0.87
#389 similarity score: 0.86
#431 similarity score: 0.86
#628 similarity score: 0.86
#647 similarity score: 0.86

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
AI-Chatbots Topics related to advanced chatbot platforms integrating multiple AI models finetuning Tools for finetuning of LLMs e.g. SFT or RLHF llm Large Language Models Models LLM and ML model repos and links New-Label Choose this option if the existing labels are insufficient to describe the content accurately
Projects
None yet
Development

No branches or pull requests

1 participant