Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture #12466

manyoso · 2025-03-19T13:45:32Z

Adds MoE-based embedding model supporting multilingual embeddings.
Selects architecture variant based on hyperparameter detection (MoE layers).
Removes unnecessary subclass initialization checks for clarity.

https://www.nomic.ai/blog/posts/nomic-embed-text-v2

Make sure to read the contributing guidelines before submitting a PR

- Adds MoE-based embedding model supporting multilingual embeddings. - Selects architecture variant based on hyperparameter detection (MoE layers). - Removes unnecessary subclass initialization checks for clarity. https://www.nomic.ai/blog/posts/nomic-embed-text-v2 Signed-off-by: Adam Treat <treat.adam@gmail.com>

Signed-off-by: Adam Treat <treat.adam@gmail.com>

ngxson · 2025-03-19T16:37:06Z

convert_hf_to_gguf.py

@@ -702,6 +695,8 @@ def get_vocab_base_pre(self, tokenizer) -> str:
        if chkhsh == "ccc2ef013c104be7bae2965776d611e1d7a8a2a9c547dd93a682c9a9fc80352e":
            # ref: https://huggingface.co/Xenova/gpt-4o
            res = "gpt-4o"
+        if chkhsh == "a81863d07e75497e2194eb1a1574d5e5cd4d5f85a87a0728b922bf2bed6fb327":
+            res = "bert"


The newly added tokenizer is nomic-embed-text-v2-moe and not bert, is this expected?

And also this list is auto-generated, please make sure not to modify it manually

ngxson · 2025-03-19T16:39:19Z

convert_hf_to_gguf.py

+
+        if "mlp.experts.mlp.w1" in name:
+            data_torch = data_torch.view(self.hparams["num_experts"], self.hparams["n_inner"], self.hparams["n_embd"])
+            return [(self.map_tensor_name(name) + ".weight", data_torch)]


will this work? (no need to return here)

map_tensor_name will append .weight if the given original name also have it

Suggested change

return [(self.map_tensor_name(name) + ".weight", data_torch)]

name += ".weight"

ngxson

Maybe I missed something, but llm_build_bert does not seem to support MoE, right? Should we also update the compute graph?

manyoso · 2025-03-19T18:44:31Z

Working on tests to verify the accuracy of the model

Signed-off-by: Adam Treat <treat.adam@gmail.com>

github-actions bot added the python label Mar 19, 2025

manyoso marked this pull request as draft March 19, 2025 13:46

manyoso added 3 commits March 19, 2025 09:52

Remove blank lines and add switch case.

Verified

This commit was signed with the committer’s verified signature.

zeripath

GPG key ID: 3CDE74631F13A748

Verified
Learn about vigilant mode

Loading
Loading status checks…

b97b415

Signed-off-by: Adam Treat <treat.adam@gmail.com>

Fix for address sanitizer.

Verified

This commit was signed with the committer’s verified signature.

zeripath

GPG key ID: 3CDE74631F13A748

Verified
Learn about vigilant mode

Loading
Loading status checks…

252c0a7

Signed-off-by: Adam Treat <treat.adam@gmail.com>

manyoso marked this pull request as ready for review March 19, 2025 15:54

ngxson reviewed Mar 19, 2025

View reviewed changes

manyoso marked this pull request as draft March 19, 2025 17:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture #12466

Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture #12466

manyoso commented Mar 19, 2025

ngxson Mar 19, 2025

ngxson Mar 19, 2025 •

edited

Loading

ngxson left a comment

manyoso commented Mar 19, 2025

	return [(self.map_tensor_name(name) + ".weight", data_torch)]
	name += ".weight"

Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture #12466

Are you sure you want to change the base?

Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture #12466

Conversation

manyoso commented Mar 19, 2025

ngxson Mar 19, 2025

Choose a reason for hiding this comment

ngxson Mar 19, 2025 • edited Loading

Choose a reason for hiding this comment

ngxson left a comment

Choose a reason for hiding this comment

manyoso commented Mar 19, 2025

ngxson Mar 19, 2025 •

edited

Loading