Mutox classifier #44

David-OC17 · 2024-10-26T02:54:21Z

Elements to include in PR

Migrate Mutox code from Seamless Communication
Add unit tests for the pipeline
(Optional) add softmax computation on top of classifier

Note: A follow-up PR will be needed in the Seamless Communication repository to account for the migration being made here.

examples/mutox_example.ipynb

mutox/__init__.py

mutox/speech_pipeline.py

pyproject.toml

mutox/cli/README.md

tests/unit_tests/test_mutox.py

sonar/models/mutox/classifier.py

sonar/models/mutox/builder.py

sonar/models/mutox/classifier.py

avidale · 2024-10-29T11:18:56Z

sonar/inference_pipelines/mutox_speech.py

+    ) -> None:
+        super().__init__(encoder)
+        self.model.to(device).eval()
+        self.mutox_classifier = mutox_classifier.to(device).eval()


According to the type annotation, mutox_classifier can be either a str or a MutoxClassifier. The latter would work fine here, but for the former, we'll have to load the classifier using the string name.

The same holds for the model part one line above.

@David-OC17 you marked this conversation as resolved, but the type annotation for mutox_classifier is still not totally consistent with the method body: either remove str from the annotation or add a load_mutox_model to the body if the input is a string.

sonar/inference_pipelines/mutox_speech.py

README.md

avidale · 2024-10-29T13:00:00Z

examples/mutox_example.ipynb

+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "/tmp/tmpqasvhgx6/commonvoice_example_en_clocks.wav\t-42.40079116821289\n",


These scores, -42 and -47, are not straightforward to interpret.
We should either mention in the top of the notebook that the model has been trained with a "Binary Cross Entropy loss with logits" objective (according) to the paper), and thus, to turn its output into a probability, we need to feed it to a sigmoid layer.

The same thing: you resolved the comment, but the notebook still doesn't feature a code change or an explanation of what kind of output the current code generates.

README.md

…repo decoupling, other

README.md

avidale · 2024-11-13T02:14:48Z

README.md

+    x = classifier(emb.to(device).to(dtype)) # tensor([[-19.7812]], device='cuda:0', dtype=torch.float16)
+
+with torch.inference_mode():
+    emb = t2vec_model.predict(["She worked hard and made a significant contribution to the team."], source_lang='fra_Latn')


source_lang='eng_Latn'?

avidale · 2024-11-13T02:16:31Z

README.md

@@ -142,6 +142,49 @@ print(blaser_qe(src=src_embs, mt=mt_embs).item())  # 4.708
 Detailed model cards with more examples: [facebook/blaser-2.0-ref](https://huggingface.co/facebook/blaser-2.0-ref), 
 [facebook/blaser-2.0-qe](https://huggingface.co/facebook/blaser-2.0-qe). 

+### Classifying the toxicity of sentences with MuTox
+
+[MuTox](https://github.com/facebookresearch/seamless_communication/tree/main/src/seamless_communication/cli/toxicity/mutox), the first highly multilingual audio-based classifier (binary) and dataset with toxicity labels. The dataset consists of 20k audio utterances for English and Spanish, and 4k for the other 19 languages, and uses the multi-model and multilingual encoders from SONAR. The output of the MuTox classifier is a probability of the evaluated being _"toxic"_, according to the definition adopted in the corresponding dataset.


This paragraph says that "The output of the MuTox classifier is a probability", but the outputs in your examples are tensors like -58.0625 which are clearly not probabilities.
We should either adjust the description (by saying that the output is a logit) or the output (by feeding it to a sigmoid function to turn it into a probability).

avidale · 2024-11-13T02:17:59Z

sonar/cards/sonar_mutox.yaml

+model_type: mutox_classifier
+model_arch: mutox
+checkpoint: "https://dl.fbaipublicfiles.com/seamless/models/mutox.pt"
+input_size: 1024


You may explicitly mention in a comment here that it's a copy of the card in https://github.com/facebookresearch/seamless_communication/blob/main/src/seamless_communication/cards/mutox.yaml.

avidale · 2024-11-13T02:21:02Z

sonar/models/mutox/classifier.py

+    input_size: int
+
+    # add sigmoid as last layer to output probability
+    output_prob: bool = False


I would probably make output_prob not a property of the config but an optional argument in the MutoxClassifier.forward method (which defaults to False, which is the current behavior).

avidale · 2024-11-13T02:23:25Z

sonar/inference_pipelines/mutox_speech.py

+    ) -> None:
+        super().__init__(encoder)
+        self.model.to(device).eval()
+        self.mutox_classifier = mutox_classifier.to(device).eval()


@David-OC17 you marked this conversation as resolved, but the type annotation for mutox_classifier is still not totally consistent with the method body: either remove str from the annotation or add a load_mutox_model to the body if the input is a string.

David-OC17 added 2 commits October 25, 2024 19:24

Main code transfer of Mutox classifier from SeamlessM4T

eff7989

Minor changes, import inside mutox broken

822673a

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 26, 2024

David-OC17 commented Oct 26, 2024

View reviewed changes

examples/mutox_example.ipynb Outdated Show resolved Hide resolved

avidale reviewed Oct 28, 2024

View reviewed changes

mutox/__init__.py Outdated Show resolved Hide resolved

mutox/speech_pipeline.py Outdated Show resolved Hide resolved

pyproject.toml Outdated Show resolved Hide resolved

mutox/cli/README.md Outdated Show resolved Hide resolved

mutox/cli/README.md Outdated Show resolved Hide resolved

David-OC17 added 2 commits October 28, 2024 19:02

Corrections from PR facebookresearch#44 comments

35c9175

Added unit tests for mutox builder, classifier

60f1816

David-OC17 requested a review from avidale October 29, 2024 02:00

artemru reviewed Oct 29, 2024

View reviewed changes

tests/unit_tests/test_mutox.py Show resolved Hide resolved

artemru reviewed Oct 29, 2024

View reviewed changes

sonar/models/mutox/classifier.py Show resolved Hide resolved

artemru reviewed Oct 29, 2024

View reviewed changes

sonar/models/mutox/builder.py Outdated Show resolved Hide resolved

avidale reviewed Oct 29, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

Resolved comments PR#44: Added MutoxConfig opt layer, style changes, …

55a342d

…repo decoupling, other

David-OC17 force-pushed the mutox_classifier branch from ed55f21 to 55a342d Compare November 9, 2024 00:13

avidale reviewed Nov 13, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mutox classifier #44

Mutox classifier #44

David-OC17 commented Oct 26, 2024

avidale Oct 29, 2024

avidale Nov 13, 2024

avidale Oct 29, 2024

avidale Nov 13, 2024 •

edited

Loading

avidale Nov 13, 2024

avidale Nov 13, 2024

avidale Nov 13, 2024

avidale Nov 13, 2024

avidale Nov 13, 2024

Mutox classifier #44

Are you sure you want to change the base?

Mutox classifier #44

Conversation

David-OC17 commented Oct 26, 2024

Elements to include in PR

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

avidale Nov 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

avidale Nov 13, 2024 •

edited

Loading