Multiple beams translate & evaluation with bleu #6

trynusnick13 · 2024-02-10T16:59:41Z

No description provided.

Added translation for multiple beams

adding few shot gpt translation

dchaplinsky · 2024-02-23T15:07:27Z

docs/translate_beams.md

+    with open('flores-eng-devtest.csv', 'w') as csvfile:
+        writer = csv.DictWriter(csvfile, fieldnames=["eng_Latn-ukr_Cyrl"])
+        writer.writeheader()
+        for domain in list_of_emails:


list_of_emails?

dchaplinsky · 2024-02-23T15:07:39Z

docs/translate_beams.md

+eng = devtest["sentence_eng_Latn"]
+def write_to_csv(list_of_emails):
+    with open('flores-eng-devtest.csv', 'w') as csvfile:
+        writer = csv.DictWriter(csvfile, fieldnames=["eng_Latn-ukr_Cyrl"])


fieldnames looks wrong.

dchaplinsky · 2024-02-23T16:22:27Z

translate_beams.py

+
+@app.command()
+def eval_model_multpl_beams_ready_prep(
+    source_file_path: Annotated[str, typer.Option()],


Gosh, some docstrings are needed.

ahahhaah, agree

dchaplinsky · 2024-02-23T16:25:26Z

translate_beams.py

+    )
+    source_sentences = []
+    with open(preprocessed_file_path) as f:
+        source_sentences = f.readlines()


In [1]: with open("/tmp/foo", "r") as fp_on: ...: lines = fp_on.readlines() ...: In [2]: lines Out[2]: ['1\n', '2\n', '3\n', '4\n']

you might want to strip newlines.

added mistral translate script

proger · 2024-02-23T22:25:33Z

few_shot_mistral_translate.py

+        all_prompts.append(translation_prompt)
+    print(f"Max tokens = {max(all_token_counts)}")
+    inputs = tokenizer(all_prompts, return_tensors="pt", padding=True)
+    model.to("cuda")


Loading the model to cuda later is slower than loading it directly to cuda. Check out this patch: 33b3774

added calc metrics script

trynusnick13 and others added 5 commits February 10, 2024 18:41

Added translation for multiple beams

a4199c8

added preprocessing

c372544

Merge pull request #1 from trynusnick13/ntrynus

d5dac17

Added translation for multiple beams

adding few shot gpt translation

ca9a788

Merge pull request #2 from trynusnick13/ntrynus

6ded670

adding few shot gpt translation

dchaplinsky reviewed Feb 23, 2024

View reviewed changes

trynusnick13 and others added 2 commits February 23, 2024 23:22

added mistral translate script

1653f93

Merge pull request #3 from trynusnick13/ntrynus

fcf8b78

added mistral translate script

proger reviewed Feb 23, 2024

View reviewed changes

trynusnick13 and others added 2 commits March 3, 2024 12:17

added calc metrics script

b13e16c

Merge pull request #4 from trynusnick13/ntrynus

73d5ecd

added calc metrics script

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple beams translate & evaluation with bleu #6

Multiple beams translate & evaluation with bleu #6

trynusnick13 commented Feb 10, 2024

dchaplinsky Feb 23, 2024

dchaplinsky Feb 23, 2024

dchaplinsky Feb 23, 2024

trynusnick13 Feb 23, 2024

dchaplinsky Feb 23, 2024

proger Feb 23, 2024 •

edited

Loading

Multiple beams translate & evaluation with bleu #6

Are you sure you want to change the base?

Multiple beams translate & evaluation with bleu #6

Conversation

trynusnick13 commented Feb 10, 2024

dchaplinsky Feb 23, 2024

Choose a reason for hiding this comment

dchaplinsky Feb 23, 2024

Choose a reason for hiding this comment

dchaplinsky Feb 23, 2024

Choose a reason for hiding this comment

trynusnick13 Feb 23, 2024

Choose a reason for hiding this comment

dchaplinsky Feb 23, 2024

Choose a reason for hiding this comment

proger Feb 23, 2024 • edited Loading

Choose a reason for hiding this comment

proger Feb 23, 2024 •

edited

Loading