Folders and files Name Name Last commit message
Last commit date
parent directory
View all files
dataset: opus
model: transformer
source language(s): deu
target language(s): afb apc ara arq arz
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
download: opus-2020-09-10.zip
test set translations: opus-2020-09-10.test.txt
test set scores: opus-2020-09-10.eval.txt
testset
BLEU
chr-F
Tatoeba-test.deu.ara
17.8
0.486
dataset: opus
model: transformer
source language(s): deu
target language(s): afb apc ara arq arz
model: transformer
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
valid language labels: >>ara<< >>ara_Latn<< >>arq<< >>arz<<
download: opus-2021-02-23.zip
test set translations: opus-2021-02-23.test.txt
test set scores: opus-2021-02-23.eval.txt
testset
BLEU
chr-F
#sent
#words
BP
Tatoeba-test.deu-afb
7.5
0.357
5
24
0.867
Tatoeba-test.deu-apc
11.0
0.225
2
6
1.000
Tatoeba-test.deu-ara
17.8
0.486
1209
6324
0.967
Tatoeba-test.deu-arq
0.0
0.514
1
4
0.717
Tatoeba-test.deu-arz
12.8
0.360
2
6
1.000
opusTCv20210807-2021-10-22.zip
dataset: opusTCv20210807
model: transformer-align
source language(s): deu
target language(s): afb apc ara arq arz
raw source language(s): deu
raw target language(s): afb apc ara arq arz
model: transformer-align
pre-processing: normalization + SentencePiece (spm32k,spm32k)
a sentence initial language token is required in the form of >>id<<
(id = valid target language ID)
valid language labels:
download: opusTCv20210807-2021-10-22.zip
test set translations: opusTCv20210807-2021-10-22.test.txt
test set scores: opusTCv20210807-2021-10-22.eval.txt
testset
BLEU
chr-F
#sent
#words
BP
Tatoeba-test-v2021-08-07.deu-ara
22.9
0.519
1208
6323
1.000
You can’t perform that action at this time.