Automated translation metrics summarized for 334 language directions.
Sources — Source segments come from curated, public-facing parallel corpora and benchmark-style collections mixed for broad coverage. They are not model-generated: originals and references are assembled upstream as parallel text only; this dashboard does not redistribute raw corpora.
Mixed parallel corpora — Parallel segments spanning diverse genres and corpus scales. Each item includes a reference translation that is certified and human-verified—not produced by the models being benchmarked.
Language pairi | Rankingi | Fluencyi | COMETi | BLEUi | sacreBLEUi | Len ratioi |
|---|---|---|---|---|---|---|
| Modern Standard Arabic → JapaneseAR · Modern Standard Arabic · العربية الفصحى→JA · Japanese · 日本語 | 74 | Median 7.03 Mean 7.05 · 25th pct. 6.71 · 75th pct. 7.43 | Mean 0.827 25th pct. 0.821 · Median 0.830 · 75th pct. 0.835 | Mean 0.1 25th pct. 0.0 · Median 0.0 · 75th pct. 0.0 | Mean 1.0 25th pct. 0.0 · Median 0.0 · 75th pct. 0.7 | Mean 0.124 25th pct. 0.118 · Median 0.118 · 75th pct. 0.131 |
| Modern Standard Arabic → Yue Chinese (Cantonese)AR · Modern Standard Arabic · العربية الفصحى→YUE · Yue Chinese (Cantonese) · 粵語 | 17 | Median 7.64 Mean 7.36 · 25th pct. 7.19 · 75th pct. 7.76 | Mean 0.299 25th pct. 0.295 · Median 0.301 · 75th pct. 0.303 | Mean 0.1 25th pct. 0.0 · Median 0.0 · 75th pct. 0.1 | Mean 0.4 25th pct. 0.0 · Median 0.0 · 75th pct. 0.3 | Mean 0.188 25th pct. 0.170 · Median 0.178 · 75th pct. 0.210 |
| Modern Standard Arabic → Mandarin ChineseAR · Modern Standard Arabic · العربية الفصحى→ZH · Mandarin Chinese · 普通话 | 125 | Median 6.45 Mean 6.23 · 25th pct. 6.07 · 75th pct. 6.58 | Mean 0.709 25th pct. 0.701 · Median 0.714 · 75th pct. 0.720 | Mean 0.0 25th pct. 0.0 · Median 0.0 · 75th pct. 0.0 | Mean 0.8 25th pct. 0.7 · Median 0.7 · 75th pct. 0.7 | Mean 0.036 25th pct. 0.033 · Median 0.033 · 75th pct. 0.035 |
| English → ThaiEN · English→TH · Thai · ไทย | 300 | Median 2.86 Mean 2.84 · 25th pct. 2.86 · 75th pct. 2.86 | Mean 0.937 25th pct. 0.922 · Median 0.925 · 75th pct. 0.956 | Mean 0.0 25th pct. 0.0 · Median 0.0 · 75th pct. 0.0 | Mean 0.0 25th pct. 0.0 · Median 0.0 · 75th pct. 0.0 | Mean 0.198 25th pct. 0.170 · Median 0.170 · 75th pct. 0.198 |
| English → Yue Chinese (Cantonese)EN · English→YUE · Yue Chinese (Cantonese) · 粵語 | 281 | Median 4.92 Mean 4.85 · 25th pct. 4.04 · 75th pct. 5.50 | Mean 0.622 25th pct. 0.609 · Median 0.624 · 75th pct. 0.635 | Mean 0.0 25th pct. 0.0 · Median 0.0 · 75th pct. 0.0 | Mean 1.2 25th pct. 0.3 · Median 1.4 · 75th pct. 1.6 | Mean 0.264 25th pct. 0.231 · Median 0.245 · 75th pct. 0.271 |
This release reports translation quality by language pair: medians and spread of automated scores (fluency, COMET, BLEU, sacreBLEU, length ratio) aggregated across evaluated directions. Run metadata describes recipes, metrics, and segment counts.