Automated translation metrics summarized for 334 language directions.
Sources — Source segments come from curated, public-facing parallel corpora and benchmark-style collections mixed for broad coverage. They are not model-generated: originals and references are assembled upstream as parallel text only; this dashboard does not redistribute raw corpora.
Mixed parallel corpora — Parallel segments spanning diverse genres and corpus scales. Each item includes a reference translation that is certified and human-verified—not produced by the models being benchmarked.
Language pairi | Rankingi | Fluencyi | COMETi | BLEUi | sacreBLEUi | Len ratioi |
|---|---|---|---|---|---|---|
| Mandarin Chinese → VietnameseZH · Mandarin Chinese · 普通话→VI · Vietnamese · Tiếng Việt | 125 | Mean 2.41 25th pct. 2.26 · Median 2.34 · 75th pct. 2.64 | Mean 0.875 25th pct. 0.869 · Median 0.876 · 75th pct. 0.881 | Mean 37.5 25th pct. 35.6 · Median 37.5 · 75th pct. 39.5 | Mean 39.5 25th pct. 37.7 · Median 39.8 · 75th pct. 41.4 | Mean 50.561 25th pct. 50.194 · Median 50.833 · 75th pct. 51.000 |
| Mandarin Chinese → UzbekZH · Mandarin Chinese · 普通话→UZ · Uzbek · Oʻzbek | 79 | Mean 5.66 25th pct. 5.21 · Median 5.70 · 75th pct. 6.06 | Mean 0.837 25th pct. 0.839 · Median 0.841 · 75th pct. 0.845 | Mean 5.6 25th pct. 4.6 · Median 5.6 · 75th pct. 6.7 | Mean 8.4 25th pct. 7.6 · Median 9.1 · 75th pct. 9.4 | Mean 5.551 25th pct. 5.455 · Median 5.550 · 75th pct. 5.694 |
| Mandarin Chinese → UrduZH · Mandarin Chinese · 普通话→UR · Urdu · اردو | 125 | Mean 5.22 25th pct. 4.28 · Median 5.21 · 75th pct. 6.04 | Mean 0.744 25th pct. 0.728 · Median 0.749 · 75th pct. 0.761 | Mean 10.1 25th pct. 8.8 · Median 10.6 · 75th pct. 12.3 | Mean 12.3 25th pct. 10.7 · Median 13.0 · 75th pct. 14.8 | Mean 6.579 25th pct. 6.425 · Median 6.529 · 75th pct. 6.599 |
| Mandarin Chinese → UkrainianZH · Mandarin Chinese · 普通话→UK · Ukrainian · Українська | 79 | Mean 5.60 25th pct. 5.14 · Median 5.56 · 75th pct. 6.17 | Mean 0.862 25th pct. 0.860 · Median 0.865 · 75th pct. 0.867 | Mean 5.9 25th pct. 5.1 · Median 5.8 · 75th pct. 6.8 | Mean 9.6 25th pct. 8.8 · Median 9.8 · 75th pct. 10.5 | Mean 5.654 25th pct. 5.569 · Median 5.649 · 75th pct. 5.691 |
| Mandarin Chinese → TurkishZH · Mandarin Chinese · 普通话→TR · Turkish · Türkçe | 125 | Mean 2.49 25th pct. 2.38 · Median 2.51 · 75th pct. 2.62 | Mean 0.850 25th pct. 0.846 · Median 0.852 · 75th pct. 0.854 | Mean 8.5 25th pct. 6.9 · Median 8.5 · 75th pct. 9.5 | Mean 10.6 25th pct. 8.8 · Median 11.3 · 75th pct. 12.1 | Mean 27.689 25th pct. 27.306 · Median 27.778 · 75th pct. 28.333 |
This release reports translation quality by language pair: medians and spread of automated scores (fluency, COMET, BLEU, sacreBLEU, length ratio) aggregated across evaluated directions. Run metadata describes recipes, metrics, and segment counts.