Automated translation metrics summarized for 334 language directions.
Sources — Source segments come from curated, public-facing parallel corpora and benchmark-style collections mixed for broad coverage. They are not model-generated: originals and references are assembled upstream as parallel text only; this dashboard does not redistribute raw corpora.
Mixed parallel corpora — Parallel segments spanning diverse genres and corpus scales. Each item includes a reference translation that is certified and human-verified—not produced by the models being benchmarked.
Language pairi | Rankingi | Fluencyi | COMETi | BLEUi | sacreBLEUi | Len ratioi |
|---|---|---|---|---|---|---|
| Hindi → Yue Chinese (Cantonese)HI · Hindi · हिन्दी→YUE · Yue Chinese (Cantonese) · 粵語 | 256 | Median 5.22 Mean 5.35 · 25th pct. 4.78 · 75th pct. 6.58 | Mean 0.206 25th pct. 0.203 · Median 0.206 · 75th pct. 0.209 | Mean 0.0 25th pct. 0.0 · Median 0.0 · 75th pct. 0.0 | Mean 0.2 25th pct. 0.0 · Median 0.0 · 75th pct. 0.0 | Mean 0.377 25th pct. 0.203 · Median 0.225 · 75th pct. 0.256 |
| Modern Standard Arabic → Yue Chinese (Cantonese)AR · Modern Standard Arabic · العربية الفصحى→YUE · Yue Chinese (Cantonese) · 粵語 | 17 | Median 7.64 Mean 7.36 · 25th pct. 7.19 · 75th pct. 7.76 | Mean 0.299 25th pct. 0.295 · Median 0.301 · 75th pct. 0.303 | Mean 0.1 25th pct. 0.0 · Median 0.0 · 75th pct. 0.1 | Mean 0.4 25th pct. 0.0 · Median 0.0 · 75th pct. 0.3 | Mean 0.188 25th pct. 0.170 · Median 0.178 · 75th pct. 0.210 |
| Portuguese → Yue Chinese (Cantonese)PT · Portuguese · Português→YUE · Yue Chinese (Cantonese) · 粵語 | 123 | Median 6.48 Mean 6.68 · 25th pct. 6.03 · 75th pct. 7.50 | Mean 0.351 25th pct. 0.341 · Median 0.352 · 75th pct. 0.364 | Mean 0.0 25th pct. 0.0 · Median 0.0 · 75th pct. 0.0 | Mean 0.0 25th pct. 0.0 · Median 0.0 · 75th pct. 0.0 | Mean 0.208 25th pct. 0.182 · Median 0.199 · 75th pct. 0.222 |
| Russian → Egyptian ArabicRU · Russian · Русский→ARZ · Egyptian Arabic · مصري | 15 | Median 7.66 Mean 7.46 · 25th pct. 7.12 · 75th pct. 7.84 | Mean 0.460 25th pct. 0.446 · Median 0.459 · 75th pct. 0.473 | Mean 0.2 25th pct. 0.1 · Median 0.1 · 75th pct. 0.2 | Mean 2.7 25th pct. 2.7 · Median 2.7 · 75th pct. 2.8 | Mean 0.892 25th pct. 0.859 · Median 0.879 · 75th pct. 0.910 |
| Portuguese → Egyptian ArabicPT · Portuguese · Português→ARZ · Egyptian Arabic · مصري | 56 | Median 7.17 Mean 7.06 · 25th pct. 6.74 · 75th pct. 7.59 | Mean 0.507 25th pct. 0.498 · Median 0.511 · 75th pct. 0.517 | Mean 4.4 25th pct. 4.4 · Median 4.5 · 75th pct. 4.6 | Mean 8.6 25th pct. 8.1 · Median 8.5 · 75th pct. 8.8 | Mean 0.768 25th pct. 0.751 · Median 0.778 · 75th pct. 0.796 |
This release reports translation quality by language pair: medians and spread of automated scores (fluency, COMET, BLEU, sacreBLEU, length ratio) aggregated across evaluated directions. Run metadata describes recipes, metrics, and segment counts.