Saved in:
| Main Authors: | Gladkoff, Serge, Han, Lifeng, Gasova, Katerina |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.13467 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date overview
by: Han, Lifeng, et al.
Published: (2016)
by: Han, Lifeng, et al.
Published: (2016)
The Multi-Range Theory of Translation Quality Measurement: MQM scoring models and Statistical Quality Control
by: Lommel, Arle, et al.
Published: (2024)
by: Lommel, Arle, et al.
Published: (2024)
MTUncertainty: Assessing the Need for Post-editing of Machine Translation Outputs by Fine-tuning OpenAI LLMs
by: Gladkoff, Serge, et al.
Published: (2023)
by: Gladkoff, Serge, et al.
Published: (2023)
Neural Machine Translation of Clinical Text: An Empirical Investigation into Multilingual Pre-Trained Language Models and Transfer-Learning
by: Han, Lifeng, et al.
Published: (2023)
by: Han, Lifeng, et al.
Published: (2023)
Ara-HOPE: Human-Centric Post-Editing Evaluation for Dialectal Arabic to Modern Standard Arabic Translation
by: Alabdullah, Abdullah, et al.
Published: (2025)
by: Alabdullah, Abdullah, et al.
Published: (2025)
Advancing Dialectal Arabic to Modern Standard Arabic Machine Translation
by: Alabdullah, Abdullah, et al.
Published: (2025)
by: Alabdullah, Abdullah, et al.
Published: (2025)
CANTONMT: Investigating Back-Translation and Model-Switch Mechanisms for Cantonese-English Neural Machine Translation
by: Hong, Kung Yin, et al.
Published: (2024)
by: Hong, Kung Yin, et al.
Published: (2024)
An Empirical Study on Chinese Character Decomposition in Multiword Expression-Aware Neural Machine Translation
by: Han, Lifeng, et al.
Published: (2025)
by: Han, Lifeng, et al.
Published: (2025)
MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language
by: Wang, Shun, et al.
Published: (2024)
by: Wang, Shun, et al.
Published: (2024)
PEAR: Pairwise Evaluation for Automatic Relative Scoring in Machine Translation
by: Proietti, Lorenzo, et al.
Published: (2026)
by: Proietti, Lorenzo, et al.
Published: (2026)
CantonMT: Cantonese to English NMT Platform with Fine-Tuned Models Using Synthetic Back-Translation Data
by: Hong, Kung Yin, et al.
Published: (2024)
by: Hong, Kung Yin, et al.
Published: (2024)
Beyond Scalar Scores: Reinforcement Learning for Error-Aware Quality Estimation of Machine Translation
by: Sindhujan, Archchana, et al.
Published: (2026)
by: Sindhujan, Archchana, et al.
Published: (2026)
Translation or Recitation? Calibrating Evaluation Scores for Machine Translation of Extremely Low-Resource Languages
by: Chen, Danlu, et al.
Published: (2026)
by: Chen, Danlu, et al.
Published: (2026)
ROC Analysis for Evaluating Translation Quality Estimation Systems
by: Garland, Evelyn Y., et al.
Published: (2026)
by: Garland, Evelyn Y., et al.
Published: (2026)
SpeechQE: Estimating the Quality of Direct Speech Translation
by: Han, HyoJung, et al.
Published: (2024)
by: Han, HyoJung, et al.
Published: (2024)
Audio-Based Crowd-Sourced Evaluation of Machine Translation Quality
by: Haq, Sami Ul, et al.
Published: (2025)
by: Haq, Sami Ul, et al.
Published: (2025)
Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTC
by: Syu, Shen-sian, et al.
Published: (2023)
by: Syu, Shen-sian, et al.
Published: (2023)
Filtered Reasoning Score: Evaluating Reasoning Quality on a Model's Most-Confident Traces
by: Pathak, Manas, et al.
Published: (2026)
by: Pathak, Manas, et al.
Published: (2026)
Should I Share this Translation? Evaluating Quality Feedback for User Reliance on Machine Translation
by: Ki, Dayeon, et al.
Published: (2025)
by: Ki, Dayeon, et al.
Published: (2025)
On Non-interactive Evaluation of Animal Communication Translators
by: Paradise, Orr, et al.
Published: (2025)
by: Paradise, Orr, et al.
Published: (2025)
AutoLLM-CARD: Towards a Description and Landscape of Large Language Models
by: Tian, Shengwei, et al.
Published: (2024)
by: Tian, Shengwei, et al.
Published: (2024)
The Paradox of Poetic Intent in Back-Translation: Evaluating the Quality of Large Language Models in Chinese Translation
by: Weigang, Li, et al.
Published: (2025)
by: Weigang, Li, et al.
Published: (2025)
Hypencoder Revisited: Reproducibility and Analysis of Non-Linear Scoring for First-Stage Retrieval
by: Eichholtz, Arne, et al.
Published: (2026)
by: Eichholtz, Arne, et al.
Published: (2026)
INSIGHTBUDDY-AI: Medication Extraction and Entity Linking using Large Language Models and Ensemble Learning
by: Romero, Pablo, et al.
Published: (2024)
by: Romero, Pablo, et al.
Published: (2024)
On Temperature-Constrained Non-Deterministic Machine Translation: Potential and Evaluation
by: Wang, Weichuan, et al.
Published: (2026)
by: Wang, Weichuan, et al.
Published: (2026)
ContrastScore: Towards Higher Quality, Less Biased, More Efficient Evaluation Metrics with Contrastive Evaluation
by: Wang, Xiao, et al.
Published: (2025)
by: Wang, Xiao, et al.
Published: (2025)
Lost in the Source Language: How Large Language Models Evaluate the Quality of Machine Translation
by: Huang, Xu, et al.
Published: (2024)
by: Huang, Xu, et al.
Published: (2024)
GPT-4 vs. Human Translators: A Comprehensive Evaluation of Translation Quality Across Languages, Domains, and Expertise Levels
by: Yan, Jianhao, et al.
Published: (2024)
by: Yan, Jianhao, et al.
Published: (2024)
Beyond Literal Mapping: Benchmarking and Improving Non-Literal Translation Evaluation
by: Tian, Yanzhi, et al.
Published: (2026)
by: Tian, Yanzhi, et al.
Published: (2026)
Exploring Fact Memorization and Style Imitation in LLMs Using QLoRA: An Experimental Study and Quality Assessment Methods
by: Vyborov, Eugene, et al.
Published: (2024)
by: Vyborov, Eugene, et al.
Published: (2024)
Large Language Models as Annotators for Machine Translation Quality Estimation
by: Wang, Sidi, et al.
Published: (2026)
by: Wang, Sidi, et al.
Published: (2026)
Beyond Holistic Scores: Automatic Trait-Based Quality Scoring of Argumentative Essays
by: Favero, Lucile, et al.
Published: (2026)
by: Favero, Lucile, et al.
Published: (2026)
Quality-Aware Translation Models: Efficient Generation and Quality Estimation in a Single Model
by: Tomani, Christian, et al.
Published: (2023)
by: Tomani, Christian, et al.
Published: (2023)
Exploration of Masked and Causal Language Modelling for Text Generation
by: Micheletti, Nicolo, et al.
Published: (2024)
by: Micheletti, Nicolo, et al.
Published: (2024)
Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
by: Briakou, Eleftheria, et al.
Published: (2024)
by: Briakou, Eleftheria, et al.
Published: (2024)
Life Cycle-Aware Evaluation of Knowledge Distillation for Machine Translation: Environmental Impact and Translation Quality Trade-offs
by: Attieh, Joseph, et al.
Published: (2026)
by: Attieh, Joseph, et al.
Published: (2026)
Evaluating Language Translation Models by Playing Telephone
by: Saba, Syeda Jannatus, et al.
Published: (2025)
by: Saba, Syeda Jannatus, et al.
Published: (2025)
GAMBIT+: A Challenge Set for Evaluating Gender Bias in Machine Translation Quality Estimation Metrics
by: Filandrianos, Giorgos, et al.
Published: (2025)
by: Filandrianos, Giorgos, et al.
Published: (2025)
Beyond Human-Only: Evaluating Human-Machine Collaboration for Collecting High-Quality Translation Data
by: Liu, Zhongtao, et al.
Published: (2024)
by: Liu, Zhongtao, et al.
Published: (2024)
MaLei at MultiClinSUM: Summarisation of Clinical Documents using Perspective-Aware Iterative Self-Prompting with LLMs
by: Ren, Libo, et al.
Published: (2025)
by: Ren, Libo, et al.
Published: (2025)
Similar Items
-
Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date overview
by: Han, Lifeng, et al.
Published: (2016) -
The Multi-Range Theory of Translation Quality Measurement: MQM scoring models and Statistical Quality Control
by: Lommel, Arle, et al.
Published: (2024) -
MTUncertainty: Assessing the Need for Post-editing of Machine Translation Outputs by Fine-tuning OpenAI LLMs
by: Gladkoff, Serge, et al.
Published: (2023) -
Neural Machine Translation of Clinical Text: An Empirical Investigation into Multilingual Pre-Trained Language Models and Transfer-Learning
by: Han, Lifeng, et al.
Published: (2023) -
Ara-HOPE: Human-Centric Post-Editing Evaluation for Dialectal Arabic to Modern Standard Arabic Translation
by: Alabdullah, Abdullah, et al.
Published: (2025)