Saved in:
| Main Authors: | Chen, Yanran, Zhao, Wei, Breitbarth, Anne, Stoeckel, Manuel, Mehler, Alexander, Eger, Steffen |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.11549 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs
by: Zhang, Ran, et al.
Published: (2024)
by: Zhang, Ran, et al.
Published: (2024)
Do Emotions Really Affect Argument Convincingness? A Dynamic Approach with LLM-based Manipulation Checks
by: Chen, Yanran, et al.
Published: (2025)
by: Chen, Yanran, et al.
Published: (2025)
LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering
by: Zhang, Ran, et al.
Published: (2025)
by: Zhang, Ran, et al.
Published: (2025)
Is there really a Citation Age Bias in NLP?
by: Nguyen, Hoa, et al.
Published: (2024)
by: Nguyen, Hoa, et al.
Published: (2024)
TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning
by: Greisinger, Christian, et al.
Published: (2026)
by: Greisinger, Christian, et al.
Published: (2026)
Evaluating Large Language Models for Structured Science Summarization in the Open Research Knowledge Graph
by: Nechakhin, Vladyslav, et al.
Published: (2024)
by: Nechakhin, Vladyslav, et al.
Published: (2024)
NLLG Quarterly arXiv Report 09/24: What are the most influential current AI Papers?
by: Leiter, Christoph, et al.
Published: (2024)
by: Leiter, Christoph, et al.
Published: (2024)
GLoRIA: Gated Low-Rank Interpretable Adaptation for Dialectal ASR
by: Mehralian, Pouya, et al.
Published: (2026)
by: Mehralian, Pouya, et al.
Published: (2026)
Beyond Reproduction: A Paired-Task Framework for Assessing LLM Comprehension and Creativity in Literary Translation
by: Zhang, Ran, et al.
Published: (2026)
by: Zhang, Ran, et al.
Published: (2026)
Syntactic Evolution in Language Usage
by: Kumar, Surbhit
Published: (2025)
by: Kumar, Surbhit
Published: (2025)
A Systematic Study of Compositional Syntactic Transformer Language Models
by: Zhao, Yida, et al.
Published: (2025)
by: Zhao, Yida, et al.
Published: (2025)
LLLMs: A Data-Driven Survey of Evolving Research on Limitations of Large Language Models
by: Kostikova, Aida, et al.
Published: (2025)
by: Kostikova, Aida, et al.
Published: (2025)
Evaluating Diversity in Automatic Poetry Generation
by: Chen, Yanran, et al.
Published: (2024)
by: Chen, Yanran, et al.
Published: (2024)
Who Annotates in NLP? A Large-scale Assessment of Human Annotation Reporting between 2018 and 2025
by: Kunilovskaya, Maria, et al.
Published: (2026)
by: Kunilovskaya, Maria, et al.
Published: (2026)
Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale
by: Hu, Xiang, et al.
Published: (2024)
by: Hu, Xiang, et al.
Published: (2024)
Understanding or Memorizing? A Case Study of German Definite Articles in Language Models
by: Drechsel, Jonathan, et al.
Published: (2026)
by: Drechsel, Jonathan, et al.
Published: (2026)
Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics
by: Roy, Subhadeep, et al.
Published: (2026)
by: Roy, Subhadeep, et al.
Published: (2026)
BMX: Boosting Natural Language Generation Metrics with Explainability
by: Leiter, Christoph, et al.
Published: (2022)
by: Leiter, Christoph, et al.
Published: (2022)
Same Meaning, Different Scores: Lexical and Syntactic Sensitivity in LLM Evaluation
by: Kostić, Bogdan, et al.
Published: (2026)
by: Kostić, Bogdan, et al.
Published: (2026)
Syntactic Control of Language Models by Posterior Inference
by: Xefteri, Vicky, et al.
Published: (2025)
by: Xefteri, Vicky, et al.
Published: (2025)
S$^4$C: Speculative Sampling with Syntactic and Semantic Coherence for Efficient Inference of Large Language Models
by: He, Tao, et al.
Published: (2025)
by: He, Tao, et al.
Published: (2025)
Evaluating Semantic and Syntactic Understanding in Large Language Models for Payroll Systems
by: Maclean, Hendrika, et al.
Published: (2026)
by: Maclean, Hendrika, et al.
Published: (2026)
ScImage: How Good Are Multimodal Large Language Models at Scientific Text-to-Image Generation?
by: Zhang, Leixin, et al.
Published: (2024)
by: Zhang, Leixin, et al.
Published: (2024)
USCORE: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine Translation
by: Belouadi, Jonas, et al.
Published: (2022)
by: Belouadi, Jonas, et al.
Published: (2022)
Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian
by: Hoffmann, Michael, et al.
Published: (2025)
by: Hoffmann, Michael, et al.
Published: (2025)
Automatic Prediction of the Performance of Every Parser
by: Biçici, Ergun
Published: (2024)
by: Biçici, Ergun
Published: (2024)
Emotionally Charged, Logically Blurred: AI-driven Emotional Framing Impairs Human Fallacy Detection
by: Chen, Yanran, et al.
Published: (2025)
by: Chen, Yanran, et al.
Published: (2025)
PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics
by: Larionov, Daniil, et al.
Published: (2024)
by: Larionov, Daniil, et al.
Published: (2024)
LLMs Underperform Graph-Based Parsers on Supervised Relation Extraction for Complex Graphs
by: Gajo, Paolo, et al.
Published: (2026)
by: Gajo, Paolo, et al.
Published: (2026)
KELPS: A Framework for Verified Multi-Language Autoformalization via Semantic-Syntactic Alignment
by: Zhang, Jiyao, et al.
Published: (2025)
by: Zhang, Jiyao, et al.
Published: (2025)
The Grammar of Transformers: A Systematic Review of Interpretability Research on Syntactic Knowledge in Language Models
by: Graichen, Nora, et al.
Published: (2026)
by: Graichen, Nora, et al.
Published: (2026)
Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo
by: Loula, João, et al.
Published: (2025)
by: Loula, João, et al.
Published: (2025)
Classification of Human- and AI-Generated Texts for English, French, German, and Spanish
by: Schaaff, Kristina, et al.
Published: (2023)
by: Schaaff, Kristina, et al.
Published: (2023)
TreePrompt: Leveraging Hierarchical Few-Shot Example Selection for Improved English-Persian and English-German Translation
by: Kakavand, Ramtin, et al.
Published: (2025)
by: Kakavand, Ramtin, et al.
Published: (2025)
SparkUI-Parser: Enhancing GUI Perception with Robust Grounding and Parsing
by: Jing, Hongyi, et al.
Published: (2025)
by: Jing, Hongyi, et al.
Published: (2025)
An Analysis on Automated Metrics for Evaluating Japanese-English Chat Translation
by: Rusli, Andre, et al.
Published: (2024)
by: Rusli, Andre, et al.
Published: (2024)
Negation Triplet Extraction with Syntactic Dependency and Semantic Consistency
by: Shi, Yuchen, et al.
Published: (2024)
by: Shi, Yuchen, et al.
Published: (2024)
Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing
by: Wang, Baode, et al.
Published: (2025)
by: Wang, Baode, et al.
Published: (2025)
OTESGN: Optimal Transport-Enhanced Syntactic-Semantic Graph Networks for Aspect-Based Sentiment Analysis
by: Liao, Xinfeng, et al.
Published: (2025)
by: Liao, Xinfeng, et al.
Published: (2025)
The Role of Syntactic Span Preferences in Post-Hoc Explanation Disagreement
by: Kamp, Jonathan, et al.
Published: (2024)
by: Kamp, Jonathan, et al.
Published: (2024)
Similar Items
-
How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs
by: Zhang, Ran, et al.
Published: (2024) -
Do Emotions Really Affect Argument Convincingness? A Dynamic Approach with LLM-based Manipulation Checks
by: Chen, Yanran, et al.
Published: (2025) -
LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering
by: Zhang, Ran, et al.
Published: (2025) -
Is there really a Citation Age Bias in NLP?
by: Nguyen, Hoa, et al.
Published: (2024) -
TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning
by: Greisinger, Christian, et al.
Published: (2026)