Saved in:
| Main Authors: | Holtermann, Carolin, Bui, Minh Duc, Zhou, Kaitlyn, Hofmann, Valentin, von der Wense, Katharina, Lauscher, Anne |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.22260 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Large Language Models Discriminate Against Speakers of German Dialects
by: Bui, Minh Duc, et al.
Published: (2025)
by: Bui, Minh Duc, et al.
Published: (2025)
Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models
by: Bui, Minh Duc, et al.
Published: (2024)
by: Bui, Minh Duc, et al.
Published: (2024)
The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification
by: Bui, Minh Duc, et al.
Published: (2024)
by: Bui, Minh Duc, et al.
Published: (2024)
Around the World in 24 Hours: Probing LLM Knowledge of Time and Place
by: Holtermann, Carolin, et al.
Published: (2025)
by: Holtermann, Carolin, et al.
Published: (2025)
Mind the Gap: A Closer Look at Tokenization for Multiple-Choice Question Answering with LLMs
by: Sanz-Guerrero, Mario, et al.
Published: (2025)
by: Sanz-Guerrero, Mario, et al.
Published: (2025)
SoS: Analysis of Surface over Semantics in Multilingual Text-To-Image Generation
by: Holtermann, Carolin, et al.
Published: (2026)
by: Holtermann, Carolin, et al.
Published: (2026)
TempViz: On the Evaluation of Temporal Knowledge in Text-to-Image Models
by: Holtermann, Carolin, et al.
Published: (2026)
by: Holtermann, Carolin, et al.
Published: (2026)
GIMMICK -- Globally Inclusive Multimodal Multitask Cultural Knowledge Benchmarking
by: Schneider, Florian, et al.
Published: (2025)
by: Schneider, Florian, et al.
Published: (2025)
Meenz bleibt Meenz, but Large Language Models Do Not Speak Its Dialect
by: Bui, Minh Duc, et al.
Published: (2026)
by: Bui, Minh Duc, et al.
Published: (2026)
Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget
by: Bui, Minh Duc, et al.
Published: (2024)
by: Bui, Minh Duc, et al.
Published: (2024)
What the Weight?! A Unified Framework for Zero-Shot Knowledge Composition
by: Holtermann, Carolin, et al.
Published: (2024)
by: Holtermann, Carolin, et al.
Published: (2024)
Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ
by: Holtermann, Carolin, et al.
Published: (2024)
by: Holtermann, Carolin, et al.
Published: (2024)
JGU Mainz's Submission to the WMT25 Shared Task on LLMs with Limited Resources for Slavic Languages: MT and QA
by: Saadi, Hossain Shaikh, et al.
Published: (2025)
by: Saadi, Hossain Shaikh, et al.
Published: (2025)
From If-Statements to ML Pipelines: Revisiting Bias in Code-Generation
by: Bui, Minh Duc, et al.
Published: (2026)
by: Bui, Minh Duc, et al.
Published: (2026)
On Generalization across Measurement Systems: LLMs Entail More Test-Time Compute for Underrepresented Cultures
by: Bui, Minh Duc, et al.
Published: (2025)
by: Bui, Minh Duc, et al.
Published: (2025)
ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to Scale
by: Frohmann, Markus, et al.
Published: (2023)
by: Frohmann, Markus, et al.
Published: (2023)
NALA_MAINZ at BLP-2025 Task 2: A Multi-agent Approach for Bangla Instruction to Python Code Generation
by: Saadi, Hossain Shaikh, et al.
Published: (2025)
by: Saadi, Hossain Shaikh, et al.
Published: (2025)
Improving Low-Resource Morphological Inflection via Self-Supervised Objectives
by: Wiemerslage, Adam, et al.
Published: (2025)
by: Wiemerslage, Adam, et al.
Published: (2025)
Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model
by: Geigle, Gregor, et al.
Published: (2025)
by: Geigle, Gregor, et al.
Published: (2025)
Mitigating Label Length Bias in Large Language Models
by: Sanz-Guerrero, Mario, et al.
Published: (2025)
by: Sanz-Guerrero, Mario, et al.
Published: (2025)
Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models
by: Sanz-Guerrero, Mario, et al.
Published: (2025)
by: Sanz-Guerrero, Mario, et al.
Published: (2025)
Interdisciplinary Research in Conversation: A Case Study in Computational Morphology for Language Documentation
by: Rice, Enora, et al.
Published: (2025)
by: Rice, Enora, et al.
Published: (2025)
Model-Based Ranking of Source Languages for Zero-Shot Cross-Lingual Transfer
by: Ebrahimi, Abteen, et al.
Published: (2025)
by: Ebrahimi, Abteen, et al.
Published: (2025)
Desiderata for the Context Use of Question Answering Systems
by: Shaier, Sagi, et al.
Published: (2024)
by: Shaier, Sagi, et al.
Published: (2024)
It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension
by: Shaier, Sagi, et al.
Published: (2024)
by: Shaier, Sagi, et al.
Published: (2024)
Asking Again and Again: Exploring LLM Robustness to Repeated Questions
by: Shaier, Sagi, et al.
Published: (2024)
by: Shaier, Sagi, et al.
Published: (2024)
CLIX: Cross-Lingual Explanations of Idiomatic Expressions
by: Gluck, Aaron, et al.
Published: (2025)
by: Gluck, Aaron, et al.
Published: (2025)
Who Are All The Stochastic Parrots Imitating? They Should Tell Us!
by: Shaier, Sagi, et al.
Published: (2023)
by: Shaier, Sagi, et al.
Published: (2023)
Why do LLaVA Vision-Language Models Reply to Images in English?
by: Hinck, Musashi, et al.
Published: (2024)
by: Hinck, Musashi, et al.
Published: (2024)
Comparing Template-based and Template-free Language Model Probing
by: Shaier, Sagi, et al.
Published: (2024)
by: Shaier, Sagi, et al.
Published: (2024)
Untangling the Influence of Typology, Data and Model Architecture on Ranking Transfer Languages for Cross-Lingual POS Tagging
by: Rice, Enora, et al.
Published: (2025)
by: Rice, Enora, et al.
Published: (2025)
TAMS: Translation-Assisted Morphological Segmentation
by: Rice, Enora, et al.
Published: (2024)
by: Rice, Enora, et al.
Published: (2024)
From Priest to Doctor: Domain Adaptation for Low-Resource Neural Machine Translation
by: Marashian, Ali, et al.
Published: (2024)
by: Marashian, Ali, et al.
Published: (2024)
Measuring Contextual Informativeness in Child-Directed Text
by: Valentini, Maria, et al.
Published: (2024)
by: Valentini, Maria, et al.
Published: (2024)
Lost in the Middle, and In-Between: Enhancing Language Models' Ability to Reason Over Long Contexts in Multi-Hop QA
by: Baker, George Arthur, et al.
Published: (2024)
by: Baker, George Arthur, et al.
Published: (2024)
MALAMUTE: A Multilingual, Highly-granular, Template-free, Education-based Probing Dataset
by: Shaier, Sagi, et al.
Published: (2024)
by: Shaier, Sagi, et al.
Published: (2024)
GRUFF: LLM Pronoun Fidelity, Reasoning, and Biases in German
by: Mewes, Fabian, et al.
Published: (2026)
by: Mewes, Fabian, et al.
Published: (2026)
Detecting Hallucinations in Authentic LLM-Human Interactions
by: Ren, Yujie, et al.
Published: (2025)
by: Ren, Yujie, et al.
Published: (2025)
Building Bridges: A Dataset for Evaluating Gender-Fair Machine Translation into German
by: Lardelli, Manuel, et al.
Published: (2024)
by: Lardelli, Manuel, et al.
Published: (2024)
The Echoes of Multilinguality: Tracing Cultural Value Shifts during LM Fine-tuning
by: Choenni, Rochelle, et al.
Published: (2024)
by: Choenni, Rochelle, et al.
Published: (2024)
Similar Items
-
Large Language Models Discriminate Against Speakers of German Dialects
by: Bui, Minh Duc, et al.
Published: (2025) -
Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models
by: Bui, Minh Duc, et al.
Published: (2024) -
The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification
by: Bui, Minh Duc, et al.
Published: (2024) -
Around the World in 24 Hours: Probing LLM Knowledge of Time and Place
by: Holtermann, Carolin, et al.
Published: (2025) -
Mind the Gap: A Closer Look at Tokenization for Multiple-Choice Question Answering with LLMs
by: Sanz-Guerrero, Mario, et al.
Published: (2025)