Saved in:
| Main Authors: | Lafargue, Valentin, Guerra-Adames, Ariel, Claeys, Emmanuelle, Vuichard, Elouan, Loubes, Jean-Michel |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.16749 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Exposing the Illusion of Fairness: Auditing Vulnerabilities to Distributional Manipulation Attacks
by: Lafargue, Valentin, et al.
Published: (2025)
by: Lafargue, Valentin, et al.
Published: (2025)
Fairness is in the details: Face Dataset Auditing
by: Lafargue, Valentin, et al.
Published: (2025)
by: Lafargue, Valentin, et al.
Published: (2025)
Buzz, Choose, Forget: A Meta-Bandit Framework for Bee-Like Decision Making
by: Claeys, Emmanuelle, et al.
Published: (2025)
by: Claeys, Emmanuelle, et al.
Published: (2025)
A Graph Signal Processing Framework for Hallucination Detection in Large Language Models
by: Noël, Valentin
Published: (2025)
by: Noël, Valentin
Published: (2025)
Latent Performance Profiling of Large Language Models
by: Chakraborty, Tanmoy, et al.
Published: (2026)
by: Chakraborty, Tanmoy, et al.
Published: (2026)
Temporally Consistent Factuality Probing for Large Language Models
by: Bajpai, Ashutosh, et al.
Published: (2024)
by: Bajpai, Ashutosh, et al.
Published: (2024)
Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation
by: Heineman, David, et al.
Published: (2025)
by: Heineman, David, et al.
Published: (2025)
CultureLLM: Incorporating Cultural Differences into Large Language Models
by: Li, Cheng, et al.
Published: (2024)
by: Li, Cheng, et al.
Published: (2024)
Cultural Alignment in Large Language Models: An Explanatory Analysis Based on Hofstede's Cultural Dimensions
by: Masoud, Reem I., et al.
Published: (2023)
by: Masoud, Reem I., et al.
Published: (2023)
Model-Agnostic Fairness Regularization for GNNs with Incomplete Sensitive Information
by: Kejani, Mahdi Tavassoli, et al.
Published: (2025)
by: Kejani, Mahdi Tavassoli, et al.
Published: (2025)
Attention Sinks as Internal Signals for Hallucination Detection in Large Language Models
by: Binkowski, Jakub, et al.
Published: (2026)
by: Binkowski, Jakub, et al.
Published: (2026)
Efficient Knowledge Probing of Large Language Models by Adapting Pre-trained Embeddings
by: Sharma, Kartik, et al.
Published: (2025)
by: Sharma, Kartik, et al.
Published: (2025)
Language Models Entangle Language and Culture
by: Jain, Shourya, et al.
Published: (2026)
by: Jain, Shourya, et al.
Published: (2026)
Can Large Language Models Generalize Procedures Across Representations?
by: Lin, Fangru, et al.
Published: (2026)
by: Lin, Fangru, et al.
Published: (2026)
Derivational Morphology Reveals Analogical Generalization in Large Language Models
by: Hofmann, Valentin, et al.
Published: (2024)
by: Hofmann, Valentin, et al.
Published: (2024)
Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models
by: Sedykh, Ivan, et al.
Published: (2026)
by: Sedykh, Ivan, et al.
Published: (2026)
Dynamic Evaluation of Large Language Models by Meta Probing Agents
by: Zhu, Kaijie, et al.
Published: (2024)
by: Zhu, Kaijie, et al.
Published: (2024)
Probing the Decision Boundaries of In-context Learning in Large Language Models
by: Zhao, Siyan, et al.
Published: (2024)
by: Zhao, Siyan, et al.
Published: (2024)
Exploring Multilingual Probing in Large Language Models: A Cross-Language Analysis
by: Li, Daoyang, et al.
Published: (2024)
by: Li, Daoyang, et al.
Published: (2024)
Trust in One Round: Confidence Estimation for Large Language Models via Structural Signals
by: Yang, Pengyue, et al.
Published: (2026)
by: Yang, Pengyue, et al.
Published: (2026)
Language over Content: Tracing Cultural Understanding in Multilingual Large Language Models
by: Cho, Seungho, et al.
Published: (2025)
by: Cho, Seungho, et al.
Published: (2025)
Who Endorsed It? Measuring Authority Bias Across Expertise Levels in Language Models
by: Mammen, Priyanka Mary, et al.
Published: (2026)
by: Mammen, Priyanka Mary, et al.
Published: (2026)
Assessing Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks
by: Lin, Fangru, et al.
Published: (2024)
by: Lin, Fangru, et al.
Published: (2024)
DiJiang: Efficient Large Language Models through Compact Kernelization
by: Chen, Hanting, et al.
Published: (2024)
by: Chen, Hanting, et al.
Published: (2024)
Rethinking Interpretability in the Era of Large Language Models
by: Singh, Chandan, et al.
Published: (2024)
by: Singh, Chandan, et al.
Published: (2024)
FacLens: Transferable Probe for Foreseeing Non-Factuality in Fact-Seeking Question Answering of Large Language Models
by: Wang, Yanling, et al.
Published: (2024)
by: Wang, Yanling, et al.
Published: (2024)
Synthetic Data Generation in Low-Resource Settings via Fine-Tuning of Large Language Models
by: Kaddour, Jean, et al.
Published: (2023)
by: Kaddour, Jean, et al.
Published: (2023)
Talking Nonsense: Probing Large Language Models' Understanding of Adversarial Gibberish Inputs
by: Cherepanova, Valeriia, et al.
Published: (2024)
by: Cherepanova, Valeriia, et al.
Published: (2024)
Graph-enhanced Large Language Models in Asynchronous Plan Reasoning
by: Lin, Fangru, et al.
Published: (2024)
by: Lin, Fangru, et al.
Published: (2024)
P^3SUM: Preserving Author's Perspective in News Summarization with Diffusion Language Models
by: Liu, Yuhan, et al.
Published: (2023)
by: Liu, Yuhan, et al.
Published: (2023)
Markov Constraint as Large Language Model Surrogate
by: Bonlarron, Alexandre, et al.
Published: (2024)
by: Bonlarron, Alexandre, et al.
Published: (2024)
Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing
by: Le, Qi, et al.
Published: (2025)
by: Le, Qi, et al.
Published: (2025)
Evaluating Black-Box Vulnerabilities with Wasserstein-Constrained Data Perturbations
by: Monteiro, Adriana Laurindo, et al.
Published: (2026)
by: Monteiro, Adriana Laurindo, et al.
Published: (2026)
Having Beer after Prayer? Measuring Cultural Bias in Large Language Models
by: Naous, Tarek, et al.
Published: (2023)
by: Naous, Tarek, et al.
Published: (2023)
Comparing Template-based and Template-free Language Model Probing
by: Shaier, Sagi, et al.
Published: (2024)
by: Shaier, Sagi, et al.
Published: (2024)
Monitoring Latent World States in Language Models with Propositional Probes
by: Feng, Jiahai, et al.
Published: (2024)
by: Feng, Jiahai, et al.
Published: (2024)
CALM: Culturally Self-Aware Language Models
by: Shen, Lingzhi, et al.
Published: (2026)
by: Shen, Lingzhi, et al.
Published: (2026)
KGLens: Towards Efficient and Effective Knowledge Probing of Large Language Models with Knowledge Graphs
by: Zheng, Shangshang, et al.
Published: (2023)
by: Zheng, Shangshang, et al.
Published: (2023)
Probing the Robustness of Large Language Models Safety to Latent Perturbations
by: Gu, Tianle, et al.
Published: (2025)
by: Gu, Tianle, et al.
Published: (2025)
A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns
by: Yehudai, Asaf, et al.
Published: (2024)
by: Yehudai, Asaf, et al.
Published: (2024)
Similar Items
-
Exposing the Illusion of Fairness: Auditing Vulnerabilities to Distributional Manipulation Attacks
by: Lafargue, Valentin, et al.
Published: (2025) -
Fairness is in the details: Face Dataset Auditing
by: Lafargue, Valentin, et al.
Published: (2025) -
Buzz, Choose, Forget: A Meta-Bandit Framework for Bee-Like Decision Making
by: Claeys, Emmanuelle, et al.
Published: (2025) -
A Graph Signal Processing Framework for Hallucination Detection in Large Language Models
by: Noël, Valentin
Published: (2025) -
Latent Performance Profiling of Large Language Models
by: Chakraborty, Tanmoy, et al.
Published: (2026)