Saved in:
| Main Authors: | Zhang, Yuhan, Wang, Erxiao, Shain, Cory |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.14642 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Perturbation: A simple and efficient adversarial tracer for representation learning in language models
by: Rozner, Joshua, et al.
Published: (2026)
by: Rozner, Joshua, et al.
Published: (2026)
BabyLM's First Constructions: Causal probing provides a signal of learning
by: Rozner, Joshua, et al.
Published: (2025)
by: Rozner, Joshua, et al.
Published: (2025)
Constructions are Revealed in Word Distributions
by: Rozner, Joshua, et al.
Published: (2025)
by: Rozner, Joshua, et al.
Published: (2025)
Artificial Aphasias in Lesioned Language Models
by: Roll, Nathan, et al.
Published: (2026)
by: Roll, Nathan, et al.
Published: (2026)
Independent-Component-Based Encoding Models of Brain Activity During Story Comprehension
by: Hari, Kamya, et al.
Published: (2026)
by: Hari, Kamya, et al.
Published: (2026)
The Text Aphasia Battery (TAB): A Clinically-Grounded Benchmark for Aphasia-Like Deficits in Language Models
by: Roll, Nathan, et al.
Published: (2025)
by: Roll, Nathan, et al.
Published: (2025)
Controllable and explainable personality sliders for LLMs at inference time
by: Hoppe, Florian, et al.
Published: (2026)
by: Hoppe, Florian, et al.
Published: (2026)
"They parted illusions -- they parted disclaim marinade": Misalignment as structural fidelity in LLMs
by: Costa, Mariana Lins
Published: (2025)
by: Costa, Mariana Lins
Published: (2025)
EmoDebt: Bayesian-Optimized Emotional Intelligence for Strategic Agent-to-Agent Debt Recovery
by: Long, Yunbo, et al.
Published: (2025)
by: Long, Yunbo, et al.
Published: (2025)
The illusion of a perfect metric: Why evaluating AI's words is harder than it looks
by: Oliva, Maria Paz, et al.
Published: (2025)
by: Oliva, Maria Paz, et al.
Published: (2025)
"Be My Cheese?": Assessing Cultural Nuance in Multilingual LLM Translations
by: Van Doren, Madison, et al.
Published: (2025)
by: Van Doren, Madison, et al.
Published: (2025)
Grammaticality illusion or ambiguous interpretation? Event-related potentials reveal the nature of the missing-NP effect in Mandarin centre-embedded structures
by: Yang, Qihang, et al.
Published: (2024)
by: Yang, Qihang, et al.
Published: (2024)
Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese
by: Liu, Yikang, et al.
Published: (2025)
by: Liu, Yikang, et al.
Published: (2025)
Quantifying and Predicting Disagreement in Graded Human Ratings
by: Zhang, Leixin, et al.
Published: (2026)
by: Zhang, Leixin, et al.
Published: (2026)
What can LLMs tell us about the mechanisms behind polarity illusions in humans? Experiments across model scales and training steps
by: Paape, Dario
Published: (2026)
by: Paape, Dario
Published: (2026)
A comparative study of zero-shot inference with large language models and supervised modeling in breast cancer pathology classification
by: Sushil, Madhumita, et al.
Published: (2024)
by: Sushil, Madhumita, et al.
Published: (2024)
Application of integrated gradients explainability to sociopsychological semantic markers
by: Aghababaei, Ali, et al.
Published: (2025)
by: Aghababaei, Ali, et al.
Published: (2025)
Can Language Models Be Tricked by Language Illusions? Easier with Syntax, Harder with Semantics
by: Zhang, Yuhan, et al.
Published: (2023)
by: Zhang, Yuhan, et al.
Published: (2023)
The Mask of Civility: Benchmarking Chinese Mock Politeness Comprehension in Large Language Models
by: Zhang, Yitong, et al.
Published: (2026)
by: Zhang, Yitong, et al.
Published: (2026)
Verbalizing LLMs' assumptions to explain and control sycophancy
by: Cheng, Myra, et al.
Published: (2026)
by: Cheng, Myra, et al.
Published: (2026)
Estimating LLM Grading Ability and Response Difficulty in Automatic Short Answer Grading via Item Response Theory
by: Cong, Longwei, et al.
Published: (2026)
by: Cong, Longwei, et al.
Published: (2026)
An explainable transformer circuit for compositional generalization
by: Tang, Cheng, et al.
Published: (2025)
by: Tang, Cheng, et al.
Published: (2025)
Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference
by: Zhang, Libo, et al.
Published: (2024)
by: Zhang, Libo, et al.
Published: (2024)
Less is more: Probabilistic reduction is best explained by small-scale predictability measures
by: Jacobs, Cassandra L., et al.
Published: (2025)
by: Jacobs, Cassandra L., et al.
Published: (2025)
Hey AI Can You Grade My Essay?: Automatic Essay Grading
by: Maliha, Maisha, et al.
Published: (2024)
by: Maliha, Maisha, et al.
Published: (2024)
Grading Scale Impact on LLM-as-a-Judge: Human-LLM Alignment Is Highest on 0-5 Grading Scale
by: Li, Weiyue, et al.
Published: (2026)
by: Li, Weiyue, et al.
Published: (2026)
User Feedback in Human-LLM Dialogues: A Lens to Understand Users But Noisy as a Learning Signal
by: Liu, Yuhan, et al.
Published: (2025)
by: Liu, Yuhan, et al.
Published: (2025)
FIRP: Faster LLM inference via future intermediate representation prediction
by: Wu, Pengfei, et al.
Published: (2024)
by: Wu, Pengfei, et al.
Published: (2024)
Pragmatic inference of scalar implicature by LLMs
by: Cho, Ye-eun, et al.
Published: (2024)
by: Cho, Ye-eun, et al.
Published: (2024)
MetaGreen: Meta-Learning Inspired Transformer Selection for Green Semantic Communication
by: Mukherjee, Shubhabrata, et al.
Published: (2024)
by: Mukherjee, Shubhabrata, et al.
Published: (2024)
LLM-based Automated Grading with Human-in-the-Loop
by: Chu, Yucheng, et al.
Published: (2025)
by: Chu, Yucheng, et al.
Published: (2025)
Grade Guard: A Smart System for Short Answer Automated Grading
by: Dadu, Niharika, et al.
Published: (2025)
by: Dadu, Niharika, et al.
Published: (2025)
Can human clinical rationales improve the performance and explainability of clinical text classification models?
by: Metzner, Christoph, et al.
Published: (2025)
by: Metzner, Christoph, et al.
Published: (2025)
Cultural evolution via iterated learning and communication explains efficient color naming systems
by: Carlsson, Emil, et al.
Published: (2023)
by: Carlsson, Emil, et al.
Published: (2023)
Archimedes-AUEB at SemEval-2024 Task 5: LLM explains Civil Procedure
by: Chlapanis, Odysseas S., et al.
Published: (2024)
by: Chlapanis, Odysseas S., et al.
Published: (2024)
EssayCBM: Rubric-Aligned Concept Bottleneck Models for Transparent Essay Grading
by: Chaudhary, Kumar Satvik, et al.
Published: (2025)
by: Chaudhary, Kumar Satvik, et al.
Published: (2025)
Classroom AI: Large Language Models as Grade-Specific Teachers
by: Oh, Jio, et al.
Published: (2026)
by: Oh, Jio, et al.
Published: (2026)
Clickbait detection: quick inference with maximum impact
by: Kuntur, Soveatin, et al.
Published: (2026)
by: Kuntur, Soveatin, et al.
Published: (2026)
Confidence Estimation in Automatic Short Answer Grading with LLMs
by: Cong, Longwei, et al.
Published: (2026)
by: Cong, Longwei, et al.
Published: (2026)
Automated Long Answer Grading with RiceChem Dataset
by: Sonkar, Shashank, et al.
Published: (2024)
by: Sonkar, Shashank, et al.
Published: (2024)
Similar Items
-
Perturbation: A simple and efficient adversarial tracer for representation learning in language models
by: Rozner, Joshua, et al.
Published: (2026) -
BabyLM's First Constructions: Causal probing provides a signal of learning
by: Rozner, Joshua, et al.
Published: (2025) -
Constructions are Revealed in Word Distributions
by: Rozner, Joshua, et al.
Published: (2025) -
Artificial Aphasias in Lesioned Language Models
by: Roll, Nathan, et al.
Published: (2026) -
Independent-Component-Based Encoding Models of Brain Activity During Story Comprehension
by: Hari, Kamya, et al.
Published: (2026)