:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Yuhan, Wang, Erxiao, Shain, Cory
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2511.14642
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Perturbation: A simple and efficient adversarial tracer for representation learning in language models
by: Rozner, Joshua, et al.
Published: (2026)

BabyLM's First Constructions: Causal probing provides a signal of learning
by: Rozner, Joshua, et al.
Published: (2025)

Constructions are Revealed in Word Distributions
by: Rozner, Joshua, et al.
Published: (2025)

Artificial Aphasias in Lesioned Language Models
by: Roll, Nathan, et al.
Published: (2026)

Independent-Component-Based Encoding Models of Brain Activity During Story Comprehension
by: Hari, Kamya, et al.
Published: (2026)

The Text Aphasia Battery (TAB): A Clinically-Grounded Benchmark for Aphasia-Like Deficits in Language Models
by: Roll, Nathan, et al.
Published: (2025)

Controllable and explainable personality sliders for LLMs at inference time
by: Hoppe, Florian, et al.
Published: (2026)

"They parted illusions -- they parted disclaim marinade": Misalignment as structural fidelity in LLMs
by: Costa, Mariana Lins
Published: (2025)

EmoDebt: Bayesian-Optimized Emotional Intelligence for Strategic Agent-to-Agent Debt Recovery
by: Long, Yunbo, et al.
Published: (2025)

The illusion of a perfect metric: Why evaluating AI's words is harder than it looks
by: Oliva, Maria Paz, et al.
Published: (2025)

"Be My Cheese?": Assessing Cultural Nuance in Multilingual LLM Translations
by: Van Doren, Madison, et al.
Published: (2025)

Grammaticality illusion or ambiguous interpretation? Event-related potentials reveal the nature of the missing-NP effect in Mandarin centre-embedded structures
by: Yang, Qihang, et al.
Published: (2024)

Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese
by: Liu, Yikang, et al.
Published: (2025)

Quantifying and Predicting Disagreement in Graded Human Ratings
by: Zhang, Leixin, et al.
Published: (2026)

What can LLMs tell us about the mechanisms behind polarity illusions in humans? Experiments across model scales and training steps
by: Paape, Dario
Published: (2026)

A comparative study of zero-shot inference with large language models and supervised modeling in breast cancer pathology classification
by: Sushil, Madhumita, et al.
Published: (2024)

Application of integrated gradients explainability to sociopsychological semantic markers
by: Aghababaei, Ali, et al.
Published: (2025)

Can Language Models Be Tricked by Language Illusions? Easier with Syntax, Harder with Semantics
by: Zhang, Yuhan, et al.
Published: (2023)

The Mask of Civility: Benchmarking Chinese Mock Politeness Comprehension in Large Language Models
by: Zhang, Yitong, et al.
Published: (2026)

Verbalizing LLMs' assumptions to explain and control sycophancy
by: Cheng, Myra, et al.
Published: (2026)

Estimating LLM Grading Ability and Response Difficulty in Automatic Short Answer Grading via Item Response Theory
by: Cong, Longwei, et al.
Published: (2026)

An explainable transformer circuit for compositional generalization
by: Tang, Cheng, et al.
Published: (2025)

Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference
by: Zhang, Libo, et al.
Published: (2024)

Less is more: Probabilistic reduction is best explained by small-scale predictability measures
by: Jacobs, Cassandra L., et al.
Published: (2025)

Hey AI Can You Grade My Essay?: Automatic Essay Grading
by: Maliha, Maisha, et al.
Published: (2024)

Grading Scale Impact on LLM-as-a-Judge: Human-LLM Alignment Is Highest on 0-5 Grading Scale
by: Li, Weiyue, et al.
Published: (2026)

User Feedback in Human-LLM Dialogues: A Lens to Understand Users But Noisy as a Learning Signal
by: Liu, Yuhan, et al.
Published: (2025)

FIRP: Faster LLM inference via future intermediate representation prediction
by: Wu, Pengfei, et al.
Published: (2024)

Pragmatic inference of scalar implicature by LLMs
by: Cho, Ye-eun, et al.
Published: (2024)

MetaGreen: Meta-Learning Inspired Transformer Selection for Green Semantic Communication
by: Mukherjee, Shubhabrata, et al.
Published: (2024)

LLM-based Automated Grading with Human-in-the-Loop
by: Chu, Yucheng, et al.
Published: (2025)

Grade Guard: A Smart System for Short Answer Automated Grading
by: Dadu, Niharika, et al.
Published: (2025)

Can human clinical rationales improve the performance and explainability of clinical text classification models?
by: Metzner, Christoph, et al.
Published: (2025)

Cultural evolution via iterated learning and communication explains efficient color naming systems
by: Carlsson, Emil, et al.
Published: (2023)

Archimedes-AUEB at SemEval-2024 Task 5: LLM explains Civil Procedure
by: Chlapanis, Odysseas S., et al.
Published: (2024)

EssayCBM: Rubric-Aligned Concept Bottleneck Models for Transparent Essay Grading
by: Chaudhary, Kumar Satvik, et al.
Published: (2025)

Classroom AI: Large Language Models as Grade-Specific Teachers
by: Oh, Jio, et al.
Published: (2026)

Clickbait detection: quick inference with maximum impact
by: Kuntur, Soveatin, et al.
Published: (2026)

Confidence Estimation in Automatic Short Answer Grading with LLMs
by: Cong, Longwei, et al.
Published: (2026)

Automated Long Answer Grading with RiceChem Dataset
by: Sonkar, Shashank, et al.
Published: (2024)