Saved in:
| Main Authors: | Niketan, Nripesh, Batatia, Hadj |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.08034 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Multi-Model Synthetic Training for Mission-Critical Small Language Models
by: Platt, Nolan, et al.
Published: (2025)
by: Platt, Nolan, et al.
Published: (2025)
The Geometry of Persona: Disentangling Personality from Reasoning in Large Language Models
by: Wang, Zhixiang
Published: (2025)
by: Wang, Zhixiang
Published: (2025)
ProbeScale: Probing Analysis to Optimize Neural Scaling Laws for Efficient Small Language Model Inference
by: Das, Sourav
Published: (2026)
by: Das, Sourav
Published: (2026)
Towards Ontology-Enhanced Representation Learning for Large Language Models
by: Ronzano, Francesco, et al.
Published: (2024)
by: Ronzano, Francesco, et al.
Published: (2024)
Hopscotch: Discovering and Skipping Redundancies in Language Models
by: Eyceoz, Mustafa, et al.
Published: (2025)
by: Eyceoz, Mustafa, et al.
Published: (2025)
Word Overuse and Alignment in Large Language Models: The Influence of Learning from Human Feedback
by: Juzek, Tom S., et al.
Published: (2025)
by: Juzek, Tom S., et al.
Published: (2025)
How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models
by: Borobia, Hector, et al.
Published: (2026)
by: Borobia, Hector, et al.
Published: (2026)
TSDS: Data Selection for Task-Specific Model Finetuning
by: Liu, Zifan, et al.
Published: (2024)
by: Liu, Zifan, et al.
Published: (2024)
Beyond Accuracy: Decomposing the Reasoning Efficiency of LLMs
by: Kaiser, Daniel, et al.
Published: (2026)
by: Kaiser, Daniel, et al.
Published: (2026)
DRO-InstructZero: Distributionally Robust Prompt Optimization for Large Language Models
by: Li, Yangyang
Published: (2025)
by: Li, Yangyang
Published: (2025)
HEFT: A Coarse-to-Fine Hierarchy for Enhancing the Efficiency and Accuracy of Language Model Reasoning
by: Hill, Brennen
Published: (2025)
by: Hill, Brennen
Published: (2025)
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models
by: Fu, Tianyu, et al.
Published: (2025)
by: Fu, Tianyu, et al.
Published: (2025)
mHC-SSM: Manifold-Constrained Hyper-Connections for State Space Language Models with Stream-Specialized Adapters
by: Mutlu, Abdulvahap, et al.
Published: (2026)
by: Mutlu, Abdulvahap, et al.
Published: (2026)
A Confidence-Diversity Framework for Calibrating AI Judgement in Accessible Qualitative Coding Tasks
by: Zhao, Zhilong, et al.
Published: (2025)
by: Zhao, Zhilong, et al.
Published: (2025)
Improving Commonsense Bias Classification by Mitigating the Influence of Demographic Terms
by: Lee, JinKyu, et al.
Published: (2024)
by: Lee, JinKyu, et al.
Published: (2024)
Hybrid Gated Flow (HGF): Stabilizing 1.58-bit LLMs via Selective Low-Rank Correction
by: Pizzo, David Alejandro Trejo
Published: (2026)
by: Pizzo, David Alejandro Trejo
Published: (2026)
Unsolvability Ceiling in Multi-LLM Routing: An Empirical Study of Evaluation Artifacts
by: Garg, Saloni, et al.
Published: (2026)
by: Garg, Saloni, et al.
Published: (2026)
Measuring and curing reasoning rigidity: from decorative chain-of-thought to genuine faithfulness
by: Basu, Abhinaba, et al.
Published: (2026)
by: Basu, Abhinaba, et al.
Published: (2026)
CLMN: Concept based Language Models via Neural Symbolic Reasoning
by: Yang, Yibo
Published: (2025)
by: Yang, Yibo
Published: (2025)
Sliced-Wasserstein Distribution Alignment Loss Improves the Ultra-Low-Bit Quantization of Large Language Models
by: Cao, Deyu, et al.
Published: (2026)
by: Cao, Deyu, et al.
Published: (2026)
Monotonicity as an Architectural Bias for Robust Language Models
by: Cooper, Patrick, et al.
Published: (2026)
by: Cooper, Patrick, et al.
Published: (2026)
SafeAnchor: Preventing Cumulative Safety Erosion in Continual Domain Adaptation of Large Language Models
by: Guo, Dongxin, et al.
Published: (2026)
by: Guo, Dongxin, et al.
Published: (2026)
Causally Grounded Mechanistic Interpretability for LLMs with Faithful Natural-Language Explanations
by: Mahale, Ajay Pravin
Published: (2026)
by: Mahale, Ajay Pravin
Published: (2026)
No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models
by: Sela, Omer
Published: (2026)
by: Sela, Omer
Published: (2026)
Waking Up Blind: Cold-Start Optimization of Supervision-Free Agentic Trajectories for Grounded Visual Perception
by: Bajpai, Ashutosh, et al.
Published: (2026)
by: Bajpai, Ashutosh, et al.
Published: (2026)
The Concept Allocation Zone: Tracking How Concepts Form Across Transformer Depth
by: Henry, James
Published: (2026)
by: Henry, James
Published: (2026)
ReFactor GNNs: Revisiting Factorisation-based Models from a Message-Passing Perspective
by: Chen, Yihong, et al.
Published: (2022)
by: Chen, Yihong, et al.
Published: (2022)
AIPsy-Affect: A Keyword-Free Clinical Stimulus Battery for Mechanistic Interpretability of Emotion in Language Models
by: Keeman, Michael
Published: (2026)
by: Keeman, Michael
Published: (2026)
Challenges and Applications of Large Language Models: A Comparison of GPT and DeepSeek family of models
by: Sharma, Shubham, et al.
Published: (2025)
by: Sharma, Shubham, et al.
Published: (2025)
Learning What Matters: Probabilistic Task Selection via Mutual Information for Model Finetuning
by: Chanda, Prateek, et al.
Published: (2025)
by: Chanda, Prateek, et al.
Published: (2025)
The Trilemma of Truth in Large Language Models
by: Savcisens, Germans, et al.
Published: (2025)
by: Savcisens, Germans, et al.
Published: (2025)
Product-of-Experts Training Reduces Dataset Artifacts in Natural Language Inference
by: Mathew, Aby Mammen
Published: (2026)
by: Mathew, Aby Mammen
Published: (2026)
Mechanistic Analysis of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning
by: Imanov, Olaf Yunus Laitinen
Published: (2026)
by: Imanov, Olaf Yunus Laitinen
Published: (2026)
Measuring Intent Comprehension in LLMs
by: Kunievsky, Nadav, et al.
Published: (2025)
by: Kunievsky, Nadav, et al.
Published: (2025)
lmfaoooo at SemEval-2026 Task 1: Humor Is an Audience. Preference Modeling for Constrained Humor Generation
by: Tikhonov, Alexey, et al.
Published: (2026)
by: Tikhonov, Alexey, et al.
Published: (2026)
Text Clustering with Large Language Model Embeddings
by: Petukhova, Alina, et al.
Published: (2024)
by: Petukhova, Alina, et al.
Published: (2024)
Linguistic Collapse: Neural Collapse in (Large) Language Models
by: Wu, Robert, et al.
Published: (2024)
by: Wu, Robert, et al.
Published: (2024)
Causal Dimensionality of Transformer Representations: Measurement, Scaling, and Layer Structure
by: Sarkar, Nilesh, et al.
Published: (2026)
by: Sarkar, Nilesh, et al.
Published: (2026)
Fine-tuning of Large Language Models for Constituency Parsing Using a Sequence to Sequence Approach
by: Delgado, Francisco Jose Cortes, et al.
Published: (2025)
by: Delgado, Francisco Jose Cortes, et al.
Published: (2025)
How Human-Like Are Large Language Models? A Register-Aware Linguistic Evaluation Framework
by: Nieth, Björn, et al.
Published: (2026)
by: Nieth, Björn, et al.
Published: (2026)
Similar Items
-
Multi-Model Synthetic Training for Mission-Critical Small Language Models
by: Platt, Nolan, et al.
Published: (2025) -
The Geometry of Persona: Disentangling Personality from Reasoning in Large Language Models
by: Wang, Zhixiang
Published: (2025) -
ProbeScale: Probing Analysis to Optimize Neural Scaling Laws for Efficient Small Language Model Inference
by: Das, Sourav
Published: (2026) -
Towards Ontology-Enhanced Representation Learning for Large Language Models
by: Ronzano, Francesco, et al.
Published: (2024) -
Hopscotch: Discovering and Skipping Redundancies in Language Models
by: Eyceoz, Mustafa, et al.
Published: (2025)