Saved in:
| Main Authors: | Cheng, Emily, Doimo, Diego, Kervadec, Corentin, Macocco, Iuri, Yu, Jade, Laio, Alessandro, Baroni, Marco |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.15471 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Tracing Computation Density in LLMs
by: Kervadec, Corentin, et al.
Published: (2026)
by: Kervadec, Corentin, et al.
Published: (2026)
Prediction hubs are context-informed frequent tokens in LLMs
by: Nielsen, Beatrix M. G., et al.
Published: (2025)
by: Nielsen, Beatrix M. G., et al.
Published: (2025)
Not a nuisance but a useful heuristic: Outlier dimensions favor frequent tokens in language models
by: Macocco, Iuri, et al.
Published: (2025)
by: Macocco, Iuri, et al.
Published: (2025)
Sparse or Dense? A Mechanistic Estimation of Computation Density in Transformer-based LLMs
by: Kervadec, Corentin, et al.
Published: (2026)
by: Kervadec, Corentin, et al.
Published: (2026)
Evil twins are not that evil: Qualitative insights into machine-generated prompts
by: Rakotonirina, Nathanaël Carraz, et al.
Published: (2024)
by: Rakotonirina, Nathanaël Carraz, et al.
Published: (2024)
Scale-adaptive and robust intrinsic dimension estimation via optimal neighbourhood identification
by: Di Noia, Antonio, et al.
Published: (2024)
by: Di Noia, Antonio, et al.
Published: (2024)
A quantitative analysis of semantic information in deep representations of text and images
by: Acevedo, Santiago, et al.
Published: (2025)
by: Acevedo, Santiago, et al.
Published: (2025)
The representation landscape of few-shot learning and fine-tuning in large language models
by: Doimo, Diego, et al.
Published: (2024)
by: Doimo, Diego, et al.
Published: (2024)
Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models
by: Cheng, Emily, et al.
Published: (2024)
by: Cheng, Emily, et al.
Published: (2024)
Head Pursuit: Probing Attention Specialization in Multimodal Transformers
by: Basile, Lorenzo, et al.
Published: (2025)
by: Basile, Lorenzo, et al.
Published: (2025)
Abstraction Induces the Brain Alignment of Language and Speech Models
by: Cheng, Emily, et al.
Published: (2026)
by: Cheng, Emily, et al.
Published: (2026)
Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals
by: Ortu, Francesco, et al.
Published: (2024)
by: Ortu, Francesco, et al.
Published: (2024)
Tracing the complexity profiles of different linguistic phenomena through the intrinsic dimension of LLM representations
by: Baroni, Marco, et al.
Published: (2026)
by: Baroni, Marco, et al.
Published: (2026)
A distributional simplicity bias in the learning dynamics of transformers
by: Rende, Riccardo, et al.
Published: (2024)
by: Rende, Riccardo, et al.
Published: (2024)
Stereotypical gender actions can be extracted from Web text
by: Herdağdelen, Amaç, et al.
Published: (2025)
by: Herdağdelen, Amaç, et al.
Published: (2025)
MemoryPrompt: A Light Wrapper to Improve Context Tracking in Pre-trained Language Models
by: Rakotonirina, Nathanaël Carraz, et al.
Published: (2024)
by: Rakotonirina, Nathanaël Carraz, et al.
Published: (2024)
Mapping of attention mechanisms to a generalized Potts model
by: Rende, Riccardo, et al.
Published: (2023)
by: Rende, Riccardo, et al.
Published: (2023)
Abstraction-of-Thought Makes Language Models Better Reasoners
by: Hong, Ruixin, et al.
Published: (2024)
by: Hong, Ruixin, et al.
Published: (2024)
An unsupervised tour through the hidden pathways of deep neural networks
by: Doimo, Diego
Published: (2025)
by: Doimo, Diego
Published: (2025)
Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions
by: Ranaldi, Leonardo, et al.
Published: (2025)
by: Ranaldi, Leonardo, et al.
Published: (2025)
Certifying Phase Abstraction
by: Froleyks, Nils, et al.
Published: (2024)
by: Froleyks, Nils, et al.
Published: (2024)
Functional Abstraction of Knowledge Recall in Large Language Models
by: Wang, Zijian, et al.
Published: (2025)
by: Wang, Zijian, et al.
Published: (2025)
The Abstraction Gap in Vision-Language Causal Reasoning
by: Hoang, Chinh, et al.
Published: (2026)
by: Hoang, Chinh, et al.
Published: (2026)
Efficient Tool Use with Chain-of-Abstraction Reasoning
by: Gao, Silin, et al.
Published: (2024)
by: Gao, Silin, et al.
Published: (2024)
When Seeing Overrides Knowing: Disentangling Knowledge Conflicts in Vision-Language Models
by: Ortu, Francesco, et al.
Published: (2025)
by: Ortu, Francesco, et al.
Published: (2025)
Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models
by: Zheng, Huaixiu Steven, et al.
Published: (2023)
by: Zheng, Huaixiu Steven, et al.
Published: (2023)
Controllable Abstraction in Summary Generation for Large Language Models via Prompt Engineering
by: Song, Xiangchen, et al.
Published: (2025)
by: Song, Xiangchen, et al.
Published: (2025)
High-Dimensional Interlingual Representations of Large Language Models
by: Wilie, Bryan, et al.
Published: (2025)
by: Wilie, Bryan, et al.
Published: (2025)
TRACE for Tracking the Emergence of Semantic Representations in Transformers
by: Aljaafari, Nura, et al.
Published: (2025)
by: Aljaafari, Nura, et al.
Published: (2025)
PACE: Procedural Abstractions for Communicating Efficiently
by: Thomas, Jonathan D., et al.
Published: (2024)
by: Thomas, Jonathan D., et al.
Published: (2024)
Curse of High Dimensionality Issue in Transformer for Long-context Modeling
by: Zhang, Shuhai, et al.
Published: (2025)
by: Zhang, Shuhai, et al.
Published: (2025)
Structural Abstraction as an Inductive Bias for Non-Stationary Language Model Training
by: Rahmati, Elnaz, et al.
Published: (2026)
by: Rahmati, Elnaz, et al.
Published: (2026)
AbsPyramid: Benchmarking the Abstraction Ability of Language Models with a Unified Entailment Graph
by: Wang, Zhaowei, et al.
Published: (2023)
by: Wang, Zhaowei, et al.
Published: (2023)
Linearly Controlled Language Generation with Performative Guarantees
by: Cheng, Emily, et al.
Published: (2024)
by: Cheng, Emily, et al.
Published: (2024)
Heterogeneous Encoders Scaling In The Transformer For Neural Machine Translation
by: Hu, Jia Cheng, et al.
Published: (2023)
by: Hu, Jia Cheng, et al.
Published: (2023)
Human Evaluation of Procedural Knowledge Graph Extraction from Text with Large Language Models
by: Carriero, Valentina Anita, et al.
Published: (2024)
by: Carriero, Valentina Anita, et al.
Published: (2024)
Optimizing FDTD Solvers for Electromagnetics: A Compiler-Guided Approach with High-Level Tensor Abstractions
by: He, Yifei, et al.
Published: (2025)
by: He, Yifei, et al.
Published: (2025)
ChatGPT for automated grading of short answer questions in mechanical ventilation
by: Jade, Tejas, et al.
Published: (2025)
by: Jade, Tejas, et al.
Published: (2025)
Reasoning Capabilities and Invariability of Large Language Models
by: Raganato, Alessandro, et al.
Published: (2025)
by: Raganato, Alessandro, et al.
Published: (2025)
Detecting Conceptual Abstraction in LLMs
by: Regneri, Michaela, et al.
Published: (2024)
by: Regneri, Michaela, et al.
Published: (2024)
Similar Items
-
Tracing Computation Density in LLMs
by: Kervadec, Corentin, et al.
Published: (2026) -
Prediction hubs are context-informed frequent tokens in LLMs
by: Nielsen, Beatrix M. G., et al.
Published: (2025) -
Not a nuisance but a useful heuristic: Outlier dimensions favor frequent tokens in language models
by: Macocco, Iuri, et al.
Published: (2025) -
Sparse or Dense? A Mechanistic Estimation of Computation Density in Transformer-based LLMs
by: Kervadec, Corentin, et al.
Published: (2026) -
Evil twins are not that evil: Qualitative insights into machine-generated prompts
by: Rakotonirina, Nathanaël Carraz, et al.
Published: (2024)