Saved in:
| Main Authors: | Lepori, Michael A., Mozer, Michael C., Ghandeharioun, Asma |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.02102 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Who's asking? User personas and the mechanics of latent misalignment
by: Ghandeharioun, Asma, et al.
Published: (2024)
by: Ghandeharioun, Asma, et al.
Published: (2024)
Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
by: Ghandeharioun, Asma, et al.
Published: (2024)
by: Ghandeharioun, Asma, et al.
Published: (2024)
Interpretability Illusions in the Generalization of Simplified Models
by: Friedman, Dan, et al.
Published: (2023)
by: Friedman, Dan, et al.
Published: (2023)
Analysis of Optimality of Large Language Models on Planning Problems
by: Bohnet, Bernd, et al.
Published: (2026)
by: Bohnet, Bernd, et al.
Published: (2026)
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
by: He, Yancheng, et al.
Published: (2025)
by: He, Yancheng, et al.
Published: (2025)
Language Models Struggle to Use Representations Learned In-Context
by: Lepori, Michael A., et al.
Published: (2026)
by: Lepori, Michael A., et al.
Published: (2026)
How Ready Are Generative Pre-trained Large Language Models for Explaining Bengali Grammatical Errors?
by: Maity, Subhankar, et al.
Published: (2024)
by: Maity, Subhankar, et al.
Published: (2024)
When Does Context Help? Error Dynamics of Contextual Information in Large Language Models
by: Wang, Dingzirui, et al.
Published: (2026)
by: Wang, Dingzirui, et al.
Published: (2026)
When Can Transformers Count to n?
by: Yehudai, Gilad, et al.
Published: (2024)
by: Yehudai, Gilad, et al.
Published: (2024)
Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language Models
by: Chen, Sijia, et al.
Published: (2024)
by: Chen, Sijia, et al.
Published: (2024)
Signatures of human-like processing in Transformer forward passes
by: Hu, Jennifer, et al.
Published: (2025)
by: Hu, Jennifer, et al.
Published: (2025)
Evaluating Robustness of Large Language Models Against Multilingual Typographical Errors
by: Zhao, Raoyuan, et al.
Published: (2025)
by: Zhao, Raoyuan, et al.
Published: (2025)
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
by: Yang, Ling, et al.
Published: (2024)
by: Yang, Ling, et al.
Published: (2024)
Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training
by: Yang, Yanlai, et al.
Published: (2024)
by: Yang, Yanlai, et al.
Published: (2024)
Uncovering Intermediate Variables in Transformers using Circuit Probing
by: Lepori, Michael A., et al.
Published: (2023)
by: Lepori, Michael A., et al.
Published: (2023)
The Emergence of Abstract Thought in Large Language Models Beyond Any Language
by: Chen, Yuxin, et al.
Published: (2025)
by: Chen, Yuxin, et al.
Published: (2025)
Large Language Models Decide Early and Explain Later
by: Datta, Ayan, et al.
Published: (2026)
by: Datta, Ayan, et al.
Published: (2026)
Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility
by: Lepori, Michael A., et al.
Published: (2025)
by: Lepori, Michael A., et al.
Published: (2025)
Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings
by: Gopalakrishnan, Anand, et al.
Published: (2025)
by: Gopalakrishnan, Anand, et al.
Published: (2025)
Race, Ethnicity and Their Implication on Bias in Large Language Models
by: Hu, Shiyue, et al.
Published: (2026)
by: Hu, Shiyue, et al.
Published: (2026)
CoDAE: Adapting Large Language Models for Education via Chain-of-Thought Data Augmentation
by: Yuan, Shuzhou, et al.
Published: (2025)
by: Yuan, Shuzhou, et al.
Published: (2025)
Explain the Flag: Contextualizing Hate Speech Beyond Censorship
by: Liartis, Jason, et al.
Published: (2026)
by: Liartis, Jason, et al.
Published: (2026)
Are LLMs Models of Distributional Semantics? A Case Study on Quantifiers
by: Enyan, Zhang, et al.
Published: (2024)
by: Enyan, Zhang, et al.
Published: (2024)
Language of Thought Shapes Output Diversity in Large Language Models
by: Xu, Shaoyang, et al.
Published: (2026)
by: Xu, Shaoyang, et al.
Published: (2026)
Pretraining Exposure Explains Popularity Judgments in Large Language Models
by: Mozafari, Jamshid, et al.
Published: (2026)
by: Mozafari, Jamshid, et al.
Published: (2026)
Think Before You Lie: How Reasoning Leads to Honesty
by: Yuan, Ann, et al.
Published: (2026)
by: Yuan, Ann, et al.
Published: (2026)
Active Prompting with Chain-of-Thought for Large Language Models
by: Diao, Shizhe, et al.
Published: (2023)
by: Diao, Shizhe, et al.
Published: (2023)
Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning
by: Wang, Xinyi, et al.
Published: (2023)
by: Wang, Xinyi, et al.
Published: (2023)
Infusing Knowledge into Large Language Models with Contextual Prompts
by: Vasisht, Kinshuk, et al.
Published: (2024)
by: Vasisht, Kinshuk, et al.
Published: (2024)
In-Contextual Gender Bias Suppression for Large Language Models
by: Oba, Daisuke, et al.
Published: (2023)
by: Oba, Daisuke, et al.
Published: (2023)
Is it Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models
by: Farajidizaji, Asma, et al.
Published: (2023)
by: Farajidizaji, Asma, et al.
Published: (2023)
Explaining Large Language Models with gSMILE
by: Dehghani, Zeinab, et al.
Published: (2025)
by: Dehghani, Zeinab, et al.
Published: (2025)
Quantum-Like Contextuality in Large Language Models
by: Lo, Kin Ian, et al.
Published: (2024)
by: Lo, Kin Ian, et al.
Published: (2024)
On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models
by: Sancheti, Abhilasha, et al.
Published: (2024)
by: Sancheti, Abhilasha, et al.
Published: (2024)
Marathon: A Race Through the Realm of Long Context with Large Language Models
by: Zhang, Lei, et al.
Published: (2023)
by: Zhang, Lei, et al.
Published: (2023)
Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
by: Chen, Hanjie, et al.
Published: (2024)
by: Chen, Hanjie, et al.
Published: (2024)
On the Hardness of Faithful Chain-of-Thought Reasoning in Large Language Models
by: Tanneru, Sree Harsha, et al.
Published: (2024)
by: Tanneru, Sree Harsha, et al.
Published: (2024)
The Homogenizing Effect of Large Language Models on Human Expression and Thought
by: Sourati, Zhivar, et al.
Published: (2025)
by: Sourati, Zhivar, et al.
Published: (2025)
Structural Embedding Projection for Contextual Large Language Model Inference
by: Enoasmo, Vincent, et al.
Published: (2025)
by: Enoasmo, Vincent, et al.
Published: (2025)
Contextually Structured Token Dependency Encoding for Large Language Models
by: Blades, James, et al.
Published: (2025)
by: Blades, James, et al.
Published: (2025)
Similar Items
-
Who's asking? User personas and the mechanics of latent misalignment
by: Ghandeharioun, Asma, et al.
Published: (2024) -
Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
by: Ghandeharioun, Asma, et al.
Published: (2024) -
Interpretability Illusions in the Generalization of Simplified Models
by: Friedman, Dan, et al.
Published: (2023) -
Analysis of Optimality of Large Language Models on Planning Problems
by: Bohnet, Bernd, et al.
Published: (2026) -
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
by: He, Yancheng, et al.
Published: (2025)