:: Library Catalog

תמונות העטיפה

שמור ב:

מידע ביבליוגרפי
Main Authors:	Conklin, Henry, Smith, Kenny
פורמט:	Preprint
יצא לאור:	2024
נושאים:	Computation and Language Artificial Intelligence
גישה מקוונת:	https://arxiv.org/abs/2406.02449
תגים:	הוספת תג אין תגיות, היה/י הראשונ/ה לתייג את הרשומה!

פריטים דומים

Information Structure in Mappings: An Approach to Learning, Representation, and Generalisation
מאת: Conklin, Henry
יצא לאור: (2025)

Emergent Hierarchical Structure in Large Language Models: An Information-Theoretic Framework for Multi-Scale Representation
מאת: Zhang, Yukin, et al.
יצא לאור: (2025)

Compositional Generalization Across Distributional Shifts with Sparse Tree Operations
מאת: Soulos, Paul, et al.
יצא לאור: (2024)

Serendipity by Design: Evaluating the Impact of Cross-domain Mappings on Human and LLM Creativity
מאת: Liu, Qiawen Ella, et al.
יצא לאור: (2026)

An Information-Theoretic Framework for Robust Large Language Model Editing
מאת: Chen, Qizhou, et al.
יצא לאור: (2025)

On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model Inference
מאת: Ren, Siyu, et al.
יצא לאור: (2024)

Pelican Soup Framework: A Theoretical Framework for Language Model Capabilities
מאת: Chiang, Ting-Rui, et al.
יצא לאור: (2024)

Multi-Scale Manifold Alignment for Interpreting Large Language Models: A Unified Information-Geometric Framework
מאת: Zhang, Yukun, et al.
יצא לאור: (2025)

How Many Features Can a Language Model Store Under the Linear Representation Hypothesis?
מאת: Garg, Nikhil, et al.
יצא לאור: (2026)

On Linear Representations and Pretraining Data Frequency in Language Models
מאת: Merullo, Jack, et al.
יצא לאור: (2025)

Information Flow Routes: Automatically Interpreting Language Models at Scale
מאת: Ferrando, Javier, et al.
יצא לאור: (2024)

Comparing Human and Large Language Model Interpretation of Implicit Information
מאת: De Santis, Antonio, et al.
יצא לאור: (2026)

Superscopes: Amplifying Internal Feature Representations for Language Model Interpretation
מאת: Jacobi, Jonathan, et al.
יצא לאור: (2025)

On Theoretical Interpretations of Concept-Based In-Context Learning
מאת: Tang, Huaze, et al.
יצא לאור: (2025)

LLMD: A Large Language Model for Interpreting Longitudinal Medical Records
מאת: Porter, Robert, et al.
יצא לאור: (2024)

Evaluating the relationship between regularity and learnability in recursive numeral systems using Reinforcement Learning
מאת: Silvi, Andrea, et al.
יצא לאור: (2026)

Cross-Lingual Transfer and Parameter-Efficient Adaptation in the Turkic Language Family: A Theoretical Framework for Low-Resource Language Models
מאת: Ibrahimzade, O., et al.
יצא לאור: (2026)

CogniDual Framework: Self-Training Large Language Models within a Dual-System Theoretical Framework for Improving Cognitive Tasks
מאת: Deng, Yongxin, et al.
יצא לאור: (2024)

A Monosemantic Attribution Framework for Stable Interpretability in Clinical Neuroscience Transformer-Based Language Models
מאת: Mamalakis, Michail, et al.
יצא לאור: (2026)

Correlated Errors in Large Language Models
מאת: Kim, Elliot, et al.
יצא לאור: (2025)

Building Models of Neurological Language
מאת: Watkins, Henry
יצא לאור: (2025)

Information-Theoretic Distillation for Reference-less Summarization
מאת: Jung, Jaehun, et al.
יצא לאור: (2024)

Unveiling Language Routing Isolation in Multilingual MoE Models for Interpretable Subnetwork Adaptation
מאת: Zheng, Kening, et al.
יצא לאור: (2026)

Unified Lexical Representation for Interpretable Visual-Language Alignment
מאת: Li, Yifan, et al.
יצא לאור: (2024)

CrashSage: A Large Language Model-Centered Framework for Contextual and Interpretable Traffic Crash Analysis
מאת: Zhen, Hao, et al.
יצא לאור: (2025)

Probing Ethical Framework Representations in Large Language Models: Structure, Entanglement, and Methodological Challenges
מאת: Xu, Weilun, et al.
יצא לאור: (2026)

Interpretability Framework for LLMs in Undergraduate Calculus
מאת: Dakshit, Sagnik, et al.
יצא לאור: (2025)

HatePrototypes: Interpretable and Transferable Representations for Implicit and Explicit Hate Speech Detection
מאת: Proskurina, Irina, et al.
יצא לאור: (2025)

Theoretical Foundations and Mitigation of Hallucination in Large Language Models
מאת: Gumaan, Esmail
יצא לאור: (2025)

Information Representation Fairness in Long-Document Embeddings: The Peculiar Interaction of Positional and Language Bias
מאת: Schuhmacher, Elias, et al.
יצא לאור: (2026)

An Information-Theoretic Framework for Comparing Voice and Text Explainability
מאת: Rajhans, Mona, et al.
יצא לאור: (2026)

Interpretability of Language Models via Task Spaces
מאת: Weber, Lucas, et al.
יצא לאור: (2024)

Activation Scaling for Steering and Interpreting Language Models
מאת: Stoehr, Niklas, et al.
יצא לאור: (2024)

Semantic Substrate Theory: An Operator-Theoretic Framework for Geometric Semantic Drift
מאת: Russell, Stephen
יצא לאור: (2026)

Interpreting Public Sentiment in Diplomacy Events: A Counterfactual Analysis Framework Using Large Language Models
מאת: Ouyang, Leyi
יצא לאור: (2025)

ARCANE: A Multi-Agent Framework for Interpretable and Configurable Alignment
מאת: Masters, Charlie, et al.
יצא לאור: (2025)

Decomposing Representation Space into Interpretable Subspaces with Unsupervised Learning
מאת: Huang, Xinting, et al.
יצא לאור: (2025)

Using Large Language Models for the Interpretation of Building Regulations
מאת: Fuchs, Stefan, et al.
יצא לאור: (2024)

LLMs Judge Themselves: A Game-Theoretic Framework for Human-Aligned Evaluation
מאת: Yang, Gao, et al.
יצא לאור: (2025)

Cognitive BASIC: An In-Model Interpreted Reasoning Language for LLMs
מאת: Kramer, Oliver
יצא לאור: (2025)