Saved in:
| Main Author: | Kapelko, Eduard |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.25220 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Active Inference Agency Formalization, Metrics, and Convergence Assessments
by: Kapelko, Eduard
Published: (2026)
by: Kapelko, Eduard
Published: (2026)
Steering Out-of-Distribution Generalization with Concept Ablation Fine-Tuning
by: Casademunt, Helena, et al.
Published: (2025)
by: Casademunt, Helena, et al.
Published: (2025)
Causality $\neq$ Invariance: Function and Concept Vectors in LLMs
by: Opiełka, Gustaw, et al.
Published: (2026)
by: Opiełka, Gustaw, et al.
Published: (2026)
Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures
by: Borobia, Hector, et al.
Published: (2026)
by: Borobia, Hector, et al.
Published: (2026)
Circuit Breaking: Removing Model Behaviors with Targeted Ablation
by: Li, Maximilian, et al.
Published: (2023)
by: Li, Maximilian, et al.
Published: (2023)
Do LLMs Understand Romanian Driving Laws? A Study on Multimodal and Fine-Tuned Question Answering
by: Barbu, Eduard, et al.
Published: (2025)
by: Barbu, Eduard, et al.
Published: (2025)
Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
by: Na, Clara, et al.
Published: (2024)
by: Na, Clara, et al.
Published: (2024)
MedConceptsQA: Open Source Medical Concepts QA Benchmark
by: Shoham, Ofir Ben, et al.
Published: (2024)
by: Shoham, Ofir Ben, et al.
Published: (2024)
Kernelized Concept Erasure
by: Ravfogel, Shauli, et al.
Published: (2022)
by: Ravfogel, Shauli, et al.
Published: (2022)
Data Alignment for Zero-Shot Concept Generation in Dermatology AI
by: Gadgil, Soham, et al.
Published: (2024)
by: Gadgil, Soham, et al.
Published: (2024)
Vector Quantized Latent Concepts: A Scalable Alternative to Clustering-Based Concept Discovery
by: Yu, Xuemin, et al.
Published: (2026)
by: Yu, Xuemin, et al.
Published: (2026)
LLM Pretraining with Continuous Concepts
by: Tack, Jihoon, et al.
Published: (2025)
by: Tack, Jihoon, et al.
Published: (2025)
Towards Compositionality in Concept Learning
by: Stein, Adam, et al.
Published: (2024)
by: Stein, Adam, et al.
Published: (2024)
Linear Adversarial Concept Erasure
by: Ravfogel, Shauli, et al.
Published: (2022)
by: Ravfogel, Shauli, et al.
Published: (2022)
LCSB: Layer-Cyclic Selective Backpropagation for Memory-Efficient On-Device LLM Fine-Tuning
by: Park, Juneyoung, et al.
Published: (2026)
by: Park, Juneyoung, et al.
Published: (2026)
Beyond Individual Facts: Investigating Categorical Knowledge Locality of Taxonomy and Meronomy Concepts in GPT Models
by: Burger, Christopher, et al.
Published: (2024)
by: Burger, Christopher, et al.
Published: (2024)
Concept Bottleneck Large Language Models
by: Sun, Chung-En, et al.
Published: (2024)
by: Sun, Chung-En, et al.
Published: (2024)
AlignSAE: Concept-Aligned Sparse Autoencoders
by: Yang, Minglai, et al.
Published: (2025)
by: Yang, Minglai, et al.
Published: (2025)
Simple Mechanisms for Representing, Indexing and Manipulating Concepts
by: Li, Yuanzhi, et al.
Published: (2023)
by: Li, Yuanzhi, et al.
Published: (2023)
Leveraging AI Graders for Missing Score Imputation to Achieve Accurate Ability Estimation in Constructed-Response Tests
by: Uto, Masaki, et al.
Published: (2025)
by: Uto, Masaki, et al.
Published: (2025)
TempTest: Local Normalization Distortion and the Detection of Machine-generated Text
by: Kempton, Tom, et al.
Published: (2025)
by: Kempton, Tom, et al.
Published: (2025)
MesaNet: Sequence Modeling by Locally Optimal Test-Time Training
by: von Oswald, Johannes, et al.
Published: (2025)
by: von Oswald, Johannes, et al.
Published: (2025)
Nonlinear Concept Erasure: a Density Matching Approach
by: Saillenfest, Antoine, et al.
Published: (2025)
by: Saillenfest, Antoine, et al.
Published: (2025)
Evaluating Sparse Autoencoders on Targeted Concept Erasure Tasks
by: Karvonen, Adam, et al.
Published: (2024)
by: Karvonen, Adam, et al.
Published: (2024)
Khattat: Enhancing Readability and Concept Representation of Semantic Typography
by: Hussein, Ahmed, et al.
Published: (2024)
by: Hussein, Ahmed, et al.
Published: (2024)
Medical Concept Normalization in a Low-Resource Setting
by: Patzelt, Tim
Published: (2024)
by: Patzelt, Tim
Published: (2024)
Learning Machines: In Search of a Concept Oriented Language
by: Gunes, Veyis
Published: (2024)
by: Gunes, Veyis
Published: (2024)
Concept Tokens: Learning Behavioral Embeddings Through Concept Definitions
by: Sastre, Ignacio, et al.
Published: (2026)
by: Sastre, Ignacio, et al.
Published: (2026)
The more polypersonal the better -- a short look on space geometry of fine-tuned layers
by: Kudriashov, Sergei, et al.
Published: (2025)
by: Kudriashov, Sergei, et al.
Published: (2025)
Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations
by: Hong, Joey, et al.
Published: (2024)
by: Hong, Joey, et al.
Published: (2024)
Evaluating Defences against Unsafe Feedback in RLHF
by: Rosati, Domenic, et al.
Published: (2024)
by: Rosati, Domenic, et al.
Published: (2024)
From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers
by: Su, Jingtong, et al.
Published: (2025)
by: Su, Jingtong, et al.
Published: (2025)
Trained on Tokens, Calibrated on Concepts: The Emergence of Semantic Calibration in LLMs
by: Nakkiran, Preetum, et al.
Published: (2025)
by: Nakkiran, Preetum, et al.
Published: (2025)
Visual Exploration of Feature Relationships in Sparse Autoencoders with Curated Concepts
by: Yan, Xinyuan, et al.
Published: (2025)
by: Yan, Xinyuan, et al.
Published: (2025)
Concept Algebra for (Score-Based) Text-Controlled Generative Models
by: Wang, Zihao, et al.
Published: (2023)
by: Wang, Zihao, et al.
Published: (2023)
Can LLMs Learn New Concepts Incrementally without Forgetting?
by: Zheng, Junhao, et al.
Published: (2024)
by: Zheng, Junhao, et al.
Published: (2024)
CLUE: Concept-Level Uncertainty Estimation for Large Language Models
by: Wang, Yu-Hsiang, et al.
Published: (2024)
by: Wang, Yu-Hsiang, et al.
Published: (2024)
Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian Distribution
by: Zhao, Haiyan, et al.
Published: (2024)
by: Zhao, Haiyan, et al.
Published: (2024)
Automated Knowledge Concept Annotation and Question Representation Learning for Knowledge Tracing
by: Ozyurt, Yilmazcan, et al.
Published: (2024)
by: Ozyurt, Yilmazcan, et al.
Published: (2024)
Low-Resource Machine Translation through the Lens of Personalized Federated Learning
by: Moskvoretskii, Viktor, et al.
Published: (2024)
by: Moskvoretskii, Viktor, et al.
Published: (2024)
Similar Items
-
Active Inference Agency Formalization, Metrics, and Convergence Assessments
by: Kapelko, Eduard
Published: (2026) -
Steering Out-of-Distribution Generalization with Concept Ablation Fine-Tuning
by: Casademunt, Helena, et al.
Published: (2025) -
Causality $\neq$ Invariance: Function and Concept Vectors in LLMs
by: Opiełka, Gustaw, et al.
Published: (2026) -
Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures
by: Borobia, Hector, et al.
Published: (2026) -
Circuit Breaking: Removing Model Behaviors with Targeted Ablation
by: Li, Maximilian, et al.
Published: (2023)