:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Kapelko, Eduard
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2509.25220
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Active Inference Agency Formalization, Metrics, and Convergence Assessments
by: Kapelko, Eduard
Published: (2026)

Steering Out-of-Distribution Generalization with Concept Ablation Fine-Tuning
by: Casademunt, Helena, et al.
Published: (2025)

Causality $\neq$ Invariance: Function and Concept Vectors in LLMs
by: Opiełka, Gustaw, et al.
Published: (2026)

Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures
by: Borobia, Hector, et al.
Published: (2026)

Circuit Breaking: Removing Model Behaviors with Targeted Ablation
by: Li, Maximilian, et al.
Published: (2023)

Do LLMs Understand Romanian Driving Laws? A Study on Multimodal and Fine-Tuned Question Answering
by: Barbu, Eduard, et al.
Published: (2025)

Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
by: Na, Clara, et al.
Published: (2024)

MedConceptsQA: Open Source Medical Concepts QA Benchmark
by: Shoham, Ofir Ben, et al.
Published: (2024)

Kernelized Concept Erasure
by: Ravfogel, Shauli, et al.
Published: (2022)

Data Alignment for Zero-Shot Concept Generation in Dermatology AI
by: Gadgil, Soham, et al.
Published: (2024)

Vector Quantized Latent Concepts: A Scalable Alternative to Clustering-Based Concept Discovery
by: Yu, Xuemin, et al.
Published: (2026)

LLM Pretraining with Continuous Concepts
by: Tack, Jihoon, et al.
Published: (2025)

Towards Compositionality in Concept Learning
by: Stein, Adam, et al.
Published: (2024)

Linear Adversarial Concept Erasure
by: Ravfogel, Shauli, et al.
Published: (2022)

LCSB: Layer-Cyclic Selective Backpropagation for Memory-Efficient On-Device LLM Fine-Tuning
by: Park, Juneyoung, et al.
Published: (2026)

Beyond Individual Facts: Investigating Categorical Knowledge Locality of Taxonomy and Meronomy Concepts in GPT Models
by: Burger, Christopher, et al.
Published: (2024)

Concept Bottleneck Large Language Models
by: Sun, Chung-En, et al.
Published: (2024)

AlignSAE: Concept-Aligned Sparse Autoencoders
by: Yang, Minglai, et al.
Published: (2025)

Simple Mechanisms for Representing, Indexing and Manipulating Concepts
by: Li, Yuanzhi, et al.
Published: (2023)

Leveraging AI Graders for Missing Score Imputation to Achieve Accurate Ability Estimation in Constructed-Response Tests
by: Uto, Masaki, et al.
Published: (2025)

TempTest: Local Normalization Distortion and the Detection of Machine-generated Text
by: Kempton, Tom, et al.
Published: (2025)

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training
by: von Oswald, Johannes, et al.
Published: (2025)

Nonlinear Concept Erasure: a Density Matching Approach
by: Saillenfest, Antoine, et al.
Published: (2025)

Evaluating Sparse Autoencoders on Targeted Concept Erasure Tasks
by: Karvonen, Adam, et al.
Published: (2024)

Khattat: Enhancing Readability and Concept Representation of Semantic Typography
by: Hussein, Ahmed, et al.
Published: (2024)

Medical Concept Normalization in a Low-Resource Setting
by: Patzelt, Tim
Published: (2024)

Learning Machines: In Search of a Concept Oriented Language
by: Gunes, Veyis
Published: (2024)

Concept Tokens: Learning Behavioral Embeddings Through Concept Definitions
by: Sastre, Ignacio, et al.
Published: (2026)

The more polypersonal the better -- a short look on space geometry of fine-tuned layers
by: Kudriashov, Sergei, et al.
Published: (2025)

Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations
by: Hong, Joey, et al.
Published: (2024)

Evaluating Defences against Unsafe Feedback in RLHF
by: Rosati, Domenic, et al.
Published: (2024)

From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers
by: Su, Jingtong, et al.
Published: (2025)

Trained on Tokens, Calibrated on Concepts: The Emergence of Semantic Calibration in LLMs
by: Nakkiran, Preetum, et al.
Published: (2025)

Visual Exploration of Feature Relationships in Sparse Autoencoders with Curated Concepts
by: Yan, Xinyuan, et al.
Published: (2025)

Concept Algebra for (Score-Based) Text-Controlled Generative Models
by: Wang, Zihao, et al.
Published: (2023)

Can LLMs Learn New Concepts Incrementally without Forgetting?
by: Zheng, Junhao, et al.
Published: (2024)

CLUE: Concept-Level Uncertainty Estimation for Large Language Models
by: Wang, Yu-Hsiang, et al.
Published: (2024)

Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian Distribution
by: Zhao, Haiyan, et al.
Published: (2024)

Automated Knowledge Concept Annotation and Question Representation Learning for Knowledge Tracing
by: Ozyurt, Yilmazcan, et al.
Published: (2024)

Low-Resource Machine Translation through the Lens of Personalized Federated Learning
by: Moskvoretskii, Viktor, et al.
Published: (2024)