Saved in:
| Main Authors: | Nguyen, Trung, Leng, Yan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.16385 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Offline Preference Optimization via Maximum Marginal Likelihood Estimation
by: Najafi, Saeed, et al.
Published: (2025)
by: Najafi, Saeed, et al.
Published: (2025)
The Linear Representation Hypothesis and the Geometry of Large Language Models
by: Park, Kiho, et al.
Published: (2023)
by: Park, Kiho, et al.
Published: (2023)
Lizard: An Efficient Linearization Framework for Large Language Models
by: Van Nguyen, Chien, et al.
Published: (2025)
by: Van Nguyen, Chien, et al.
Published: (2025)
Linear Representation Transferability Hypothesis: Leveraging Small Models to Steer Large Models
by: Bello, Femi, et al.
Published: (2025)
by: Bello, Femi, et al.
Published: (2025)
Structured Pruning for Diverse Best-of-N Reasoning Optimization
by: Nguyen, Hieu Trung, et al.
Published: (2025)
by: Nguyen, Hieu Trung, et al.
Published: (2025)
Task-driven Layerwise Additive Activation Intervention
by: Nguyen, Hieu Trung, et al.
Published: (2025)
by: Nguyen, Hieu Trung, et al.
Published: (2025)
Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation
by: Vargas, Francisco, et al.
Published: (2020)
by: Vargas, Francisco, et al.
Published: (2020)
How Many Features Can a Language Model Store Under the Linear Representation Hypothesis?
by: Garg, Nikhil, et al.
Published: (2026)
by: Garg, Nikhil, et al.
Published: (2026)
Towards Understanding Multi-Task Learning (Generalization) of LLMs via Detecting and Exploring Task-Specific Neurons
by: Leng, Yongqi, et al.
Published: (2024)
by: Leng, Yongqi, et al.
Published: (2024)
Explainable Disentangled Representation Learning for Generalizable Authorship Attribution in the Era of Generative AI
by: Man, Hieu, et al.
Published: (2026)
by: Man, Hieu, et al.
Published: (2026)
Optimizing Multi-Stage Language Models for Effective Text Retrieval
by: Trung, Quang Hoang, et al.
Published: (2024)
by: Trung, Quang Hoang, et al.
Published: (2024)
Adaptive Two-Phase Finetuning LLMs for Japanese Legal Text Retrieval
by: Trung, Quang Hoang, et al.
Published: (2024)
by: Trung, Quang Hoang, et al.
Published: (2024)
On the Origins of Linear Representations in Large Language Models
by: Jiang, Yibo, et al.
Published: (2024)
by: Jiang, Yibo, et al.
Published: (2024)
Reasoning Planning for Language Models
by: Nguyen, Bao, et al.
Published: (2025)
by: Nguyen, Bao, et al.
Published: (2025)
HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation
by: Liu, Haokun, et al.
Published: (2025)
by: Liu, Haokun, et al.
Published: (2025)
Momentum SVGD-EM for Accelerated Maximum Marginal Likelihood Estimation
by: Rozzio, Adam, et al.
Published: (2026)
by: Rozzio, Adam, et al.
Published: (2026)
Large Language Model-Enhanced Algorithm Selection: Towards Comprehensive Algorithm Representation
by: Wu, Xingyu, et al.
Published: (2023)
by: Wu, Xingyu, et al.
Published: (2023)
Don't Read Everything: A Curvature-Conditioned Query for Linear Attention
by: Le, Dong, et al.
Published: (2026)
by: Le, Dong, et al.
Published: (2026)
Mixture-of-Personas Language Models for Population Simulation
by: Bui, Ngoc, et al.
Published: (2025)
by: Bui, Ngoc, et al.
Published: (2025)
Adaptive Rollout Allocation for Online Reinforcement Learning with Verifiable Rewards
by: Nguyen, Hieu Trung, et al.
Published: (2026)
by: Nguyen, Hieu Trung, et al.
Published: (2026)
Towards Generalising Neural Topical Representations
by: Yang, Xiaohao, et al.
Published: (2023)
by: Yang, Xiaohao, et al.
Published: (2023)
NoveltyRank: A Retrieval-Augmented Framework for Conceptual Novelty Estimation in AI Research
by: Yan, Zhengxu, et al.
Published: (2025)
by: Yan, Zhengxu, et al.
Published: (2025)
Interacting Particle Langevin Algorithm for Maximum Marginal Likelihood Estimation
by: Akyildiz, Ö. Deniz, et al.
Published: (2023)
by: Akyildiz, Ö. Deniz, et al.
Published: (2023)
SEA: Sparse Linear Attention with Estimated Attention Mask
by: Lee, Heejun, et al.
Published: (2023)
by: Lee, Heejun, et al.
Published: (2023)
MIC: Maximizing Informational Capacity in Adaptive Representations via Isotropic Subspace Alignment
by: Hong, Dang Nguyen, et al.
Published: (2026)
by: Hong, Dang Nguyen, et al.
Published: (2026)
Large Language Models Encode Semantics and Alignment in Linearly Separable Representations
by: Saglam, Baturay, et al.
Published: (2025)
by: Saglam, Baturay, et al.
Published: (2025)
Cross-Tokenizer Likelihood Scoring Algorithms for Language Model Distillation
by: Phan, Buu, et al.
Published: (2025)
by: Phan, Buu, et al.
Published: (2025)
VITRO: Vocabulary Inversion for Time-series Representation Optimization
by: Bellos, Filippos, et al.
Published: (2024)
by: Bellos, Filippos, et al.
Published: (2024)
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning
by: Xie, Wanyun, et al.
Published: (2025)
by: Xie, Wanyun, et al.
Published: (2025)
Scientific Hypothesis Generation and Validation: Methods, Datasets, and Future Directions
by: Kulkarni, Adithya, et al.
Published: (2025)
by: Kulkarni, Adithya, et al.
Published: (2025)
Exploiting LLMs for Automatic Hypothesis Assessment via a Logit-Based Calibrated Prior
by: Gong, Yue, et al.
Published: (2025)
by: Gong, Yue, et al.
Published: (2025)
Towards Efficient Active Learning in NLP via Pretrained Representations
by: Vysogorets, Artem, et al.
Published: (2024)
by: Vysogorets, Artem, et al.
Published: (2024)
ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods
by: Xie, Roy, et al.
Published: (2024)
by: Xie, Roy, et al.
Published: (2024)
Revisiting the Superficial Alignment Hypothesis
by: Raghavendra, Mohit, et al.
Published: (2024)
by: Raghavendra, Mohit, et al.
Published: (2024)
Maximum Score Routing For Mixture-of-Experts
by: Dong, Bowen, et al.
Published: (2025)
by: Dong, Bowen, et al.
Published: (2025)
Failure Modes of Maximum Entropy RLHF
by: Çağatan, Ömer Veysel, et al.
Published: (2025)
by: Çağatan, Ömer Veysel, et al.
Published: (2025)
Disentangling the Roles of Representation and Selection in Data Pruning
by: Du, Yupei, et al.
Published: (2025)
by: Du, Yupei, et al.
Published: (2025)
Memories Retrieved from Many Paths: A Multi-Prefix Framework for Robust Detection of Training Data Leakage in Large Language Models
by: Dang, Trung Cuong, et al.
Published: (2025)
by: Dang, Trung Cuong, et al.
Published: (2025)
What Do Language Models Learn in Context? The Structured Task Hypothesis
by: Li, Jiaoda, et al.
Published: (2024)
by: Li, Jiaoda, et al.
Published: (2024)
Learning State-Tracking from Code Using Linear RNNs
by: Siems, Julien, et al.
Published: (2026)
by: Siems, Julien, et al.
Published: (2026)
Similar Items
-
Offline Preference Optimization via Maximum Marginal Likelihood Estimation
by: Najafi, Saeed, et al.
Published: (2025) -
The Linear Representation Hypothesis and the Geometry of Large Language Models
by: Park, Kiho, et al.
Published: (2023) -
Lizard: An Efficient Linearization Framework for Large Language Models
by: Van Nguyen, Chien, et al.
Published: (2025) -
Linear Representation Transferability Hypothesis: Leveraging Small Models to Steer Large Models
by: Bello, Femi, et al.
Published: (2025) -
Structured Pruning for Diverse Best-of-N Reasoning Optimization
by: Nguyen, Hieu Trung, et al.
Published: (2025)