:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Nguyen, Trung, Leng, Yan
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Computation and Language
Online Access:	https://arxiv.org/abs/2502.16385
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Offline Preference Optimization via Maximum Marginal Likelihood Estimation
by: Najafi, Saeed, et al.
Published: (2025)

The Linear Representation Hypothesis and the Geometry of Large Language Models
by: Park, Kiho, et al.
Published: (2023)

Lizard: An Efficient Linearization Framework for Large Language Models
by: Van Nguyen, Chien, et al.
Published: (2025)

Linear Representation Transferability Hypothesis: Leveraging Small Models to Steer Large Models
by: Bello, Femi, et al.
Published: (2025)

Structured Pruning for Diverse Best-of-N Reasoning Optimization
by: Nguyen, Hieu Trung, et al.
Published: (2025)

Task-driven Layerwise Additive Activation Intervention
by: Nguyen, Hieu Trung, et al.
Published: (2025)

Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation
by: Vargas, Francisco, et al.
Published: (2020)

How Many Features Can a Language Model Store Under the Linear Representation Hypothesis?
by: Garg, Nikhil, et al.
Published: (2026)

Towards Understanding Multi-Task Learning (Generalization) of LLMs via Detecting and Exploring Task-Specific Neurons
by: Leng, Yongqi, et al.
Published: (2024)

Explainable Disentangled Representation Learning for Generalizable Authorship Attribution in the Era of Generative AI
by: Man, Hieu, et al.
Published: (2026)

Optimizing Multi-Stage Language Models for Effective Text Retrieval
by: Trung, Quang Hoang, et al.
Published: (2024)

Adaptive Two-Phase Finetuning LLMs for Japanese Legal Text Retrieval
by: Trung, Quang Hoang, et al.
Published: (2024)

On the Origins of Linear Representations in Large Language Models
by: Jiang, Yibo, et al.
Published: (2024)

Reasoning Planning for Language Models
by: Nguyen, Bao, et al.
Published: (2025)

HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation
by: Liu, Haokun, et al.
Published: (2025)

Momentum SVGD-EM for Accelerated Maximum Marginal Likelihood Estimation
by: Rozzio, Adam, et al.
Published: (2026)

Large Language Model-Enhanced Algorithm Selection: Towards Comprehensive Algorithm Representation
by: Wu, Xingyu, et al.
Published: (2023)

Don't Read Everything: A Curvature-Conditioned Query for Linear Attention
by: Le, Dong, et al.
Published: (2026)

Mixture-of-Personas Language Models for Population Simulation
by: Bui, Ngoc, et al.
Published: (2025)

Adaptive Rollout Allocation for Online Reinforcement Learning with Verifiable Rewards
by: Nguyen, Hieu Trung, et al.
Published: (2026)

Towards Generalising Neural Topical Representations
by: Yang, Xiaohao, et al.
Published: (2023)

NoveltyRank: A Retrieval-Augmented Framework for Conceptual Novelty Estimation in AI Research
by: Yan, Zhengxu, et al.
Published: (2025)

Interacting Particle Langevin Algorithm for Maximum Marginal Likelihood Estimation
by: Akyildiz, Ö. Deniz, et al.
Published: (2023)

SEA: Sparse Linear Attention with Estimated Attention Mask
by: Lee, Heejun, et al.
Published: (2023)

MIC: Maximizing Informational Capacity in Adaptive Representations via Isotropic Subspace Alignment
by: Hong, Dang Nguyen, et al.
Published: (2026)

Large Language Models Encode Semantics and Alignment in Linearly Separable Representations
by: Saglam, Baturay, et al.
Published: (2025)

Cross-Tokenizer Likelihood Scoring Algorithms for Language Model Distillation
by: Phan, Buu, et al.
Published: (2025)

VITRO: Vocabulary Inversion for Time-series Representation Optimization
by: Bellos, Filippos, et al.
Published: (2024)

Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning
by: Xie, Wanyun, et al.
Published: (2025)

Scientific Hypothesis Generation and Validation: Methods, Datasets, and Future Directions
by: Kulkarni, Adithya, et al.
Published: (2025)

Exploiting LLMs for Automatic Hypothesis Assessment via a Logit-Based Calibrated Prior
by: Gong, Yue, et al.
Published: (2025)

Towards Efficient Active Learning in NLP via Pretrained Representations
by: Vysogorets, Artem, et al.
Published: (2024)

ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods
by: Xie, Roy, et al.
Published: (2024)

Revisiting the Superficial Alignment Hypothesis
by: Raghavendra, Mohit, et al.
Published: (2024)

Maximum Score Routing For Mixture-of-Experts
by: Dong, Bowen, et al.
Published: (2025)

Failure Modes of Maximum Entropy RLHF
by: Çağatan, Ömer Veysel, et al.
Published: (2025)

Disentangling the Roles of Representation and Selection in Data Pruning
by: Du, Yupei, et al.
Published: (2025)

Memories Retrieved from Many Paths: A Multi-Prefix Framework for Robust Detection of Training Data Leakage in Large Language Models
by: Dang, Trung Cuong, et al.
Published: (2025)

What Do Language Models Learn in Context? The Structured Task Hypothesis
by: Li, Jiaoda, et al.
Published: (2024)

Learning State-Tracking from Code Using Linear RNNs
by: Siems, Julien, et al.
Published: (2026)