:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Subramaniam, Vighnesh, Conwell, Colin, Katz, Boris, Barbu, Andrei, Cheung, Brian
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2512.04198
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Training the Untrainable: Introducing Inductive Bias via Representational Alignment
by: Subramaniam, Vighnesh, et al.
Published: (2024)

Revealing Vision-Language Integration in the Brain with Multimodal Networks
by: Subramaniam, Vighnesh, et al.
Published: (2024)

Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains
by: Subramaniam, Vighnesh, et al.
Published: (2025)

Population Transformer: Learning Population-level Representations of Neural Activity
by: Chau, Geeling, et al.
Published: (2024)

Fine-Tuning a Time Series Foundation Model with Wasserstein Loss
by: Chernov, Andrei
Published: (2024)

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
by: Katz, Shahar, et al.
Published: (2024)

Fairness Aware Reward Optimization
by: Choi, Ching Lam, et al.
Published: (2026)

Bridging the Semantic Gap for Categorical Data Clustering via Large Language Models
by: Yang, Zihua, et al.
Published: (2026)

Supernova: Achieving More with Less in Transformer Architectures
by: Tanase, Andrei-Valentin, et al.
Published: (2025)

SupraTok: Cross-Boundary Tokenization for Enhanced Language Model Performance
by: Tănase, Andrei-Valentin, et al.
Published: (2025)

Post-training makes large language models less human-like
by: Binz, Marcel, et al.
Published: (2026)

Parameter Efficient Multimodal Instruction Tuning for Romanian Vision Language Models
by: Dima, George-Andrei, et al.
Published: (2025)

HAL: Inducing Human-likeness in LLMs with Alignment
by: Hasan, Masum, et al.
Published: (2026)

SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models
by: Wang, Xiaoxuan, et al.
Published: (2023)

Ensemble Distillation for Unsupervised Constituency Parsing
by: Shayegh, Behzad, et al.
Published: (2023)

Mechanistic Interpretability of GPT-like Models on Summarization Tasks
by: Mishra, Anurag
Published: (2025)

Position: The Most Expensive Part of an LLM should be its Training Data
by: Kandpal, Nikhil, et al.
Published: (2025)

Task-Informed Anti-Curriculum by Masking Improves Downstream Performance on Text
by: Jarca, Andrei, et al.
Published: (2025)

Building Large-Scale English-Romanian Literary Translation Resources with Open Models
by: Nadas, Mihai, et al.
Published: (2025)

Exploring Major Transitions in the Evolution of Biological Cognition With Artificial Neural Networks
by: Voudouris, Konstantinos, et al.
Published: (2025)

ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization
by: Yadav, Prateek, et al.
Published: (2023)

Unsupervised Learning of Disentangled Representations from Video
by: Denton, Remi, et al.
Published: (2017)

Star Attention: Efficient LLM Inference over Long Sequences
by: Acharya, Shantanu, et al.
Published: (2024)

Error Diversity Matters: An Error-Resistant Ensemble Method for Unsupervised Dependency Parsing
by: Shayegh, Behzad, et al.
Published: (2024)

Online Speculative Decoding
by: Liu, Xiaoxuan, et al.
Published: (2023)

Language models show human-like content effects on reasoning tasks
by: Dasgupta, Ishita, et al.
Published: (2022)

Beyond Labels: Aligning Large Language Models with Human-like Reasoning
by: Kabir, Muhammad Rafsan, et al.
Published: (2024)

UniMaia: Steering Chess Policies with Language for Human-like Play
by: Siu, Sherman, et al.
Published: (2026)

Reduction of Supervision for Biomedical Knowledge Discovery
by: Theodoropoulos, Christos, et al.
Published: (2025)

Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter
by: Andrusenko, Andrei, et al.
Published: (2024)

Evolutionary Contrastive Distillation for Language Model Alignment
by: Katz-Samuels, Julian, et al.
Published: (2024)

Contextually Entangled Gradient Mapping for Optimized LLM Comprehension
by: Sisate, Colin, et al.
Published: (2025)

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
by: Feng, Xidong, et al.
Published: (2023)

Blockwise Advantage Estimation for Multi-Objective RL with Verifiable Rewards
by: Pavlenko, Kirill, et al.
Published: (2026)

Unlocking Continual Learning Abilities in Language Models
by: Du, Wenyu, et al.
Published: (2024)

MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages
by: Whitehouse, Chenxi, et al.
Published: (2025)

Scalable Bayesian Low-Rank Adaptation of Large Language Models via Stochastic Variational Subspace Inference
by: Samplawski, Colin, et al.
Published: (2025)

Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks
by: Gong, Linyuan, et al.
Published: (2024)

Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
by: Huang, Jerry, et al.
Published: (2024)

Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive
by: Pal, Arka, et al.
Published: (2024)