:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zweiger, Adam, Pari, Jyothish, Guo, Han, Akyürek, Ekin, Kim, Yoon, Agrawal, Pulkit
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2506.10943
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Surprising Effectiveness of Test-Time Training for Few-Shot Learning
by: Akyürek, Ekin, et al.
Published: (2024)

Collective Model Intelligence Requires Compatible Specialization
by: Pari, Jyothish, et al.
Published: (2024)

RL's Razor: Why Online Reinforcement Learning Forgets Less
by: Shenfeld, Idan, et al.
Published: (2025)

General Intelligence Requires Reward-based Pretraining
by: Han, Seungwook, et al.
Published: (2025)

Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning
by: Reuss, Moritz, et al.
Published: (2024)

In-Context Language Learning: Architectures and Algorithms
by: Akyürek, Ekin, et al.
Published: (2024)

Fast KV Compaction via Attention Matching
by: Zweiger, Adam, et al.
Published: (2026)

Few-Shot Task Learning through Inverse Generative Modeling
by: Netanyahu, Aviv, et al.
Published: (2024)

Is Active Persona Inference Necessary for Aligning Small Models to Personal Preferences?
by: Tang, Zilu, et al.
Published: (2025)

Value Augmented Sampling for Language Model Alignment and Personalization
by: Han, Seungwook, et al.
Published: (2024)

Training Language Models via Neural Cellular Automata
by: Lee, Dan, et al.
Published: (2026)

Language Model Personalization via Reward Factorization
by: Shenfeld, Idan, et al.
Published: (2025)

Self-Distillation Enables Continual Learning
by: Shenfeld, Idan, et al.
Published: (2026)

H2LooP Spark Preview: Continual Pretraining of Large Language Models for Low-Level Embedded Systems Code
by: Singh, Amit, et al.
Published: (2026)

Learning Linear Attention in Polynomial Time
by: Yau, Morris, et al.
Published: (2024)

Emergence and Effectiveness of Task Vectors in In-Context Learning: An Encoder Decoder Perspective
by: Han, Seungwook, et al.
Published: (2024)

Leveraging Manifold Embeddings for Enhanced Graph Transformer Representations and Learning
by: Jyothish, Ankit, et al.
Published: (2025)

Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability
by: Akyürek, Afra Feyza, et al.
Published: (2024)

Automatic Environment Shaping is the Next Frontier in RL
by: Park, Younghyo, et al.
Published: (2024)

JUICER: Data-Efficient Imitation Learning for Robotic Assembly
by: Ankile, Lars, et al.
Published: (2024)

Mechanistic Interpretability of LoRA-Adapted Language Models for Nuclear Reactor Safety Applications
by: Lee, Yoon Pyo
Published: (2025)

TGRL: An Algorithm for Teacher Guided Reinforcement Learning
by: Shenfeld, Idan, et al.
Published: (2023)

FAST-Q: Fast-track Exploration with Adversarially Balanced State Representations for Counterfactual Action Estimation in Offline Reinforcement Learning
by: Agrawal, Pulkit, et al.
Published: (2025)

Going Beyond Heuristics by Imposing Policy Improvement as a Constraint
by: Lee, Chi-Chang, et al.
Published: (2025)

Separating Intrinsic Ambiguity from Estimation Uncertainty in Deep Generative Models for Linear Inverse Problems
by: Guo, Yuxin, et al.
Published: (2026)

Fast MoE Inference via Predictive Prefetching and Expert Replication
by: Jyothish, Ankit, et al.
Published: (2026)

From Imitation to Refinement -- Residual RL for Precise Assembly
by: Ankile, Lars, et al.
Published: (2024)

LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
by: Guo, Han, et al.
Published: (2023)

Vegetable Peeling: A Case Study in Constrained Dexterous Manipulation
by: Chen, Tao, et al.
Published: (2024)

Bridging the Sim-to-Real Gap for Athletic Loco-Manipulation
by: Fey, Nolan, et al.
Published: (2025)

SoftMimic: Learning Compliant Whole-body Control from Examples
by: Margolis, Gabriel B., et al.
Published: (2025)

Random Latent Exploration for Deep Reinforcement Learning
by: Mahankali, Srinath, et al.
Published: (2024)

Explainable and Interpretable Forecasts on Non-Smooth Multivariate Time Series for Responsible Gameplay
by: Jagirdar, Hussain, et al.
Published: (2025)

SelfReplay: Adapting Self-Supervised Sensory Models via Adaptive Meta-Task Replay
by: Yoon, Hyungjun, et al.
Published: (2024)

Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
by: Li, Zechu, et al.
Published: (2024)

Learning Force Control for Legged Manipulation
by: Portela, Tifanny, et al.
Published: (2024)

Curiosity-driven Red-teaming for Large Language Models
by: Hong, Zhang-Wei, et al.
Published: (2024)

Aligning Robot and Human Representations
by: Bobu, Andreea, et al.
Published: (2023)

ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization
by: Zhang, Chen Bo Calvin, et al.
Published: (2024)

ROER: Regularized Optimal Experience Replay
by: Li, Changling, et al.
Published: (2024)