Saved in:
| Main Authors: | Zweiger, Adam, Pari, Jyothish, Guo, Han, Akyürek, Ekin, Kim, Yoon, Agrawal, Pulkit |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.10943 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Surprising Effectiveness of Test-Time Training for Few-Shot Learning
by: Akyürek, Ekin, et al.
Published: (2024)
by: Akyürek, Ekin, et al.
Published: (2024)
Collective Model Intelligence Requires Compatible Specialization
by: Pari, Jyothish, et al.
Published: (2024)
by: Pari, Jyothish, et al.
Published: (2024)
RL's Razor: Why Online Reinforcement Learning Forgets Less
by: Shenfeld, Idan, et al.
Published: (2025)
by: Shenfeld, Idan, et al.
Published: (2025)
General Intelligence Requires Reward-based Pretraining
by: Han, Seungwook, et al.
Published: (2025)
by: Han, Seungwook, et al.
Published: (2025)
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning
by: Reuss, Moritz, et al.
Published: (2024)
by: Reuss, Moritz, et al.
Published: (2024)
In-Context Language Learning: Architectures and Algorithms
by: Akyürek, Ekin, et al.
Published: (2024)
by: Akyürek, Ekin, et al.
Published: (2024)
Fast KV Compaction via Attention Matching
by: Zweiger, Adam, et al.
Published: (2026)
by: Zweiger, Adam, et al.
Published: (2026)
Few-Shot Task Learning through Inverse Generative Modeling
by: Netanyahu, Aviv, et al.
Published: (2024)
by: Netanyahu, Aviv, et al.
Published: (2024)
Is Active Persona Inference Necessary for Aligning Small Models to Personal Preferences?
by: Tang, Zilu, et al.
Published: (2025)
by: Tang, Zilu, et al.
Published: (2025)
Value Augmented Sampling for Language Model Alignment and Personalization
by: Han, Seungwook, et al.
Published: (2024)
by: Han, Seungwook, et al.
Published: (2024)
Training Language Models via Neural Cellular Automata
by: Lee, Dan, et al.
Published: (2026)
by: Lee, Dan, et al.
Published: (2026)
Language Model Personalization via Reward Factorization
by: Shenfeld, Idan, et al.
Published: (2025)
by: Shenfeld, Idan, et al.
Published: (2025)
Self-Distillation Enables Continual Learning
by: Shenfeld, Idan, et al.
Published: (2026)
by: Shenfeld, Idan, et al.
Published: (2026)
H2LooP Spark Preview: Continual Pretraining of Large Language Models for Low-Level Embedded Systems Code
by: Singh, Amit, et al.
Published: (2026)
by: Singh, Amit, et al.
Published: (2026)
Learning Linear Attention in Polynomial Time
by: Yau, Morris, et al.
Published: (2024)
by: Yau, Morris, et al.
Published: (2024)
Emergence and Effectiveness of Task Vectors in In-Context Learning: An Encoder Decoder Perspective
by: Han, Seungwook, et al.
Published: (2024)
by: Han, Seungwook, et al.
Published: (2024)
Leveraging Manifold Embeddings for Enhanced Graph Transformer Representations and Learning
by: Jyothish, Ankit, et al.
Published: (2025)
by: Jyothish, Ankit, et al.
Published: (2025)
Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability
by: Akyürek, Afra Feyza, et al.
Published: (2024)
by: Akyürek, Afra Feyza, et al.
Published: (2024)
Automatic Environment Shaping is the Next Frontier in RL
by: Park, Younghyo, et al.
Published: (2024)
by: Park, Younghyo, et al.
Published: (2024)
JUICER: Data-Efficient Imitation Learning for Robotic Assembly
by: Ankile, Lars, et al.
Published: (2024)
by: Ankile, Lars, et al.
Published: (2024)
Mechanistic Interpretability of LoRA-Adapted Language Models for Nuclear Reactor Safety Applications
by: Lee, Yoon Pyo
Published: (2025)
by: Lee, Yoon Pyo
Published: (2025)
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
by: Shenfeld, Idan, et al.
Published: (2023)
by: Shenfeld, Idan, et al.
Published: (2023)
FAST-Q: Fast-track Exploration with Adversarially Balanced State Representations for Counterfactual Action Estimation in Offline Reinforcement Learning
by: Agrawal, Pulkit, et al.
Published: (2025)
by: Agrawal, Pulkit, et al.
Published: (2025)
Going Beyond Heuristics by Imposing Policy Improvement as a Constraint
by: Lee, Chi-Chang, et al.
Published: (2025)
by: Lee, Chi-Chang, et al.
Published: (2025)
Separating Intrinsic Ambiguity from Estimation Uncertainty in Deep Generative Models for Linear Inverse Problems
by: Guo, Yuxin, et al.
Published: (2026)
by: Guo, Yuxin, et al.
Published: (2026)
Fast MoE Inference via Predictive Prefetching and Expert Replication
by: Jyothish, Ankit, et al.
Published: (2026)
by: Jyothish, Ankit, et al.
Published: (2026)
From Imitation to Refinement -- Residual RL for Precise Assembly
by: Ankile, Lars, et al.
Published: (2024)
by: Ankile, Lars, et al.
Published: (2024)
LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
by: Guo, Han, et al.
Published: (2023)
by: Guo, Han, et al.
Published: (2023)
Vegetable Peeling: A Case Study in Constrained Dexterous Manipulation
by: Chen, Tao, et al.
Published: (2024)
by: Chen, Tao, et al.
Published: (2024)
Bridging the Sim-to-Real Gap for Athletic Loco-Manipulation
by: Fey, Nolan, et al.
Published: (2025)
by: Fey, Nolan, et al.
Published: (2025)
SoftMimic: Learning Compliant Whole-body Control from Examples
by: Margolis, Gabriel B., et al.
Published: (2025)
by: Margolis, Gabriel B., et al.
Published: (2025)
Random Latent Exploration for Deep Reinforcement Learning
by: Mahankali, Srinath, et al.
Published: (2024)
by: Mahankali, Srinath, et al.
Published: (2024)
Explainable and Interpretable Forecasts on Non-Smooth Multivariate Time Series for Responsible Gameplay
by: Jagirdar, Hussain, et al.
Published: (2025)
by: Jagirdar, Hussain, et al.
Published: (2025)
SelfReplay: Adapting Self-Supervised Sensory Models via Adaptive Meta-Task Replay
by: Yoon, Hyungjun, et al.
Published: (2024)
by: Yoon, Hyungjun, et al.
Published: (2024)
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
by: Li, Zechu, et al.
Published: (2024)
by: Li, Zechu, et al.
Published: (2024)
Learning Force Control for Legged Manipulation
by: Portela, Tifanny, et al.
Published: (2024)
by: Portela, Tifanny, et al.
Published: (2024)
Curiosity-driven Red-teaming for Large Language Models
by: Hong, Zhang-Wei, et al.
Published: (2024)
by: Hong, Zhang-Wei, et al.
Published: (2024)
Aligning Robot and Human Representations
by: Bobu, Andreea, et al.
Published: (2023)
by: Bobu, Andreea, et al.
Published: (2023)
ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization
by: Zhang, Chen Bo Calvin, et al.
Published: (2024)
by: Zhang, Chen Bo Calvin, et al.
Published: (2024)
ROER: Regularized Optimal Experience Replay
by: Li, Changling, et al.
Published: (2024)
by: Li, Changling, et al.
Published: (2024)
Similar Items
-
The Surprising Effectiveness of Test-Time Training for Few-Shot Learning
by: Akyürek, Ekin, et al.
Published: (2024) -
Collective Model Intelligence Requires Compatible Specialization
by: Pari, Jyothish, et al.
Published: (2024) -
RL's Razor: Why Online Reinforcement Learning Forgets Less
by: Shenfeld, Idan, et al.
Published: (2025) -
General Intelligence Requires Reward-based Pretraining
by: Han, Seungwook, et al.
Published: (2025) -
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning
by: Reuss, Moritz, et al.
Published: (2024)