Saved in:
| Main Authors: | Kumar, Ramnath, Ritscher, Kyle, Judy, Junmin, Liu, Lawrence, Hsieh, Cho-Jui |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.06441 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FastLane: Efficient Routed Systems for Late-Interaction Retrieval
by: Kumar, Ramnath, et al.
Published: (2026)
by: Kumar, Ramnath, et al.
Published: (2026)
Why Line Search when you can Plane Search? SO-Friendly Neural Networks allow Per-Iteration Optimization of Learning and Momentum Rates for Every Layer
by: Shea, Betty, et al.
Published: (2024)
by: Shea, Betty, et al.
Published: (2024)
Provably Robust Training of Quantum Circuit Classifiers Against Parameter Noise
by: Tecot, Lucas, et al.
Published: (2025)
by: Tecot, Lucas, et al.
Published: (2025)
Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation
by: Xie, Tong, et al.
Published: (2024)
by: Xie, Tong, et al.
Published: (2024)
Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits
by: Kang, Yue, et al.
Published: (2023)
by: Kang, Yue, et al.
Published: (2023)
Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems
by: Kang, Yue, et al.
Published: (2024)
by: Kang, Yue, et al.
Published: (2024)
Low-rank Matrix Bandits with Heavy-tailed Rewards
by: Kang, Yue, et al.
Published: (2024)
by: Kang, Yue, et al.
Published: (2024)
Solving for X and Beyond: Can Large Language Models Solve Complex Math Problems with More-Than-Two Unknowns?
by: Kao, Kuei-Chun, et al.
Published: (2024)
by: Kao, Kuei-Chun, et al.
Published: (2024)
An Efficient Rehearsal Scheme for Catastrophic Forgetting Mitigation during Multi-stage Fine-tuning
by: Bai, Andrew, et al.
Published: (2024)
by: Bai, Andrew, et al.
Published: (2024)
On the Loss of Context-awareness in General Instruction Fine-tuning
by: Wang, Yihan, et al.
Published: (2024)
by: Wang, Yihan, et al.
Published: (2024)
Developing a Multi-Modal Machine Learning Model For Predicting Performance of Automotive Hood Frames
by: Indupally, Abhishek, et al.
Published: (2025)
by: Indupally, Abhishek, et al.
Published: (2025)
How (and when) can you fit examples to logic-based hypothesis classes over infinite structures?
by: Benedikt, Michael, et al.
Published: (2026)
by: Benedikt, Michael, et al.
Published: (2026)
Do Synthetic Trajectories Reflect Real Reward Hacking? A Systematic Study on Monitoring In-the-Wild Hacking in Code Generation
by: Li, Lichen, et al.
Published: (2026)
by: Li, Lichen, et al.
Published: (2026)
Concepts or Skills? Rethinking Instruction Selection for Multi-modal Models
by: Bai, Andrew, et al.
Published: (2025)
by: Bai, Andrew, et al.
Published: (2025)
Embedding Space Selection for Detecting Memorization and Fingerprinting in Generative Models
by: He, Jack, et al.
Published: (2024)
by: He, Jack, et al.
Published: (2024)
Learning label-label correlations in Extreme Multi-label Classification via Label Features
by: Kharbanda, Siddhant, et al.
Published: (2024)
by: Kharbanda, Siddhant, et al.
Published: (2024)
If you can distinguish, you can express: Galois theory, Stone--Weierstrass, machine learning, and linguistics
by: Blum-Smith, Ben, et al.
Published: (2025)
by: Blum-Smith, Ben, et al.
Published: (2025)
Certified Training with Branch-and-Bound for Lyapunov-stable Neural Control
by: Shi, Zhouxing, et al.
Published: (2024)
by: Shi, Zhouxing, et al.
Published: (2024)
On Discrete Prompt Optimization for Diffusion Models
by: Wang, Ruochen, et al.
Published: (2024)
by: Wang, Ruochen, et al.
Published: (2024)
Mitigating Bias in Dataset Distillation
by: Cui, Justin, et al.
Published: (2024)
by: Cui, Justin, et al.
Published: (2024)
CLUE: Concept-Level Uncertainty Estimation for Large Language Models
by: Wang, Yu-Hsiang, et al.
Published: (2024)
by: Wang, Yu-Hsiang, et al.
Published: (2024)
Recent Advances in Traffic Accident Analysis and Prediction: A Comprehensive Review of Machine Learning Techniques
by: Behboudi, Noushin, et al.
Published: (2024)
by: Behboudi, Noushin, et al.
Published: (2024)
Efficient and Interpretable Information Retrieval for Product Question Answering with Heterogeneous Data
by: Biswas, Biplob, et al.
Published: (2024)
by: Biswas, Biplob, et al.
Published: (2024)
Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning
by: Chiang, Chia-Cheng, et al.
Published: (2024)
by: Chiang, Chia-Cheng, et al.
Published: (2024)
POME: Post Optimization Model Edit via Muon-style Projection
by: Liu, Yong, et al.
Published: (2025)
by: Liu, Yong, et al.
Published: (2025)
LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization
by: Yen, Jui-Nan, et al.
Published: (2024)
by: Yen, Jui-Nan, et al.
Published: (2024)
IRIS: Intrinsic Reward Image Synthesis
by: Chen, Yihang, et al.
Published: (2025)
by: Chen, Yihang, et al.
Published: (2025)
Why you don't overfit, and don't need Bayes if you only train for one epoch
by: Aitchison, Laurence
Published: (2024)
by: Aitchison, Laurence
Published: (2024)
LLM-guided Hierarchical Search for End-to-end Reasoning Intensive Retrieval
by: Gupta, Nilesh, et al.
Published: (2025)
by: Gupta, Nilesh, et al.
Published: (2025)
AutoRubric-T2I: Robust Rule-Based Reward Model for Text-to-Image Alignment
by: Kao, Kuei-Chun, et al.
Published: (2026)
by: Kao, Kuei-Chun, et al.
Published: (2026)
Adversarial Examples Detection with Bayesian Neural Network
by: Li, Yao, et al.
Published: (2021)
by: Li, Yao, et al.
Published: (2021)
Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning
by: Liu, Yong, et al.
Published: (2024)
by: Liu, Yong, et al.
Published: (2024)
Concept Gradient: Concept-based Interpretation Without Linear Assumption
by: Bai, Andrew, et al.
Published: (2022)
by: Bai, Andrew, et al.
Published: (2022)
Scalable Deep Metric Learning on Attributed Graphs
by: Li, Xiang, et al.
Published: (2024)
by: Li, Xiang, et al.
Published: (2024)
Federated Contrastive Learning of Graph-Level Representations
by: Li, Xiang, et al.
Published: (2024)
by: Li, Xiang, et al.
Published: (2024)
Neural Network Verification with Branch-and-Bound for General Nonlinearities
by: Shi, Zhouxing, et al.
Published: (2024)
by: Shi, Zhouxing, et al.
Published: (2024)
Rethinking RL Evaluation: Can Benchmarks Truly Reveal Failures of RL Methods?
by: Chen, Zihan, et al.
Published: (2025)
by: Chen, Zihan, et al.
Published: (2025)
Two-stage LLM Fine-tuning with Less Specialization and More Generalization
by: Wang, Yihan, et al.
Published: (2022)
by: Wang, Yihan, et al.
Published: (2022)
Rethinking Neural-based Matrix Inversion: Why can't, and Where can
by: Ji, Yuliang, et al.
Published: (2025)
by: Ji, Yuliang, et al.
Published: (2025)
GreenPhase: A Green Learning Approach for Earthquake Phase Picking
by: Wu, Yixing, et al.
Published: (2026)
by: Wu, Yixing, et al.
Published: (2026)
Similar Items
-
FastLane: Efficient Routed Systems for Late-Interaction Retrieval
by: Kumar, Ramnath, et al.
Published: (2026) -
Why Line Search when you can Plane Search? SO-Friendly Neural Networks allow Per-Iteration Optimization of Learning and Momentum Rates for Every Layer
by: Shea, Betty, et al.
Published: (2024) -
Provably Robust Training of Quantum Circuit Classifiers Against Parameter Noise
by: Tecot, Lucas, et al.
Published: (2025) -
Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation
by: Xie, Tong, et al.
Published: (2024) -
Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits
by: Kang, Yue, et al.
Published: (2023)