Saved in:
| Main Authors: | Lu, Kai, Zhao, Siqi, Wan, Jiguang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.14759 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Squeezing Lemons with Hammers: An Evaluation of AutoML and Tabular Deep Learning for Data-Scarce Classification Applications
by: Knauer, Ricardo, et al.
Published: (2024)
by: Knauer, Ricardo, et al.
Published: (2024)
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
by: Feng, Lang, et al.
Published: (2025)
by: Feng, Lang, et al.
Published: (2025)
Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
by: Lin, Qiqiang, et al.
Published: (2024)
by: Lin, Qiqiang, et al.
Published: (2024)
Balanced Online Class-Incremental Learning via Dual Classifiers
by: Wen, Shunjie, et al.
Published: (2025)
by: Wen, Shunjie, et al.
Published: (2025)
Meta-Learning for Cold-Start Personalization in Prompt-Tuned LLMs
by: Zhao, Yushang, et al.
Published: (2025)
by: Zhao, Yushang, et al.
Published: (2025)
Toward Fair Federated Learning under Demographic Disparities and Data Imbalance
by: Wu, Qiming, et al.
Published: (2025)
by: Wu, Qiming, et al.
Published: (2025)
Premise Selection for a Lean Hammer
by: Zhu, Thomas, et al.
Published: (2025)
by: Zhu, Thomas, et al.
Published: (2025)
ROAD: Adaptive Data Mixing for Offline-to-Online Reinforcement Learning via Bi-Level Optimization
by: Yang, Letian, et al.
Published: (2026)
by: Yang, Letian, et al.
Published: (2026)
Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
by: Chen, Keru, et al.
Published: (2024)
by: Chen, Keru, et al.
Published: (2024)
Learning-Zone Energy: Online Data Selection for Efficient RL Post-Training
by: Cui, Peng, et al.
Published: (2026)
by: Cui, Peng, et al.
Published: (2026)
Towards Multimodal Active Learning: Efficient Learning with Limited Paired Data
by: Zhang, Jiancheng, et al.
Published: (2025)
by: Zhang, Jiancheng, et al.
Published: (2025)
SAMG: Offline-to-Online Reinforcement Learning via State-Action-Conditional Offline Model Guidance
by: Zhang, Liyu, et al.
Published: (2024)
by: Zhang, Liyu, et al.
Published: (2024)
Beyond the Lower Bound: Bridging Regret Minimization and Best Arm Identification in Lexicographic Bandits
by: Xue, Bo, et al.
Published: (2025)
by: Xue, Bo, et al.
Published: (2025)
Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition
by: Wang, Zichen, et al.
Published: (2025)
by: Wang, Zichen, et al.
Published: (2025)
FHBench: Towards Efficient and Personalized Federated Learning for Multimodal Healthcare
by: Wang, Penghao, et al.
Published: (2025)
by: Wang, Penghao, et al.
Published: (2025)
Efficient User Sequence Learning for Online Services via Compressed Graph Neural Networks
by: Wu, Yucheng, et al.
Published: (2024)
by: Wu, Yucheng, et al.
Published: (2024)
Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them
by: Rajani, Neel, et al.
Published: (2025)
by: Rajani, Neel, et al.
Published: (2025)
Towards Achieving Near-optimal Utility for Privacy-Preserving Federated Learning via Data Generation and Parameter Distortion
by: Zhang, Xiaojin, et al.
Published: (2023)
by: Zhang, Xiaojin, et al.
Published: (2023)
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
by: Zhao, Kai, et al.
Published: (2023)
by: Zhao, Kai, et al.
Published: (2023)
Metis-SPECS: Decoupling Multimodal Learning via Self-distilled Preference-based Cold Start
by: Chen, Kun, et al.
Published: (2025)
by: Chen, Kun, et al.
Published: (2025)
Hierarchically Gated Experts for Efficient Online Continual Learning
by: Luong, Kevin, et al.
Published: (2024)
by: Luong, Kevin, et al.
Published: (2024)
FADE: Towards Fairness-aware Generation for Domain Generalization via Classifier-Guided Score-based Diffusion Models
by: Lin, Yujie, et al.
Published: (2024)
by: Lin, Yujie, et al.
Published: (2024)
Stackelberg Coupling of Online Representation Learning and Reinforcement Learning
by: Martinez, Fernando, et al.
Published: (2025)
by: Martinez, Fernando, et al.
Published: (2025)
Towards Efficient Online Exploration for Reinforcement Learning with Human Feedback
by: Li, Gen, et al.
Published: (2025)
by: Li, Gen, et al.
Published: (2025)
Categorical Data Clustering via Value Order Estimated Distance Metric Learning
by: Zhang, Yiqun, et al.
Published: (2024)
by: Zhang, Yiqun, et al.
Published: (2024)
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
by: Wilcoxson, Max, et al.
Published: (2024)
by: Wilcoxson, Max, et al.
Published: (2024)
Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism
by: Wang, Kunyun, et al.
Published: (2025)
by: Wang, Kunyun, et al.
Published: (2025)
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
by: Yan, Kai, et al.
Published: (2024)
by: Yan, Kai, et al.
Published: (2024)
Gradients as an Action: Towards Communication-Efficient Federated Recommender Systems via Adaptive Action Sharing
by: Lu, Zhufeng, et al.
Published: (2025)
by: Lu, Zhufeng, et al.
Published: (2025)
Efficient Data Selection for Multimodal Models via Incremental Optimization Utility
by: Jing, Jinhao, et al.
Published: (2026)
by: Jing, Jinhao, et al.
Published: (2026)
EDGE: Efficient Data Selection for LLM Agents via Guideline Effectiveness
by: Zhang, Yunxiao, et al.
Published: (2025)
by: Zhang, Yunxiao, et al.
Published: (2025)
Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities
by: Li, Zirui, et al.
Published: (2025)
by: Li, Zirui, et al.
Published: (2025)
The Three Regimes of Offline-to-Online Reinforcement Learning
by: Li, Lu, et al.
Published: (2025)
by: Li, Lu, et al.
Published: (2025)
Online Drift Detection with Maximum Concept Discrepancy
by: Wan, Ke, et al.
Published: (2024)
by: Wan, Ke, et al.
Published: (2024)
Online Gradient Boosting Decision Tree: In-Place Updates for Efficient Adding/Deleting Data
by: Lin, Huawei, et al.
Published: (2025)
by: Lin, Huawei, et al.
Published: (2025)
Efficient Beamforming Optimization for STAR-RIS-Assisted Communications: A Gradient-Based Meta Learning Approach
by: Yang, Dongdong, et al.
Published: (2025)
by: Yang, Dongdong, et al.
Published: (2025)
A Toolbox, Not a Hammer -- Multi-TAG: Scaling Math Reasoning with Multi-Tool Aggregation
by: Yao, Bohan, et al.
Published: (2025)
by: Yao, Bohan, et al.
Published: (2025)
Dynamic Environment Responsive Online Meta-Learning with Fairness Awareness
by: Zhao, Chen, et al.
Published: (2024)
by: Zhao, Chen, et al.
Published: (2024)
Online Multi-modal Root Cause Identification in Microservice Systems
by: Zheng, Lecheng, et al.
Published: (2024)
by: Zheng, Lecheng, et al.
Published: (2024)
PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning
by: Liu, Tao, et al.
Published: (2026)
by: Liu, Tao, et al.
Published: (2026)
Similar Items
-
Squeezing Lemons with Hammers: An Evaluation of AutoML and Tabular Deep Learning for Data-Scarce Classification Applications
by: Knauer, Ricardo, et al.
Published: (2024) -
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
by: Feng, Lang, et al.
Published: (2025) -
Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
by: Lin, Qiqiang, et al.
Published: (2024) -
Balanced Online Class-Incremental Learning via Dual Classifiers
by: Wen, Shunjie, et al.
Published: (2025) -
Meta-Learning for Cold-Start Personalization in Prompt-Tuned LLMs
by: Zhao, Yushang, et al.
Published: (2025)