:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lu, Kai, Zhao, Siqi, Wan, Jiguang
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2411.14759
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Squeezing Lemons with Hammers: An Evaluation of AutoML and Tabular Deep Learning for Data-Scarce Classification Applications
by: Knauer, Ricardo, et al.
Published: (2024)

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
by: Feng, Lang, et al.
Published: (2025)

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
by: Lin, Qiqiang, et al.
Published: (2024)

Balanced Online Class-Incremental Learning via Dual Classifiers
by: Wen, Shunjie, et al.
Published: (2025)

Meta-Learning for Cold-Start Personalization in Prompt-Tuned LLMs
by: Zhao, Yushang, et al.
Published: (2025)

Toward Fair Federated Learning under Demographic Disparities and Data Imbalance
by: Wu, Qiming, et al.
Published: (2025)

Premise Selection for a Lean Hammer
by: Zhu, Thomas, et al.
Published: (2025)

ROAD: Adaptive Data Mixing for Offline-to-Online Reinforcement Learning via Bi-Level Optimization
by: Yang, Letian, et al.
Published: (2026)

Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
by: Chen, Keru, et al.
Published: (2024)

Learning-Zone Energy: Online Data Selection for Efficient RL Post-Training
by: Cui, Peng, et al.
Published: (2026)

Towards Multimodal Active Learning: Efficient Learning with Limited Paired Data
by: Zhang, Jiancheng, et al.
Published: (2025)

SAMG: Offline-to-Online Reinforcement Learning via State-Action-Conditional Offline Model Guidance
by: Zhang, Liyu, et al.
Published: (2024)

Beyond the Lower Bound: Bridging Regret Minimization and Best Arm Identification in Lexicographic Bandits
by: Xue, Bo, et al.
Published: (2025)

Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition
by: Wang, Zichen, et al.
Published: (2025)

FHBench: Towards Efficient and Personalized Federated Learning for Multimodal Healthcare
by: Wang, Penghao, et al.
Published: (2025)

Efficient User Sequence Learning for Online Services via Compressed Graph Neural Networks
by: Wu, Yucheng, et al.
Published: (2024)

Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them
by: Rajani, Neel, et al.
Published: (2025)

Towards Achieving Near-optimal Utility for Privacy-Preserving Federated Learning via Data Generation and Parameter Distortion
by: Zhang, Xiaojin, et al.
Published: (2023)

ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
by: Zhao, Kai, et al.
Published: (2023)

Metis-SPECS: Decoupling Multimodal Learning via Self-distilled Preference-based Cold Start
by: Chen, Kun, et al.
Published: (2025)

Hierarchically Gated Experts for Efficient Online Continual Learning
by: Luong, Kevin, et al.
Published: (2024)

FADE: Towards Fairness-aware Generation for Domain Generalization via Classifier-Guided Score-based Diffusion Models
by: Lin, Yujie, et al.
Published: (2024)

Stackelberg Coupling of Online Representation Learning and Reinforcement Learning
by: Martinez, Fernando, et al.
Published: (2025)

Towards Efficient Online Exploration for Reinforcement Learning with Human Feedback
by: Li, Gen, et al.
Published: (2025)

Categorical Data Clustering via Value Order Estimated Distance Metric Learning
by: Zhang, Yiqun, et al.
Published: (2024)

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
by: Wilcoxson, Max, et al.
Published: (2024)

Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism
by: Wang, Kunyun, et al.
Published: (2025)

Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
by: Yan, Kai, et al.
Published: (2024)

Gradients as an Action: Towards Communication-Efficient Federated Recommender Systems via Adaptive Action Sharing
by: Lu, Zhufeng, et al.
Published: (2025)

Efficient Data Selection for Multimodal Models via Incremental Optimization Utility
by: Jing, Jinhao, et al.
Published: (2026)

EDGE: Efficient Data Selection for LLM Agents via Guideline Effectiveness
by: Zhang, Yunxiao, et al.
Published: (2025)

Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities
by: Li, Zirui, et al.
Published: (2025)

The Three Regimes of Offline-to-Online Reinforcement Learning
by: Li, Lu, et al.
Published: (2025)

Online Drift Detection with Maximum Concept Discrepancy
by: Wan, Ke, et al.
Published: (2024)

Online Gradient Boosting Decision Tree: In-Place Updates for Efficient Adding/Deleting Data
by: Lin, Huawei, et al.
Published: (2025)

Efficient Beamforming Optimization for STAR-RIS-Assisted Communications: A Gradient-Based Meta Learning Approach
by: Yang, Dongdong, et al.
Published: (2025)

A Toolbox, Not a Hammer -- Multi-TAG: Scaling Math Reasoning with Multi-Tool Aggregation
by: Yao, Bohan, et al.
Published: (2025)

Dynamic Environment Responsive Online Meta-Learning with Fairness Awareness
by: Zhao, Chen, et al.
Published: (2024)

Online Multi-modal Root Cause Identification in Microservice Systems
by: Zheng, Lecheng, et al.
Published: (2024)

PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning
by: Liu, Tao, et al.
Published: (2026)