:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Rui, Wang, Guoyin, Li, Jiwei
Format:	Preprint
Published:	2023
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2309.14681
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Similarity-based Neighbor Selection for Graph LLMs
by: Li, Rui, et al.
Published: (2024)

Packing Analysis: Packing Is More Appropriate for Large Models or Datasets in Supervised Fine-tuning
by: Wang, Shuhe, et al.
Published: (2024)

Reinforcement Learning Enhanced LLMs: A Survey
by: Wang, Shuhe, et al.
Published: (2024)

Turn That Frown Upside Down: FaceID Customization via Cross-Training Data
by: Wang, Shuhe, et al.
Published: (2025)

Cloud Model Characteristic Function Auto-Encoder: Integrating Cloud Model Theory with MMD Regularization for Enhanced Generative Modeling
by: Hu, Biao, et al.
Published: (2025)

Picky LLMs and Unreliable RMs: An Empirical Study on Safety Alignment after Instruction Tuning
by: Li, Guanlin, et al.
Published: (2025)

Causally Sufficient and Necessary Feature Expansion for Class-Incremental Learning
by: Zhang, Zhen, et al.
Published: (2026)

Are Expressive Models Truly Necessary for Offline RL?
by: Wang, Guan, et al.
Published: (2024)

Instruction Tuning for Large Language Models: A Survey
by: Zhang, Shengyu, et al.
Published: (2023)

Template-assisted Contrastive Learning of Task-oriented Dialogue Sentence Embeddings
by: Oh, Minsik, et al.
Published: (2023)

Is Optimal Transport Necessary for Inverse Reinforcement Learning?
by: Dong, Zixuan, et al.
Published: (2025)

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
by: Su, Songqiao, et al.
Published: (2025)

Granular-Ball-Induced Multiple Kernel K-Means
by: Xia, Shuyin, et al.
Published: (2025)

Grounding Computer Use Agents on Human Demonstrations
by: Feizi, Aarash, et al.
Published: (2025)

Learning to Answer from Correct Demonstrations
by: Joshi, Nirmit, et al.
Published: (2025)

Granular-ball computing: an efficient, robust, and interpretable adaptive multi-granularity representation and computation method
by: Xia, Shuyin, et al.
Published: (2023)

When a Robot is More Capable than a Human: Learning from Constrained Demonstrators
by: Li, Xinhu, et al.
Published: (2025)

A Theoretical Analysis of Compositional Generalization in Neural Networks: A Necessary and Sufficient Condition
by: Li, Yuanpeng
Published: (2025)

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
by: Zhang, Wenhao, et al.
Published: (2025)

Mastering the Minority: An Uncertainty-guided Multi-Expert Framework for Challenging-tailed Sequence Learning
by: Wang, Ye, et al.
Published: (2026)

Not All Layers of LLMs Are Necessary During Inference
by: Fan, Siqi, et al.
Published: (2024)

Is Prompt Selection Necessary for Task-Free Online Continual Learning?
by: Park, Seoyoung, et al.
Published: (2026)

Learning Quadruped Walking from Seconds of Demonstration
by: Zhang, Ruipeng, et al.
Published: (2026)

Implicit In-context Learning
by: Li, Zhuowei, et al.
Published: (2024)

CRINN: Contrastive Reinforcement Learning for Approximate Nearest Neighbor Search
by: Li, Xiaoya, et al.
Published: (2025)

Implicit Federated In-context Learning For Task-Specific LLM Fine-Tuning
by: Li, Dongcheng, et al.
Published: (2025)

Data-Centric Machine Learning for Earth Observation: Necessary and Sufficient Features
by: Najjar, Hiba, et al.
Published: (2024)

Understanding the Dynamics of Demonstration Conflict in In-Context Learning
by: Jiao, Difan, et al.
Published: (2026)

Is Monotonic Sampling Necessary in Diffusion Models?
by: Khan, Muhammad Haris
Published: (2026)

GBFRS: Robust Fuzzy Rough Sets via Granular-ball Computing
by: Xia, Shuyin, et al.
Published: (2025)

Learning to Select In-Context Demonstration Preferred by Large Language Model
by: Zhang, Zheng, et al.
Published: (2025)

A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
by: Kim, Kihyun, et al.
Published: (2024)

LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations
by: Ruoss, Anian, et al.
Published: (2024)

Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning
by: Liu, Hui, et al.
Published: (2024)

Retrieval-Augmented Mixture of LoRA Experts for Uploadable Machine Learning
by: Zhao, Ziyu, et al.
Published: (2024)

From Demonstrations to Rewards: Alignment Without Explicit Human Preferences
by: Zeng, Siliang, et al.
Published: (2025)

Sufficient and Necessary Explanations (and What Lies in Between)
by: Bharti, Beepul, et al.
Published: (2024)

Are Expressive Encoders Necessary for Discrete Graph Generation?
by: Revolinsky, Jay, et al.
Published: (2026)

Is PRM Necessary? Problem-Solving RL Implicitly Induces PRM Capability in LLMs
by: Feng, Zhangying, et al.
Published: (2025)

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning
by: Li, Xiaoya, et al.
Published: (2025)