Saved in:
| Main Authors: | Li, Rui, Wang, Guoyin, Li, Jiwei |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2309.14681 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Similarity-based Neighbor Selection for Graph LLMs
by: Li, Rui, et al.
Published: (2024)
by: Li, Rui, et al.
Published: (2024)
Packing Analysis: Packing Is More Appropriate for Large Models or Datasets in Supervised Fine-tuning
by: Wang, Shuhe, et al.
Published: (2024)
by: Wang, Shuhe, et al.
Published: (2024)
Reinforcement Learning Enhanced LLMs: A Survey
by: Wang, Shuhe, et al.
Published: (2024)
by: Wang, Shuhe, et al.
Published: (2024)
Turn That Frown Upside Down: FaceID Customization via Cross-Training Data
by: Wang, Shuhe, et al.
Published: (2025)
by: Wang, Shuhe, et al.
Published: (2025)
Cloud Model Characteristic Function Auto-Encoder: Integrating Cloud Model Theory with MMD Regularization for Enhanced Generative Modeling
by: Hu, Biao, et al.
Published: (2025)
by: Hu, Biao, et al.
Published: (2025)
Picky LLMs and Unreliable RMs: An Empirical Study on Safety Alignment after Instruction Tuning
by: Li, Guanlin, et al.
Published: (2025)
by: Li, Guanlin, et al.
Published: (2025)
Causally Sufficient and Necessary Feature Expansion for Class-Incremental Learning
by: Zhang, Zhen, et al.
Published: (2026)
by: Zhang, Zhen, et al.
Published: (2026)
Are Expressive Models Truly Necessary for Offline RL?
by: Wang, Guan, et al.
Published: (2024)
by: Wang, Guan, et al.
Published: (2024)
Instruction Tuning for Large Language Models: A Survey
by: Zhang, Shengyu, et al.
Published: (2023)
by: Zhang, Shengyu, et al.
Published: (2023)
Template-assisted Contrastive Learning of Task-oriented Dialogue Sentence Embeddings
by: Oh, Minsik, et al.
Published: (2023)
by: Oh, Minsik, et al.
Published: (2023)
Is Optimal Transport Necessary for Inverse Reinforcement Learning?
by: Dong, Zixuan, et al.
Published: (2025)
by: Dong, Zixuan, et al.
Published: (2025)
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
by: Su, Songqiao, et al.
Published: (2025)
by: Su, Songqiao, et al.
Published: (2025)
Granular-Ball-Induced Multiple Kernel K-Means
by: Xia, Shuyin, et al.
Published: (2025)
by: Xia, Shuyin, et al.
Published: (2025)
Grounding Computer Use Agents on Human Demonstrations
by: Feizi, Aarash, et al.
Published: (2025)
by: Feizi, Aarash, et al.
Published: (2025)
Learning to Answer from Correct Demonstrations
by: Joshi, Nirmit, et al.
Published: (2025)
by: Joshi, Nirmit, et al.
Published: (2025)
Granular-ball computing: an efficient, robust, and interpretable adaptive multi-granularity representation and computation method
by: Xia, Shuyin, et al.
Published: (2023)
by: Xia, Shuyin, et al.
Published: (2023)
When a Robot is More Capable than a Human: Learning from Constrained Demonstrators
by: Li, Xinhu, et al.
Published: (2025)
by: Li, Xinhu, et al.
Published: (2025)
A Theoretical Analysis of Compositional Generalization in Neural Networks: A Necessary and Sufficient Condition
by: Li, Yuanpeng
Published: (2025)
by: Li, Yuanpeng
Published: (2025)
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
by: Zhang, Wenhao, et al.
Published: (2025)
by: Zhang, Wenhao, et al.
Published: (2025)
Mastering the Minority: An Uncertainty-guided Multi-Expert Framework for Challenging-tailed Sequence Learning
by: Wang, Ye, et al.
Published: (2026)
by: Wang, Ye, et al.
Published: (2026)
Not All Layers of LLMs Are Necessary During Inference
by: Fan, Siqi, et al.
Published: (2024)
by: Fan, Siqi, et al.
Published: (2024)
Is Prompt Selection Necessary for Task-Free Online Continual Learning?
by: Park, Seoyoung, et al.
Published: (2026)
by: Park, Seoyoung, et al.
Published: (2026)
Learning Quadruped Walking from Seconds of Demonstration
by: Zhang, Ruipeng, et al.
Published: (2026)
by: Zhang, Ruipeng, et al.
Published: (2026)
Implicit In-context Learning
by: Li, Zhuowei, et al.
Published: (2024)
by: Li, Zhuowei, et al.
Published: (2024)
CRINN: Contrastive Reinforcement Learning for Approximate Nearest Neighbor Search
by: Li, Xiaoya, et al.
Published: (2025)
by: Li, Xiaoya, et al.
Published: (2025)
Implicit Federated In-context Learning For Task-Specific LLM Fine-Tuning
by: Li, Dongcheng, et al.
Published: (2025)
by: Li, Dongcheng, et al.
Published: (2025)
Data-Centric Machine Learning for Earth Observation: Necessary and Sufficient Features
by: Najjar, Hiba, et al.
Published: (2024)
by: Najjar, Hiba, et al.
Published: (2024)
Understanding the Dynamics of Demonstration Conflict in In-Context Learning
by: Jiao, Difan, et al.
Published: (2026)
by: Jiao, Difan, et al.
Published: (2026)
Is Monotonic Sampling Necessary in Diffusion Models?
by: Khan, Muhammad Haris
Published: (2026)
by: Khan, Muhammad Haris
Published: (2026)
GBFRS: Robust Fuzzy Rough Sets via Granular-ball Computing
by: Xia, Shuyin, et al.
Published: (2025)
by: Xia, Shuyin, et al.
Published: (2025)
Learning to Select In-Context Demonstration Preferred by Large Language Model
by: Zhang, Zheng, et al.
Published: (2025)
by: Zhang, Zheng, et al.
Published: (2025)
A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
by: Kim, Kihyun, et al.
Published: (2024)
by: Kim, Kihyun, et al.
Published: (2024)
LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations
by: Ruoss, Anian, et al.
Published: (2024)
by: Ruoss, Anian, et al.
Published: (2024)
Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning
by: Liu, Hui, et al.
Published: (2024)
by: Liu, Hui, et al.
Published: (2024)
Retrieval-Augmented Mixture of LoRA Experts for Uploadable Machine Learning
by: Zhao, Ziyu, et al.
Published: (2024)
by: Zhao, Ziyu, et al.
Published: (2024)
From Demonstrations to Rewards: Alignment Without Explicit Human Preferences
by: Zeng, Siliang, et al.
Published: (2025)
by: Zeng, Siliang, et al.
Published: (2025)
Sufficient and Necessary Explanations (and What Lies in Between)
by: Bharti, Beepul, et al.
Published: (2024)
by: Bharti, Beepul, et al.
Published: (2024)
Are Expressive Encoders Necessary for Discrete Graph Generation?
by: Revolinsky, Jay, et al.
Published: (2026)
by: Revolinsky, Jay, et al.
Published: (2026)
Is PRM Necessary? Problem-Solving RL Implicitly Induces PRM Capability in LLMs
by: Feng, Zhangying, et al.
Published: (2025)
by: Feng, Zhangying, et al.
Published: (2025)
CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning
by: Li, Xiaoya, et al.
Published: (2025)
by: Li, Xiaoya, et al.
Published: (2025)
Similar Items
-
Similarity-based Neighbor Selection for Graph LLMs
by: Li, Rui, et al.
Published: (2024) -
Packing Analysis: Packing Is More Appropriate for Large Models or Datasets in Supervised Fine-tuning
by: Wang, Shuhe, et al.
Published: (2024) -
Reinforcement Learning Enhanced LLMs: A Survey
by: Wang, Shuhe, et al.
Published: (2024) -
Turn That Frown Upside Down: FaceID Customization via Cross-Training Data
by: Wang, Shuhe, et al.
Published: (2025) -
Cloud Model Characteristic Function Auto-Encoder: Integrating Cloud Model Theory with MMD Regularization for Enhanced Generative Modeling
by: Hu, Biao, et al.
Published: (2025)