Saved in:
| Main Authors: | Zhou, Ziqi, Zhang, Jingyue, Zhang, Jingyuan, He, Yangfan, Wang, Boyue, Shi, Tianyu, Khamis, Alaa |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.04135 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Efficient Test-Time Finetuning of LLMs via Convex Reconstruction and Gradient Caching
by: Khamis, Alaa, et al.
Published: (2026)
by: Khamis, Alaa, et al.
Published: (2026)
DDPM-MoCo: Advancing Industrial Surface Defect Generation and Detection with Generative and Contrastive Learning
by: He, Yangfan, et al.
Published: (2024)
by: He, Yangfan, et al.
Published: (2024)
Data-centric Federated Graph Learning with Large Language Models
by: Yan, Bo, et al.
Published: (2025)
by: Yan, Bo, et al.
Published: (2025)
WcDT: World-centric Diffusion Transformer for Traffic Scene Generation
by: Yang, Chen, et al.
Published: (2024)
by: Yang, Chen, et al.
Published: (2024)
Design of Reward Function on Reinforcement Learning for Automated Driving
by: Goto, Takeru, et al.
Published: (2025)
by: Goto, Takeru, et al.
Published: (2025)
TalkToAgent: A Human-centric Explanation of Reinforcement Learning Agents with Large Language Models
by: Kim, Haechang, et al.
Published: (2025)
by: Kim, Haechang, et al.
Published: (2025)
Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models
by: Hong, Haitao, et al.
Published: (2025)
by: Hong, Haitao, et al.
Published: (2025)
Reinforcement Learning with Inverse Rewards for World Model Post-training
by: Ye, Yang, et al.
Published: (2025)
by: Ye, Yang, et al.
Published: (2025)
Using Large Language Models to Automate and Expedite Reinforcement Learning with Reward Machine
by: Alsadat, Shayan Meshkat, et al.
Published: (2024)
by: Alsadat, Shayan Meshkat, et al.
Published: (2024)
MM-FusionNet: Context-Aware Dynamic Fusion for Multi-modal Fake News Detection with Large Vision-Language Models
by: He, Junhao, et al.
Published: (2025)
by: He, Junhao, et al.
Published: (2025)
Dynamic Multiple-Parameter Joint Time-Vertex Fractional Fourier Transform and its Intelligent Filtering Methods
by: Cui, Manjun, et al.
Published: (2025)
by: Cui, Manjun, et al.
Published: (2025)
Ego-centric Learning of Communicative World Models for Autonomous Driving
by: Wang, Hang, et al.
Published: (2025)
by: Wang, Hang, et al.
Published: (2025)
User-centric Subjective Leaderboard by Customizable Reward Modeling
by: Jia, Qi, et al.
Published: (2025)
by: Jia, Qi, et al.
Published: (2025)
UserLM-R1: Modeling Human Reasoning in User Language Models with Multi-Reward Reinforcement Learning
by: Zhang, Feng, et al.
Published: (2026)
by: Zhang, Feng, et al.
Published: (2026)
An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination
by: Wei, Dixiao, et al.
Published: (2025)
by: Wei, Dixiao, et al.
Published: (2025)
Enhancing Commentary Strategies for Imperfect Information Card Games: A Study of Large Language Models in Guandan Commentary
by: Tao, Meiling, et al.
Published: (2024)
by: Tao, Meiling, et al.
Published: (2024)
Reflective Human-Machine Co-adaptation for Enhanced Text-to-Image Generation Dialogue System
by: Feng, Yuheng, et al.
Published: (2024)
by: Feng, Yuheng, et al.
Published: (2024)
RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models
by: Wang, Jiongxiao, et al.
Published: (2023)
by: Wang, Jiongxiao, et al.
Published: (2023)
Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study
by: Storhaug, André, et al.
Published: (2024)
by: Storhaug, André, et al.
Published: (2024)
Semantic Codebook Learning for Dynamic Recommendation Models
by: Lv, Zheqi, et al.
Published: (2024)
by: Lv, Zheqi, et al.
Published: (2024)
MaxCode: A Max-Reward Reinforcement Learning Framework for Automated Code Optimization
by: Ou, Jiefu, et al.
Published: (2026)
by: Ou, Jiefu, et al.
Published: (2026)
Instruct Large Language Models to Drive like Humans
by: Zhang, Ruijun, et al.
Published: (2024)
by: Zhang, Ruijun, et al.
Published: (2024)
PhoGAD: Graph-based Anomaly Behavior Detection with Persistent Homology Optimization
by: Yuan, Ziqi, et al.
Published: (2024)
by: Yuan, Ziqi, et al.
Published: (2024)
PINNsAgent: Automated PDE Surrogation with Large Language Models
by: Wuwu, Qingpo, et al.
Published: (2025)
by: Wuwu, Qingpo, et al.
Published: (2025)
ROAD: Responsibility-Oriented Reward Design for Reinforcement Learning in Autonomous Driving
by: Chen, Yongming, et al.
Published: (2025)
by: Chen, Yongming, et al.
Published: (2025)
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
by: Li, Hao, et al.
Published: (2023)
by: Li, Hao, et al.
Published: (2023)
Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning
by: Mo, Shentong
Published: (2026)
by: Mo, Shentong
Published: (2026)
Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
by: Ye, Kai, et al.
Published: (2025)
by: Ye, Kai, et al.
Published: (2025)
DWAFM: Dynamic Weighted Graph Structure Embedding Integrated with Attention and Frequency-Domain MLPs for Traffic Forecasting
by: Shi, Sen, et al.
Published: (2026)
by: Shi, Sen, et al.
Published: (2026)
Reinforcing Few-step Generators via Reward-Tilted Distribution Matching
by: Huang, Yushi, et al.
Published: (2026)
by: Huang, Yushi, et al.
Published: (2026)
EMPOWER: Evolutionary Medical Prompt Optimization With Reinforcement Learning
by: Chen, Yinda, et al.
Published: (2025)
by: Chen, Yinda, et al.
Published: (2025)
ReGraP-LLaVA: Reasoning enabled Graph-based Personalized Large Language and Vision Assistant
by: Xiang, Yifan, et al.
Published: (2025)
by: Xiang, Yifan, et al.
Published: (2025)
Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling
by: Wang, Jie, et al.
Published: (2024)
by: Wang, Jie, et al.
Published: (2024)
Solving Data-centric Tasks using Large Language Models
by: Barke, Shraddha, et al.
Published: (2024)
by: Barke, Shraddha, et al.
Published: (2024)
Harmony: A Human-Aware, Responsive, Modular Assistant with a Locally Deployed Large Language Model
by: Yin, Ziqi, et al.
Published: (2024)
by: Yin, Ziqi, et al.
Published: (2024)
A Software Engineering Perspective on Testing Large Language Models: Research, Practice, Tools and Benchmarks
by: Hudson, Sinclair, et al.
Published: (2024)
by: Hudson, Sinclair, et al.
Published: (2024)
Systematic Reward Gap Optimization for Mitigating VLM Hallucinations
by: He, Lehan, et al.
Published: (2024)
by: He, Lehan, et al.
Published: (2024)
Prior Constraints-based Reward Model Training for Aligning Large Language Models
by: Zhou, Hang, et al.
Published: (2024)
by: Zhou, Hang, et al.
Published: (2024)
Free-Mask: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing
by: Gao, Bo, et al.
Published: (2024)
by: Gao, Bo, et al.
Published: (2024)
REvolve: Reward Evolution with Large Language Models using Human Feedback
by: Hazra, Rishi, et al.
Published: (2024)
by: Hazra, Rishi, et al.
Published: (2024)
Similar Items
-
Efficient Test-Time Finetuning of LLMs via Convex Reconstruction and Gradient Caching
by: Khamis, Alaa, et al.
Published: (2026) -
DDPM-MoCo: Advancing Industrial Surface Defect Generation and Detection with Generative and Contrastive Learning
by: He, Yangfan, et al.
Published: (2024) -
Data-centric Federated Graph Learning with Large Language Models
by: Yan, Bo, et al.
Published: (2025) -
WcDT: World-centric Diffusion Transformer for Traffic Scene Generation
by: Yang, Chen, et al.
Published: (2024) -
Design of Reward Function on Reinforcement Learning for Automated Driving
by: Goto, Takeru, et al.
Published: (2025)