:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autore principale:	Ortega, Pedro A.
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Machine Learning Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2602.02912
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

BaNEL: Exploration Posteriors for Generative Modeling Using Only Negative Rewards
di: Lee, Sangyun, et al.
Pubblicazione: (2025)

Debiasing Reward Models by Representation Learning with Guarantees
di: Ng, Ignavier, et al.
Pubblicazione: (2025)

Auxiliary Reward Generation with Transition Distance Representation Learning
di: Li, Siyuan, et al.
Pubblicazione: (2024)

Representation Without Reward: A JEPA Audit for LLM Fine-Tuning
di: Sengupta, Biswa
Pubblicazione: (2026)

Temporal Representations for Exploration: Learning Complex Exploratory Behavior without Extrinsic Rewards
di: Mohamed, Faisal, et al.
Pubblicazione: (2026)

Knowledge Adaptation as Posterior Correction
di: Khan, Mohammad Emtiyaz
Pubblicazione: (2025)

Can We Really Learn One Representation to Optimize All Rewards?
di: Zheng, Chongyi, et al.
Pubblicazione: (2026)

Amortized In-Context Bayesian Posterior Estimation
di: Mittal, Sarthak, et al.
Pubblicazione: (2025)

Posterior Label Smoothing for Node Classification
di: Heo, Jaeseung, et al.
Pubblicazione: (2024)

SVRG and Beyond via Posterior Correction
di: Daheim, Nico, et al.
Pubblicazione: (2025)

Intrinsic Reward Policy Optimization for Sparse-Reward Environments
di: Cho, Minjae, et al.
Pubblicazione: (2026)

Attention-Based Reward Shaping for Sparse and Delayed Rewards
di: Holmes, Ian, et al.
Pubblicazione: (2025)

Reward Hacking Mitigation using Verifiable Composite Rewards
di: Tarek, Mirza Farhan Bin, et al.
Pubblicazione: (2025)

Repairing Reward Functions with Feedback to Mitigate Reward Hacking
di: Hatgis-Kessell, Stephane, et al.
Pubblicazione: (2025)

Reward Centering
di: Naik, Abhishek, et al.
Pubblicazione: (2024)

Adversarial Reward Auditing for Active Detection and Mitigation of Reward Hacking
di: Beigi, Mohammad, et al.
Pubblicazione: (2026)

Solving Diffusion Inverse Problems with Restart Posterior Sampling
di: Ahmed, Bilal, et al.
Pubblicazione: (2025)

Intentional Updates for Streaming Reinforcement Learning
di: Sharifnassab, Arsalan, et al.
Pubblicazione: (2026)

SemiReward: A General Reward Model for Semi-supervised Learning
di: Li, Siyuan, et al.
Pubblicazione: (2023)

Trust Region Reward Optimization and Proximal Inverse Reward Optimization Algorithm
di: Chen, Yang, et al.
Pubblicazione: (2025)

Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
di: Wang, Chaoqi, et al.
Pubblicazione: (2025)

Tiered Reward: Designing Rewards for Specification and Fast Learning of Desired Behavior
di: Zhou, Zhiyuan, et al.
Pubblicazione: (2022)

When Distance Distracts: Representation Distance Bias in BT-Loss for Reward Models
di: Xie, Tong, et al.
Pubblicazione: (2025)

Regularized Offline Policy Optimization with Posterior Hybrid Bayesian Belief
di: Lin, Hongqiang, et al.
Pubblicazione: (2026)

Uncertainty-aware Evaluation of Auxiliary Anomalies with the Expected Anomaly Posterior
di: Perini, Lorenzo, et al.
Pubblicazione: (2024)

Efficient Approximate Posterior Sampling with Annealed Langevin Monte Carlo
di: Parulekar, Advait, et al.
Pubblicazione: (2025)

Bootstrapped Reward Shaping
di: Adamczyk, Jacob, et al.
Pubblicazione: (2025)

Neural Reward Machines
di: Umili, Elena, et al.
Pubblicazione: (2024)

Numeric Reward Machines
di: Levina, Kristina, et al.
Pubblicazione: (2024)

What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?
di: Shihab, Ibne Farabi, et al.
Pubblicazione: (2025)

Which Rewards Matter? Reward Selection for Reinforcement Learning under Limited Feedback
di: Chaudhari, Shreyas, et al.
Pubblicazione: (2025)

The Update-Equivalence Framework for Decision-Time Planning
di: Sokota, Samuel, et al.
Pubblicazione: (2023)

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time
di: Wang, Haozhe, et al.
Pubblicazione: (2026)

MemReward: Graph-Based Experience Memory for LLM Reward Prediction with Limited Labels
di: Luo, Tianyang, et al.
Pubblicazione: (2026)

Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners
di: Muslimani, Calarina, et al.
Pubblicazione: (2025)

InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling
di: Miao, Yuchun, et al.
Pubblicazione: (2024)

PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
di: Deng, Fei, et al.
Pubblicazione: (2024)

Efficient LLM Reasoning via Variational Posterior Guidance with Efficiency Awareness
di: Chen, Zizhao, et al.
Pubblicazione: (2026)

Posterior Mean Matching: Generative Modeling through Online Bayesian Inference
di: Salazar, Sebastian, et al.
Pubblicazione: (2024)

Coupled Data and Measurement Space Dynamics for Enhanced Diffusion Posterior Sampling
di: Hamidi, Shayan Mohajer, et al.
Pubblicazione: (2025)