:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Adamczyk, Jacob
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2501.09081
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Exploration Behavior of Untrained Policies
by: Adamczyk, Jacob
Published: (2025)

Maximum Entropy Exploration Without the Rollouts
by: Adamczyk, Jacob, et al.
Published: (2026)

Thermodynamics of Reinforcement Learning Curricula
by: Adamczyk, Jacob, et al.
Published: (2026)

Bootstrapped Reward Shaping
by: Adamczyk, Jacob, et al.
Published: (2025)

EVAL: EigenVector-based Average-reward Learning
by: Adamczyk, Jacob, et al.
Published: (2025)

Average-Reward Soft Actor-Critic
by: Adamczyk, Jacob, et al.
Published: (2025)

Boosting Soft Q-Learning by Bounding
by: Adamczyk, Jacob, et al.
Published: (2024)

Evaluating machine learning models for predicting pesticide toxicity to honey bees
by: Adamczyk, Jakub, et al.
Published: (2025)

Benchmarking Pretrained Molecular Embedding Models For Molecular Representation Learning
by: Praski, Mateusz, et al.
Published: (2025)

Active Inference with Reusable State-Dependent Value Profiles
by: Poschl, Jacob
Published: (2025)

DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions
by: Li, Zongyue, et al.
Published: (2025)

Inferring Reward Machines and Transition Machines from Partially Observable Markov Decision Processes
by: Wu, Yuly, et al.
Published: (2025)

Transitive RL: Value Learning via Divide and Conquer
by: Park, Seohong, et al.
Published: (2025)

Inferring response times of perceptual decisions with Poisson variational autoencoders
by: Johnson, Hayden R., et al.
Published: (2025)

Universal Value-Function Uncertainties
by: Zanger, Moritz A., et al.
Published: (2025)

Positive-Unlabeled Constraint Learning for Inferring Nonlinear Continuous Constraints Functions from Expert Demonstrations
by: Peng, Baiyu, et al.
Published: (2024)

Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems
by: Khan, Zaid, et al.
Published: (2025)

Quasimetric Value Functions with Dense Rewards
by: Valieva, Khadichabonu, et al.
Published: (2024)

Learning Exposure Mapping Functions for Inferring Heterogeneous Peer Effects
by: Adhikari, Shishir, et al.
Published: (2025)

Massively Scaling Explicit Policy-conditioned Value Functions
by: Bohlinger, Nico, et al.
Published: (2025)

VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
by: Chen, Xuyang, et al.
Published: (2025)

Adaptive Exploration for Data-Efficient General Value Function Evaluations
by: Jain, Arushi, et al.
Published: (2024)

Stable Offline Value Function Learning with Bisimulation-based Representations
by: Pavse, Brahma S., et al.
Published: (2024)

Tensor Low-rank Approximation of Finite-horizon Value Functions
by: Rozada, Sergio, et al.
Published: (2024)

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics
by: Nair, Jishnu Sethumadhavan, et al.
Published: (2026)

Catapult Dynamics and Phase Transitions in Quadratic Nets
by: Meltzer, David, et al.
Published: (2023)

LOCAL: Learning with Orientation Matrix to Infer Causal Structure from Time Series Data
by: Zhang, Jiajun, et al.
Published: (2024)

Bayesian Optimization for Function-Valued Responses under Min-Max Criteria
by: Ahadi, Pouya, et al.
Published: (2025)

Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement Learning
by: Rozada, Sergio, et al.
Published: (2022)

On the Limited Representational Power of Value Functions and its Links to Statistical (In)Efficiency
by: Cheikhi, David, et al.
Published: (2024)

Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?
by: Tarasov, Denis, et al.
Published: (2024)

A Poisson-Gamma Dynamic Factor Model with Time-Varying Transition Dynamics
by: Wang, Jiahao, et al.
Published: (2024)

HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading
by: Luo, Cheng, et al.
Published: (2025)

ViVa: Video-Trained Value Functions for Guiding Online RL from Diverse Data
by: Dashora, Nitish, et al.
Published: (2025)

OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories
by: Burnwal, Returaj, et al.
Published: (2026)

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
by: Farebrother, Jesse, et al.
Published: (2024)

On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
by: Zhang, Yuheng, et al.
Published: (2024)

Sample and Oracle Efficient Reinforcement Learning for MDPs with Linearly-Realizable Value Functions
by: Mhammedi, Zakaria
Published: (2024)

AirRadar: Inferring Nationwide Air Quality in China with Deep Neural Networks
by: Wang, Qiongyan, et al.
Published: (2025)

Inferring Behavior-Specific Context Improves Zero-Shot Generalization in Reinforcement Learning
by: Ndir, Tidiane Camaret, et al.
Published: (2024)