Saved in:
| Main Author: | Adamczyk, Jacob |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.09081 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Exploration Behavior of Untrained Policies
by: Adamczyk, Jacob
Published: (2025)
by: Adamczyk, Jacob
Published: (2025)
Maximum Entropy Exploration Without the Rollouts
by: Adamczyk, Jacob, et al.
Published: (2026)
by: Adamczyk, Jacob, et al.
Published: (2026)
Thermodynamics of Reinforcement Learning Curricula
by: Adamczyk, Jacob, et al.
Published: (2026)
by: Adamczyk, Jacob, et al.
Published: (2026)
Bootstrapped Reward Shaping
by: Adamczyk, Jacob, et al.
Published: (2025)
by: Adamczyk, Jacob, et al.
Published: (2025)
EVAL: EigenVector-based Average-reward Learning
by: Adamczyk, Jacob, et al.
Published: (2025)
by: Adamczyk, Jacob, et al.
Published: (2025)
Average-Reward Soft Actor-Critic
by: Adamczyk, Jacob, et al.
Published: (2025)
by: Adamczyk, Jacob, et al.
Published: (2025)
Boosting Soft Q-Learning by Bounding
by: Adamczyk, Jacob, et al.
Published: (2024)
by: Adamczyk, Jacob, et al.
Published: (2024)
Evaluating machine learning models for predicting pesticide toxicity to honey bees
by: Adamczyk, Jakub, et al.
Published: (2025)
by: Adamczyk, Jakub, et al.
Published: (2025)
Benchmarking Pretrained Molecular Embedding Models For Molecular Representation Learning
by: Praski, Mateusz, et al.
Published: (2025)
by: Praski, Mateusz, et al.
Published: (2025)
Active Inference with Reusable State-Dependent Value Profiles
by: Poschl, Jacob
Published: (2025)
by: Poschl, Jacob
Published: (2025)
DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions
by: Li, Zongyue, et al.
Published: (2025)
by: Li, Zongyue, et al.
Published: (2025)
Inferring Reward Machines and Transition Machines from Partially Observable Markov Decision Processes
by: Wu, Yuly, et al.
Published: (2025)
by: Wu, Yuly, et al.
Published: (2025)
Transitive RL: Value Learning via Divide and Conquer
by: Park, Seohong, et al.
Published: (2025)
by: Park, Seohong, et al.
Published: (2025)
Inferring response times of perceptual decisions with Poisson variational autoencoders
by: Johnson, Hayden R., et al.
Published: (2025)
by: Johnson, Hayden R., et al.
Published: (2025)
Universal Value-Function Uncertainties
by: Zanger, Moritz A., et al.
Published: (2025)
by: Zanger, Moritz A., et al.
Published: (2025)
Positive-Unlabeled Constraint Learning for Inferring Nonlinear Continuous Constraints Functions from Expert Demonstrations
by: Peng, Baiyu, et al.
Published: (2024)
by: Peng, Baiyu, et al.
Published: (2024)
Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems
by: Khan, Zaid, et al.
Published: (2025)
by: Khan, Zaid, et al.
Published: (2025)
Quasimetric Value Functions with Dense Rewards
by: Valieva, Khadichabonu, et al.
Published: (2024)
by: Valieva, Khadichabonu, et al.
Published: (2024)
Learning Exposure Mapping Functions for Inferring Heterogeneous Peer Effects
by: Adhikari, Shishir, et al.
Published: (2025)
by: Adhikari, Shishir, et al.
Published: (2025)
Massively Scaling Explicit Policy-conditioned Value Functions
by: Bohlinger, Nico, et al.
Published: (2025)
by: Bohlinger, Nico, et al.
Published: (2025)
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
by: Chen, Xuyang, et al.
Published: (2025)
by: Chen, Xuyang, et al.
Published: (2025)
Adaptive Exploration for Data-Efficient General Value Function Evaluations
by: Jain, Arushi, et al.
Published: (2024)
by: Jain, Arushi, et al.
Published: (2024)
Stable Offline Value Function Learning with Bisimulation-based Representations
by: Pavse, Brahma S., et al.
Published: (2024)
by: Pavse, Brahma S., et al.
Published: (2024)
Tensor Low-rank Approximation of Finite-horizon Value Functions
by: Rozada, Sergio, et al.
Published: (2024)
by: Rozada, Sergio, et al.
Published: (2024)
Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics
by: Nair, Jishnu Sethumadhavan, et al.
Published: (2026)
by: Nair, Jishnu Sethumadhavan, et al.
Published: (2026)
Catapult Dynamics and Phase Transitions in Quadratic Nets
by: Meltzer, David, et al.
Published: (2023)
by: Meltzer, David, et al.
Published: (2023)
LOCAL: Learning with Orientation Matrix to Infer Causal Structure from Time Series Data
by: Zhang, Jiajun, et al.
Published: (2024)
by: Zhang, Jiajun, et al.
Published: (2024)
Bayesian Optimization for Function-Valued Responses under Min-Max Criteria
by: Ahadi, Pouya, et al.
Published: (2025)
by: Ahadi, Pouya, et al.
Published: (2025)
Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement Learning
by: Rozada, Sergio, et al.
Published: (2022)
by: Rozada, Sergio, et al.
Published: (2022)
On the Limited Representational Power of Value Functions and its Links to Statistical (In)Efficiency
by: Cheikhi, David, et al.
Published: (2024)
by: Cheikhi, David, et al.
Published: (2024)
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?
by: Tarasov, Denis, et al.
Published: (2024)
by: Tarasov, Denis, et al.
Published: (2024)
A Poisson-Gamma Dynamic Factor Model with Time-Varying Transition Dynamics
by: Wang, Jiahao, et al.
Published: (2024)
by: Wang, Jiahao, et al.
Published: (2024)
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading
by: Luo, Cheng, et al.
Published: (2025)
by: Luo, Cheng, et al.
Published: (2025)
ViVa: Video-Trained Value Functions for Guiding Online RL from Diverse Data
by: Dashora, Nitish, et al.
Published: (2025)
by: Dashora, Nitish, et al.
Published: (2025)
OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories
by: Burnwal, Returaj, et al.
Published: (2026)
by: Burnwal, Returaj, et al.
Published: (2026)
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
by: Farebrother, Jesse, et al.
Published: (2024)
by: Farebrother, Jesse, et al.
Published: (2024)
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
by: Zhang, Yuheng, et al.
Published: (2024)
by: Zhang, Yuheng, et al.
Published: (2024)
Sample and Oracle Efficient Reinforcement Learning for MDPs with Linearly-Realizable Value Functions
by: Mhammedi, Zakaria
Published: (2024)
by: Mhammedi, Zakaria
Published: (2024)
AirRadar: Inferring Nationwide Air Quality in China with Deep Neural Networks
by: Wang, Qiongyan, et al.
Published: (2025)
by: Wang, Qiongyan, et al.
Published: (2025)
Inferring Behavior-Specific Context Improves Zero-Shot Generalization in Reinforcement Learning
by: Ndir, Tidiane Camaret, et al.
Published: (2024)
by: Ndir, Tidiane Camaret, et al.
Published: (2024)
Similar Items
-
Exploration Behavior of Untrained Policies
by: Adamczyk, Jacob
Published: (2025) -
Maximum Entropy Exploration Without the Rollouts
by: Adamczyk, Jacob, et al.
Published: (2026) -
Thermodynamics of Reinforcement Learning Curricula
by: Adamczyk, Jacob, et al.
Published: (2026) -
Bootstrapped Reward Shaping
by: Adamczyk, Jacob, et al.
Published: (2025) -
EVAL: EigenVector-based Average-reward Learning
by: Adamczyk, Jacob, et al.
Published: (2025)