:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rafiee, Banafsheh, Ghiassian, Sina, Jin, Jun, Sutton, Richard, Luo, Jun, White, Adam
Format:	Preprint
Published:	2022
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2210.14361
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Minimax Rate of Second-Order Calibration
by: Ciosek, Kamil, et al.
Published: (2026)

In-context Exploration-Exploitation for Reinforcement Learning
by: Dai, Zhenwen, et al.
Published: (2024)

Learning in complex action spaces without policy gradients
by: Tavakoli, Arash, et al.
Published: (2024)

Soft Preference Optimization: Aligning Language Models to Expert Distributions
by: Sharifnassab, Arsalan, et al.
Published: (2024)

Dynamic Reward Scaling for Multivariate Time Series Anomaly Detection: A VAE-Enhanced Reinforcement Learning Approach
by: Golchin, Bahareh, et al.
Published: (2025)

Accelerating scientific discovery with the common task framework
by: Kutz, J. Nathan, et al.
Published: (2025)

DRTA: Dynamic Reward Scaling for Reinforcement Learning in Time Series Anomaly Detection
by: Golchin, Bahareh, et al.
Published: (2025)

DNABERT-2: Fine-Tuning a Genomic Language Model for Colorectal Gene Enhancer Classification
by: King, Darren, et al.
Published: (2025)

Swift-Sarsa: Fast and Robust Linear Control
by: Javed, Khurram, et al.
Published: (2025)

ACEGEN: Reinforcement learning of generative chemical agents for drug discovery
by: Bou, Albert, et al.
Published: (2024)

Fine-Tuning without Performance Degradation
by: Wang, Han, et al.
Published: (2025)

A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
by: Patterson, Andrew, et al.
Published: (2021)

Rethinking Multimodal Fusion for Time Series: Auxiliary Modalities Need Constrained Fusion
by: Lee, Seunghan, et al.
Published: (2026)

Hallucination Detection on a Budget: Efficient Bayesian Estimation of Semantic Entropy
by: Ciosek, Kamil, et al.
Published: (2025)

A Parameter Update Balancing Algorithm for Multi-task Ranking Models in Recommendation Systems
by: Yuan, Jun, et al.
Published: (2024)

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
by: Elelimy, Esraa, et al.
Published: (2024)

Empirical Design in Reinforcement Learning
by: Patterson, Andrew, et al.
Published: (2023)

Why Do Neural Networks Forget: A Study of Collapse in Continual Learning
by: Zhu, Yunqin, et al.
Published: (2026)

E(3)-invariant diffusion model for pocket-aware peptide generation
by: Liang, Po-Yu, et al.
Published: (2024)

Investigating the Interplay of Prioritized Replay and Generalization
by: Panahi, Parham Mohammad, et al.
Published: (2024)

Augmenting generative models with biomedical knowledge graphs improves targeted drug discovery
by: Malusare, Aditya, et al.
Published: (2025)

GeoPro-Net: Learning Interpretable Spatiotemporal Prediction Models through Statistically-Guided Geo-Prototyping
by: An, Bang, et al.
Published: (2024)

Dynamic and Adaptive Feature Generation with LLM
by: Zhang, Xinhao, et al.
Published: (2024)

Conflict-Averse Gradient Descent for Multi-task Learning
by: Liu, Bo, et al.
Published: (2021)

Revisiting Mixture Policies in Entropy-Regularized Actor-Critic
by: He, Jiamin, et al.
Published: (2026)

Flow Matching with Arbitrary Auxiliary Paths
by: Peng, Xin, et al.
Published: (2026)

Disentangling Representations through Multi-task Learning
by: Vafidis, Pantelis, et al.
Published: (2024)

A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning
by: Adkins, Jacob, et al.
Published: (2024)

Reward Centering
by: Naik, Abhishek, et al.
Published: (2024)

Score matching through the roof: linear, nonlinear, and latent variables causal discovery
by: Montagna, Francesco, et al.
Published: (2024)

Decoupled Split Learning via Auxiliary Loss
by: Zihad, Anower, et al.
Published: (2026)

Neon: Negative Extrapolation From Self-Training Improves Image Generation
by: Alemohammad, Sina, et al.
Published: (2025)

A New View on Planning in Online Reinforcement Learning
by: Roice, Kevin, et al.
Published: (2024)

Harnessing Discrete Representations For Continual Reinforcement Learning
by: Meyer, Edan, et al.
Published: (2023)

Aux-NAS: Exploiting Auxiliary Labels with Negligibly Extra Inference Cost
by: Gao, Yuan, et al.
Published: (2024)

MetaOptimize: A Framework for Optimizing Step Sizes and Other Meta-parameters
by: Sharifnassab, Arsalan, et al.
Published: (2024)

Personalized Subgraph Federated Learning with Differentiable Auxiliary Projections
by: Zhuo, Wei, et al.
Published: (2025)

Why and How Auxiliary Tasks Improve JEPA Representations
by: Yu, Jiacan, et al.
Published: (2025)

Improving Continual Learning Performance and Efficiency with Auxiliary Classifiers
by: Szatkowski, Filip, et al.
Published: (2024)

Auxiliary Reward Generation with Transition Distance Representation Learning
by: Li, Siyuan, et al.
Published: (2024)