:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mandal, Saptarshi, Murthy, Yashaswini, Srikant, R.
Format:	Preprint
Published:	2025
Subjects:	Machine Learning I.2.6
Online Access:	https://arxiv.org/abs/2510.01721
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Fusing Rewards and Preferences in Reinforcement Learning
by: Khorasani, Sadegh, et al.
Published: (2025)

Universal Approximation of Continuous Functionals on Compact Subsets via Linear Measurements and Scalar Nonlinearities
by: Krylov, Andrey, et al.
Published: (2026)

Learning Can Converge Stably to the Wrong Belief under Latent Reliability
by: Zhang, Zhipeng, et al.
Published: (2026)

Hierarchical Universal Value Function Approximators
by: Arora, Rushiv
Published: (2024)

Implicit Counterfactual Data Augmentation for Robust Learning
by: Zhou, Xiaoling, et al.
Published: (2023)

2Mamba2Furious: Linear in Complexity, Competitive in Accuracy
by: Mongaras, Gabriel, et al.
Published: (2026)

Evaluating and Learning Robust Bandit Policies Under Uncertain Causal Mechanisms
by: Avery, Katherine, et al.
Published: (2025)

Symbol-Temporal Consistency Self-supervised Learning for Robust Time Series Classification
by: Garcia, Kevin, et al.
Published: (2025)

Composing Linear Layers from Irreducibles
by: Pence, Travis, et al.
Published: (2025)

Distributional Reinforcement Learning for Condition-Based Maintenance of Multi-Pump Equipment
by: Yasuno, Takato
Published: (2026)

Understanding Boolean Function Learnability on Deep Neural Networks: PAC Learning Meets Neurosymbolic Models
by: Nicolau, Marcio, et al.
Published: (2020)

AI and Machine Learning Approaches for Predicting Nanoparticles Toxicity The Critical Role of Physiochemical Properties
by: Yousaf, Iqra
Published: (2024)

ES-C51: Expected Sarsa Based C51 Distributional Reinforcement Learning Algorithm
by: Tandon, Rijul, et al.
Published: (2025)

How Does the Pretraining Distribution Shape In-Context Learning? Task Selection, Generalization, and Robustness
by: Azizian, Waïss, et al.
Published: (2025)

SQARL: A Size-Agnostic Reinforcement Learning approach for Circuit Allocation in Distributed Quantum Architectures
by: Carballo, Víctor, et al.
Published: (2026)

Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator
by: Furuyama, Ryoma, et al.
Published: (2024)

New Paradigm of Adversarial Training: Releasing Accuracy-Robustness Trade-Off via Dummy Class
by: Wang, Yanyun, et al.
Published: (2024)

RobustModelMaker: Coupling Bootstrap Stability Selection with Leakage-Safe Nested Cross-Validation for Scientific Machine Learning
by: Barnard, Amanda S
Published: (2026)

Learning Transferable Predictability Representations
by: Goswami, Diyali, et al.
Published: (2026)

Deep Memory Search: A Metaheuristic Approach for Optimizing Heuristic Search
by: Hedar, Abdel-Rahman, et al.
Published: (2024)

Beyond Monotonicity: Revisiting Factorization Principles in Multi-Agent Q-Learning
by: Hu, Tianmeng, et al.
Published: (2025)

How to Boost Any Loss Function
by: Nock, Richard, et al.
Published: (2024)

Strengthening the Internal Adversarial Robustness in Lifted Neural Networks
by: Zach, Christopher
Published: (2025)

Pruning Spurious Subgraphs for Graph Out-of-Distribution Generalization
by: Yao, Tianjun, et al.
Published: (2025)

Expressivity of Graph Neural Networks Through the Lens of Adversarial Robustness
by: Campi, Francesco, et al.
Published: (2023)

BOND: License to Train with Black-Box Functions
by: Clark, Andrew, et al.
Published: (2025)

Enhancing Classifier Evaluation: A Fairer Benchmarking Strategy Based on Ability and Robustness
by: Cardoso, Lucas, et al.
Published: (2025)

RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defenses
by: Djilani, Mohamed, et al.
Published: (2024)

ZClassifier: Temperature Tuning and Manifold Approximation via KL Divergence on Logit Space
by: Yong, Shim Soon
Published: (2025)

Adaptive Epsilon Adversarial Training for Robust Gravitational Wave Parameter Estimation Using Normalizing Flows
by: Yang, Yiqian, et al.
Published: (2024)

Hard Samples, Bad Labels: Robust Loss Functions That Know When to Back Off
by: Pellegrino, Nicholas, et al.
Published: (2025)

Perfecting Aircraft Maneuvers with Reinforcement Learning
by: Cilan, Atahan, et al.
Published: (2026)

Why LoRA Resists Label Noise: A Theoretical Framework for Noise-Robust Parameter-Efficient Fine-Tuning
by: Steele, Brady
Published: (2026)

Robustness of Spatio-temporal Graph Neural Networks for Fault Location in Partially Observable Distribution Grids
by: Karabulut, Burak, et al.
Published: (2026)

KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning
by: Mi, Zhendong, et al.
Published: (2025)

KL-Regularised Q-Learning: A Token-level Action-Value perspective on Online RLHF
by: Brown, Jason R, et al.
Published: (2025)

MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention
by: Chen, Yuxin, et al.
Published: (2024)

Symmetric Equilibrium Learning of VAEs
by: Flach, Boris, et al.
Published: (2023)

A Comparative Analysis of Reinforcement Learning and Conventional Deep Learning Approaches for Bearing Fault Diagnosis
by: Çakır, Efe, et al.
Published: (2025)

Path-Coupled Bellman Flows for Distributional Reinforcement Learning
by: Xu, Boyang, et al.
Published: (2026)