Saved in:
| Main Authors: | Mandal, Saptarshi, Murthy, Yashaswini, Srikant, R. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.01721 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Fusing Rewards and Preferences in Reinforcement Learning
by: Khorasani, Sadegh, et al.
Published: (2025)
by: Khorasani, Sadegh, et al.
Published: (2025)
Universal Approximation of Continuous Functionals on Compact Subsets via Linear Measurements and Scalar Nonlinearities
by: Krylov, Andrey, et al.
Published: (2026)
by: Krylov, Andrey, et al.
Published: (2026)
Learning Can Converge Stably to the Wrong Belief under Latent Reliability
by: Zhang, Zhipeng, et al.
Published: (2026)
by: Zhang, Zhipeng, et al.
Published: (2026)
Hierarchical Universal Value Function Approximators
by: Arora, Rushiv
Published: (2024)
by: Arora, Rushiv
Published: (2024)
Implicit Counterfactual Data Augmentation for Robust Learning
by: Zhou, Xiaoling, et al.
Published: (2023)
by: Zhou, Xiaoling, et al.
Published: (2023)
2Mamba2Furious: Linear in Complexity, Competitive in Accuracy
by: Mongaras, Gabriel, et al.
Published: (2026)
by: Mongaras, Gabriel, et al.
Published: (2026)
Evaluating and Learning Robust Bandit Policies Under Uncertain Causal Mechanisms
by: Avery, Katherine, et al.
Published: (2025)
by: Avery, Katherine, et al.
Published: (2025)
Symbol-Temporal Consistency Self-supervised Learning for Robust Time Series Classification
by: Garcia, Kevin, et al.
Published: (2025)
by: Garcia, Kevin, et al.
Published: (2025)
Composing Linear Layers from Irreducibles
by: Pence, Travis, et al.
Published: (2025)
by: Pence, Travis, et al.
Published: (2025)
Distributional Reinforcement Learning for Condition-Based Maintenance of Multi-Pump Equipment
by: Yasuno, Takato
Published: (2026)
by: Yasuno, Takato
Published: (2026)
Understanding Boolean Function Learnability on Deep Neural Networks: PAC Learning Meets Neurosymbolic Models
by: Nicolau, Marcio, et al.
Published: (2020)
by: Nicolau, Marcio, et al.
Published: (2020)
AI and Machine Learning Approaches for Predicting Nanoparticles Toxicity The Critical Role of Physiochemical Properties
by: Yousaf, Iqra
Published: (2024)
by: Yousaf, Iqra
Published: (2024)
ES-C51: Expected Sarsa Based C51 Distributional Reinforcement Learning Algorithm
by: Tandon, Rijul, et al.
Published: (2025)
by: Tandon, Rijul, et al.
Published: (2025)
How Does the Pretraining Distribution Shape In-Context Learning? Task Selection, Generalization, and Robustness
by: Azizian, Waïss, et al.
Published: (2025)
by: Azizian, Waïss, et al.
Published: (2025)
SQARL: A Size-Agnostic Reinforcement Learning approach for Circuit Allocation in Distributed Quantum Architectures
by: Carballo, Víctor, et al.
Published: (2026)
by: Carballo, Víctor, et al.
Published: (2026)
Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator
by: Furuyama, Ryoma, et al.
Published: (2024)
by: Furuyama, Ryoma, et al.
Published: (2024)
New Paradigm of Adversarial Training: Releasing Accuracy-Robustness Trade-Off via Dummy Class
by: Wang, Yanyun, et al.
Published: (2024)
by: Wang, Yanyun, et al.
Published: (2024)
RobustModelMaker: Coupling Bootstrap Stability Selection with Leakage-Safe Nested Cross-Validation for Scientific Machine Learning
by: Barnard, Amanda S
Published: (2026)
by: Barnard, Amanda S
Published: (2026)
Learning Transferable Predictability Representations
by: Goswami, Diyali, et al.
Published: (2026)
by: Goswami, Diyali, et al.
Published: (2026)
Deep Memory Search: A Metaheuristic Approach for Optimizing Heuristic Search
by: Hedar, Abdel-Rahman, et al.
Published: (2024)
by: Hedar, Abdel-Rahman, et al.
Published: (2024)
Beyond Monotonicity: Revisiting Factorization Principles in Multi-Agent Q-Learning
by: Hu, Tianmeng, et al.
Published: (2025)
by: Hu, Tianmeng, et al.
Published: (2025)
How to Boost Any Loss Function
by: Nock, Richard, et al.
Published: (2024)
by: Nock, Richard, et al.
Published: (2024)
Strengthening the Internal Adversarial Robustness in Lifted Neural Networks
by: Zach, Christopher
Published: (2025)
by: Zach, Christopher
Published: (2025)
Pruning Spurious Subgraphs for Graph Out-of-Distribution Generalization
by: Yao, Tianjun, et al.
Published: (2025)
by: Yao, Tianjun, et al.
Published: (2025)
Expressivity of Graph Neural Networks Through the Lens of Adversarial Robustness
by: Campi, Francesco, et al.
Published: (2023)
by: Campi, Francesco, et al.
Published: (2023)
BOND: License to Train with Black-Box Functions
by: Clark, Andrew, et al.
Published: (2025)
by: Clark, Andrew, et al.
Published: (2025)
Enhancing Classifier Evaluation: A Fairer Benchmarking Strategy Based on Ability and Robustness
by: Cardoso, Lucas, et al.
Published: (2025)
by: Cardoso, Lucas, et al.
Published: (2025)
RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defenses
by: Djilani, Mohamed, et al.
Published: (2024)
by: Djilani, Mohamed, et al.
Published: (2024)
ZClassifier: Temperature Tuning and Manifold Approximation via KL Divergence on Logit Space
by: Yong, Shim Soon
Published: (2025)
by: Yong, Shim Soon
Published: (2025)
Adaptive Epsilon Adversarial Training for Robust Gravitational Wave Parameter Estimation Using Normalizing Flows
by: Yang, Yiqian, et al.
Published: (2024)
by: Yang, Yiqian, et al.
Published: (2024)
Hard Samples, Bad Labels: Robust Loss Functions That Know When to Back Off
by: Pellegrino, Nicholas, et al.
Published: (2025)
by: Pellegrino, Nicholas, et al.
Published: (2025)
Perfecting Aircraft Maneuvers with Reinforcement Learning
by: Cilan, Atahan, et al.
Published: (2026)
by: Cilan, Atahan, et al.
Published: (2026)
Why LoRA Resists Label Noise: A Theoretical Framework for Noise-Robust Parameter-Efficient Fine-Tuning
by: Steele, Brady
Published: (2026)
by: Steele, Brady
Published: (2026)
Robustness of Spatio-temporal Graph Neural Networks for Fault Location in Partially Observable Distribution Grids
by: Karabulut, Burak, et al.
Published: (2026)
by: Karabulut, Burak, et al.
Published: (2026)
KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning
by: Mi, Zhendong, et al.
Published: (2025)
by: Mi, Zhendong, et al.
Published: (2025)
KL-Regularised Q-Learning: A Token-level Action-Value perspective on Online RLHF
by: Brown, Jason R, et al.
Published: (2025)
by: Brown, Jason R, et al.
Published: (2025)
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention
by: Chen, Yuxin, et al.
Published: (2024)
by: Chen, Yuxin, et al.
Published: (2024)
Symmetric Equilibrium Learning of VAEs
by: Flach, Boris, et al.
Published: (2023)
by: Flach, Boris, et al.
Published: (2023)
A Comparative Analysis of Reinforcement Learning and Conventional Deep Learning Approaches for Bearing Fault Diagnosis
by: Çakır, Efe, et al.
Published: (2025)
by: Çakır, Efe, et al.
Published: (2025)
Path-Coupled Bellman Flows for Distributional Reinforcement Learning
by: Xu, Boyang, et al.
Published: (2026)
by: Xu, Boyang, et al.
Published: (2026)
Similar Items
-
Fusing Rewards and Preferences in Reinforcement Learning
by: Khorasani, Sadegh, et al.
Published: (2025) -
Universal Approximation of Continuous Functionals on Compact Subsets via Linear Measurements and Scalar Nonlinearities
by: Krylov, Andrey, et al.
Published: (2026) -
Learning Can Converge Stably to the Wrong Belief under Latent Reliability
by: Zhang, Zhipeng, et al.
Published: (2026) -
Hierarchical Universal Value Function Approximators
by: Arora, Rushiv
Published: (2024) -
Implicit Counterfactual Data Augmentation for Robust Learning
by: Zhou, Xiaoling, et al.
Published: (2023)