:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Silva, Ricardo Pedro Querido Andrade, Bouarour, Nassim, Fettache, Dina, Boussouar, Sarab, Ibrahim, Noha, Amer-Yahia, Sihem
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2603.27695
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Lever: Inference-Time Policy Reuse under Support Constraints
by: Vitenko, Ihor, et al.
Published: (2026)

Producer-Fairness in Sequential Bundle Recommendation
by: Rio, Alexandre, et al.
Published: (2025)

A Sampling-based Framework for Hypothesis Testing on Large Attributed Graphs
by: Wang, Yun, et al.
Published: (2024)

Personalized Top-k Set Queries Over Predicted Scores
by: Nia, Sohrab Namazi, et al.
Published: (2025)

Distilling Tabular Foundation Models for Structured Health Data
by: Tanna, Aditya, et al.
Published: (2026)

Pocket Foundation Models: Distilling TFMs into CPU-Ready Gradient-Boosted Trees
by: Tanna, Aditya, et al.
Published: (2026)

Ensembling Tabular Foundation Models - A Diversity Ceiling And A Calibration Trap
by: Tanna, Aditya, et al.
Published: (2026)

Shaping the Prior: How Synthetic Task Distributions Determine Tabular Foundation Model Quality
by: Bouadi, Mohamed, et al.
Published: (2026)

Data Presentation Over Architecture: Resampling Strategies for Credit Risk Prediction with Tabular Foundation Models
by: Tanna, Aditya, et al.
Published: (2026)

A Reinforcement Learning Environment for Automatic Code Optimization in the MLIR Compiler
by: Tirichine, Mohammed, et al.
Published: (2024)

Microeconomic Foundations of Multi-Agent Learning
by: Helou, Nassim
Published: (2026)

DARE: Difficulty-Adaptive Reinforcement Learning with Co-Evolved Difficulty Estimation
by: Zhou, Yang, et al.
Published: (2026)

Terracorder: Sense Long and Prosper
by: Millar, Josh, et al.
Published: (2024)

Path-Based Quantum Meta-Learning for Adaptive Optimization of Reconfigurable Intelligent Surfaces
by: Hassan, Noha, et al.
Published: (2026)

Balancing the Reasoning Load: Difficulty-Differentiated Policy Optimization with Length Redistribution for Efficient and Robust Reinforcement Learning
by: Xia, Yinan, et al.
Published: (2026)

On Efficient Approximate Aggregate Nearest Neighbor Queries over Learned Representations
by: Wang, Carrie, et al.
Published: (2025)

ADHint: Adaptive Hints with Difficulty Priors for Reinforcement Learning
by: Zhang, Feng, et al.
Published: (2025)

PAC Guarantees for Reinforcement Learning: Sample Complexity, Coverage, and Structure
by: Steier, Joshua
Published: (2026)

Offline Constrained Reinforcement Learning under Partial Data Coverage
by: Ko, Seokmin, et al.
Published: (2025)

Benchmarking Ultra-Low-Power $μ$NPUs
by: Millar, Josh, et al.
Published: (2025)

Reinforcement Learning for Compositional Generalization with Outcome-Level Optimization
by: Fu, Xiyan, et al.
Published: (2026)

Learning Coverage Paths in Unknown Environments with Deep Reinforcement Learning
by: Jonnarth, Arvi, et al.
Published: (2023)

Mitigating Overthinking in Large Reasoning Models via Difficulty-aware Reinforcement Learning
by: Wan, Qian, et al.
Published: (2026)

Learning to Recharge: UAV Coverage Path Planning through Deep Reinforcement Learning
by: Theile, Mirco, et al.
Published: (2023)

Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning
by: Singh, Sagalpreet, et al.
Published: (2025)

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers
by: Merouani, Massinissa, et al.
Published: (2024)

Optimizing FPGA and Wafer Test Coverage with Spatial Sampling and Machine Learning
by: WeiQuan, Wang, et al.
Published: (2025)

Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices
by: Woo, Jiin, et al.
Published: (2024)

On the Complexity of Offline Reinforcement Learning with $Q^\star$-Approximation and Partial Coverage
by: Liu, Haolin, et al.
Published: (2026)

KoopAGRU: A Koopman-based Anomaly Detection in Time-Series using Gated Recurrent Units
by: Yahia, Issam Ait, et al.
Published: (2025)

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective
by: Kong, Deyang, et al.
Published: (2025)

HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning
by: Wang, Weiqi, et al.
Published: (2026)

Optimizing Reasoning Efficiency through Prompt Difficulty Prediction
by: Zhao, Bo, et al.
Published: (2025)

Meta Reinforcement Learning for Strategic IoT Deployments Coverage in Disaster-Response UAV Swarms
by: Dhuheir, Marwan, et al.
Published: (2024)

DISCO Balances the Scales: Adaptive Domain- and Difficulty-Aware Reinforcement Learning on Imbalanced Data
by: Zhou, Yuhang, et al.
Published: (2025)

An Introduction to Deep Reinforcement and Imitation Learning
by: Santana, Pedro
Published: (2025)

Structured Learning of Compositional Sequential Interventions
by: Yu, Jialin, et al.
Published: (2024)

SIR-RL: Reinforcement Learning for Optimized Policy Control during Epidemiological Outbreaks in Emerging Market and Developing Economies
by: Jain, Maeghal, et al.
Published: (2024)

D$^2$Evo: Dual Difficulty-Aware Self-Evolution for Data-Efficient Reinforcement Learning
by: Zhang, Ru, et al.
Published: (2026)

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems
by: Li, Zongqian, et al.
Published: (2026)