Saved in:
| Main Authors: | Silva, Ricardo Pedro Querido Andrade, Bouarour, Nassim, Fettache, Dina, Boussouar, Sarab, Ibrahim, Noha, Amer-Yahia, Sihem |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.27695 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Lever: Inference-Time Policy Reuse under Support Constraints
by: Vitenko, Ihor, et al.
Published: (2026)
by: Vitenko, Ihor, et al.
Published: (2026)
Producer-Fairness in Sequential Bundle Recommendation
by: Rio, Alexandre, et al.
Published: (2025)
by: Rio, Alexandre, et al.
Published: (2025)
A Sampling-based Framework for Hypothesis Testing on Large Attributed Graphs
by: Wang, Yun, et al.
Published: (2024)
by: Wang, Yun, et al.
Published: (2024)
Personalized Top-k Set Queries Over Predicted Scores
by: Nia, Sohrab Namazi, et al.
Published: (2025)
by: Nia, Sohrab Namazi, et al.
Published: (2025)
Distilling Tabular Foundation Models for Structured Health Data
by: Tanna, Aditya, et al.
Published: (2026)
by: Tanna, Aditya, et al.
Published: (2026)
Pocket Foundation Models: Distilling TFMs into CPU-Ready Gradient-Boosted Trees
by: Tanna, Aditya, et al.
Published: (2026)
by: Tanna, Aditya, et al.
Published: (2026)
Ensembling Tabular Foundation Models - A Diversity Ceiling And A Calibration Trap
by: Tanna, Aditya, et al.
Published: (2026)
by: Tanna, Aditya, et al.
Published: (2026)
Shaping the Prior: How Synthetic Task Distributions Determine Tabular Foundation Model Quality
by: Bouadi, Mohamed, et al.
Published: (2026)
by: Bouadi, Mohamed, et al.
Published: (2026)
Data Presentation Over Architecture: Resampling Strategies for Credit Risk Prediction with Tabular Foundation Models
by: Tanna, Aditya, et al.
Published: (2026)
by: Tanna, Aditya, et al.
Published: (2026)
A Reinforcement Learning Environment for Automatic Code Optimization in the MLIR Compiler
by: Tirichine, Mohammed, et al.
Published: (2024)
by: Tirichine, Mohammed, et al.
Published: (2024)
Microeconomic Foundations of Multi-Agent Learning
by: Helou, Nassim
Published: (2026)
by: Helou, Nassim
Published: (2026)
DARE: Difficulty-Adaptive Reinforcement Learning with Co-Evolved Difficulty Estimation
by: Zhou, Yang, et al.
Published: (2026)
by: Zhou, Yang, et al.
Published: (2026)
Terracorder: Sense Long and Prosper
by: Millar, Josh, et al.
Published: (2024)
by: Millar, Josh, et al.
Published: (2024)
Path-Based Quantum Meta-Learning for Adaptive Optimization of Reconfigurable Intelligent Surfaces
by: Hassan, Noha, et al.
Published: (2026)
by: Hassan, Noha, et al.
Published: (2026)
Balancing the Reasoning Load: Difficulty-Differentiated Policy Optimization with Length Redistribution for Efficient and Robust Reinforcement Learning
by: Xia, Yinan, et al.
Published: (2026)
by: Xia, Yinan, et al.
Published: (2026)
On Efficient Approximate Aggregate Nearest Neighbor Queries over Learned Representations
by: Wang, Carrie, et al.
Published: (2025)
by: Wang, Carrie, et al.
Published: (2025)
ADHint: Adaptive Hints with Difficulty Priors for Reinforcement Learning
by: Zhang, Feng, et al.
Published: (2025)
by: Zhang, Feng, et al.
Published: (2025)
PAC Guarantees for Reinforcement Learning: Sample Complexity, Coverage, and Structure
by: Steier, Joshua
Published: (2026)
by: Steier, Joshua
Published: (2026)
Offline Constrained Reinforcement Learning under Partial Data Coverage
by: Ko, Seokmin, et al.
Published: (2025)
by: Ko, Seokmin, et al.
Published: (2025)
Benchmarking Ultra-Low-Power $μ$NPUs
by: Millar, Josh, et al.
Published: (2025)
by: Millar, Josh, et al.
Published: (2025)
Reinforcement Learning for Compositional Generalization with Outcome-Level Optimization
by: Fu, Xiyan, et al.
Published: (2026)
by: Fu, Xiyan, et al.
Published: (2026)
Learning Coverage Paths in Unknown Environments with Deep Reinforcement Learning
by: Jonnarth, Arvi, et al.
Published: (2023)
by: Jonnarth, Arvi, et al.
Published: (2023)
Mitigating Overthinking in Large Reasoning Models via Difficulty-aware Reinforcement Learning
by: Wan, Qian, et al.
Published: (2026)
by: Wan, Qian, et al.
Published: (2026)
Learning to Recharge: UAV Coverage Path Planning through Deep Reinforcement Learning
by: Theile, Mirco, et al.
Published: (2023)
by: Theile, Mirco, et al.
Published: (2023)
Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning
by: Singh, Sagalpreet, et al.
Published: (2025)
by: Singh, Sagalpreet, et al.
Published: (2025)
LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers
by: Merouani, Massinissa, et al.
Published: (2024)
by: Merouani, Massinissa, et al.
Published: (2024)
Optimizing FPGA and Wafer Test Coverage with Spatial Sampling and Machine Learning
by: WeiQuan, Wang, et al.
Published: (2025)
by: WeiQuan, Wang, et al.
Published: (2025)
Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices
by: Woo, Jiin, et al.
Published: (2024)
by: Woo, Jiin, et al.
Published: (2024)
On the Complexity of Offline Reinforcement Learning with $Q^\star$-Approximation and Partial Coverage
by: Liu, Haolin, et al.
Published: (2026)
by: Liu, Haolin, et al.
Published: (2026)
KoopAGRU: A Koopman-based Anomaly Detection in Time-Series using Gated Recurrent Units
by: Yahia, Issam Ait, et al.
Published: (2025)
by: Yahia, Issam Ait, et al.
Published: (2025)
Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective
by: Kong, Deyang, et al.
Published: (2025)
by: Kong, Deyang, et al.
Published: (2025)
HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning
by: Wang, Weiqi, et al.
Published: (2026)
by: Wang, Weiqi, et al.
Published: (2026)
Optimizing Reasoning Efficiency through Prompt Difficulty Prediction
by: Zhao, Bo, et al.
Published: (2025)
by: Zhao, Bo, et al.
Published: (2025)
Meta Reinforcement Learning for Strategic IoT Deployments Coverage in Disaster-Response UAV Swarms
by: Dhuheir, Marwan, et al.
Published: (2024)
by: Dhuheir, Marwan, et al.
Published: (2024)
DISCO Balances the Scales: Adaptive Domain- and Difficulty-Aware Reinforcement Learning on Imbalanced Data
by: Zhou, Yuhang, et al.
Published: (2025)
by: Zhou, Yuhang, et al.
Published: (2025)
An Introduction to Deep Reinforcement and Imitation Learning
by: Santana, Pedro
Published: (2025)
by: Santana, Pedro
Published: (2025)
Structured Learning of Compositional Sequential Interventions
by: Yu, Jialin, et al.
Published: (2024)
by: Yu, Jialin, et al.
Published: (2024)
SIR-RL: Reinforcement Learning for Optimized Policy Control during Epidemiological Outbreaks in Emerging Market and Developing Economies
by: Jain, Maeghal, et al.
Published: (2024)
by: Jain, Maeghal, et al.
Published: (2024)
D$^2$Evo: Dual Difficulty-Aware Self-Evolution for Data-Efficient Reinforcement Learning
by: Zhang, Ru, et al.
Published: (2026)
by: Zhang, Ru, et al.
Published: (2026)
Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems
by: Li, Zongqian, et al.
Published: (2026)
by: Li, Zongqian, et al.
Published: (2026)
Similar Items
-
Lever: Inference-Time Policy Reuse under Support Constraints
by: Vitenko, Ihor, et al.
Published: (2026) -
Producer-Fairness in Sequential Bundle Recommendation
by: Rio, Alexandre, et al.
Published: (2025) -
A Sampling-based Framework for Hypothesis Testing on Large Attributed Graphs
by: Wang, Yun, et al.
Published: (2024) -
Personalized Top-k Set Queries Over Predicted Scores
by: Nia, Sohrab Namazi, et al.
Published: (2025) -
Distilling Tabular Foundation Models for Structured Health Data
by: Tanna, Aditya, et al.
Published: (2026)