:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Liu, Kweiguu, Maghsudi, Setareh
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Machine Learning
Accesso online:	https://arxiv.org/abs/2405.11417
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Robust Optimization Approach and Learning Based Hide-and-Seek Game for Resilient Network Design
di: Khosravi, Mohammad, et al.
Pubblicazione: (2026)

Anomaly Detection in Networked Bandits
di: Cheng, Xiaotong, et al.
Pubblicazione: (2025)

Stochastic Multi-Objective Multi-Armed Bandits: Regret Definition and Algorithm
di: Davoodi, Mansoor, et al.
Pubblicazione: (2025)

Emergence of Fair Leaders via Mediators in Multi-Agent Reinforcement Learning
di: Dodwadmath, Akshay, et al.
Pubblicazione: (2025)

Pareto Multi-Objective Alignment for Language Models
di: He, Qiang, et al.
Pubblicazione: (2025)

Meta Learning in Bandits within Shared Affine Subspaces
di: Bilaj, Steven, et al.
Pubblicazione: (2024)

Distributed Management of Fluctuating Energy Resources in Dynamic Networked Systems
di: Cheng, Xiaotong, et al.
Pubblicazione: (2024)

Decentralized Task Offloading and Load-Balancing for Mobile Edge Computing in Dense Networks
di: Yahya, Mariam, et al.
Pubblicazione: (2024)

Service Placement in Small Cell Networks Using Distributed Best Arm Identification in Linear Bandits
di: Yahya, Mariam, et al.
Pubblicazione: (2025)

Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation
di: He, Qiang, et al.
Pubblicazione: (2024)

Unveiling the Decision-Making Process in Reinforcement Learning with Genetic Programming
di: Eberhardinger, Manuel, et al.
Pubblicazione: (2024)

Meta-Learning Multi-armed Bandits for Beam Tracking in 5G and 6G Networks
di: Mattick, Alexander, et al.
Pubblicazione: (2025)

Quantum-Inspired Reinforcement Learning in the Presence of Epistemic Ambivalence
di: Habibi, Alireza, et al.
Pubblicazione: (2025)

One Model for All: Multi-Objective Controllable Language Models
di: He, Qiang, et al.
Pubblicazione: (2026)

Online Influence Maximization with Semi-Bandit Feedback under Corruptions
di: Cheng, Xiaotong, et al.
Pubblicazione: (2024)

A Robust Optimization Approach for Regenerator Placement in Fault-Tolerant Networks Under Discrete Cost Uncertainty
di: Khosravi, Mohammad, et al.
Pubblicazione: (2026)

Efficient Resource Allocation under Adversary Attacks: A Decomposition-Based Approach
di: Davoodi, Mansoor, et al.
Pubblicazione: (2025)

Lipschitz Bandits with Stochastic Delayed Feedback
di: Liu, Zhongxuan, et al.
Pubblicazione: (2025)

Safe and Efficient Online Convex Optimization with Linear Budget Constraints and Partial Feedback
di: Liu, Shanqi, et al.
Pubblicazione: (2024)

Feedback Control for Small Budget Pacing
di: Apparaju, Sreeja, et al.
Pubblicazione: (2025)

Biased Dueling Bandits with Stochastic Delayed Feedback
di: Yi, Bongsoo, et al.
Pubblicazione: (2024)

Bandit and Delayed Feedback in Online Structured Prediction
di: Shibukawa, Yuki, et al.
Pubblicazione: (2025)

A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays
di: Masoudian, Saeed, et al.
Pubblicazione: (2023)

A Reduction-based Framework for Sequential Decision Making with Delayed Feedback
di: Yang, Yunchang, et al.
Pubblicazione: (2023)

Differentiable Attenuation Filters for Feedback Delay Networks
di: Ibnyahya, Ilias, et al.
Pubblicazione: (2025)

Improved Regret for Bandit Convex Optimization with Delayed Feedback
di: Wan, Yuanyu, et al.
Pubblicazione: (2024)

Exploiting Curvature in Online Convex Optimization with Delayed Feedback
di: Qiu, Hao, et al.
Pubblicazione: (2025)

Online Nonsubmodular Optimization with Delayed Feedback in the Bandit Setting
di: Yang, Sifan, et al.
Pubblicazione: (2025)

Neural Contextual Bandits Under Delayed Feedback Constraints
di: Moghimi, Mohammadali, et al.
Pubblicazione: (2025)

Online Budget Allocation with Censored Semi-Bandit Feedback
di: Bachoc, François, et al.
Pubblicazione: (2025)

Linear and Neural Dueling Bandits with Delayed Feedback
di: Wang, Xiangyi, et al.
Pubblicazione: (2026)

Debiased Recommendation with Noisy Feedback
di: Li, Haoxuan, et al.
Pubblicazione: (2024)

A Repeated Auction Model for Load-Aware Dynamic Resource Allocation in Multi-Access Edge Computing
di: Habiba, Ummy, et al.
Pubblicazione: (2024)

Hierarchical Functionality Prioritization in Multicast ISAC: Optimal Admission Control and Discrete-Phase Beamforming
di: Abanto-Leon, Luis F., et al.
Pubblicazione: (2024)

Delayed Feedback Modeling with Influence Functions
di: Ding, Chenlu, et al.
Pubblicazione: (2025)

Resilient Full-Duplex ISAC in the Face of Imperfect SI Cancellation: Globally Optimal Timeslot Allocation and Beam Selection
di: Abanto-Leon, Luis F., et al.
Pubblicazione: (2025)

Optimal Radio Resource Management for ISAC Under Imperfect Information: A Resource Economy-Driven Perspective
di: Abanto-Leon, Luis F., et al.
Pubblicazione: (2026)

Optimal User and Target Scheduling, User-Target Pairing, and Low-Resolution Phase-Only Beamforming for ISAC Systems
di: Abanto-Leon, Luis F., et al.
Pubblicazione: (2025)

Improved Best-of-Both-Worlds Regret for Bandits with Delayed Feedback
di: Schlisselberg, Ofir, et al.
Pubblicazione: (2025)

Adversarial Bandits with Multi-User Delayed Feedback: Theory and Application
di: Li, Yandi, et al.
Pubblicazione: (2023)