:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Xu, Lily, Wilder, Bryan, Khalil, Elias B., Tambe, Milind
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Machine Learning Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2503.01919
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Leaving the Nest: Going Beyond Local Loss Functions for Predict-Then-Optimize
di: Shah, Sanket, et al.
Pubblicazione: (2023)

Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data
di: Kong, Lingkai, et al.
Pubblicazione: (2025)

Rule-Bottleneck Reinforcement Learning: Joint Explanation and Decision Optimization for Resource Allocation with Language Agents
di: Tec, Mauricio, et al.
Pubblicazione: (2025)

Generative AI Against Poaching: Latent Composite Flow Matching for Wildlife Conservation
di: Kong, Lingkai, et al.
Pubblicazione: (2025)

Combining Diverse Information for Coordinated Action: Stochastic Bandit Algorithms for Heterogeneous Agents
di: Gordon, Lucia, et al.
Pubblicazione: (2024)

Efficient Public Health Intervention Planning Using Decomposition-Based Decision-Focused Learning
di: Shah, Sanket, et al.
Pubblicazione: (2024)

Contrasting local and global modeling with machine learning and satellite data: A case study estimating tree canopy height in African savannas
di: Rolf, Esther, et al.
Pubblicazione: (2024)

Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards
di: Verma, Shresth, et al.
Pubblicazione: (2024)

IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health
di: Jain, Gauri, et al.
Pubblicazione: (2024)

Decisions and Deployment: The Five-Year SAHELI Project (2020-2025) on Restless Multi-Armed Bandits for Improving Maternal and Child Health
di: Verma, Shresth, et al.
Pubblicazione: (2026)

Spectral bandits
di: Kocák, Tomáš, et al.
Pubblicazione: (2026)

Reward Shaping for Inference-Time Alignment: A Stackelberg Game Perspective
di: Wang, Haichuan, et al.
Pubblicazione: (2026)

What is the Right Notion of Distance between Predict-then-Optimize Tasks?
di: Rodriguez-Diaz, Paula, et al.
Pubblicazione: (2024)

Can LLMs Reconcile Knowledge Conflicts in Counterfactual Reasoning
di: Yamin, Khurram, et al.
Pubblicazione: (2025)

Beyond Majority Voting: LLM Aggregation by Leveraging Higher-Order Information
di: Ai, Rui, et al.
Pubblicazione: (2025)

Evaluating the Effectiveness of Index-Based Treatment Allocation
di: Boehmer, Niclas, et al.
Pubblicazione: (2024)

A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health
di: Behari, Nikhil, et al.
Pubblicazione: (2024)

Fairness for Workers Who Pull the Arms: An Index Based Policy for Allocation of Restless Bandit Tasks
di: Biswas, Arpita, et al.
Pubblicazione: (2023)

Linear bandits with polylogarithmic minimax regret
di: Lumbreras, Josep, et al.
Pubblicazione: (2024)

Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization
di: Zhao, Yunfan, et al.
Pubblicazione: (2023)

A single algorithm for both restless and rested rotting bandits
di: Seznec, Julien, et al.
Pubblicazione: (2026)

Adversarial bandit optimization for approximately linear functions
di: Cheng, Zhuoyu, et al.
Pubblicazione: (2025)

Transcendence: Generative Models Can Outperform The Experts That Train Them
di: Zhang, Edwin, et al.
Pubblicazione: (2024)

Deep reinforcement learning for weakly coupled MDP's with continuous actions
di: Robledo, Francisco, et al.
Pubblicazione: (2024)

Efficient Ensemble Selection from Binary and Pairwise Feedback
di: Neoh, Tzeh Yuan, et al.
Pubblicazione: (2026)

Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers
di: Lin, Xiaoqiang, et al.
Pubblicazione: (2023)

Multilinguality in LLM-Designed Reward Functions for Restless Bandits: Effects on Task Performance and Fairness
di: Parthasarathy, Ambreesh, et al.
Pubblicazione: (2025)

Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing
di: Choo, Davin, et al.
Pubblicazione: (2025)

Valid Inference with Imperfect Synthetic Data
di: Byun, Yewon, et al.
Pubblicazione: (2025)

Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits
di: Liang, Biyonka, et al.
Pubblicazione: (2024)

VORTEX: Aligning Task Utility and Human Preferences through LLM-Guided Reward Shaping
di: Xiong, Guojun, et al.
Pubblicazione: (2025)

Preliminary Study of the Impact of AI-Based Interventions on Health and Behavioral Outcomes in Maternal Health Programs
di: Dasgupta, Arpan, et al.
Pubblicazione: (2024)

Neur2BiLO: Neural Bilevel Optimization
di: Dumouchelle, Justin, et al.
Pubblicazione: (2024)

Deep Learning for Two-Stage Robust Integer Optimization
di: Dumouchelle, Justin, et al.
Pubblicazione: (2023)

Application-Driven Innovation in Machine Learning
di: Rolnick, David, et al.
Pubblicazione: (2024)

Functional multi-armed bandit and the best function identification problems
di: Dorn, Yuriy, et al.
Pubblicazione: (2025)

Auditing Fairness by Betting
di: Chugg, Ben, et al.
Pubblicazione: (2023)

Healthcare LLM Benchmarks Are Only as Good as Their Explicit Assumptions
di: Raman, Naveen, et al.
Pubblicazione: (2026)

Social Environment Design
di: Zhang, Edwin, et al.
Pubblicazione: (2024)

Self-Supervised Transformers as Iterative Solution Improvers for Constraint Satisfaction
di: Xu, Yudong W., et al.
Pubblicazione: (2025)