Saved in:
| Main Authors: | Ban, Yikun, Yan, Yuchen, Banerjee, Arindam, He, Jingrui |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2305.03784 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Meta Clustering of Neural Bandits
by: Ban, Yikun, et al.
Published: (2024)
by: Ban, Yikun, et al.
Published: (2024)
Neural Active Learning Beyond Bandits
by: Ban, Yikun, et al.
Published: (2024)
by: Ban, Yikun, et al.
Published: (2024)
Conservative Contextual Bandits: Beyond Linear Representations
by: Deb, Rohan, et al.
Published: (2024)
by: Deb, Rohan, et al.
Published: (2024)
Replicable Bandits with UCB based Exploration
by: Deb, Rohan, et al.
Published: (2026)
by: Deb, Rohan, et al.
Published: (2026)
PageRank Bandits for Link Prediction
by: Ban, Yikun, et al.
Published: (2024)
by: Ban, Yikun, et al.
Published: (2024)
Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards
by: Lu, Xiaodong, et al.
Published: (2026)
by: Lu, Xiaodong, et al.
Published: (2026)
LLM-Forest: Ensemble Learning of LLMs with Graph-Augmented Prompts for Data Imputation
by: He, Xinrui, et al.
Published: (2024)
by: He, Xinrui, et al.
Published: (2024)
Exploration via Feature Perturbation in Contextual Bandits
by: Yi, Seouh-won, et al.
Published: (2025)
by: Yi, Seouh-won, et al.
Published: (2025)
Can Graph Neural Networks Learn Language with Extremely Weak Text Supervision?
by: Li, Zihao, et al.
Published: (2024)
by: Li, Zihao, et al.
Published: (2024)
RIE-Greedy: Regularization-Induced Exploration for Contextual Bandits
by: Li, Tong, et al.
Published: (2026)
by: Li, Tong, et al.
Published: (2026)
Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning
by: Zou, Jiaru, et al.
Published: (2025)
by: Zou, Jiaru, et al.
Published: (2025)
Co-Exploration and Co-Exploitation via Shared Structure in Multi-Task Bandits
by: Mukherjee, Sumantrak, et al.
Published: (2025)
by: Mukherjee, Sumantrak, et al.
Published: (2025)
Exploitation Over Exploration: Unmasking the Bias in Linear Bandit Recommender Offline Evaluation
by: Pires, Pedro R., et al.
Published: (2025)
by: Pires, Pedro R., et al.
Published: (2025)
Uncertainty of Joint Neural Contextual Bandit
by: Guo, Hongbo, et al.
Published: (2024)
by: Guo, Hongbo, et al.
Published: (2024)
Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff
by: Qian, Jian, et al.
Published: (2024)
by: Qian, Jian, et al.
Published: (2024)
Neural Risk-sensitive Satisficing in Contextual Bandits
by: Ito, Shogo, et al.
Published: (2025)
by: Ito, Shogo, et al.
Published: (2025)
HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems
by: Angioli, Marco, et al.
Published: (2025)
by: Angioli, Marco, et al.
Published: (2025)
Observation-Augmented Contextual Multi-Armed Bandits for Robotic Search and Exploration
by: Wakayama, Shohei, et al.
Published: (2023)
by: Wakayama, Shohei, et al.
Published: (2023)
The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective
by: Yan, Renye, et al.
Published: (2024)
by: Yan, Renye, et al.
Published: (2024)
Contextual Bandit Optimization with Pre-Trained Neural Networks
by: Terekhov, Mikhail
Published: (2025)
by: Terekhov, Mikhail
Published: (2025)
Neural Contextual Bandits Under Delayed Feedback Constraints
by: Moghimi, Mohammadali, et al.
Published: (2025)
by: Moghimi, Mohammadali, et al.
Published: (2025)
Contextual Bandits in Payment Processing: Non-uniform Exploration and Supervised Learning
by: Vangara, Akhila, et al.
Published: (2024)
by: Vangara, Akhila, et al.
Published: (2024)
GCL-OT: Graph Contrastive Learning with Optimal Transport for Heterophilic Text-Attributed Graphs
by: Ren, Yating, et al.
Published: (2025)
by: Ren, Yating, et al.
Published: (2025)
Gradual Fine-Tuning for Flow Matching Models
by: Thorkelsdottir, Gudrun, et al.
Published: (2026)
by: Thorkelsdottir, Gudrun, et al.
Published: (2026)
Optimization for Neural Operators can Benefit from Width
by: Cisneros-Velarde, Pedro, et al.
Published: (2025)
by: Cisneros-Velarde, Pedro, et al.
Published: (2025)
Quantum-Enhanced Neural Contextual Bandit Algorithms
by: Huang, Yuqi, et al.
Published: (2026)
by: Huang, Yuqi, et al.
Published: (2026)
Risk-Aware Continuous Control with Neural Contextual Bandits
by: Ayala-Romero, Jose A., et al.
Published: (2023)
by: Ayala-Romero, Jose A., et al.
Published: (2023)
FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits
by: Lu, Pingchen, et al.
Published: (2025)
by: Lu, Pingchen, et al.
Published: (2025)
Variance-Dependent Regret Lower Bounds for Contextual Bandits
by: He, Jiafan, et al.
Published: (2025)
by: He, Jiafan, et al.
Published: (2025)
Sparse Nonparametric Contextual Bandits
by: Flynn, Hamish, et al.
Published: (2025)
by: Flynn, Hamish, et al.
Published: (2025)
Learning with Incomplete Context: Linear Contextual Bandits with Pretrained Imputation
by: Yan, Hao, et al.
Published: (2025)
by: Yan, Hao, et al.
Published: (2025)
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
by: Bui, Ha Manh, et al.
Published: (2024)
by: Bui, Ha Manh, et al.
Published: (2024)
Active Human Feedback Collection via Neural Contextual Dueling Bandits
by: Verma, Arun, et al.
Published: (2025)
by: Verma, Arun, et al.
Published: (2025)
Efficient Algorithms for Logistic Contextual Slate Bandits with Bandit Feedback
by: Goyal, Tanmay, et al.
Published: (2025)
by: Goyal, Tanmay, et al.
Published: (2025)
Linear Contextual Bandits with Interference
by: Xu, Yang, et al.
Published: (2024)
by: Xu, Yang, et al.
Published: (2024)
Semantic-Space Exploration and Exploitation in RLVR for LLM Reasoning
by: Huang, Fanding, et al.
Published: (2025)
by: Huang, Fanding, et al.
Published: (2025)
In-context Exploration-Exploitation for Reinforcement Learning
by: Dai, Zhenwen, et al.
Published: (2024)
by: Dai, Zhenwen, et al.
Published: (2024)
Exploitation Is All You Need... for Exploration
by: Rentschler, Micah, et al.
Published: (2025)
by: Rentschler, Micah, et al.
Published: (2025)
Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
by: Di, Qiwei, et al.
Published: (2024)
by: Di, Qiwei, et al.
Published: (2024)
Architectural Exploration of Hybrid Neural Decoders for Neuromorphic Implantable BMI
by: Mohan, Vivek, et al.
Published: (2025)
by: Mohan, Vivek, et al.
Published: (2025)
Similar Items
-
Meta Clustering of Neural Bandits
by: Ban, Yikun, et al.
Published: (2024) -
Neural Active Learning Beyond Bandits
by: Ban, Yikun, et al.
Published: (2024) -
Conservative Contextual Bandits: Beyond Linear Representations
by: Deb, Rohan, et al.
Published: (2024) -
Replicable Bandits with UCB based Exploration
by: Deb, Rohan, et al.
Published: (2026) -
PageRank Bandits for Link Prediction
by: Ban, Yikun, et al.
Published: (2024)