:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ban, Yikun, Yan, Yuchen, Banerjee, Arindam, He, Jingrui
Format:	Preprint
Published:	2023
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2305.03784
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Meta Clustering of Neural Bandits
by: Ban, Yikun, et al.
Published: (2024)

Neural Active Learning Beyond Bandits
by: Ban, Yikun, et al.
Published: (2024)

Conservative Contextual Bandits: Beyond Linear Representations
by: Deb, Rohan, et al.
Published: (2024)

Replicable Bandits with UCB based Exploration
by: Deb, Rohan, et al.
Published: (2026)

PageRank Bandits for Link Prediction
by: Ban, Yikun, et al.
Published: (2024)

Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards
by: Lu, Xiaodong, et al.
Published: (2026)

LLM-Forest: Ensemble Learning of LLMs with Graph-Augmented Prompts for Data Imputation
by: He, Xinrui, et al.
Published: (2024)

Exploration via Feature Perturbation in Contextual Bandits
by: Yi, Seouh-won, et al.
Published: (2025)

Can Graph Neural Networks Learn Language with Extremely Weak Text Supervision?
by: Li, Zihao, et al.
Published: (2024)

RIE-Greedy: Regularization-Induced Exploration for Contextual Bandits
by: Li, Tong, et al.
Published: (2026)

Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning
by: Zou, Jiaru, et al.
Published: (2025)

Co-Exploration and Co-Exploitation via Shared Structure in Multi-Task Bandits
by: Mukherjee, Sumantrak, et al.
Published: (2025)

Exploitation Over Exploration: Unmasking the Bias in Linear Bandit Recommender Offline Evaluation
by: Pires, Pedro R., et al.
Published: (2025)

Uncertainty of Joint Neural Contextual Bandit
by: Guo, Hongbo, et al.
Published: (2024)

Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff
by: Qian, Jian, et al.
Published: (2024)

Neural Risk-sensitive Satisficing in Contextual Bandits
by: Ito, Shogo, et al.
Published: (2025)

HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems
by: Angioli, Marco, et al.
Published: (2025)

Observation-Augmented Contextual Multi-Armed Bandits for Robotic Search and Exploration
by: Wakayama, Shohei, et al.
Published: (2023)

The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective
by: Yan, Renye, et al.
Published: (2024)

Contextual Bandit Optimization with Pre-Trained Neural Networks
by: Terekhov, Mikhail
Published: (2025)

Neural Contextual Bandits Under Delayed Feedback Constraints
by: Moghimi, Mohammadali, et al.
Published: (2025)

Contextual Bandits in Payment Processing: Non-uniform Exploration and Supervised Learning
by: Vangara, Akhila, et al.
Published: (2024)

GCL-OT: Graph Contrastive Learning with Optimal Transport for Heterophilic Text-Attributed Graphs
by: Ren, Yating, et al.
Published: (2025)

Gradual Fine-Tuning for Flow Matching Models
by: Thorkelsdottir, Gudrun, et al.
Published: (2026)

Optimization for Neural Operators can Benefit from Width
by: Cisneros-Velarde, Pedro, et al.
Published: (2025)

Quantum-Enhanced Neural Contextual Bandit Algorithms
by: Huang, Yuqi, et al.
Published: (2026)

Risk-Aware Continuous Control with Neural Contextual Bandits
by: Ayala-Romero, Jose A., et al.
Published: (2023)

FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits
by: Lu, Pingchen, et al.
Published: (2025)

Variance-Dependent Regret Lower Bounds for Contextual Bandits
by: He, Jiafan, et al.
Published: (2025)

Sparse Nonparametric Contextual Bandits
by: Flynn, Hamish, et al.
Published: (2025)

Learning with Incomplete Context: Linear Contextual Bandits with Pretrained Imputation
by: Yan, Hao, et al.
Published: (2025)

Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
by: Bui, Ha Manh, et al.
Published: (2024)

Active Human Feedback Collection via Neural Contextual Dueling Bandits
by: Verma, Arun, et al.
Published: (2025)

Efficient Algorithms for Logistic Contextual Slate Bandits with Bandit Feedback
by: Goyal, Tanmay, et al.
Published: (2025)

Linear Contextual Bandits with Interference
by: Xu, Yang, et al.
Published: (2024)

Semantic-Space Exploration and Exploitation in RLVR for LLM Reasoning
by: Huang, Fanding, et al.
Published: (2025)

In-context Exploration-Exploitation for Reinforcement Learning
by: Dai, Zhenwen, et al.
Published: (2024)

Exploitation Is All You Need... for Exploration
by: Rentschler, Micah, et al.
Published: (2025)

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
by: Di, Qiwei, et al.
Published: (2024)

Architectural Exploration of Hybrid Neural Decoders for Neuromorphic Implantable BMI
by: Mohan, Vivek, et al.
Published: (2025)