Saved in:
| Main Authors: | Wei, Ziyi, Zhong, Huaiyang, Li, Xiaocheng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.10374 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Better Statistical Understanding of Watermarking LLMs
by: Cai, Zhongze, et al.
Published: (2024)
by: Cai, Zhongze, et al.
Published: (2024)
The Enhanced Physics-Informed Kolmogorov-Arnold Networks: Applications of Newton's Laws in Financial Deep Reinforcement Learning (RL) Algorithms
by: Thoi, Trang, et al.
Published: (2026)
by: Thoi, Trang, et al.
Published: (2026)
What Matters in Data for DPO?
by: Pan, Yu, et al.
Published: (2025)
by: Pan, Yu, et al.
Published: (2025)
Optimistic Reinforcement Learning with Quantile Objectives
by: Alipour-Vaezi, Mohammad, et al.
Published: (2025)
by: Alipour-Vaezi, Mohammad, et al.
Published: (2025)
Quantile Markov Decision Process
by: Li, Xiaocheng, et al.
Published: (2017)
by: Li, Xiaocheng, et al.
Published: (2017)
Dimension-free Private Mean Estimation for Anisotropic Distributions
by: Dagan, Yuval, et al.
Published: (2024)
by: Dagan, Yuval, et al.
Published: (2024)
Improving the stability of the covariance-controlled adaptive Langevin thermostat for large-scale Bayesian sampling
by: Wei, Jiani, et al.
Published: (2025)
by: Wei, Jiani, et al.
Published: (2025)
Understanding Uncertainty Sampling via Equivalent Loss
by: Liu, Shang, et al.
Published: (2023)
by: Liu, Shang, et al.
Published: (2023)
K*-Means: A Parameter-free Clustering Algorithm
by: Mahon, Louis, et al.
Published: (2025)
by: Mahon, Louis, et al.
Published: (2025)
Dual-Directed Algorithm Design for Efficient Pure Exploration
by: Qin, Chao, et al.
Published: (2023)
by: Qin, Chao, et al.
Published: (2023)
Policy Improvement Reinforcement Learning
by: Wang, Huaiyang, et al.
Published: (2026)
by: Wang, Huaiyang, et al.
Published: (2026)
Robust Distributed Estimation: Extending Gossip Algorithms to Ranking and Trimmed Means
by: Van Elst, Anna, et al.
Published: (2025)
by: Van Elst, Anna, et al.
Published: (2025)
Scalable Decentralized Algorithms for Online Personalized Mean Estimation
by: Galante, Franco, et al.
Published: (2024)
by: Galante, Franco, et al.
Published: (2024)
Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach
by: Liu, Linyu, et al.
Published: (2024)
by: Liu, Linyu, et al.
Published: (2024)
Mean Estimation from Coarse Data: Characterizations and Efficient Algorithms
by: Kalavasis, Alkis, et al.
Published: (2026)
by: Kalavasis, Alkis, et al.
Published: (2026)
Risk Profiling and Modulation for LLMs
by: Wang, Yikai, et al.
Published: (2025)
by: Wang, Yikai, et al.
Published: (2025)
Principled Algorithms for Optimizing Generalized Metrics in Multi-Label Learning
by: Mohri, Mehryar, et al.
Published: (2026)
by: Mohri, Mehryar, et al.
Published: (2026)
Sparse Mean Estimation in Adversarial Settings via Incremental Learning
by: Ma, Jianhao, et al.
Published: (2023)
by: Ma, Jianhao, et al.
Published: (2023)
OM2P: Offline Multi-Agent Mean-Flow Policy
by: Li, Zhuoran, et al.
Published: (2025)
by: Li, Zhuoran, et al.
Published: (2025)
A Fast Algorithm for the Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit
by: Nakamura, Shintaro, et al.
Published: (2023)
by: Nakamura, Shintaro, et al.
Published: (2023)
Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits
by: Huang, Ziyi, et al.
Published: (2024)
by: Huang, Ziyi, et al.
Published: (2024)
Calibrating conditional risk
by: Vasilyev, Andrey, et al.
Published: (2026)
by: Vasilyev, Andrey, et al.
Published: (2026)
When No-Rejection Learning is Consistent for Regression with Rejection
by: Li, Xiaocheng, et al.
Published: (2023)
by: Li, Xiaocheng, et al.
Published: (2023)
Learning to Make Adherence-Aware Advice
by: Chen, Guanting, et al.
Published: (2023)
by: Chen, Guanting, et al.
Published: (2023)
A Sub-Quadratic Time Algorithm for Robust Sparse Mean Estimation
by: Pensia, Ankit
Published: (2024)
by: Pensia, Ankit
Published: (2024)
Uniform Mean Estimation for Heavy-Tailed Distributions via Median-of-Means
by: Høgsgaard, Mikael Møller, et al.
Published: (2025)
by: Høgsgaard, Mikael Møller, et al.
Published: (2025)
Predictor-Rejector Multi-Class Abstention: Theoretical Analysis and Algorithms
by: Mao, Anqi, et al.
Published: (2023)
by: Mao, Anqi, et al.
Published: (2023)
Conditional Mean and Variance Estimation via \textit{k}-NN Algorithm with Automated Variance Selection
by: Matabuena, Marcos, et al.
Published: (2024)
by: Matabuena, Marcos, et al.
Published: (2024)
Differentially-Private Collaborative Online Personalized Mean Estimation
by: Yakimenka, Yauhen, et al.
Published: (2024)
by: Yakimenka, Yauhen, et al.
Published: (2024)
Optimal Survey Design for Private Mean Estimation
by: Chen, Yu-Wei, et al.
Published: (2025)
by: Chen, Yu-Wei, et al.
Published: (2025)
Learning-Based Sparsification of Dynamic Graphs in Robotic Exploration Algorithms
by: Sastry, Adithya V., et al.
Published: (2026)
by: Sastry, Adithya V., et al.
Published: (2026)
Collaborative Prediction: To Join or To Disjoin Datasets
by: Kim, Kyung Rok, et al.
Published: (2025)
by: Kim, Kyung Rok, et al.
Published: (2025)
On the Benefits of Free Exploration for Regret Minimization in Multi-Armed Bandits
by: Hou, Yunlong, et al.
Published: (2026)
by: Hou, Yunlong, et al.
Published: (2026)
Theoretically Grounded Loss Functions and Algorithms for Score-Based Multi-Class Abstention
by: Mao, Anqi, et al.
Published: (2023)
by: Mao, Anqi, et al.
Published: (2023)
An Observation on Lloyd's k-Means Algorithm in High Dimensions
by: Silva-Sánchez, David, et al.
Published: (2025)
by: Silva-Sánchez, David, et al.
Published: (2025)
Covariance-Aware Private Mean Estimation Without Private Covariance Estimation
by: Brown, Gavin, et al.
Published: (2021)
by: Brown, Gavin, et al.
Published: (2021)
ExAL: An Exploration Enhanced Adversarial Learning Algorithm
by: Vinil, A, et al.
Published: (2024)
by: Vinil, A, et al.
Published: (2024)
Provably Efficient Algorithms for S- and Non-Rectangular Robust MDPs with General Parameterization
by: Satheesh, Anirudh, et al.
Published: (2026)
by: Satheesh, Anirudh, et al.
Published: (2026)
High-Accuracy List-Decodable Mean Estimation
by: Chen, Ziyun, et al.
Published: (2025)
by: Chen, Ziyun, et al.
Published: (2025)
Dropout-Based Rashomon Set Exploration for Efficient Predictive Multiplicity Estimation
by: Hsu, Hsiang, et al.
Published: (2024)
by: Hsu, Hsiang, et al.
Published: (2024)
Similar Items
-
Towards Better Statistical Understanding of Watermarking LLMs
by: Cai, Zhongze, et al.
Published: (2024) -
The Enhanced Physics-Informed Kolmogorov-Arnold Networks: Applications of Newton's Laws in Financial Deep Reinforcement Learning (RL) Algorithms
by: Thoi, Trang, et al.
Published: (2026) -
What Matters in Data for DPO?
by: Pan, Yu, et al.
Published: (2025) -
Optimistic Reinforcement Learning with Quantile Objectives
by: Alipour-Vaezi, Mohammad, et al.
Published: (2025) -
Quantile Markov Decision Process
by: Li, Xiaocheng, et al.
Published: (2017)