:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wei, Ziyi, Zhong, Huaiyang, Li, Xiaocheng
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2510.10374
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Towards Better Statistical Understanding of Watermarking LLMs
by: Cai, Zhongze, et al.
Published: (2024)

The Enhanced Physics-Informed Kolmogorov-Arnold Networks: Applications of Newton's Laws in Financial Deep Reinforcement Learning (RL) Algorithms
by: Thoi, Trang, et al.
Published: (2026)

What Matters in Data for DPO?
by: Pan, Yu, et al.
Published: (2025)

Optimistic Reinforcement Learning with Quantile Objectives
by: Alipour-Vaezi, Mohammad, et al.
Published: (2025)

Quantile Markov Decision Process
by: Li, Xiaocheng, et al.
Published: (2017)

Dimension-free Private Mean Estimation for Anisotropic Distributions
by: Dagan, Yuval, et al.
Published: (2024)

Improving the stability of the covariance-controlled adaptive Langevin thermostat for large-scale Bayesian sampling
by: Wei, Jiani, et al.
Published: (2025)

Understanding Uncertainty Sampling via Equivalent Loss
by: Liu, Shang, et al.
Published: (2023)

K*-Means: A Parameter-free Clustering Algorithm
by: Mahon, Louis, et al.
Published: (2025)

Dual-Directed Algorithm Design for Efficient Pure Exploration
by: Qin, Chao, et al.
Published: (2023)

Policy Improvement Reinforcement Learning
by: Wang, Huaiyang, et al.
Published: (2026)

Robust Distributed Estimation: Extending Gossip Algorithms to Ranking and Trimmed Means
by: Van Elst, Anna, et al.
Published: (2025)

Scalable Decentralized Algorithms for Online Personalized Mean Estimation
by: Galante, Franco, et al.
Published: (2024)

Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach
by: Liu, Linyu, et al.
Published: (2024)

Mean Estimation from Coarse Data: Characterizations and Efficient Algorithms
by: Kalavasis, Alkis, et al.
Published: (2026)

Risk Profiling and Modulation for LLMs
by: Wang, Yikai, et al.
Published: (2025)

Principled Algorithms for Optimizing Generalized Metrics in Multi-Label Learning
by: Mohri, Mehryar, et al.
Published: (2026)

Sparse Mean Estimation in Adversarial Settings via Incremental Learning
by: Ma, Jianhao, et al.
Published: (2023)

OM2P: Offline Multi-Agent Mean-Flow Policy
by: Li, Zhuoran, et al.
Published: (2025)

A Fast Algorithm for the Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit
by: Nakamura, Shintaro, et al.
Published: (2023)

Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits
by: Huang, Ziyi, et al.
Published: (2024)

Calibrating conditional risk
by: Vasilyev, Andrey, et al.
Published: (2026)

When No-Rejection Learning is Consistent for Regression with Rejection
by: Li, Xiaocheng, et al.
Published: (2023)

Learning to Make Adherence-Aware Advice
by: Chen, Guanting, et al.
Published: (2023)

A Sub-Quadratic Time Algorithm for Robust Sparse Mean Estimation
by: Pensia, Ankit
Published: (2024)

Uniform Mean Estimation for Heavy-Tailed Distributions via Median-of-Means
by: Høgsgaard, Mikael Møller, et al.
Published: (2025)

Predictor-Rejector Multi-Class Abstention: Theoretical Analysis and Algorithms
by: Mao, Anqi, et al.
Published: (2023)

Conditional Mean and Variance Estimation via \textit{k}-NN Algorithm with Automated Variance Selection
by: Matabuena, Marcos, et al.
Published: (2024)

Differentially-Private Collaborative Online Personalized Mean Estimation
by: Yakimenka, Yauhen, et al.
Published: (2024)

Optimal Survey Design for Private Mean Estimation
by: Chen, Yu-Wei, et al.
Published: (2025)

Learning-Based Sparsification of Dynamic Graphs in Robotic Exploration Algorithms
by: Sastry, Adithya V., et al.
Published: (2026)

Collaborative Prediction: To Join or To Disjoin Datasets
by: Kim, Kyung Rok, et al.
Published: (2025)

On the Benefits of Free Exploration for Regret Minimization in Multi-Armed Bandits
by: Hou, Yunlong, et al.
Published: (2026)

Theoretically Grounded Loss Functions and Algorithms for Score-Based Multi-Class Abstention
by: Mao, Anqi, et al.
Published: (2023)

An Observation on Lloyd's k-Means Algorithm in High Dimensions
by: Silva-Sánchez, David, et al.
Published: (2025)

Covariance-Aware Private Mean Estimation Without Private Covariance Estimation
by: Brown, Gavin, et al.
Published: (2021)

ExAL: An Exploration Enhanced Adversarial Learning Algorithm
by: Vinil, A, et al.
Published: (2024)

Provably Efficient Algorithms for S- and Non-Rectangular Robust MDPs with General Parameterization
by: Satheesh, Anirudh, et al.
Published: (2026)

High-Accuracy List-Decodable Mean Estimation
by: Chen, Ziyun, et al.
Published: (2025)

Dropout-Based Rashomon Set Exploration for Efficient Predictive Multiplicity Estimation
by: Hsu, Hsiang, et al.
Published: (2024)