Torthaí cuardaigh - Lin, F.
- 1 - 20 toradh as 73 á dtaispeáint
- Téigh chuig an gcéad leathanach eile
-
1
Winning Gold at IMO 2025 with a Model-Agnostic Verification-and-Refinement Pipeline de réir Huang, Yichen, Yang, Lin F.
Foilsithe / Cruthaithe 2025Faigh an téacs iomlán
Preprint -
2
Confident Natural Policy Gradient for Local Planning in $q_π$-realizable Constrained MDPs de réir Tian, Tian, Yang, Lin F., Szepesvári, Csaba
Foilsithe / Cruthaithe 2024Faigh an téacs iomlán
Preprint -
3
Sample Complexity Bounds for Linear Constrained MDPs with a Generative Model de réir Liu, Xingtu, Yang, Lin F., Vaswani, Sharan
Foilsithe / Cruthaithe 2025Faigh an téacs iomlán
Preprint -
4
Near-Optimal Sample Complexity Bounds for Constrained Average-Reward MDPs de réir Wei, Yukuan, Li, Xudong, Yang, Lin F.
Foilsithe / Cruthaithe 2025Faigh an téacs iomlán
Preprint -
5
Near-Optimal Sample Complexity for Online Constrained MDPs de réir Liu, Chang, Li, Yunfan, Yang, Lin F.
Foilsithe / Cruthaithe 2026Faigh an téacs iomlán
Preprint -
6
Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error de réir Du, Ally Yalei, Yang, Lin F., Wang, Ruosong
Foilsithe / Cruthaithe 2024Faigh an téacs iomlán
Preprint -
7
Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds de réir Huang, Jiayi, Zhong, Han, Wang, Liwei, Yang, Lin F.
Foilsithe / Cruthaithe 2023Faigh an téacs iomlán
Preprint -
8
Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning de réir Liu, Junyan, Li, Yunfan, Wang, Ruosong, Yang, Lin F.
Foilsithe / Cruthaithe 2024Faigh an téacs iomlán
Preprint -
9
Learning for Bandits under Action Erasures de réir Hanna, Osama, Karakas, Merve, Yang, Lin F., Fragouli, Christina
Foilsithe / Cruthaithe 2024Faigh an téacs iomlán
Preprint -
10
Does Feedback Help in Bandits with Arm Erasures? de réir Karakas, Merve, Hanna, Osama, Yang, Lin F., Fragouli, Christina
Foilsithe / Cruthaithe 2025Faigh an téacs iomlán
Preprint -
11
On the optimal regret of collaborative personalized linear bandits de réir Huang, Bruce, Zhou, Ruida, Yang, Lin F., Diggavi, Suhas
Foilsithe / Cruthaithe 2025Faigh an téacs iomlán
Preprint -
12
Best-Arm Identification with Noisy Actuation de réir Karakas, Merve, Hanna, Osama, Yang, Lin F., Fragouli, Christina
Foilsithe / Cruthaithe 2026Faigh an téacs iomlán
Preprint -
13
Multi-Agent Bandit Learning through Heterogeneous Action Erasure Channels de réir Hanna, Osama A., Karakas, Merve, Yang, Lin F., Fragouli, Christina
Foilsithe / Cruthaithe 2023Faigh an téacs iomlán
Preprint -
14
Don't Forget to Connect! Improving RAG with Graph-based Reranking de réir Dong, Jialin, Fatemi, Bahare, Perozzi, Bryan, Yang, Lin F., Tsitsulin, Anton
Foilsithe / Cruthaithe 2024Faigh an téacs iomlán
Preprint -
15
ARMOR: High-Performance Semi-Structured Pruning via Adaptive Matrix Factorization de réir Liu, Lawrence, Liu, Alexander, Wang, Mengdi, Zhao, Tuo, Yang, Lin F.
Foilsithe / Cruthaithe 2025Faigh an téacs iomlán
Preprint -
16
A geometric distortion solution specifically for historical observations and its implementation de réir Lin, F. R., Peng, Q. Y., Zheng, Z. J., Guo, B. F.
Foilsithe / Cruthaithe 2024Faigh an téacs iomlán
Preprint -
17
Precision premium transformation -- a high-precision astrometric solution based on the precision premium curve de réir Zheng, Z. J., Peng, Q. Y., Lin, F. R., Li, D., Zheng, Y.
Foilsithe / Cruthaithe 2024Faigh an téacs iomlán
Preprint -
18
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning de réir Wang, Yiran, Liu, Chenshu, Li, Yunfan, Amani, Sanae, Zhou, Bolei, Yang, Lin F.
Foilsithe / Cruthaithe 2024Faigh an téacs iomlán
Preprint -
19
NoWag: A Unified Framework for Shape Preserving Compression of Large Language Models de réir Liu, Lawrence, Chakrabarti, Inesh, Li, Yixiao, Wang, Mengdi, Zhao, Tuo, Yang, Lin F.
Foilsithe / Cruthaithe 2025Faigh an téacs iomlán
Preprint -
20
LACONIC: Length-Aware Constrained Reinforcement Learning for LLM de réir Liu, Chang, Zhao, Yiran, Liu, Lawrence, Ye, Yaoqi, Szepesvári, Csaba, Yang, Lin F.
Foilsithe / Cruthaithe 2026Faigh an téacs iomlán
Preprint