:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Yiyan, Liang, Liu, Sifei, Zhang, Weitong
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Machine Learning
Accesso online:	https://arxiv.org/abs/2605.29033
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Energy-Weighted Flow Matching for Offline Reinforcement Learning
di: Zhang, Shiyuan, et al.
Pubblicazione: (2025)

Return-to-Go Is More Than a Number: Q-Guided Alignment for Return-Conditioned Supervised Learning
di: Yang, Yuxiao, et al.
Pubblicazione: (2026)

Q-MMR: Off-Policy Evaluation via Recursive Reweighting and Moment Matching
di: Li, Xiang, et al.
Pubblicazione: (2026)

A Few Moments Please: Scalable Graphon Learning via Moment Matching
di: Ramezanpour, Reza, et al.
Pubblicazione: (2025)

Moment Matching Denoising Gibbs Sampling
di: Zhang, Mingtian, et al.
Pubblicazione: (2023)

Inductive Moment Matching
di: Zhou, Linqi, et al.
Pubblicazione: (2025)

Near-Optimal Second-Order Guarantees for Model-Based Adversarial Imitation Learning
di: Li, Shangzhe, et al.
Pubblicazione: (2025)

Provably Efficient Offline-to-Online Value Adaptation with General Function Approximation
di: Li, Shangzhe, et al.
Pubblicazione: (2026)

Moment Alignment: Unifying Gradient and Hessian Matching for Domain Generalization
di: Chen, Yuen, et al.
Pubblicazione: (2025)

Your Self-Play Algorithm is Secretly an Adversarial Imitator: Understanding LLM Self-Play through the Lens of Imitation Learning
di: Li, Shangzhe, et al.
Pubblicazione: (2026)

Neural Tractability via Structure: Learning-Augmented Algorithms for Graph Combinatorial Optimization
di: Li, Jialiang, et al.
Pubblicazione: (2025)

Imitation from Observations with Trajectory-Level Generative Embeddings
di: Qu, Yongtao, et al.
Pubblicazione: (2026)

Evaluating Uplift Modeling under Structural Biases: Insights into Metric Stability and Model Robustness
di: Yang, Yuxuan, et al.
Pubblicazione: (2026)

Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling
di: Li, Xiang, et al.
Pubblicazione: (2026)

Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs
di: Zhang, Junkai, et al.
Pubblicazione: (2023)

Sharper Bounds for Chebyshev Moment Matching, with Applications
di: Musco, Cameron, et al.
Pubblicazione: (2024)

Stiefel Flow Matching for Moment-Constrained Structure Elucidation
di: Cheng, Austin, et al.
Pubblicazione: (2024)

Asymptotically Unbiased Synthetic Control Methods by Moment Matching
di: Kato, Masahiro, et al.
Pubblicazione: (2023)

Adversarial Moment-Matching Distillation of Large Language Models
di: Jia, Chen
Pubblicazione: (2024)

OGLS-SD: On-Policy Self-Distillation with Outcome-Guided Logit Steering for LLM Reasoning
di: Yang, Yuxiao, et al.
Pubblicazione: (2026)

Q-learning with Adjoint Matching
di: Li, Qiyang, et al.
Pubblicazione: (2026)

Enhancing Financial Market Predictions: Causality-Driven Feature Selection
di: Liang, Wenhao, et al.
Pubblicazione: (2024)

Unlearning Evaluation through Subset Statistical Independence
di: Zhang, Chenhao, et al.
Pubblicazione: (2026)

Nonlinearity and Uncertainty Informed Moment-Matching Gaussian Mixture Splitting
di: Kulik, Jackson, et al.
Pubblicazione: (2024)

Toward Efficient Data-Free Unlearning
di: Zhang, Chenhao, et al.
Pubblicazione: (2024)

Achieving Constant Regret in Linear Markov Decision Processes
di: Zhang, Weitong, et al.
Pubblicazione: (2024)

Guided Learning: Lubricating End-to-End Modeling for Multi-stage Decision-making
di: Guo, Jian, et al.
Pubblicazione: (2024)

Calibrating Deep Neural Network using Euclidean Distance
di: Liang, Wenhao, et al.
Pubblicazione: (2024)

Trust Region Q Adjoint Matching
di: Dong, Yonghoon, et al.
Pubblicazione: (2026)

Training-Free Generative Sampling via Moment-Matched Score Smoothing
di: Yao, Zhenyu, et al.
Pubblicazione: (2026)

Asymptotic FDR Control with Model-X Knockoffs: Is Moments Matching Sufficient?
di: Fan, Yingying, et al.
Pubblicazione: (2025)

Sketchy Moment Matching: Toward Fast and Provable Data Selection for Finetuning
di: Dong, Yijun, et al.
Pubblicazione: (2024)

Uncertainty-Aware Reward-Free Exploration with General Function Approximation
di: Zhang, Junkai, et al.
Pubblicazione: (2024)

Lifting Manifolds to Mitigate Pseudo-Alignment in LLM4TS
di: Zheng, Liangwei Nathan, et al.
Pubblicazione: (2025)

NUM2EVENT: Interpretable Event Reasoning from Numerical time-series
di: Feng, Ninghui, et al.
Pubblicazione: (2025)

MasRouter: Learning to Route LLMs for Multi-Agent Systems
di: Yue, Yanwei, et al.
Pubblicazione: (2025)

Exact Gaussian Moment Matching for Residual Networks: a Second-Order Method
di: Kuang, Simon, et al.
Pubblicazione: (2026)

Distributionally Robust Policy Evaluation and Learning for Continuous Treatment with Observational Data
di: Leung, Cheuk Hang, et al.
Pubblicazione: (2025)

Learning a Diffusion Model Policy from Rewards via Q-Score Matching
di: Psenka, Michael, et al.
Pubblicazione: (2023)

Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control
di: Hua, Chengxiu, et al.
Pubblicazione: (2025)