:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Hao, Chen, Zhichao, Liu, Zhaoran, Li, Haozhe, Yang, Degui, Liu, Xinggao, Li, Haoxuan
Format:	Preprint
Published:	2022
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2210.11039
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FreDF: Learning to Forecast in the Frequency Domain
by: Wang, Hao, et al.
Published: (2024)

Proximity Matters: Local Proximity Enhanced Balancing for Treatment Effect Estimation
by: Wang, Hao, et al.
Published: (2024)

An Accurate and Interpretable Framework for Trustworthy Process Monitoring
by: Wang, Hao, et al.
Published: (2023)

Mixture of Low Rank Adaptation with Partial Parameter Sharing for Time Series Forecasting
by: Pan, Licheng, et al.
Published: (2025)

CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks
by: Wang, Hao, et al.
Published: (2026)

DeepFilter: A Transformer-style Framework for Accurate and Efficient Process Monitoring
by: Wang, Hao, et al.
Published: (2025)

From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization
by: Chen, Xinjie, et al.
Published: (2025)

Enabling Agents to Communicate Entirely in Latent Space
by: Du, Zhuoyun, et al.
Published: (2025)

ECAT: A Entire space Continual and Adaptive Transfer Learning Framework for Cross-Domain Recommendation
by: Hou, Chaoqun, et al.
Published: (2024)

Deep Time-series Forecasting Needs Kernelized Moment Balancing
by: Pan, Licheng, et al.
Published: (2026)

Quadratic Direct Forecast for Training Multi-Step Time-Series Forecast Models
by: Wang, Hao, et al.
Published: (2025)

Time-o1: Time-Series Forecasting Needs Transformed Label Alignment
by: Wang, Hao, et al.
Published: (2025)

Estimating the Effects of Sample Training Orders for Large Language Models without Retraining
by: Yang, Hao, et al.
Published: (2025)

CoLD: Counterfactually-Guided Length Debiasing for Process Reward Models in Mathematical Reasoning
by: Zheng, Congmin, et al.
Published: (2025)

DistDF: Time-Series Forecasting Needs Joint-Distribution Wasserstein Alignment
by: Wang, Hao, et al.
Published: (2025)

Task Priors: Enhancing Model Evaluation by Considering the Entire Space of Downstream Tasks
by: Patel, Niket, et al.
Published: (2025)

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
by: Liu, Zhihan, et al.
Published: (2023)

Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency
by: Wang, Lingxiao, et al.
Published: (2022)

Entire Chain Uplift Modeling with Context-Enhanced Learning for Intelligent Marketing
by: Huang, Yinqiu, et al.
Published: (2024)

DDTime: Dataset Distillation with Spectral Alignment and Information Bottleneck for Time-Series Forecasting
by: Li, Yuqi, et al.
Published: (2025)

Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?
by: Yin, Yutong, et al.
Published: (2025)

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
by: Zhang, Shenao, et al.
Published: (2024)

Contextual Dynamic Pricing with Strategic Buyers
by: Liu, Pangpang, et al.
Published: (2023)

FDRMFL:Multi-modal Federated Feature Extraction Model Based on Information Maximization and Contrastive Learning
by: Wu, Haozhe
Published: (2025)

HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents
by: Peng, Jiangweizhi, et al.
Published: (2026)

IB-GRPO: Aligning LLM-based Learning Path Recommendation with Educational Objectives via Indicator-Based Group Relative Policy Optimization
by: Wang, Shuai, et al.
Published: (2026)

DKINet: Medication Recommendation via Domain Knowledge Informed Deep Learning
by: Liu, Sicen, et al.
Published: (2023)

Self-Distilled Disentangled Learning for Counterfactual Prediction
by: Li, Xinshu, et al.
Published: (2024)

Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
by: Lu, Miao, et al.
Published: (2022)

Revisiting Counterfactual Regression through the Lens of Gromov-Wasserstein Information Bottleneck
by: Yang, Hao, et al.
Published: (2024)

Language Models for Controllable DNA Sequence Design
by: Su, Xingyu, et al.
Published: (2025)

Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer
by: Liu, Zhihan, et al.
Published: (2024)

Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents
by: Liu, Zhihan, et al.
Published: (2026)

Transformer-Based Spatial-Temporal Counterfactual Outcomes Estimation
by: Li, He, et al.
Published: (2025)

Latent-Space Contrastive Reinforcement Learning for Stable and Efficient LLM Reasoning
by: Shan, Lianlei, et al.
Published: (2026)

Can a Single Tree Outperform an Entire Forest?
by: Mao, Qiangqiang, et al.
Published: (2024)

Benchmarking Counterfactual Interpretability in Deep Learning Models for Time Series Classification
by: Kan, Ziwen, et al.
Published: (2024)

LeapFactual: Reliable Visual Counterfactual Explanation Using Conditional Flow Matching
by: Cao, Zhuo, et al.
Published: (2025)

Decentralized Autoregressive Generation
by: Maschan, Stepan, et al.
Published: (2026)

Budgeting Counterfactual for Offline RL
by: Liu, Yao, et al.
Published: (2023)