Saved in:
| Main Authors: | Wang, Hao, Chen, Zhichao, Liu, Zhaoran, Li, Haozhe, Yang, Degui, Liu, Xinggao, Li, Haoxuan |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2210.11039 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FreDF: Learning to Forecast in the Frequency Domain
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
Proximity Matters: Local Proximity Enhanced Balancing for Treatment Effect Estimation
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
An Accurate and Interpretable Framework for Trustworthy Process Monitoring
by: Wang, Hao, et al.
Published: (2023)
by: Wang, Hao, et al.
Published: (2023)
Mixture of Low Rank Adaptation with Partial Parameter Sharing for Time Series Forecasting
by: Pan, Licheng, et al.
Published: (2025)
by: Pan, Licheng, et al.
Published: (2025)
CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks
by: Wang, Hao, et al.
Published: (2026)
by: Wang, Hao, et al.
Published: (2026)
DeepFilter: A Transformer-style Framework for Accurate and Efficient Process Monitoring
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization
by: Chen, Xinjie, et al.
Published: (2025)
by: Chen, Xinjie, et al.
Published: (2025)
Enabling Agents to Communicate Entirely in Latent Space
by: Du, Zhuoyun, et al.
Published: (2025)
by: Du, Zhuoyun, et al.
Published: (2025)
ECAT: A Entire space Continual and Adaptive Transfer Learning Framework for Cross-Domain Recommendation
by: Hou, Chaoqun, et al.
Published: (2024)
by: Hou, Chaoqun, et al.
Published: (2024)
Deep Time-series Forecasting Needs Kernelized Moment Balancing
by: Pan, Licheng, et al.
Published: (2026)
by: Pan, Licheng, et al.
Published: (2026)
Quadratic Direct Forecast for Training Multi-Step Time-Series Forecast Models
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
Time-o1: Time-Series Forecasting Needs Transformed Label Alignment
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
Estimating the Effects of Sample Training Orders for Large Language Models without Retraining
by: Yang, Hao, et al.
Published: (2025)
by: Yang, Hao, et al.
Published: (2025)
CoLD: Counterfactually-Guided Length Debiasing for Process Reward Models in Mathematical Reasoning
by: Zheng, Congmin, et al.
Published: (2025)
by: Zheng, Congmin, et al.
Published: (2025)
DistDF: Time-Series Forecasting Needs Joint-Distribution Wasserstein Alignment
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
Task Priors: Enhancing Model Evaluation by Considering the Entire Space of Downstream Tasks
by: Patel, Niket, et al.
Published: (2025)
by: Patel, Niket, et al.
Published: (2025)
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
by: Liu, Zhihan, et al.
Published: (2023)
by: Liu, Zhihan, et al.
Published: (2023)
Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency
by: Wang, Lingxiao, et al.
Published: (2022)
by: Wang, Lingxiao, et al.
Published: (2022)
Entire Chain Uplift Modeling with Context-Enhanced Learning for Intelligent Marketing
by: Huang, Yinqiu, et al.
Published: (2024)
by: Huang, Yinqiu, et al.
Published: (2024)
DDTime: Dataset Distillation with Spectral Alignment and Information Bottleneck for Time-Series Forecasting
by: Li, Yuqi, et al.
Published: (2025)
by: Li, Yuqi, et al.
Published: (2025)
Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?
by: Yin, Yutong, et al.
Published: (2025)
by: Yin, Yutong, et al.
Published: (2025)
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
by: Zhang, Shenao, et al.
Published: (2024)
by: Zhang, Shenao, et al.
Published: (2024)
Contextual Dynamic Pricing with Strategic Buyers
by: Liu, Pangpang, et al.
Published: (2023)
by: Liu, Pangpang, et al.
Published: (2023)
FDRMFL:Multi-modal Federated Feature Extraction Model Based on Information Maximization and Contrastive Learning
by: Wu, Haozhe
Published: (2025)
by: Wu, Haozhe
Published: (2025)
HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents
by: Peng, Jiangweizhi, et al.
Published: (2026)
by: Peng, Jiangweizhi, et al.
Published: (2026)
IB-GRPO: Aligning LLM-based Learning Path Recommendation with Educational Objectives via Indicator-Based Group Relative Policy Optimization
by: Wang, Shuai, et al.
Published: (2026)
by: Wang, Shuai, et al.
Published: (2026)
DKINet: Medication Recommendation via Domain Knowledge Informed Deep Learning
by: Liu, Sicen, et al.
Published: (2023)
by: Liu, Sicen, et al.
Published: (2023)
Self-Distilled Disentangled Learning for Counterfactual Prediction
by: Li, Xinshu, et al.
Published: (2024)
by: Li, Xinshu, et al.
Published: (2024)
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
by: Lu, Miao, et al.
Published: (2022)
by: Lu, Miao, et al.
Published: (2022)
Revisiting Counterfactual Regression through the Lens of Gromov-Wasserstein Information Bottleneck
by: Yang, Hao, et al.
Published: (2024)
by: Yang, Hao, et al.
Published: (2024)
Language Models for Controllable DNA Sequence Design
by: Su, Xingyu, et al.
Published: (2025)
by: Su, Xingyu, et al.
Published: (2025)
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer
by: Liu, Zhihan, et al.
Published: (2024)
by: Liu, Zhihan, et al.
Published: (2024)
Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents
by: Liu, Zhihan, et al.
Published: (2026)
by: Liu, Zhihan, et al.
Published: (2026)
Transformer-Based Spatial-Temporal Counterfactual Outcomes Estimation
by: Li, He, et al.
Published: (2025)
by: Li, He, et al.
Published: (2025)
Latent-Space Contrastive Reinforcement Learning for Stable and Efficient LLM Reasoning
by: Shan, Lianlei, et al.
Published: (2026)
by: Shan, Lianlei, et al.
Published: (2026)
Can a Single Tree Outperform an Entire Forest?
by: Mao, Qiangqiang, et al.
Published: (2024)
by: Mao, Qiangqiang, et al.
Published: (2024)
Benchmarking Counterfactual Interpretability in Deep Learning Models for Time Series Classification
by: Kan, Ziwen, et al.
Published: (2024)
by: Kan, Ziwen, et al.
Published: (2024)
LeapFactual: Reliable Visual Counterfactual Explanation Using Conditional Flow Matching
by: Cao, Zhuo, et al.
Published: (2025)
by: Cao, Zhuo, et al.
Published: (2025)
Decentralized Autoregressive Generation
by: Maschan, Stepan, et al.
Published: (2026)
by: Maschan, Stepan, et al.
Published: (2026)
Budgeting Counterfactual for Offline RL
by: Liu, Yao, et al.
Published: (2023)
by: Liu, Yao, et al.
Published: (2023)
Similar Items
-
FreDF: Learning to Forecast in the Frequency Domain
by: Wang, Hao, et al.
Published: (2024) -
Proximity Matters: Local Proximity Enhanced Balancing for Treatment Effect Estimation
by: Wang, Hao, et al.
Published: (2024) -
An Accurate and Interpretable Framework for Trustworthy Process Monitoring
by: Wang, Hao, et al.
Published: (2023) -
Mixture of Low Rank Adaptation with Partial Parameter Sharing for Time Series Forecasting
by: Pan, Licheng, et al.
Published: (2025) -
CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks
by: Wang, Hao, et al.
Published: (2026)