Saved in:
| Main Authors: | Liu, Pengwei, Hao, Zhongkai, Ren, Xingyu, Yuan, Hangjie, Ren, Jiayang, Ni, Dong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.05232 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Discovering Physical Directions in Weight Space: Composing Neural PDE Experts
by: Wang, Pengkai, et al.
Published: (2026)
by: Wang, Pengkai, et al.
Published: (2026)
An Efficient Graph-Transformer Operator for Learning Physical Dynamics with Manifolds Embedding
by: Liu, Pengwei, et al.
Published: (2025)
by: Liu, Pengwei, et al.
Published: (2025)
Quantum Semi-Random Forests for Qubit-Efficient Recommender Systems
by: Alavi, Azadeh, et al.
Published: (2025)
by: Alavi, Azadeh, et al.
Published: (2025)
Where We Have Arrived in Proving the Emergence of Sparse Symbolic Concepts in AI Models
by: Ren, Qihan, et al.
Published: (2023)
by: Ren, Qihan, et al.
Published: (2023)
A Global Optimization Algorithm for K-Center Clustering of One Billion Samples
by: Ren, Jiayang, et al.
Published: (2022)
by: Ren, Jiayang, et al.
Published: (2022)
Learning-driven Physically-aware Large-scale Circuit Gate Sizing
by: Ye, Yuyang, et al.
Published: (2024)
by: Ye, Yuyang, et al.
Published: (2024)
Correcting Mean Bias in Text Embeddings: A Refined Renormalization with Training-Free Improvements on MMTEB
by: Ren, Xingyu, et al.
Published: (2025)
by: Ren, Xingyu, et al.
Published: (2025)
Task Aware Dreamer for Task Generalization in Reinforcement Learning
by: Ying, Chengyang, et al.
Published: (2023)
by: Ying, Chengyang, et al.
Published: (2023)
Rethinking the Stability-Plasticity Trade-off in Continual Learning from an Architectural Perspective
by: Lu, Aojun, et al.
Published: (2025)
by: Lu, Aojun, et al.
Published: (2025)
Proxy Compression for Language Modeling
by: Zheng, Lin, et al.
Published: (2026)
by: Zheng, Lin, et al.
Published: (2026)
On the Reuse Bias in Off-Policy Reinforcement Learning
by: Ying, Chengyang, et al.
Published: (2022)
by: Ying, Chengyang, et al.
Published: (2022)
A Physics-preserved Transfer Learning Method for Differential Equations
by: Yang, Hao-Ran, et al.
Published: (2025)
by: Yang, Hao-Ran, et al.
Published: (2025)
Preconditioning for Physics-Informed Neural Networks
by: Liu, Songming, et al.
Published: (2024)
by: Liu, Songming, et al.
Published: (2024)
Task-Oriented Multimodal Token Transmission in Resource-Constrained Multiuser Networks
by: Zhang, Junhe, et al.
Published: (2025)
by: Zhang, Junhe, et al.
Published: (2025)
A Faster Path to Continual Learning
by: Li, Wei, et al.
Published: (2026)
by: Li, Wei, et al.
Published: (2026)
Learning Semantic Proxies from Visual Prompts for Parameter-Efficient Fine-Tuning in Deep Metric Learning
by: Ren, Li, et al.
Published: (2024)
by: Ren, Li, et al.
Published: (2024)
Towards Mitigating Excessive Forgetting in LLM Unlearning via Entanglement-Guidance with Proxy Constraint
by: Liu, Zhihao, et al.
Published: (2025)
by: Liu, Zhihao, et al.
Published: (2025)
A Survey on Diffusion Models for Anomaly Detection
by: Liu, Jing, et al.
Published: (2025)
by: Liu, Jing, et al.
Published: (2025)
Why Does RL Generalize Better Than SFT? A Data-Centric Perspective on VLM Post-Training
by: Lu, Aojun, et al.
Published: (2026)
by: Lu, Aojun, et al.
Published: (2026)
Curriculum Sampling: A Two-Phase Curriculum for Efficient Training of Flow Matching
by: Sun, Pengwei
Published: (2026)
by: Sun, Pengwei
Published: (2026)
Uncertainty-aware Knowledge Tracing
by: Cheng, Weihua, et al.
Published: (2025)
by: Cheng, Weihua, et al.
Published: (2025)
Verbalized Graph Representation Learning: A Fully Interpretable Graph Model Based on Large Language Models Throughout the Entire Process
by: Ji, Xingyu, et al.
Published: (2024)
by: Ji, Xingyu, et al.
Published: (2024)
Cognitive Activation and Chaotic Dynamics in Large Language Models: A Quasi-Lyapunov Analysis of Reasoning Mechanisms
by: Li, Xiaojian, et al.
Published: (2025)
by: Li, Xiaojian, et al.
Published: (2025)
Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models
by: Ren, Zirui, et al.
Published: (2026)
by: Ren, Zirui, et al.
Published: (2026)
SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images
by: Chen, Shuhang, et al.
Published: (2025)
by: Chen, Shuhang, et al.
Published: (2025)
Your Diffusion Model is Secretly a Certifiably Robust Classifier
by: Chen, Huanran, et al.
Published: (2024)
by: Chen, Huanran, et al.
Published: (2024)
Proxy-Guided Measurement Calibration
by: Vishnubhatla, Saketh, et al.
Published: (2026)
by: Vishnubhatla, Saketh, et al.
Published: (2026)
Generating In-Distribution Proxy Graphs for Explaining Graph Neural Networks
by: Chen, Zhuomin, et al.
Published: (2024)
by: Chen, Zhuomin, et al.
Published: (2024)
Revisiting Neural Networks for Continual Learning: An Architectural Perspective
by: Lu, Aojun, et al.
Published: (2024)
by: Lu, Aojun, et al.
Published: (2024)
Adapt before Continual Learning
by: Lu, Aojun, et al.
Published: (2025)
by: Lu, Aojun, et al.
Published: (2025)
Sharpness-aware Federated Graph Learning
by: Li, Ruiyu, et al.
Published: (2025)
by: Li, Ruiyu, et al.
Published: (2025)
Improved Operator Learning by Orthogonal Attention
by: Xiao, Zipeng, et al.
Published: (2023)
by: Xiao, Zipeng, et al.
Published: (2023)
Human-AI Collaborative Autonomous Experimentation With Proxy Modeling for Comparative Observation
by: Biswas, Arpan, et al.
Published: (2026)
by: Biswas, Arpan, et al.
Published: (2026)
Linking Process to Outcome: Conditional Reward Modeling for LLM Reasoning
by: Zhang, Zheng, et al.
Published: (2025)
by: Zhang, Zheng, et al.
Published: (2025)
Exploratory Diffusion Model for Unsupervised Reinforcement Learning
by: Ying, Chengyang, et al.
Published: (2025)
by: Ying, Chengyang, et al.
Published: (2025)
Modeling Latent Non-Linear Dynamical System over Time Series
by: Fujiwara, Ren, et al.
Published: (2024)
by: Fujiwara, Ren, et al.
Published: (2024)
Amortized Network Intervention to Steer the Excitatory Point Processes
by: Song, Zitao, et al.
Published: (2023)
by: Song, Zitao, et al.
Published: (2023)
Bi-directional Model Cascading with Proxy Confidence
by: Warren, David, et al.
Published: (2025)
by: Warren, David, et al.
Published: (2025)
GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text
by: Liu, Pengfei, et al.
Published: (2023)
by: Liu, Pengfei, et al.
Published: (2023)
ProxyKV: Cross-Model Proxy Pruning for Efficient Long-Context LLM Inference
by: Li, Junjie, et al.
Published: (2026)
by: Li, Junjie, et al.
Published: (2026)
Similar Items
-
Discovering Physical Directions in Weight Space: Composing Neural PDE Experts
by: Wang, Pengkai, et al.
Published: (2026) -
An Efficient Graph-Transformer Operator for Learning Physical Dynamics with Manifolds Embedding
by: Liu, Pengwei, et al.
Published: (2025) -
Quantum Semi-Random Forests for Qubit-Efficient Recommender Systems
by: Alavi, Azadeh, et al.
Published: (2025) -
Where We Have Arrived in Proving the Emergence of Sparse Symbolic Concepts in AI Models
by: Ren, Qihan, et al.
Published: (2023) -
A Global Optimization Algorithm for K-Center Clustering of One Billion Samples
by: Ren, Jiayang, et al.
Published: (2022)