Saved in:
| Main Authors: | Zhang, Junpeng, Cheng, Lei, Zhang, Guoxi, Cai, Hua, Xu, Qing, Zhang, Quanshi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.17967 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Randomness of Low-Layer Parameters Determines Confusing Samples in Terms of Interaction Representations of a DNN
by: Zhang, Junpeng, et al.
Published: (2025)
by: Zhang, Junpeng, et al.
Published: (2025)
Revisiting Generalization Power of a DNN in Terms of Symbolic Interactions
by: Cheng, Lei, et al.
Published: (2025)
by: Cheng, Lei, et al.
Published: (2025)
Two-Phase Dynamics of Interactions Explains the Starting Point of a DNN Learning Over-Fitted Features
by: Zhang, Junpeng, et al.
Published: (2024)
by: Zhang, Junpeng, et al.
Published: (2024)
Technical Report: Quantifying and Analyzing the Generalization Power of a DNN
by: He, Yuxuan, et al.
Published: (2025)
by: He, Yuxuan, et al.
Published: (2025)
Towards the Dynamics of a DNN Learning Symbolic Interactions
by: Ren, Qihan, et al.
Published: (2024)
by: Ren, Qihan, et al.
Published: (2024)
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability
by: Ren, Qihan, et al.
Published: (2026)
by: Ren, Qihan, et al.
Published: (2026)
Technical Note: Defining and Quantifying AND-OR Interactions for Faithful and Concise Explanation of DNNs
by: Li, Mingjie, et al.
Published: (2023)
by: Li, Mingjie, et al.
Published: (2023)
Attribution Explanations for Deep Neural Networks: A Theoretical Perspective
by: Deng, Huiqi, et al.
Published: (2025)
by: Deng, Huiqi, et al.
Published: (2025)
Does a Neural Network Really Encode Symbolic Concepts?
by: Li, Mingjie, et al.
Published: (2023)
by: Li, Mingjie, et al.
Published: (2023)
Layerwise Change of Knowledge in Neural Networks
by: Cheng, Xu, et al.
Published: (2024)
by: Cheng, Xu, et al.
Published: (2024)
Disentangling Regional Primitives for Image Generation
by: Chen, Zhengting, et al.
Published: (2024)
by: Chen, Zhengting, et al.
Published: (2024)
End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations
by: Luo, Lirui, et al.
Published: (2024)
by: Luo, Lirui, et al.
Published: (2024)
Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions
by: Gao, Jin, et al.
Published: (2024)
by: Gao, Jin, et al.
Published: (2024)
Debunk the Myth of SFT Generalization
by: Lin, Xiaofeng, et al.
Published: (2025)
by: Lin, Xiaofeng, et al.
Published: (2025)
Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs
by: Lou, Siyu, et al.
Published: (2024)
by: Lou, Siyu, et al.
Published: (2024)
Towards Attributions of Input Variables in a Coalition
by: Zheng, Xinhao, et al.
Published: (2023)
by: Zheng, Xinhao, et al.
Published: (2023)
The Interaction Bottleneck of Deep Neural Networks: Discovery, Proof, and Modulation
by: Deng, Huiqi, et al.
Published: (2025)
by: Deng, Huiqi, et al.
Published: (2025)
FedTreeLoRA: Reconciling Statistical and Functional Heterogeneity in Federated LoRA Fine-Tuning
by: Bian, Jieming, et al.
Published: (2026)
by: Bian, Jieming, et al.
Published: (2026)
Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections
by: Wang, Bo, et al.
Published: (2025)
by: Wang, Bo, et al.
Published: (2025)
VickreyFeedback: Cost-efficient Data Construction for Reinforcement Learning from Human Feedback
by: Zhang, Guoxi, et al.
Published: (2024)
by: Zhang, Guoxi, et al.
Published: (2024)
MVR: Multi-view Video Reward Shaping for Reinforcement Learning
by: Luo, Lirui, et al.
Published: (2026)
by: Luo, Lirui, et al.
Published: (2026)
RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs
by: Matsutani, Kohsei, et al.
Published: (2025)
by: Matsutani, Kohsei, et al.
Published: (2025)
Explaining Generalization Power of a DNN Using Interactive Concepts
by: Zhou, Huilin, et al.
Published: (2023)
by: Zhou, Huilin, et al.
Published: (2023)
Red Teaming Language Models for Processing Contradictory Dialogues
by: Wen, Xiaofei, et al.
Published: (2024)
by: Wen, Xiaofei, et al.
Published: (2024)
A Unified and Stable Risk Minimization Framework for Weakly Supervised Learning with Theoretical Guarantees
by: Zhang, Miao, et al.
Published: (2025)
by: Zhang, Miao, et al.
Published: (2025)
Cost-Sensitive Unbiased Risk Estimation for Multi-Class Positive-Unlabeled Learning
by: Zhang, Miao, et al.
Published: (2025)
by: Zhang, Miao, et al.
Published: (2025)
Utilizing Autoregressive Networks for Full Lifecycle Data Generation of Rolling Bearings for RUL Prediction
by: Wang, Junliang, et al.
Published: (2024)
by: Wang, Junliang, et al.
Published: (2024)
Defining and Extracting generalizable interaction primitives from DNNs
by: Chen, Lu, et al.
Published: (2024)
by: Chen, Lu, et al.
Published: (2024)
Where We Have Arrived in Proving the Emergence of Sparse Symbolic Concepts in AI Models
by: Ren, Qihan, et al.
Published: (2023)
by: Ren, Qihan, et al.
Published: (2023)
A Game-Theoretic Negotiation Framework for Cross-Cultural Consensus in LLMs
by: Zhang, Guoxi, et al.
Published: (2025)
by: Zhang, Guoxi, et al.
Published: (2025)
TEMPLE: Incentivizing Temporal Understanding of Video Large Language Models via Progressive Pre-SFT Alignment
by: Li, Shicheng, et al.
Published: (2025)
by: Li, Shicheng, et al.
Published: (2025)
Can LLMs Reconcile Knowledge Conflicts in Counterfactual Reasoning
by: Yamin, Khurram, et al.
Published: (2025)
by: Yamin, Khurram, et al.
Published: (2025)
CID-TKG: Collaborative Historical Invariance and Evolutionary Dynamics Learning for Temporal Knowledge Graph Reasoning
by: Lei, Shuai-Long, et al.
Published: (2026)
by: Lei, Shuai-Long, et al.
Published: (2026)
An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models
by: Feng, Yuming, et al.
Published: (2026)
by: Feng, Yuming, et al.
Published: (2026)
Image Generation from Contextually-Contradictory Prompts
by: Huberman, Saar, et al.
Published: (2025)
by: Huberman, Saar, et al.
Published: (2025)
Towards the Resistance of Neural Network Watermarking to Fine-tuning
by: Tang, Ling, et al.
Published: (2025)
by: Tang, Ling, et al.
Published: (2025)
mSFT: Addressing Dataset Mixtures Overfitting Heterogeneously in Multi-task SFT
by: Koh, Woosung, et al.
Published: (2026)
by: Koh, Woosung, et al.
Published: (2026)
Complementing reinforcement learning with SFT through logit averaging in the post training of LLMs
by: Gan, Xingwei, et al.
Published: (2026)
by: Gan, Xingwei, et al.
Published: (2026)
Entropy-Gradient Inversion: Moving Toward Internal Mechanism of Large Reasoning Models
by: Yang, Junyao, et al.
Published: (2026)
by: Yang, Junyao, et al.
Published: (2026)
How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning
by: Cai, Hongyi James, et al.
Published: (2025)
by: Cai, Hongyi James, et al.
Published: (2025)
Similar Items
-
Randomness of Low-Layer Parameters Determines Confusing Samples in Terms of Interaction Representations of a DNN
by: Zhang, Junpeng, et al.
Published: (2025) -
Revisiting Generalization Power of a DNN in Terms of Symbolic Interactions
by: Cheng, Lei, et al.
Published: (2025) -
Two-Phase Dynamics of Interactions Explains the Starting Point of a DNN Learning Over-Fitted Features
by: Zhang, Junpeng, et al.
Published: (2024) -
Technical Report: Quantifying and Analyzing the Generalization Power of a DNN
by: He, Yuxuan, et al.
Published: (2025) -
Towards the Dynamics of a DNN Learning Symbolic Interactions
by: Ren, Qihan, et al.
Published: (2024)