:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Junpeng, Cheng, Lei, Zhang, Guoxi, Cai, Hua, Xu, Qing, Zhang, Quanshi
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.17967
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Randomness of Low-Layer Parameters Determines Confusing Samples in Terms of Interaction Representations of a DNN
by: Zhang, Junpeng, et al.
Published: (2025)

Revisiting Generalization Power of a DNN in Terms of Symbolic Interactions
by: Cheng, Lei, et al.
Published: (2025)

Two-Phase Dynamics of Interactions Explains the Starting Point of a DNN Learning Over-Fitted Features
by: Zhang, Junpeng, et al.
Published: (2024)

Technical Report: Quantifying and Analyzing the Generalization Power of a DNN
by: He, Yuxuan, et al.
Published: (2025)

Towards the Dynamics of a DNN Learning Symbolic Interactions
by: Ren, Qihan, et al.
Published: (2024)

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability
by: Ren, Qihan, et al.
Published: (2026)

Technical Note: Defining and Quantifying AND-OR Interactions for Faithful and Concise Explanation of DNNs
by: Li, Mingjie, et al.
Published: (2023)

Attribution Explanations for Deep Neural Networks: A Theoretical Perspective
by: Deng, Huiqi, et al.
Published: (2025)

Does a Neural Network Really Encode Symbolic Concepts?
by: Li, Mingjie, et al.
Published: (2023)

Layerwise Change of Knowledge in Neural Networks
by: Cheng, Xu, et al.
Published: (2024)

Disentangling Regional Primitives for Image Generation
by: Chen, Zhengting, et al.
Published: (2024)

End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations
by: Luo, Lirui, et al.
Published: (2024)

Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions
by: Gao, Jin, et al.
Published: (2024)

Debunk the Myth of SFT Generalization
by: Lin, Xiaofeng, et al.
Published: (2025)

Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs
by: Lou, Siyu, et al.
Published: (2024)

Towards Attributions of Input Variables in a Coalition
by: Zheng, Xinhao, et al.
Published: (2023)

The Interaction Bottleneck of Deep Neural Networks: Discovery, Proof, and Modulation
by: Deng, Huiqi, et al.
Published: (2025)

FedTreeLoRA: Reconciling Statistical and Functional Heterogeneity in Federated LoRA Fine-Tuning
by: Bian, Jieming, et al.
Published: (2026)

Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections
by: Wang, Bo, et al.
Published: (2025)

VickreyFeedback: Cost-efficient Data Construction for Reinforcement Learning from Human Feedback
by: Zhang, Guoxi, et al.
Published: (2024)

MVR: Multi-view Video Reward Shaping for Reinforcement Learning
by: Luo, Lirui, et al.
Published: (2026)

RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs
by: Matsutani, Kohsei, et al.
Published: (2025)

Explaining Generalization Power of a DNN Using Interactive Concepts
by: Zhou, Huilin, et al.
Published: (2023)

Red Teaming Language Models for Processing Contradictory Dialogues
by: Wen, Xiaofei, et al.
Published: (2024)

A Unified and Stable Risk Minimization Framework for Weakly Supervised Learning with Theoretical Guarantees
by: Zhang, Miao, et al.
Published: (2025)

Cost-Sensitive Unbiased Risk Estimation for Multi-Class Positive-Unlabeled Learning
by: Zhang, Miao, et al.
Published: (2025)

Utilizing Autoregressive Networks for Full Lifecycle Data Generation of Rolling Bearings for RUL Prediction
by: Wang, Junliang, et al.
Published: (2024)

Defining and Extracting generalizable interaction primitives from DNNs
by: Chen, Lu, et al.
Published: (2024)

Where We Have Arrived in Proving the Emergence of Sparse Symbolic Concepts in AI Models
by: Ren, Qihan, et al.
Published: (2023)

A Game-Theoretic Negotiation Framework for Cross-Cultural Consensus in LLMs
by: Zhang, Guoxi, et al.
Published: (2025)

TEMPLE: Incentivizing Temporal Understanding of Video Large Language Models via Progressive Pre-SFT Alignment
by: Li, Shicheng, et al.
Published: (2025)

Can LLMs Reconcile Knowledge Conflicts in Counterfactual Reasoning
by: Yamin, Khurram, et al.
Published: (2025)

CID-TKG: Collaborative Historical Invariance and Evolutionary Dynamics Learning for Temporal Knowledge Graph Reasoning
by: Lei, Shuai-Long, et al.
Published: (2026)

An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models
by: Feng, Yuming, et al.
Published: (2026)

Image Generation from Contextually-Contradictory Prompts
by: Huberman, Saar, et al.
Published: (2025)

Towards the Resistance of Neural Network Watermarking to Fine-tuning
by: Tang, Ling, et al.
Published: (2025)

mSFT: Addressing Dataset Mixtures Overfitting Heterogeneously in Multi-task SFT
by: Koh, Woosung, et al.
Published: (2026)

Complementing reinforcement learning with SFT through logit averaging in the post training of LLMs
by: Gan, Xingwei, et al.
Published: (2026)

Entropy-Gradient Inversion: Moving Toward Internal Mechanism of Large Reasoning Models
by: Yang, Junyao, et al.
Published: (2026)

How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning
by: Cai, Hongyi James, et al.
Published: (2025)