Saved in:
| Main Authors: | Li, Yang, Zhang, Zirui, Liu, Yang, Mao, Chengzhi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.15529 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning
by: Zhang, Zirui, et al.
Published: (2026)
by: Zhang, Zirui, et al.
Published: (2026)
SPIN: Self-Supervised Prompt INjection
by: Zhou, Leon, et al.
Published: (2024)
by: Zhou, Leon, et al.
Published: (2024)
I Can Hear You: Selective Robust Training for Deepfake Audio Detection
by: Zhang, Zirui, et al.
Published: (2024)
by: Zhang, Zirui, et al.
Published: (2024)
Understanding Temporal Logic Consistency in Video-Language Models through Cross-Modal Attention Discriminability
by: Li, Chengzhi, et al.
Published: (2025)
by: Li, Chengzhi, et al.
Published: (2025)
INT-FlashAttention: Enabling Flash Attention for INT8 Quantization
by: Chen, Shimao, et al.
Published: (2024)
by: Chen, Shimao, et al.
Published: (2024)
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization
by: Zhang, Jintao, et al.
Published: (2024)
by: Zhang, Jintao, et al.
Published: (2024)
SelfIE: Self-Interpretation of Large Language Model Embeddings
by: Chen, Haozhe, et al.
Published: (2024)
by: Chen, Haozhe, et al.
Published: (2024)
CompilerKV: Risk-Adaptive KV Compression via Offline Experience Compilation
by: Yang, Ning, et al.
Published: (2026)
by: Yang, Ning, et al.
Published: (2026)
Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion
by: Cheng, Zhen, et al.
Published: (2026)
by: Cheng, Zhen, et al.
Published: (2026)
Exploring Over-stationarization in Deep Learning-based Bus/Tram Arrival Time Prediction: Analysis and Non-stationary Effect Recovery
by: Li, Zirui, et al.
Published: (2025)
by: Li, Zirui, et al.
Published: (2025)
OPRIDE: Offline Preference-based Reinforcement Learning via In-Dataset Exploration
by: Yang, Yiqin, et al.
Published: (2026)
by: Yang, Yiqin, et al.
Published: (2026)
Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification
by: Liu, Qihao, et al.
Published: (2025)
by: Liu, Qihao, et al.
Published: (2025)
Aligned but Blind: Alignment Increases Implicit Bias by Reducing Awareness of Race
by: Sun, Lihao, et al.
Published: (2025)
by: Sun, Lihao, et al.
Published: (2025)
CaDA: Cross-Problem Routing Solver with Constraint-Aware Dual-Attention
by: Li, Han, et al.
Published: (2024)
by: Li, Han, et al.
Published: (2024)
The Struggle Between Continuation and Refusal: A Mechanistic Analysis of the Continuation-Triggered Jailbreak in LLMs
by: Deng, Yonghong, et al.
Published: (2026)
by: Deng, Yonghong, et al.
Published: (2026)
Cross-Modal Attention Network with Dual Graph Learning in Multimodal Recommendation
by: Dai, Ji, et al.
Published: (2026)
by: Dai, Ji, et al.
Published: (2026)
Software-Hardware Co-optimization for Modular E2E AV Paradigm: A Unified Framework of Optimization Approaches, Simulation Environment and Evaluation Metrics
by: Ji, Chengzhi, et al.
Published: (2026)
by: Ji, Chengzhi, et al.
Published: (2026)
Enhancing Stress-Strain Predictions with Seq2Seq and Cross-Attention based on Small Punch Test
by: Yang, Zhengni, et al.
Published: (2025)
by: Yang, Zhengni, et al.
Published: (2025)
How Do LLMs and VLMs Understand Viewpoint Rotation Without Vision? An Interpretability Study
by: Yang, Zhen, et al.
Published: (2026)
by: Yang, Zhen, et al.
Published: (2026)
Looking Back and Forth: Cross-Image Attention Calibration and Attentive Preference Learning for Multi-Image Hallucination Mitigation
by: Yang, Xiaochen, et al.
Published: (2026)
by: Yang, Xiaochen, et al.
Published: (2026)
LLMs are Single-threaded Reasoners: Demystifying the Working Mechanism of Soft Thinking
by: Wu, Junhong, et al.
Published: (2025)
by: Wu, Junhong, et al.
Published: (2025)
MAXS: Meta-Adaptive Exploration with LLM Agents
by: Zhang, Jian, et al.
Published: (2026)
by: Zhang, Jian, et al.
Published: (2026)
Cross-Stage Attention Multi-Expert Network for Radiologist-Inspired Breast Ultrasound Diagnosis
by: Zhai, Xinyang, et al.
Published: (2026)
by: Zhai, Xinyang, et al.
Published: (2026)
Cooperative Autonomous Driving in Diverse Behavioral Traffic: A Heterogeneous Graph Reinforcement Learning Approach
by: Liu, Qi, et al.
Published: (2025)
by: Liu, Qi, et al.
Published: (2025)
ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object
by: Zhang, Chenshuang, et al.
Published: (2024)
by: Zhang, Chenshuang, et al.
Published: (2024)
ConvD: Attention Enhanced Dynamic Convolutional Embeddings for Knowledge Graph Completion
by: Guo, Wenbin, et al.
Published: (2023)
by: Guo, Wenbin, et al.
Published: (2023)
Attention-guided Evidence Grounding for Spoken Question Answering
by: Yang, Ke, et al.
Published: (2026)
by: Yang, Ke, et al.
Published: (2026)
CATP: Cross-Attention Token Pruning for Accuracy Preserved Multimodal Model Inference
by: Liao, Ruqi, et al.
Published: (2024)
by: Liao, Ruqi, et al.
Published: (2024)
Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement
by: Yang, Tao, et al.
Published: (2024)
by: Yang, Tao, et al.
Published: (2024)
Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach
by: Yang, Siyuan, et al.
Published: (2025)
by: Yang, Siyuan, et al.
Published: (2025)
Enhancing Research Idea Generation through Combinatorial Innovation and Multi-Agent Iterative Search Strategies
by: Chen, Shuai, et al.
Published: (2026)
by: Chen, Shuai, et al.
Published: (2026)
Student Guides Teacher: Weak-to-Strong Inference via Spectral Orthogonal Exploration
by: Wang, Dayu, et al.
Published: (2026)
by: Wang, Dayu, et al.
Published: (2026)
Video Diffusion Models Excel at Tracking Similar-Looking Objects Without Supervision
by: Zhang, Chenshuang, et al.
Published: (2025)
by: Zhang, Chenshuang, et al.
Published: (2025)
Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models
by: Ren, Zirui, et al.
Published: (2026)
by: Ren, Zirui, et al.
Published: (2026)
Less Is More: Fast and Accurate Reasoning with Cross-Head Unified Sparse Attention
by: Yang, Lijie, et al.
Published: (2025)
by: Yang, Lijie, et al.
Published: (2025)
Generative Auto-Bidding with Unified Modeling and Exploration
by: Zhang, Mingming, et al.
Published: (2026)
by: Zhang, Mingming, et al.
Published: (2026)
Revisiting Cross-Attention Mechanisms: Leveraging Beneficial Noise for Domain-Adaptive Learning
by: Zang, Zelin, et al.
Published: (2026)
by: Zang, Zelin, et al.
Published: (2026)
Enhancing Abstractive Summarization of Scientific Papers Using Structure Information
by: Bao, Tong, et al.
Published: (2025)
by: Bao, Tong, et al.
Published: (2025)
RS-Claw: Progressive Active Tool Exploration via Hierarchical Skill Trees for Remote Sensing Agents
by: Liu, Liangtian, et al.
Published: (2026)
by: Liu, Liangtian, et al.
Published: (2026)
SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
by: Gao, Yizhao, et al.
Published: (2025)
by: Gao, Yizhao, et al.
Published: (2025)
Similar Items
-
R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning
by: Zhang, Zirui, et al.
Published: (2026) -
SPIN: Self-Supervised Prompt INjection
by: Zhou, Leon, et al.
Published: (2024) -
I Can Hear You: Selective Robust Training for Deepfake Audio Detection
by: Zhang, Zirui, et al.
Published: (2024) -
Understanding Temporal Logic Consistency in Video-Language Models through Cross-Modal Attention Discriminability
by: Li, Chengzhi, et al.
Published: (2025) -
INT-FlashAttention: Enabling Flash Attention for INT8 Quantization
by: Chen, Shimao, et al.
Published: (2024)