:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Rui, Cao, Yaoguang, Chen, Yuyi, Xu, Jianyi, Li, Zhuoyang, Shang, Jiachen, Yang, Shichun
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.01832
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Multi-modal Fusion Network for Terrain Perception Based on Illumination Aware
by: Wang, Rui, et al.
Published: (2025)

FocalAD: Local Motion Planning for End-to-End Autonomous Driving
by: Sun, Bin, et al.
Published: (2025)

Synesthesia of Machines (SoM)-Enhanced ISAC Precoding for Vehicular Networks with Double Dynamics
by: Yang, Zonghui, et al.
Published: (2024)

ForeAct: Steering Your VLA with Efficient Visual Foresight Planning
by: Zhang, Zhuoyang, et al.
Published: (2026)

Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training
by: Li, Wenbo, et al.
Published: (2024)

Synesthesia of Machines (SoM)-Enhanced Sub-THz ISAC Transmission for Air-Ground Network
by: Yang, Zonghui, et al.
Published: (2025)

Dimensions of Vulnerability in Visual Working Memory: An AI-Driven Approach to Perceptual Comparison
by: Cao, Yuang, et al.
Published: (2025)

Sample-and-Bound for Non-Convex Optimization
by: Zhai, Yaoguang, et al.
Published: (2024)

Synesthesia of Machines (SoM)-Based Task-Driven MIMO System for Image Transmission
by: Li, Sijiang, et al.
Published: (2025)

Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs
by: Huang, Luke J., et al.
Published: (2026)

Unbiased Visual Reasoning with Controlled Visual Inputs
by: Li, Zhaonan, et al.
Published: (2025)

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
by: Tang, Haotian, et al.
Published: (2024)

TransDex: Pre-training Visuo-Tactile Policy with Point Cloud Reconstruction for Dexterous Manipulation of Transparent Objects
by: Li, Fengguan, et al.
Published: (2026)

Cross-Paradigm Evaluation of Gaze-Based Semantic Object Identification for Intelligent Vehicles
by: Deng, Penghao, et al.
Published: (2026)

Oedipus and the Sphinx: Benchmarking and Improving Visual Language Models for Complex Graphic Reasoning
by: Zhang, Jianyi, et al.
Published: (2025)

Bridging the Sim-to-Real Gap in Semiconductor Visual Program Synthesis via Input Binarization
by: Ohtsubo, Yusuke, et al.
Published: (2026)

Tracking Drift: Variation-Aware Entropy Scheduling for Non-Stationary Reinforcement Learning
by: Wang, Tongxi, et al.
Published: (2026)

Visual Inception: Compromising Long-term Planning in Agentic Recommenders via Multimodal Memory Poisoning
by: Qian, Jiachen
Published: (2026)

VTD: Visual and Tactile Database for Driver State and Behavior Perception
by: Wang, Jie, et al.
Published: (2024)

Advancing Deep Learning through Probability Engineering: A Pragmatic Paradigm for Modern AI
by: Zhang, Jianyi
Published: (2025)

Differentiable Rule Induction from Raw Sequence Inputs
by: Gao, Kun, et al.
Published: (2026)

Music Style Transfer With Diffusion Model
by: Huang, Hong, et al.
Published: (2024)

Clarifying Semantics of In-Context Examples for Unit Test Generation
by: Yang, Chen, et al.
Published: (2025)

Topology Optimization of Random Memristors for Input-Aware Dynamic SNN
by: Wang, Bo, et al.
Published: (2024)

Transferring Tactile Data Across Sensors
by: Amri, Wadhah Zai El, et al.
Published: (2024)

DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding
by: Yan, Hao, et al.
Published: (2026)

SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input
by: Lv, Zhen, et al.
Published: (2024)

TaCo: A Benchmark for Lossless and Lossy Codecs of Heterogeneous Tactile Data
by: Cheng, Zhengxue, et al.
Published: (2026)

Aerial Vision-Language Navigation with a Unified Framework for Spatial, Temporal and Embodied Reasoning
by: Xu, Huilin, et al.
Published: (2025)

Towards Environmentally Equitable AI via Geographical Load Balancing
by: Li, Pengfei, et al.
Published: (2023)

Demonstrating the Octopi-1.5 Visual-Tactile-Language Model
by: Yu, Samson, et al.
Published: (2025)

Physics-Driven Learning Framework for Tomographic Tactile Sensing
by: Yang, Xuanxuan, et al.
Published: (2025)

DA-Cramming: Enhancing Cost-Effective Language Model Pretraining with Dependency Agreement Integration
by: Kuo, Martin, et al.
Published: (2023)

Tactile MNIST: Benchmarking Active Tactile Perception
by: Schneider, Tim, et al.
Published: (2025)

Vital Insight: Assisting Experts' Context-Driven Sensemaking of Multi-modal Personal Tracking Data Using Visualization and Human-In-The-Loop LLM
by: Li, Jiachen, et al.
Published: (2024)

Adversarial Generative Flow Network for Solving Vehicle Routing Problems
by: Zhang, Ni, et al.
Published: (2025)

Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation
by: Xue, Han, et al.
Published: (2025)

An Enhanced Prompt-Based LLM Reasoning Scheme via Knowledge Graph-Integrated Collaboration
by: Li, Yihao, et al.
Published: (2024)

The Concept of the Tactile Signature System for Individuals with Visual Impairments
by: Kremenchutskiy, Anatoliy, et al.
Published: (2024)

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search
by: Jia, Yiming, et al.
Published: (2025)