Saved in:
| Main Authors: | Wang, Rui, Cao, Yaoguang, Chen, Yuyi, Xu, Jianyi, Li, Zhuoyang, Shang, Jiachen, Yang, Shichun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01832 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Multi-modal Fusion Network for Terrain Perception Based on Illumination Aware
by: Wang, Rui, et al.
Published: (2025)
by: Wang, Rui, et al.
Published: (2025)
FocalAD: Local Motion Planning for End-to-End Autonomous Driving
by: Sun, Bin, et al.
Published: (2025)
by: Sun, Bin, et al.
Published: (2025)
Synesthesia of Machines (SoM)-Enhanced ISAC Precoding for Vehicular Networks with Double Dynamics
by: Yang, Zonghui, et al.
Published: (2024)
by: Yang, Zonghui, et al.
Published: (2024)
ForeAct: Steering Your VLA with Efficient Visual Foresight Planning
by: Zhang, Zhuoyang, et al.
Published: (2026)
by: Zhang, Zhuoyang, et al.
Published: (2026)
Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training
by: Li, Wenbo, et al.
Published: (2024)
by: Li, Wenbo, et al.
Published: (2024)
Synesthesia of Machines (SoM)-Enhanced Sub-THz ISAC Transmission for Air-Ground Network
by: Yang, Zonghui, et al.
Published: (2025)
by: Yang, Zonghui, et al.
Published: (2025)
Dimensions of Vulnerability in Visual Working Memory: An AI-Driven Approach to Perceptual Comparison
by: Cao, Yuang, et al.
Published: (2025)
by: Cao, Yuang, et al.
Published: (2025)
Sample-and-Bound for Non-Convex Optimization
by: Zhai, Yaoguang, et al.
Published: (2024)
by: Zhai, Yaoguang, et al.
Published: (2024)
Synesthesia of Machines (SoM)-Based Task-Driven MIMO System for Image Transmission
by: Li, Sijiang, et al.
Published: (2025)
by: Li, Sijiang, et al.
Published: (2025)
Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs
by: Huang, Luke J., et al.
Published: (2026)
by: Huang, Luke J., et al.
Published: (2026)
Unbiased Visual Reasoning with Controlled Visual Inputs
by: Li, Zhaonan, et al.
Published: (2025)
by: Li, Zhaonan, et al.
Published: (2025)
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
by: Tang, Haotian, et al.
Published: (2024)
by: Tang, Haotian, et al.
Published: (2024)
TransDex: Pre-training Visuo-Tactile Policy with Point Cloud Reconstruction for Dexterous Manipulation of Transparent Objects
by: Li, Fengguan, et al.
Published: (2026)
by: Li, Fengguan, et al.
Published: (2026)
Cross-Paradigm Evaluation of Gaze-Based Semantic Object Identification for Intelligent Vehicles
by: Deng, Penghao, et al.
Published: (2026)
by: Deng, Penghao, et al.
Published: (2026)
Oedipus and the Sphinx: Benchmarking and Improving Visual Language Models for Complex Graphic Reasoning
by: Zhang, Jianyi, et al.
Published: (2025)
by: Zhang, Jianyi, et al.
Published: (2025)
Bridging the Sim-to-Real Gap in Semiconductor Visual Program Synthesis via Input Binarization
by: Ohtsubo, Yusuke, et al.
Published: (2026)
by: Ohtsubo, Yusuke, et al.
Published: (2026)
Tracking Drift: Variation-Aware Entropy Scheduling for Non-Stationary Reinforcement Learning
by: Wang, Tongxi, et al.
Published: (2026)
by: Wang, Tongxi, et al.
Published: (2026)
Visual Inception: Compromising Long-term Planning in Agentic Recommenders via Multimodal Memory Poisoning
by: Qian, Jiachen
Published: (2026)
by: Qian, Jiachen
Published: (2026)
VTD: Visual and Tactile Database for Driver State and Behavior Perception
by: Wang, Jie, et al.
Published: (2024)
by: Wang, Jie, et al.
Published: (2024)
Advancing Deep Learning through Probability Engineering: A Pragmatic Paradigm for Modern AI
by: Zhang, Jianyi
Published: (2025)
by: Zhang, Jianyi
Published: (2025)
Differentiable Rule Induction from Raw Sequence Inputs
by: Gao, Kun, et al.
Published: (2026)
by: Gao, Kun, et al.
Published: (2026)
Music Style Transfer With Diffusion Model
by: Huang, Hong, et al.
Published: (2024)
by: Huang, Hong, et al.
Published: (2024)
Clarifying Semantics of In-Context Examples for Unit Test Generation
by: Yang, Chen, et al.
Published: (2025)
by: Yang, Chen, et al.
Published: (2025)
Topology Optimization of Random Memristors for Input-Aware Dynamic SNN
by: Wang, Bo, et al.
Published: (2024)
by: Wang, Bo, et al.
Published: (2024)
Transferring Tactile Data Across Sensors
by: Amri, Wadhah Zai El, et al.
Published: (2024)
by: Amri, Wadhah Zai El, et al.
Published: (2024)
DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding
by: Yan, Hao, et al.
Published: (2026)
by: Yan, Hao, et al.
Published: (2026)
SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input
by: Lv, Zhen, et al.
Published: (2024)
by: Lv, Zhen, et al.
Published: (2024)
TaCo: A Benchmark for Lossless and Lossy Codecs of Heterogeneous Tactile Data
by: Cheng, Zhengxue, et al.
Published: (2026)
by: Cheng, Zhengxue, et al.
Published: (2026)
Aerial Vision-Language Navigation with a Unified Framework for Spatial, Temporal and Embodied Reasoning
by: Xu, Huilin, et al.
Published: (2025)
by: Xu, Huilin, et al.
Published: (2025)
Towards Environmentally Equitable AI via Geographical Load Balancing
by: Li, Pengfei, et al.
Published: (2023)
by: Li, Pengfei, et al.
Published: (2023)
Demonstrating the Octopi-1.5 Visual-Tactile-Language Model
by: Yu, Samson, et al.
Published: (2025)
by: Yu, Samson, et al.
Published: (2025)
Physics-Driven Learning Framework for Tomographic Tactile Sensing
by: Yang, Xuanxuan, et al.
Published: (2025)
by: Yang, Xuanxuan, et al.
Published: (2025)
DA-Cramming: Enhancing Cost-Effective Language Model Pretraining with Dependency Agreement Integration
by: Kuo, Martin, et al.
Published: (2023)
by: Kuo, Martin, et al.
Published: (2023)
Tactile MNIST: Benchmarking Active Tactile Perception
by: Schneider, Tim, et al.
Published: (2025)
by: Schneider, Tim, et al.
Published: (2025)
Vital Insight: Assisting Experts' Context-Driven Sensemaking of Multi-modal Personal Tracking Data Using Visualization and Human-In-The-Loop LLM
by: Li, Jiachen, et al.
Published: (2024)
by: Li, Jiachen, et al.
Published: (2024)
Adversarial Generative Flow Network for Solving Vehicle Routing Problems
by: Zhang, Ni, et al.
Published: (2025)
by: Zhang, Ni, et al.
Published: (2025)
Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation
by: Xue, Han, et al.
Published: (2025)
by: Xue, Han, et al.
Published: (2025)
An Enhanced Prompt-Based LLM Reasoning Scheme via Knowledge Graph-Integrated Collaboration
by: Li, Yihao, et al.
Published: (2024)
by: Li, Yihao, et al.
Published: (2024)
The Concept of the Tactile Signature System for Individuals with Visual Impairments
by: Kremenchutskiy, Anatoliy, et al.
Published: (2024)
by: Kremenchutskiy, Anatoliy, et al.
Published: (2024)
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search
by: Jia, Yiming, et al.
Published: (2025)
by: Jia, Yiming, et al.
Published: (2025)
Similar Items
-
A Multi-modal Fusion Network for Terrain Perception Based on Illumination Aware
by: Wang, Rui, et al.
Published: (2025) -
FocalAD: Local Motion Planning for End-to-End Autonomous Driving
by: Sun, Bin, et al.
Published: (2025) -
Synesthesia of Machines (SoM)-Enhanced ISAC Precoding for Vehicular Networks with Double Dynamics
by: Yang, Zonghui, et al.
Published: (2024) -
ForeAct: Steering Your VLA with Efficient Visual Foresight Planning
by: Zhang, Zhuoyang, et al.
Published: (2026) -
Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training
by: Li, Wenbo, et al.
Published: (2024)