Saved in:
| Main Authors: | Shao, Run, Yang, Cheng, Li, Qiujun, Zhu, Qing, Zhang, Yongjun, Li, YanSheng, Liu, Yu, Tang, Yong, Liu, Dapeng, Yang, Shizhong, Li, Haifeng |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.00546 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation
by: Wang, Haonan, et al.
Published: (2024)
by: Wang, Haonan, et al.
Published: (2024)
LocationAgent: A Hierarchical Agent for Image Geolocation via Decoupling Strategy and Evidence from Parametric Knowledge
by: Li, Qiujun, et al.
Published: (2026)
by: Li, Qiujun, et al.
Published: (2026)
RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding
by: Xu, Linrui, et al.
Published: (2024)
by: Xu, Linrui, et al.
Published: (2024)
Fabrication and Analysis of the Wear Properties of High‐Vanadium High‐Speed Steel through Spark Plasma Sintering
by: Shuaiwu Tong, et al.
Published: (2024)
by: Shuaiwu Tong, et al.
Published: (2024)
Adaptive Channel Estimation and Hybrid Beamforming for RIS aided Vehicular Communication
by: Li, Tianyou, et al.
Published: (2026)
by: Li, Tianyou, et al.
Published: (2026)
STA-GANN: A Valid and Generalizable Spatio-Temporal Kriging Approach
by: Li, Yujie, et al.
Published: (2025)
by: Li, Yujie, et al.
Published: (2025)
PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity
by: Yuan, Yuqian, et al.
Published: (2025)
by: Yuan, Yuqian, et al.
Published: (2025)
The Wittgensteinian Representation Hypothesis: Is Language the Attractor of Multimodal Convergence?
by: Zhang, Zhaoyang, et al.
Published: (2026)
by: Zhang, Zhaoyang, et al.
Published: (2026)
Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency
by: Guo, Xiangyu, et al.
Published: (2025)
by: Guo, Xiangyu, et al.
Published: (2025)
SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability
by: Wang, Jiankang, et al.
Published: (2025)
by: Wang, Jiankang, et al.
Published: (2025)
Spatio-Temporal Few-Shot Learning via Diffusive Neural Network Generation
by: Yuan, Yuan, et al.
Published: (2024)
by: Yuan, Yuan, et al.
Published: (2024)
Value-Decomposed Reinforcement Learning Framework for Taxiway Routing with Hierarchical Conflict-Aware Observations
by: Zhou, Shizhong, et al.
Published: (2026)
by: Zhou, Shizhong, et al.
Published: (2026)
A Refer-and-Ground Multimodal Large Language Model for Biomedicine
by: Huang, Xiaoshuang, et al.
Published: (2024)
by: Huang, Xiaoshuang, et al.
Published: (2024)
LiDAR Prompted Spatio-Temporal Multi-View Stereo for Autonomous Driving
by: Sun, Qihao, et al.
Published: (2026)
by: Sun, Qihao, et al.
Published: (2026)
Unleashing the Potential of Multimodal LLMs for Zero-Shot Spatio-Temporal Video Grounding
by: Yang, Zaiquan, et al.
Published: (2025)
by: Yang, Zaiquan, et al.
Published: (2025)
Jointly Modeling Spatio-Temporal Features of Tactile Signals for Action Classification
by: Lin, Jimmy, et al.
Published: (2024)
by: Lin, Jimmy, et al.
Published: (2024)
Crip Spacetime: Access, Failure, and Accountability in Academic Life. By MargaretPrice, Durham: Duke University Press, 2024. 240 pp. $26.95 (paper). ISBN: 978‐1‐47‐803037‐9; $102.95 (hardcover). ISBN: 978‐1‐47‐802613‐6
by: Leyan Zheng, et al.
Published: (2025)
by: Leyan Zheng, et al.
Published: (2025)
STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
by: Guo, Shaoxiong, et al.
Published: (2025)
by: Guo, Shaoxiong, et al.
Published: (2025)
Air Quality Prediction with A Meteorology-Guided Modality-Decoupled Spatio-Temporal Network
by: Yin, Hang, et al.
Published: (2025)
by: Yin, Hang, et al.
Published: (2025)
Transformer RGBT Tracking with Spatio-Temporal Multimodal Tokens
by: Sun, Dengdi, et al.
Published: (2024)
by: Sun, Dengdi, et al.
Published: (2024)
Robust Multimodal Semantic Segmentation with Balanced Modality Contributions
by: Tan, Jiaqi, et al.
Published: (2025)
by: Tan, Jiaqi, et al.
Published: (2025)
Multimodal Contrastive Learning via Uni-Modal Coding and Cross-Modal Prediction for Multimodal Sentiment Analysis
by: Lin, Ronghao, et al.
Published: (2022)
by: Lin, Ronghao, et al.
Published: (2022)
UniFlow: A Foundation Model for Unified Urban Spatio-Temporal Flow Prediction
by: Yuan, Yuan, et al.
Published: (2024)
by: Yuan, Yuan, et al.
Published: (2024)
GLIDE: Graph-guided Leap Inference for Diffusion Estimation of Spatio-Temporal Point Processes
by: Zhou, Guanyu, et al.
Published: (2026)
by: Zhou, Guanyu, et al.
Published: (2026)
Mining Multi-Modality Spatio-Temporal Cues for Video Important Person Identification
by: Wang, Xiao, et al.
Published: (2026)
by: Wang, Xiao, et al.
Published: (2026)
UrbanGPT: Spatio-Temporal Large Language Models
by: Li, Zhonghang, et al.
Published: (2024)
by: Li, Zhonghang, et al.
Published: (2024)
AsyReC: A Multimodal Graph-based Framework for Spatio-Temporal Asymmetric Dyadic Relationship Classification
by: Tang, Wang, et al.
Published: (2025)
by: Tang, Wang, et al.
Published: (2025)
TreeMIL: A Multi-instance Learning Framework for Time Series Anomaly Detection with Inexact Supervision
by: Liu, Chen, et al.
Published: (2024)
by: Liu, Chen, et al.
Published: (2024)
Learnability in Online Kernel Selection with Memory Constraint via Data-dependent Regret Analysis
by: Li, Junfan, et al.
Published: (2024)
by: Li, Junfan, et al.
Published: (2024)
Improved Kernel Alignment Regret Bound for Online Kernel Learning
by: Li, Junfan, et al.
Published: (2022)
by: Li, Junfan, et al.
Published: (2022)
Graph Learning-Driven Multi-Vessel Association: Fusing Multimodal Data for Maritime Intelligence
by: Lu, Yuxu, et al.
Published: (2025)
by: Lu, Yuxu, et al.
Published: (2025)
Interacted Object Grounding in Spatio-Temporal Human-Object Interactions
by: Liu, Xiaoyang, et al.
Published: (2024)
by: Liu, Xiaoyang, et al.
Published: (2024)
Decoupled Diffusion Sparks Adaptive Scene Generation
by: Zhou, Yunsong, et al.
Published: (2025)
by: Zhou, Yunsong, et al.
Published: (2025)
DMTrack: Spatio-Temporal Multimodal Tracking via Dual-Adapter
by: Li, Weihong, et al.
Published: (2025)
by: Li, Weihong, et al.
Published: (2025)
STQuant: Spatio-Temporal Adaptive Framework for Optimizer Quantization in Large Multimodal Model Training
by: Liu, Minglu, et al.
Published: (2026)
by: Liu, Minglu, et al.
Published: (2026)
Enhancing SNN-based Spatio-Temporal Learning: A Benchmark Dataset and Cross-Modality Attention Model
by: Zhou, Shibo, et al.
Published: (2024)
by: Zhou, Shibo, et al.
Published: (2024)
Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models
by: Chen, Longze, et al.
Published: (2024)
by: Chen, Longze, et al.
Published: (2024)
Modality-Guided Dynamic Graph Fusion and Temporal Diffusion for Self-Supervised RGB-T Tracking
by: Li, Shenglan, et al.
Published: (2025)
by: Li, Shenglan, et al.
Published: (2025)
Multimodal Classification via Modal-Aware Interactive Enhancement
by: Jiang, Qing-Yuan, et al.
Published: (2024)
by: Jiang, Qing-Yuan, et al.
Published: (2024)
Seismic analysis based on a new interval method with incomplete information
by: Liang, Shizhong, et al.
Published: (2025)
by: Liang, Shizhong, et al.
Published: (2025)
Similar Items
-
AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation
by: Wang, Haonan, et al.
Published: (2024) -
LocationAgent: A Hierarchical Agent for Image Geolocation via Decoupling Strategy and Evidence from Parametric Knowledge
by: Li, Qiujun, et al.
Published: (2026) -
RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding
by: Xu, Linrui, et al.
Published: (2024) -
Fabrication and Analysis of the Wear Properties of High‐Vanadium High‐Speed Steel through Spark Plasma Sintering
by: Shuaiwu Tong, et al.
Published: (2024) -
Adaptive Channel Estimation and Hybrid Beamforming for RIS aided Vehicular Communication
by: Li, Tianyou, et al.
Published: (2026)