:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shao, Run, Yang, Cheng, Li, Qiujun, Zhu, Qing, Zhang, Yongjun, Li, YanSheng, Liu, Yu, Tang, Yong, Liu, Dapeng, Yang, Shizhong, Li, Haifeng
Format:	Preprint
Published:	2023
Subjects:	Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2401.00546
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation
by: Wang, Haonan, et al.
Published: (2024)

LocationAgent: A Hierarchical Agent for Image Geolocation via Decoupling Strategy and Evidence from Parametric Knowledge
by: Li, Qiujun, et al.
Published: (2026)

RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding
by: Xu, Linrui, et al.
Published: (2024)

Fabrication and Analysis of the Wear Properties of High‐Vanadium High‐Speed Steel through Spark Plasma Sintering
by: Shuaiwu Tong, et al.
Published: (2024)

Adaptive Channel Estimation and Hybrid Beamforming for RIS aided Vehicular Communication
by: Li, Tianyou, et al.
Published: (2026)

STA-GANN: A Valid and Generalizable Spatio-Temporal Kriging Approach
by: Li, Yujie, et al.
Published: (2025)

PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity
by: Yuan, Yuqian, et al.
Published: (2025)

The Wittgensteinian Representation Hypothesis: Is Language the Attractor of Multimodal Convergence?
by: Zhang, Zhaoyang, et al.
Published: (2026)

Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency
by: Guo, Xiangyu, et al.
Published: (2025)

SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability
by: Wang, Jiankang, et al.
Published: (2025)

Spatio-Temporal Few-Shot Learning via Diffusive Neural Network Generation
by: Yuan, Yuan, et al.
Published: (2024)

Value-Decomposed Reinforcement Learning Framework for Taxiway Routing with Hierarchical Conflict-Aware Observations
by: Zhou, Shizhong, et al.
Published: (2026)

A Refer-and-Ground Multimodal Large Language Model for Biomedicine
by: Huang, Xiaoshuang, et al.
Published: (2024)

LiDAR Prompted Spatio-Temporal Multi-View Stereo for Autonomous Driving
by: Sun, Qihao, et al.
Published: (2026)

Unleashing the Potential of Multimodal LLMs for Zero-Shot Spatio-Temporal Video Grounding
by: Yang, Zaiquan, et al.
Published: (2025)

Jointly Modeling Spatio-Temporal Features of Tactile Signals for Action Classification
by: Lin, Jimmy, et al.
Published: (2024)

Crip Spacetime: Access, Failure, and Accountability in Academic Life. By MargaretPrice, Durham: Duke University Press, 2024. 240 pp. $26.95 (paper). ISBN: 978‐1‐47‐803037‐9; $102.95 (hardcover). ISBN: 978‐1‐47‐802613‐6
by: Leyan Zheng, et al.
Published: (2025)

STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
by: Guo, Shaoxiong, et al.
Published: (2025)

Air Quality Prediction with A Meteorology-Guided Modality-Decoupled Spatio-Temporal Network
by: Yin, Hang, et al.
Published: (2025)

Transformer RGBT Tracking with Spatio-Temporal Multimodal Tokens
by: Sun, Dengdi, et al.
Published: (2024)

Robust Multimodal Semantic Segmentation with Balanced Modality Contributions
by: Tan, Jiaqi, et al.
Published: (2025)

Multimodal Contrastive Learning via Uni-Modal Coding and Cross-Modal Prediction for Multimodal Sentiment Analysis
by: Lin, Ronghao, et al.
Published: (2022)

UniFlow: A Foundation Model for Unified Urban Spatio-Temporal Flow Prediction
by: Yuan, Yuan, et al.
Published: (2024)

GLIDE: Graph-guided Leap Inference for Diffusion Estimation of Spatio-Temporal Point Processes
by: Zhou, Guanyu, et al.
Published: (2026)

Mining Multi-Modality Spatio-Temporal Cues for Video Important Person Identification
by: Wang, Xiao, et al.
Published: (2026)

UrbanGPT: Spatio-Temporal Large Language Models
by: Li, Zhonghang, et al.
Published: (2024)

AsyReC: A Multimodal Graph-based Framework for Spatio-Temporal Asymmetric Dyadic Relationship Classification
by: Tang, Wang, et al.
Published: (2025)

TreeMIL: A Multi-instance Learning Framework for Time Series Anomaly Detection with Inexact Supervision
by: Liu, Chen, et al.
Published: (2024)

Learnability in Online Kernel Selection with Memory Constraint via Data-dependent Regret Analysis
by: Li, Junfan, et al.
Published: (2024)

Improved Kernel Alignment Regret Bound for Online Kernel Learning
by: Li, Junfan, et al.
Published: (2022)

Graph Learning-Driven Multi-Vessel Association: Fusing Multimodal Data for Maritime Intelligence
by: Lu, Yuxu, et al.
Published: (2025)

Interacted Object Grounding in Spatio-Temporal Human-Object Interactions
by: Liu, Xiaoyang, et al.
Published: (2024)

Decoupled Diffusion Sparks Adaptive Scene Generation
by: Zhou, Yunsong, et al.
Published: (2025)

DMTrack: Spatio-Temporal Multimodal Tracking via Dual-Adapter
by: Li, Weihong, et al.
Published: (2025)

STQuant: Spatio-Temporal Adaptive Framework for Optimizer Quantization in Large Multimodal Model Training
by: Liu, Minglu, et al.
Published: (2026)

Enhancing SNN-based Spatio-Temporal Learning: A Benchmark Dataset and Cross-Modality Attention Model
by: Zhou, Shibo, et al.
Published: (2024)

Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models
by: Chen, Longze, et al.
Published: (2024)

Modality-Guided Dynamic Graph Fusion and Temporal Diffusion for Self-Supervised RGB-T Tracking
by: Li, Shenglan, et al.
Published: (2025)

Multimodal Classification via Modal-Aware Interactive Enhancement
by: Jiang, Qing-Yuan, et al.
Published: (2024)

Seismic analysis based on a new interval method with incomplete information
by: Liang, Shizhong, et al.
Published: (2025)