Saved in:
| Main Authors: | Chang, Shen, Tian, Renran, Adams, Nicole, Kong, Nan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.03558 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VLMs Guided Interpretable Decision Making for Autonomous Driving
by: Hu, Xin, et al.
Published: (2025)
by: Hu, Xin, et al.
Published: (2025)
SAW-Bench: Learning Situated Awareness in the Real World
by: Li, Chuhan, et al.
Published: (2026)
by: Li, Chuhan, et al.
Published: (2026)
Real-time Ship Recognition and Georeferencing for the Improvement of Maritime Situational Awareness
by: Perez, Borja Carrillo
Published: (2024)
by: Perez, Borja Carrillo
Published: (2024)
PSI: A Benchmark for Human Interpretation and Response in Traffic Interactions
by: Jing, Taotao, et al.
Published: (2021)
by: Jing, Taotao, et al.
Published: (2021)
Real-Time Drone Detection in Event Cameras via Per-Pixel Frequency Analysis
by: Bezick, Michael, et al.
Published: (2026)
by: Bezick, Michael, et al.
Published: (2026)
Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond
by: Wang, Zhechao, et al.
Published: (2024)
by: Wang, Zhechao, et al.
Published: (2024)
Emotion-Aware Classroom Quality Assessment Leveraging IoT-Based Real-Time Student Monitoring
by: Nguyen, Hai, et al.
Published: (2026)
by: Nguyen, Hai, et al.
Published: (2026)
SpectraSentinel: LightWeight Dual-Stream Real-Time Drone Detection, Tracking and Payload Identification
by: Kabir, Shahriar, et al.
Published: (2025)
by: Kabir, Shahriar, et al.
Published: (2025)
RTGaze: Real-Time 3D-Aware Gaze Redirection from a Single Image
by: Wang, Hengfei, et al.
Published: (2025)
by: Wang, Hengfei, et al.
Published: (2025)
Empowering Large Language Models with 3D Situation Awareness
by: Yuan, Zhihao, et al.
Published: (2025)
by: Yuan, Zhihao, et al.
Published: (2025)
Assessing Situational and Spatial Awareness of VLMs with Synthetically Generated Video
by: Benschop, Pascal, et al.
Published: (2026)
by: Benschop, Pascal, et al.
Published: (2026)
Motion-Compensated Latent Semantic Canvases for Visual Situational Awareness on Edge
by: Lodin, Igor, et al.
Published: (2025)
by: Lodin, Igor, et al.
Published: (2025)
Efficient Multi-branch Segmentation Network for Situation Awareness in Autonomous Navigation
by: Zhou, Guan-Cheng, et al.
Published: (2024)
by: Zhou, Guan-Cheng, et al.
Published: (2024)
MMOT: The First Challenging Benchmark for Drone-based Multispectral Multi-Object Tracking
by: Li, Tianhao, et al.
Published: (2025)
by: Li, Tianhao, et al.
Published: (2025)
DreamDrone: Text-to-Image Diffusion Models are Zero-shot Perpetual View Generators
by: Kong, Hanyang, et al.
Published: (2023)
by: Kong, Hanyang, et al.
Published: (2023)
Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction
by: Wang, Shaoxiang, et al.
Published: (2024)
by: Wang, Shaoxiang, et al.
Published: (2024)
A Real-Time Human Action Recognition Model for Assisted Living
by: Wang, Yixuan, et al.
Published: (2025)
by: Wang, Yixuan, et al.
Published: (2025)
KV-Tracker: Real-Time Pose Tracking with Transformers
by: Taher, Marwan, et al.
Published: (2025)
by: Taher, Marwan, et al.
Published: (2025)
Situational Scene Graph for Structured Human-centric Situation Understanding
by: Sugandhika, Chinthani, et al.
Published: (2024)
by: Sugandhika, Chinthani, et al.
Published: (2024)
UAV-Assisted Real-Time Disaster Detection Using Optimized Transformer Model
by: Jankovic, Branislava, et al.
Published: (2025)
by: Jankovic, Branislava, et al.
Published: (2025)
Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation
by: Hughes, Philip, et al.
Published: (2025)
by: Hughes, Philip, et al.
Published: (2025)
Model-Based Real-Time Pose and Sag Estimation of Overhead Power Lines Using LiDAR for Drone Inspection
by: Girard, Alexandre, et al.
Published: (2025)
by: Girard, Alexandre, et al.
Published: (2025)
Learning Camera Movement Control from Real-World Drone Videos
by: Hou, Yunzhong, et al.
Published: (2024)
by: Hou, Yunzhong, et al.
Published: (2024)
DroneMOT: Drone-based Multi-Object Tracking Considering Detection Difficulties and Simultaneous Moving of Drones and Objects
by: Wang, Peng, et al.
Published: (2024)
by: Wang, Peng, et al.
Published: (2024)
Explaining the Unseen: Multimodal Vision-Language Reasoning for Situational Awareness in Underground Mining Disasters
by: Jewel, Mizanur Rahman, et al.
Published: (2025)
by: Jewel, Mizanur Rahman, et al.
Published: (2025)
Enhancing Maritime Situational Awareness through End-to-End Onboard Raw Data Analysis
by: Del Prete, Roberto, et al.
Published: (2024)
by: Del Prete, Roberto, et al.
Published: (2024)
Predictive Temporal Attention on Event-based Video Stream for Energy-efficient Situation Awareness
by: Bu, Yiming, et al.
Published: (2024)
by: Bu, Yiming, et al.
Published: (2024)
Enhancing Vision Language Models with Logic Reasoning for Situational Awareness
by: Pradeep, Pavana, et al.
Published: (2026)
by: Pradeep, Pavana, et al.
Published: (2026)
Real-Time 3D Object Detection with Inference-Aligned Learning
by: Zhao, Chenyu, et al.
Published: (2025)
by: Zhao, Chenyu, et al.
Published: (2025)
Hardware Acceleration for Real-Time Wildfire Detection Onboard Drone Networks
by: Briley, Austin, et al.
Published: (2024)
by: Briley, Austin, et al.
Published: (2024)
DARTS: A Drone-Based AI-Powered Real-Time Traffic Incident Detection System
by: Li, Bai, et al.
Published: (2025)
by: Li, Bai, et al.
Published: (2025)
UAL-Bench: The First Comprehensive Unusual Activity Localization Benchmark
by: Abdullah, Hasnat Md, et al.
Published: (2024)
by: Abdullah, Hasnat Md, et al.
Published: (2024)
SurgOnAir: Hierarchy-Aware Real-Time Surgical Video Commentary
by: He, Jingyi, et al.
Published: (2026)
by: He, Jingyi, et al.
Published: (2026)
DeTrack: A Benchmark and Altitude-Aware Dual World Model for Drone-embodied Tracking
by: Hu, Guyue, et al.
Published: (2026)
by: Hu, Guyue, et al.
Published: (2026)
RefDrone: A Challenging Benchmark for Referring Expression Comprehension in Drone Scenes
by: Sun, Zhichao, et al.
Published: (2025)
by: Sun, Zhichao, et al.
Published: (2025)
Drone-type-Set: Drone types detection benchmark for drone detection and tracking
by: AlDosari, Kholoud, et al.
Published: (2024)
by: AlDosari, Kholoud, et al.
Published: (2024)
STAR: A Benchmark for Situated Reasoning in Real-World Videos
by: Wu, Bo, et al.
Published: (2024)
by: Wu, Bo, et al.
Published: (2024)
Voice-Assisted Real-Time Traffic Sign Recognition System Using Convolutional Neural Network
by: Manawadu, Mayura, et al.
Published: (2024)
by: Manawadu, Mayura, et al.
Published: (2024)
Vision-TTT: Efficient and Expressive Visual Representation Learning with Test-Time Training
by: Kong, Quan, et al.
Published: (2026)
by: Kong, Quan, et al.
Published: (2026)
Exploiting Supervised Poison Vulnerability to Strengthen Self-Supervised Defense
by: Styborski, Jeremy, et al.
Published: (2024)
by: Styborski, Jeremy, et al.
Published: (2024)
Similar Items
-
VLMs Guided Interpretable Decision Making for Autonomous Driving
by: Hu, Xin, et al.
Published: (2025) -
SAW-Bench: Learning Situated Awareness in the Real World
by: Li, Chuhan, et al.
Published: (2026) -
Real-time Ship Recognition and Georeferencing for the Improvement of Maritime Situational Awareness
by: Perez, Borja Carrillo
Published: (2024) -
PSI: A Benchmark for Human Interpretation and Response in Traffic Interactions
by: Jing, Taotao, et al.
Published: (2021) -
Real-Time Drone Detection in Event Cameras via Per-Pixel Frequency Analysis
by: Bezick, Michael, et al.
Published: (2026)