:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Meng, Wei
Format:	Preprint
Published:	2025
Subjects:	Computers and Society Artificial Intelligence Computer Vision and Pattern Recognition 05C82, 68T07, 68T05, 62H30 I.2.10; I.4.8; H.5.1; H.2.8
Online Access:	https://arxiv.org/abs/2507.21100
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

TerraSeg: Self-Supervised Ground Segmentation for Any LiDAR
by: Lentsch, Ted, et al.
Published: (2026)

UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes
by: Lentsch, Ted, et al.
Published: (2024)

Self-Attention And Beyond the Infinite: Towards Linear Transformers with Infinite Self-Attention
by: Roffo, Giorgio, et al.
Published: (2026)

Classifier Calibration at Scale: An Empirical Study of Model-Agnostic Post-Hoc Methods
by: Manokhin, Valery, et al.
Published: (2026)

See What You Need: Query-Aware Visual Intelligence through Reasoning-Perception Loops
by: Dong, Zixuan, et al.
Published: (2025)

DeepShade: Enable Shade Simulation by Text-conditioned Image Generation
by: Da, Longchao, et al.
Published: (2025)

Dense Video Understanding with Gated Residual Tokenization
by: Zhang, Haichao, et al.
Published: (2025)

Active Negative Loss: A Robust Framework for Learning with Noisy Labels
by: Ye, Xichen, et al.
Published: (2024)

Akasha 2: Hamiltonian State Space Duality and Visual-Language Joint Embedding Predictive Architectur
by: Meziani, Yani
Published: (2026)

IAUNet: Instance-Aware U-Net
by: Prytula, Yaroslav, et al.
Published: (2025)

AI-Powered Augmented Reality for Satellite Assembly, Integration and Test
by: Patricio, Alvaro, et al.
Published: (2024)

AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning
by: Patel, Urjitkumar, et al.
Published: (2025)

Deep Learning-Based Multi-Object Tracking: A Comprehensive Survey from Foundations to State-of-the-Art
by: Adžemović, Momir
Published: (2025)

GeoJEPA: Towards Eliminating Augmentation- and Sampling Bias in Multimodal Geospatial Learning
by: Lundqvist, Theodor, et al.
Published: (2025)

WSCIF: A Weakly-Supervised Color Intelligence Framework for Tactical Anomaly Detection in Surveillance Keyframes
by: Meng, Wei
Published: (2025)

VideoMind: An Omni-Modal Video Dataset with Intent Grounding for Deep-Cognitive Video Understanding
by: Yang, Baoyao, et al.
Published: (2025)

Pixel-Wise Multimodal Contrastive Learning for Remote Sensing Images
by: Stival, Leandro, et al.
Published: (2026)

A deep learning approach to track eye movements based on events
by: Seth, Chirag, et al.
Published: (2025)

MVTamperBench: Evaluating Robustness of Vision-Language Models
by: Agarwal, Amit, et al.
Published: (2024)

LinkedOut: Linking World Knowledge Representation Out of Video LLM for Next-Generation Video Recommendation
by: Zhang, Haichao, et al.
Published: (2025)

Modeling and Visualization Reasoning for Stakeholders in Education and Industry Integration Systems: Research on Structured Synthetic Dialogue Data Generation Based on NIST Standards
by: Meng, Wei
Published: (2025)

Polarization-Based Eye Tracking with Personalized Siamese Architectures
by: Kalkanli, Beyza, et al.
Published: (2026)

Visualizing the Evolution of Twitter (X.com) Conversations: A Comprehensive Methodology Applied to AI Training Discussions on ChatGPT
by: Jess, Nicole, et al.
Published: (2024)

Divergence-Based Similarity Function for Multi-View Contrastive Learning
by: Jeon, Jae Hyoung, et al.
Published: (2025)

Enhanced Single-Cell RNA-seq Embedding through Gene Expression and Data-Driven Gene-Gene Interaction Integration
by: Goudarzi, Hojjat Torabi, et al.
Published: (2025)

Mitigating Catastrophic Forgetting in Streaming Generative and Predictive Learning via Stateful Replay
by: Du, Wenzhang
Published: (2025)

Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models
by: Moradi, Mehrdad, et al.
Published: (2025)

Efficient and Privacy-Protecting Background Removal for 2D Video Streaming using iPhone 15 Pro Max LiDAR
by: Kinnevan, Jessica, et al.
Published: (2025)

Enhancing Diversity in Multi-objective Feature Selection
by: Miyandoab, Sevil Zanjani, et al.
Published: (2024)

Network Analysis of the Egyptian Reddit Community
by: Shaawat, Samy, et al.
Published: (2026)

Measuring Similarity in Causal Graphs: A Framework for Semantic and Structural Analysis
by: Liu, Ning-Yuan Georgia, et al.
Published: (2025)

OpenFusion++: An Open-vocabulary Real-time Scene Understanding System
by: Jin, Xiaofeng, et al.
Published: (2025)

LightPFP: A Lightweight Route to Ab Initio Accuracy at Scale
by: Li, Wenwen, et al.
Published: (2025)

Vis-CoT: A Human-in-the-Loop Framework for Interactive Visualization and Intervention in LLM Chain-of-Thought Reasoning
by: Pather, Kaviraj, et al.
Published: (2025)

VisChainBench: A Benchmark for Multi-Turn, Multi-Image Visual Reasoning Beyond Language Priors
by: Lyu, Wenbo, et al.
Published: (2025)

Geo2Sound: A Scalable Geo-Aligned Framework for Soundscape Generation from Satellite Imagery
by: Wu, Kunlin, et al.
Published: (2026)

Hierarchical Point-Patch Fusion with Adaptive Patch Codebook for 3D Shape Anomaly Detection
by: Kang, Xueyang, et al.
Published: (2026)

A Single Image Is All You Need: Zero-Shot Anomaly Localization Without Training Data
by: Moradi, Mehrdad, et al.
Published: (2025)

AssistedDS: Benchmarking How External Domain Knowledge Assists LLMs in Automated Data Science
by: Luo, An, et al.
Published: (2025)

Butter: Frequency Consistency and Hierarchical Fusion for Autonomous Driving Object Detection
by: Lin, Xiaojian, et al.
Published: (2025)