:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Khai, Nguyen Truong, Vinh, Luong Duc
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2512.23594
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Multi-Perspective Data Augmentation for Few-shot Object Detection
by: Vu, Anh-Khoa Nguyen, et al.
Published: (2025)

Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance
by: Che, Quang-Huy, et al.
Published: (2024)

TwinLiteNet+: An Enhanced Multi-Task Segmentation Model for Autonomous Driving
by: Che, Quang-Huy, et al.
Published: (2024)

Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization
by: Nguyen, Quang Vinh, et al.
Published: (2024)

TinyGiantVLM: A Lightweight Vision-Language Architecture for Spatial Reasoning under Resource Constraints
by: Ly, Vinh-Thuan, et al.
Published: (2025)

TriLiteNet: Lightweight Model for Multi-Task Visual Perception
by: Che, Quang-Huy, et al.
Published: (2025)

Enhancing person re-identification via Uncertainty Feature Fusion Method and Auto-weighted Measure Combination
by: Che, Quang-Huy, et al.
Published: (2024)

ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition
by: Nguyen, Thai-Binh, et al.
Published: (2025)

Enhancing the Fairness and Performance of Edge Cameras with Explainable AI
by: Nguyen, Truong Thanh Hung, et al.
Published: (2024)

A Time Series Dataset of NIR Spectra and RGB and NIR-HSI Images of the Barley Germination Process
by: Engstrøm, Ole-Christian Galbo, et al.
Published: (2025)

ODExAI: A Comprehensive Object Detection Explainable AI Evaluation
by: Nguyen, Loc Phuc Truong, et al.
Published: (2025)

Towards RGB-NIR Cross-modality Image Registration and Beyond
by: Li, Huadong, et al.
Published: (2024)

The Art of Camouflage: Few-Shot Learning for Animal Detection and Segmentation
by: Nguyen, Thanh-Danh, et al.
Published: (2023)

MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views
by: Li, Runfa, et al.
Published: (2024)

PerspectiveNet: Multi-View Perception for Dynamic Scene Understanding
by: Nguyen, Vinh
Published: (2024)

Multimodal Object Detection using Depth and Image Data for Manufacturing Parts
by: Mahjourian, Nazanin, et al.
Published: (2024)

SDPA++: A General Framework for Self-Supervised Denoising with Patch Aggregation
by: Nguyen, Huy Minh Nhat, et al.
Published: (2025)

Pixel-aligned RGB-NIR Stereo Imaging and Dataset for Robot Vision
by: Kim, Jinnyeong, et al.
Published: (2024)

EA-Swin: An Embedding-Agnostic Swin Transformer for AI-Generated Video Detection
by: Mai, Hung, et al.
Published: (2026)

ColorMamba: Towards High-quality NIR-to-RGB Spectral Translation with Mamba
by: Zhai, Huiyu, et al.
Published: (2024)

IQBench: How "Smart'' Are Vision-Language Models? A Study with Human IQ Tests
by: Pham, Tan-Hanh, et al.
Published: (2025)

Dual-Path Enhancements in Event-Based Eye Tracking: Augmented Robustness and Adaptive Temporal Modeling
by: Truong, Hoang M., et al.
Published: (2025)

SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization
by: Pham, Tan-Hanh, et al.
Published: (2024)

MambaCAFU: Hybrid Multi-Scale and Multi-Attention Model with Mamba-Based Fusion for Medical Image Segmentation
by: Bui, T-Mai, et al.
Published: (2025)

Anatomical Attention Alignment representation for Radiology Report Generation
by: Nguyen, Quang Vinh, et al.
Published: (2025)

Vision-Language Models for Infrared Industrial Sensing in Additive Manufacturing Scene Description
by: Mahjourian, Nazanin, et al.
Published: (2025)

FA-Seg: A Fast and Accurate Diffusion-Based Method for Open-Vocabulary Segmentation
by: Che, Huy, et al.
Published: (2025)

Ambient-robust Inverse Rendering using Active RGB-NIR Imaging
by: Chung, Hoon-Gyu, et al.
Published: (2026)

Hierarchical Neural Collapse Detection Transformer for Class Incremental Object Detection
by: Pham, Duc Thanh, et al.
Published: (2025)

LP-OVOD: Open-Vocabulary Object Detection by Linear Probing
by: Pham, Chau, et al.
Published: (2023)

RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes
by: Li, Fang, et al.
Published: (2025)

VEIGAR: View-consistent Explicit Inpainting and Geometry Alignment for 3D object Removal
by: Do, Pham Khai Nguyen, et al.
Published: (2025)

CLEAR: Causal Learning Framework For Robust Histopathology Tumor Detection Under Out-Of-Distribution Shifts
by: Thi, Kieu-Anh Truong, et al.
Published: (2025)

Preliminary analysis of RGB-NIR Image Registration techniques for off-road forestry environments
by: Deoli, Pankaj, et al.
Published: (2026)

R&D: Balancing Reliability and Diversity in Synthetic Data Augmentation for Semantic Segmentation
by: Che, Huy, et al.
Published: (2026)

Smart Camera Parking System With Auto Parking Spot Detection
by: Nguyen, Tuan T., et al.
Published: (2024)

Energy-Based Sliced Wasserstein Distance
by: Nguyen, Khai, et al.
Published: (2023)

Sliced Wasserstein Estimation with Control Variates
by: Nguyen, Khai, et al.
Published: (2023)

Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
by: Truong, Quang-Trung, et al.
Published: (2024)

Camera Motion Estimation from RGB-D-Inertial Scene Flow
by: Cerezo, Samuel, et al.
Published: (2024)