Saved in:
| Main Authors: | Khai, Nguyen Truong, Vinh, Luong Duc |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.23594 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Multi-Perspective Data Augmentation for Few-shot Object Detection
by: Vu, Anh-Khoa Nguyen, et al.
Published: (2025)
by: Vu, Anh-Khoa Nguyen, et al.
Published: (2025)
Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance
by: Che, Quang-Huy, et al.
Published: (2024)
by: Che, Quang-Huy, et al.
Published: (2024)
TwinLiteNet+: An Enhanced Multi-Task Segmentation Model for Autonomous Driving
by: Che, Quang-Huy, et al.
Published: (2024)
by: Che, Quang-Huy, et al.
Published: (2024)
Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization
by: Nguyen, Quang Vinh, et al.
Published: (2024)
by: Nguyen, Quang Vinh, et al.
Published: (2024)
TinyGiantVLM: A Lightweight Vision-Language Architecture for Spatial Reasoning under Resource Constraints
by: Ly, Vinh-Thuan, et al.
Published: (2025)
by: Ly, Vinh-Thuan, et al.
Published: (2025)
TriLiteNet: Lightweight Model for Multi-Task Visual Perception
by: Che, Quang-Huy, et al.
Published: (2025)
by: Che, Quang-Huy, et al.
Published: (2025)
Enhancing person re-identification via Uncertainty Feature Fusion Method and Auto-weighted Measure Combination
by: Che, Quang-Huy, et al.
Published: (2024)
by: Che, Quang-Huy, et al.
Published: (2024)
ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition
by: Nguyen, Thai-Binh, et al.
Published: (2025)
by: Nguyen, Thai-Binh, et al.
Published: (2025)
Enhancing the Fairness and Performance of Edge Cameras with Explainable AI
by: Nguyen, Truong Thanh Hung, et al.
Published: (2024)
by: Nguyen, Truong Thanh Hung, et al.
Published: (2024)
A Time Series Dataset of NIR Spectra and RGB and NIR-HSI Images of the Barley Germination Process
by: Engstrøm, Ole-Christian Galbo, et al.
Published: (2025)
by: Engstrøm, Ole-Christian Galbo, et al.
Published: (2025)
ODExAI: A Comprehensive Object Detection Explainable AI Evaluation
by: Nguyen, Loc Phuc Truong, et al.
Published: (2025)
by: Nguyen, Loc Phuc Truong, et al.
Published: (2025)
Towards RGB-NIR Cross-modality Image Registration and Beyond
by: Li, Huadong, et al.
Published: (2024)
by: Li, Huadong, et al.
Published: (2024)
The Art of Camouflage: Few-Shot Learning for Animal Detection and Segmentation
by: Nguyen, Thanh-Danh, et al.
Published: (2023)
by: Nguyen, Thanh-Danh, et al.
Published: (2023)
MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views
by: Li, Runfa, et al.
Published: (2024)
by: Li, Runfa, et al.
Published: (2024)
PerspectiveNet: Multi-View Perception for Dynamic Scene Understanding
by: Nguyen, Vinh
Published: (2024)
by: Nguyen, Vinh
Published: (2024)
Multimodal Object Detection using Depth and Image Data for Manufacturing Parts
by: Mahjourian, Nazanin, et al.
Published: (2024)
by: Mahjourian, Nazanin, et al.
Published: (2024)
SDPA++: A General Framework for Self-Supervised Denoising with Patch Aggregation
by: Nguyen, Huy Minh Nhat, et al.
Published: (2025)
by: Nguyen, Huy Minh Nhat, et al.
Published: (2025)
Pixel-aligned RGB-NIR Stereo Imaging and Dataset for Robot Vision
by: Kim, Jinnyeong, et al.
Published: (2024)
by: Kim, Jinnyeong, et al.
Published: (2024)
EA-Swin: An Embedding-Agnostic Swin Transformer for AI-Generated Video Detection
by: Mai, Hung, et al.
Published: (2026)
by: Mai, Hung, et al.
Published: (2026)
ColorMamba: Towards High-quality NIR-to-RGB Spectral Translation with Mamba
by: Zhai, Huiyu, et al.
Published: (2024)
by: Zhai, Huiyu, et al.
Published: (2024)
IQBench: How "Smart'' Are Vision-Language Models? A Study with Human IQ Tests
by: Pham, Tan-Hanh, et al.
Published: (2025)
by: Pham, Tan-Hanh, et al.
Published: (2025)
Dual-Path Enhancements in Event-Based Eye Tracking: Augmented Robustness and Adaptive Temporal Modeling
by: Truong, Hoang M., et al.
Published: (2025)
by: Truong, Hoang M., et al.
Published: (2025)
SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization
by: Pham, Tan-Hanh, et al.
Published: (2024)
by: Pham, Tan-Hanh, et al.
Published: (2024)
MambaCAFU: Hybrid Multi-Scale and Multi-Attention Model with Mamba-Based Fusion for Medical Image Segmentation
by: Bui, T-Mai, et al.
Published: (2025)
by: Bui, T-Mai, et al.
Published: (2025)
Anatomical Attention Alignment representation for Radiology Report Generation
by: Nguyen, Quang Vinh, et al.
Published: (2025)
by: Nguyen, Quang Vinh, et al.
Published: (2025)
Vision-Language Models for Infrared Industrial Sensing in Additive Manufacturing Scene Description
by: Mahjourian, Nazanin, et al.
Published: (2025)
by: Mahjourian, Nazanin, et al.
Published: (2025)
FA-Seg: A Fast and Accurate Diffusion-Based Method for Open-Vocabulary Segmentation
by: Che, Huy, et al.
Published: (2025)
by: Che, Huy, et al.
Published: (2025)
Ambient-robust Inverse Rendering using Active RGB-NIR Imaging
by: Chung, Hoon-Gyu, et al.
Published: (2026)
by: Chung, Hoon-Gyu, et al.
Published: (2026)
Hierarchical Neural Collapse Detection Transformer for Class Incremental Object Detection
by: Pham, Duc Thanh, et al.
Published: (2025)
by: Pham, Duc Thanh, et al.
Published: (2025)
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing
by: Pham, Chau, et al.
Published: (2023)
by: Pham, Chau, et al.
Published: (2023)
RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes
by: Li, Fang, et al.
Published: (2025)
by: Li, Fang, et al.
Published: (2025)
VEIGAR: View-consistent Explicit Inpainting and Geometry Alignment for 3D object Removal
by: Do, Pham Khai Nguyen, et al.
Published: (2025)
by: Do, Pham Khai Nguyen, et al.
Published: (2025)
CLEAR: Causal Learning Framework For Robust Histopathology Tumor Detection Under Out-Of-Distribution Shifts
by: Thi, Kieu-Anh Truong, et al.
Published: (2025)
by: Thi, Kieu-Anh Truong, et al.
Published: (2025)
Preliminary analysis of RGB-NIR Image Registration techniques for off-road forestry environments
by: Deoli, Pankaj, et al.
Published: (2026)
by: Deoli, Pankaj, et al.
Published: (2026)
R&D: Balancing Reliability and Diversity in Synthetic Data Augmentation for Semantic Segmentation
by: Che, Huy, et al.
Published: (2026)
by: Che, Huy, et al.
Published: (2026)
Smart Camera Parking System With Auto Parking Spot Detection
by: Nguyen, Tuan T., et al.
Published: (2024)
by: Nguyen, Tuan T., et al.
Published: (2024)
Energy-Based Sliced Wasserstein Distance
by: Nguyen, Khai, et al.
Published: (2023)
by: Nguyen, Khai, et al.
Published: (2023)
Sliced Wasserstein Estimation with Control Variates
by: Nguyen, Khai, et al.
Published: (2023)
by: Nguyen, Khai, et al.
Published: (2023)
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
by: Truong, Quang-Trung, et al.
Published: (2024)
by: Truong, Quang-Trung, et al.
Published: (2024)
Camera Motion Estimation from RGB-D-Inertial Scene Flow
by: Cerezo, Samuel, et al.
Published: (2024)
by: Cerezo, Samuel, et al.
Published: (2024)
Similar Items
-
Multi-Perspective Data Augmentation for Few-shot Object Detection
by: Vu, Anh-Khoa Nguyen, et al.
Published: (2025) -
Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance
by: Che, Quang-Huy, et al.
Published: (2024) -
TwinLiteNet+: An Enhanced Multi-Task Segmentation Model for Autonomous Driving
by: Che, Quang-Huy, et al.
Published: (2024) -
Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization
by: Nguyen, Quang Vinh, et al.
Published: (2024) -
TinyGiantVLM: A Lightweight Vision-Language Architecture for Spatial Reasoning under Resource Constraints
by: Ly, Vinh-Thuan, et al.
Published: (2025)