:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Nguyen, Tuan Dung, Ho, Minh Khoi, Chen, Qi, Xie, Yutong, Cam-Tu, Nguyen, Nguyen, Minh Khoi, Nguyen, Dang Huy Pham, Hengel, Anton van den, Verjans, Johan W., Nguyen, Phi Le, Phan, Vu Minh Hieu
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2604.04863
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Overthinking Causes Hallucination: Tracing Confounder Propagation in Vision Language Models
by: Shoby, Abin, et al.
Published: (2026)

Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs
by: Nguyen, Dung, et al.
Published: (2025)

Interactive Medical Image Analysis with Concept-based Similarity Reasoning
by: Huy, Ta Duc, et al.
Published: (2025)

Enhanced Multimodal Video Retrieval System: Integrating Query Expansion and Cross-modal Temporal Event Retrieval
by: Vo, Van-Thinh, et al.
Published: (2025)

Med-StepBench: A Hierarchical Reasoning Framework for Evaluating Hallucinations in Medical Vision-Language Models
by: Nguyen, Minh Khoi, et al.
Published: (2026)

Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking
by: Nguyen, Phuc, et al.
Published: (2024)

SwiftPie: Lightning-fast Subject-driven Image Personalization via One step Diffusion
by: Duong, Huy, et al.
Published: (2026)

Fourier-Attentive Representation Learning: A Fourier-Guided Framework for Few-Shot Generalization in Vision-Language Models
by: Pham, Hieu Dinh Trung, et al.
Published: (2025)

Eco‐Friendly Synthesis of Zinc Oxide and Magnesium Oxide Nanoparticles: Comparative Insights into Characterization, Electrochemical, and Photocatalytic Properties
by: Nguyen Duc Huy, et al.
Published: (2026)

CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models
by: Nguyen, Quang-Binh, et al.
Published: (2025)

Giant Cutaneous Horn of the Cheek: A Case Report
by: Thuc Xuan Nguyen, et al.
Published: (2026)

AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate Diagnosis
by: Chowdhury, Townim F., et al.
Published: (2024)

Optimizing Electric Vehicle Charging Station Placement Using Reinforcement Learning and Agent-Based Simulations
by: Nguyen, Minh-Duc, et al.
Published: (2025)

OE3DIS: Open-Ended 3D Point Cloud Instance Segmentation
by: Nguyen, Phuc D. A., et al.
Published: (2024)

Robust Aggregation for Federated Sequential Recommendation with Sparse and Poisoned Data
by: Nguyen, Minh Hieu
Published: (2026)

Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance
by: Pham, Duc-Hai, et al.
Published: (2024)

Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding
by: Huy, Ta Duc, et al.
Published: (2025)

KiseKloset for Fashion Retrieval and Recommendation
by: Phan-Nguyen, Thanh-Tung, et al.
Published: (2025)

VLegal-Bench: Cognitively Grounded Benchmark for Vietnamese Legal Reasoning of Large Language Models
by: Dong, Nguyen Tien, et al.
Published: (2025)

ModeDreamer: Mode Guiding Score Distillation for Text-to-3D Generation using Reference Image Prompts
by: Tran, Uy Dieu, et al.
Published: (2024)

VLQA: The First Comprehensive, Large, and High-Quality Vietnamese Dataset for Legal Question Answering
by: Nguyen, Tan-Minh, et al.
Published: (2025)

"True" self-avoiding walks on general trees
by: Nguyen, Tuan-Minh
Published: (2026)

LP-OVOD: Open-Vocabulary Object Detection by Linear Probing
by: Pham, Chau, et al.
Published: (2023)

Alibaba International E-commerce Product Search Competition DcuRAGONs Team Technical Report
by: Nguyen-Ho, Thang-Long, et al.
Published: (2025)

Looking in the mirror: A faithful counterfactual explanation method for interpreting deep image classification models
by: Chowdhury, Townim Faisal, et al.
Published: (2025)

ReFineVLA: Reasoning-Aware Teacher-Guided Transfer Fine-Tuning
by: Van Vo, Tuan, et al.
Published: (2025)

Dieu khien he da tac tu
by: Trinh, Minh Hoang, et al.
Published: (2026)

A Survey of Theory of Mind in Large Language Models: Evaluations, Representations, and Safety Risks
by: Nguyen, Hieu Minh "Jord"
Published: (2025)

KGAlign: Joint Semantic-Structural Knowledge Encoding for Multimodal Fake News Detection
by: La, Tuan-Vinh, et al.
Published: (2025)

Dual Strategies for Test-Time Adaptation
by: Phuong, Nam Nguyen, et al.
Published: (2026)

Collaboration Between Human–Robot Interaction Based on CDPR in a Virtual Reality Game Environment
by: Dang Tri Dung, et al.
Published: (2025)

MMAP: A Multi-Magnification and Prototype-Aware Architecture for Predicting Spatial Gene Expression
by: Nguyen, Hai Dang, et al.
Published: (2025)

Qualitative Properties of Solutions of Nonlinear Fractional Diffusion Equations Perturbed by a Multiplicative H ‐Regular Space‐Time White Noise
by: Dang Duc Trong, et al.
Published: (2025)

Biopolymer Application for Preservation of Tropical Fruits and Vegetables
by: Dung Thuy Nguyen Pham, et al.
Published: (2025)

Metacognitive Sensitivity for Test-Time Dynamic Model Selection
by: Trinh, Le Tuan Minh, et al.
Published: (2025)

On some Sobolev and Pólya-Szegö type inequalities with weights and applications
by: Giang, Trung Hieu, et al.
Published: (2024)

PAT: Pixel-wise Adaptive Training for Long-tailed Segmentation
by: Do, Khoi, et al.
Published: (2024)

Link prediction Graph Neural Networks for structure recognition of Handwritten Mathematical Expressions
by: Nguyen, Cuong Tuan, et al.
Published: (2025)

Stable Messenger: Steganography for Message-Concealed Image Generation
by: Nguyen, Quang, et al.
Published: (2023)

On the maximum purity of absolutely separable bipartite states
by: Dung, Hoang Phi, et al.
Published: (2025)