Saved in:
| Main Authors: | Nguyen, Tuan Dung, Ho, Minh Khoi, Chen, Qi, Xie, Yutong, Cam-Tu, Nguyen, Nguyen, Minh Khoi, Nguyen, Dang Huy Pham, Hengel, Anton van den, Verjans, Johan W., Nguyen, Phi Le, Phan, Vu Minh Hieu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.04863 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Overthinking Causes Hallucination: Tracing Confounder Propagation in Vision Language Models
by: Shoby, Abin, et al.
Published: (2026)
by: Shoby, Abin, et al.
Published: (2026)
Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs
by: Nguyen, Dung, et al.
Published: (2025)
by: Nguyen, Dung, et al.
Published: (2025)
Interactive Medical Image Analysis with Concept-based Similarity Reasoning
by: Huy, Ta Duc, et al.
Published: (2025)
by: Huy, Ta Duc, et al.
Published: (2025)
Enhanced Multimodal Video Retrieval System: Integrating Query Expansion and Cross-modal Temporal Event Retrieval
by: Vo, Van-Thinh, et al.
Published: (2025)
by: Vo, Van-Thinh, et al.
Published: (2025)
Med-StepBench: A Hierarchical Reasoning Framework for Evaluating Hallucinations in Medical Vision-Language Models
by: Nguyen, Minh Khoi, et al.
Published: (2026)
by: Nguyen, Minh Khoi, et al.
Published: (2026)
Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking
by: Nguyen, Phuc, et al.
Published: (2024)
by: Nguyen, Phuc, et al.
Published: (2024)
SwiftPie: Lightning-fast Subject-driven Image Personalization via One step Diffusion
by: Duong, Huy, et al.
Published: (2026)
by: Duong, Huy, et al.
Published: (2026)
Fourier-Attentive Representation Learning: A Fourier-Guided Framework for Few-Shot Generalization in Vision-Language Models
by: Pham, Hieu Dinh Trung, et al.
Published: (2025)
by: Pham, Hieu Dinh Trung, et al.
Published: (2025)
Eco‐Friendly Synthesis of Zinc Oxide and Magnesium Oxide Nanoparticles: Comparative Insights into Characterization, Electrochemical, and Photocatalytic Properties
by: Nguyen Duc Huy, et al.
Published: (2026)
by: Nguyen Duc Huy, et al.
Published: (2026)
CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models
by: Nguyen, Quang-Binh, et al.
Published: (2025)
by: Nguyen, Quang-Binh, et al.
Published: (2025)
Giant Cutaneous Horn of the Cheek: A Case Report
by: Thuc Xuan Nguyen, et al.
Published: (2026)
by: Thuc Xuan Nguyen, et al.
Published: (2026)
AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate Diagnosis
by: Chowdhury, Townim F., et al.
Published: (2024)
by: Chowdhury, Townim F., et al.
Published: (2024)
Optimizing Electric Vehicle Charging Station Placement Using Reinforcement Learning and Agent-Based Simulations
by: Nguyen, Minh-Duc, et al.
Published: (2025)
by: Nguyen, Minh-Duc, et al.
Published: (2025)
OE3DIS: Open-Ended 3D Point Cloud Instance Segmentation
by: Nguyen, Phuc D. A., et al.
Published: (2024)
by: Nguyen, Phuc D. A., et al.
Published: (2024)
Robust Aggregation for Federated Sequential Recommendation with Sparse and Poisoned Data
by: Nguyen, Minh Hieu
Published: (2026)
by: Nguyen, Minh Hieu
Published: (2026)
Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance
by: Pham, Duc-Hai, et al.
Published: (2024)
by: Pham, Duc-Hai, et al.
Published: (2024)
Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding
by: Huy, Ta Duc, et al.
Published: (2025)
by: Huy, Ta Duc, et al.
Published: (2025)
KiseKloset for Fashion Retrieval and Recommendation
by: Phan-Nguyen, Thanh-Tung, et al.
Published: (2025)
by: Phan-Nguyen, Thanh-Tung, et al.
Published: (2025)
VLegal-Bench: Cognitively Grounded Benchmark for Vietnamese Legal Reasoning of Large Language Models
by: Dong, Nguyen Tien, et al.
Published: (2025)
by: Dong, Nguyen Tien, et al.
Published: (2025)
ModeDreamer: Mode Guiding Score Distillation for Text-to-3D Generation using Reference Image Prompts
by: Tran, Uy Dieu, et al.
Published: (2024)
by: Tran, Uy Dieu, et al.
Published: (2024)
VLQA: The First Comprehensive, Large, and High-Quality Vietnamese Dataset for Legal Question Answering
by: Nguyen, Tan-Minh, et al.
Published: (2025)
by: Nguyen, Tan-Minh, et al.
Published: (2025)
"True" self-avoiding walks on general trees
by: Nguyen, Tuan-Minh
Published: (2026)
by: Nguyen, Tuan-Minh
Published: (2026)
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing
by: Pham, Chau, et al.
Published: (2023)
by: Pham, Chau, et al.
Published: (2023)
Alibaba International E-commerce Product Search Competition DcuRAGONs Team Technical Report
by: Nguyen-Ho, Thang-Long, et al.
Published: (2025)
by: Nguyen-Ho, Thang-Long, et al.
Published: (2025)
Looking in the mirror: A faithful counterfactual explanation method for interpreting deep image classification models
by: Chowdhury, Townim Faisal, et al.
Published: (2025)
by: Chowdhury, Townim Faisal, et al.
Published: (2025)
ReFineVLA: Reasoning-Aware Teacher-Guided Transfer Fine-Tuning
by: Van Vo, Tuan, et al.
Published: (2025)
by: Van Vo, Tuan, et al.
Published: (2025)
Dieu khien he da tac tu
by: Trinh, Minh Hoang, et al.
Published: (2026)
by: Trinh, Minh Hoang, et al.
Published: (2026)
A Survey of Theory of Mind in Large Language Models: Evaluations, Representations, and Safety Risks
by: Nguyen, Hieu Minh "Jord"
Published: (2025)
by: Nguyen, Hieu Minh "Jord"
Published: (2025)
KGAlign: Joint Semantic-Structural Knowledge Encoding for Multimodal Fake News Detection
by: La, Tuan-Vinh, et al.
Published: (2025)
by: La, Tuan-Vinh, et al.
Published: (2025)
Dual Strategies for Test-Time Adaptation
by: Phuong, Nam Nguyen, et al.
Published: (2026)
by: Phuong, Nam Nguyen, et al.
Published: (2026)
Collaboration Between Human–Robot Interaction Based on CDPR in a Virtual Reality Game Environment
by: Dang Tri Dung, et al.
Published: (2025)
by: Dang Tri Dung, et al.
Published: (2025)
MMAP: A Multi-Magnification and Prototype-Aware Architecture for Predicting Spatial Gene Expression
by: Nguyen, Hai Dang, et al.
Published: (2025)
by: Nguyen, Hai Dang, et al.
Published: (2025)
Qualitative Properties of Solutions of Nonlinear Fractional Diffusion Equations Perturbed by a Multiplicative H ‐Regular Space‐Time White Noise
by: Dang Duc Trong, et al.
Published: (2025)
by: Dang Duc Trong, et al.
Published: (2025)
Biopolymer Application for Preservation of Tropical Fruits and Vegetables
by: Dung Thuy Nguyen Pham, et al.
Published: (2025)
by: Dung Thuy Nguyen Pham, et al.
Published: (2025)
Metacognitive Sensitivity for Test-Time Dynamic Model Selection
by: Trinh, Le Tuan Minh, et al.
Published: (2025)
by: Trinh, Le Tuan Minh, et al.
Published: (2025)
On some Sobolev and Pólya-Szegö type inequalities with weights and applications
by: Giang, Trung Hieu, et al.
Published: (2024)
by: Giang, Trung Hieu, et al.
Published: (2024)
PAT: Pixel-wise Adaptive Training for Long-tailed Segmentation
by: Do, Khoi, et al.
Published: (2024)
by: Do, Khoi, et al.
Published: (2024)
Link prediction Graph Neural Networks for structure recognition of Handwritten Mathematical Expressions
by: Nguyen, Cuong Tuan, et al.
Published: (2025)
by: Nguyen, Cuong Tuan, et al.
Published: (2025)
Stable Messenger: Steganography for Message-Concealed Image Generation
by: Nguyen, Quang, et al.
Published: (2023)
by: Nguyen, Quang, et al.
Published: (2023)
On the maximum purity of absolutely separable bipartite states
by: Dung, Hoang Phi, et al.
Published: (2025)
by: Dung, Hoang Phi, et al.
Published: (2025)
Similar Items
-
Overthinking Causes Hallucination: Tracing Confounder Propagation in Vision Language Models
by: Shoby, Abin, et al.
Published: (2026) -
Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs
by: Nguyen, Dung, et al.
Published: (2025) -
Interactive Medical Image Analysis with Concept-based Similarity Reasoning
by: Huy, Ta Duc, et al.
Published: (2025) -
Enhanced Multimodal Video Retrieval System: Integrating Query Expansion and Cross-modal Temporal Event Retrieval
by: Vo, Van-Thinh, et al.
Published: (2025) -
Med-StepBench: A Hierarchical Reasoning Framework for Evaluating Hallucinations in Medical Vision-Language Models
by: Nguyen, Minh Khoi, et al.
Published: (2026)