Saved in:
| Main Authors: | Zhang, Xi, Meng, Zaiqiao, Lever, Jake, Ho, Edmond S. L. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.19378 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive Decoding
by: Zhang, Xi, et al.
Published: (2025)
by: Zhang, Xi, et al.
Published: (2025)
Towards Accurate and Efficient Waste Image Classification: A Hybrid Deep Learning and Machine Learning Approach
by: Nguyen, Ngoc-Bao-Quang, et al.
Published: (2025)
by: Nguyen, Ngoc-Bao-Quang, et al.
Published: (2025)
UTAL-GNN: Unsupervised Temporal Action Localization using Graph Neural Networks
by: Badatya, Bikash Kumar, et al.
Published: (2025)
by: Badatya, Bikash Kumar, et al.
Published: (2025)
DeltaVLM: Interactive Remote Sensing Image Change Analysis via Instruction-guided Difference Perception
by: Deng, Pei, et al.
Published: (2025)
by: Deng, Pei, et al.
Published: (2025)
FAME: Feature Activation Map Explanation on Image Classification and Face Recognition
by: Zhang, Xinyi, et al.
Published: (2026)
by: Zhang, Xinyi, et al.
Published: (2026)
Dynamic Arthroscopic Navigation System for Anterior Cruciate Ligament Reconstruction Based on Multi-level Memory Architecture
by: Wang, Shuo, et al.
Published: (2025)
by: Wang, Shuo, et al.
Published: (2025)
Quantized Vision-Language Models for Damage Assessment: A Comparative Study of LLaVA-1.5-7B Quantization Levels
by: Yasuno, Takato
Published: (2026)
by: Yasuno, Takato
Published: (2026)
Image-Based Leopard Seal Recognition: Approaches and Challenges in Current Automated Systems
by: Salazar, Jorge Yero, et al.
Published: (2024)
by: Salazar, Jorge Yero, et al.
Published: (2024)
Product Review Based on Optimized Facial Expression Detection
by: Chaugule, Vikrant, et al.
Published: (2026)
by: Chaugule, Vikrant, et al.
Published: (2026)
Learning Association via Track-Detection Matching for Multi-Object Tracking
by: Adžemović, Momir
Published: (2025)
by: Adžemović, Momir
Published: (2025)
The Influence of Iconicity in Transfer Learning for Sign Language Recognition
by: Artiaga, Keren, et al.
Published: (2026)
by: Artiaga, Keren, et al.
Published: (2026)
RAM-H1200: A Unified Evaluation and Dataset on Hand Radiographs for Rheumatoid Arthritis
by: Yang, Songxiao, et al.
Published: (2026)
by: Yang, Songxiao, et al.
Published: (2026)
OmniFall: From Staged Through Synthetic to Wild, A Unified Multi-Domain Dataset for Robust Fall Detection
by: Schneider, David, et al.
Published: (2025)
by: Schneider, David, et al.
Published: (2025)
Learning from Semantic Dictionaries: Discriminative Codebook Contrastive Learning for Unified Visual Representation and Generation
by: Estepa, Imanol G., et al.
Published: (2026)
by: Estepa, Imanol G., et al.
Published: (2026)
All4One: Symbiotic Neighbour Contrastive Learning via Self-Attention and Redundancy Reduction
by: Estepa, Imanol G., et al.
Published: (2023)
by: Estepa, Imanol G., et al.
Published: (2023)
Taming the Tail: Leveraging Asymmetric Loss and Pade Approximation to Overcome Medical Image Long-Tailed Class Imbalance
by: Kashyap, Pankhi, et al.
Published: (2024)
by: Kashyap, Pankhi, et al.
Published: (2024)
Motion-Guided Semantic Alignment with Negative Prompts for Zero-Shot Video Action Recognition
by: Wang, Yiming, et al.
Published: (2026)
by: Wang, Yiming, et al.
Published: (2026)
Mistake Attribution: Fine-Grained Mistake Understanding in Egocentric Videos
by: Li, Yayuan, et al.
Published: (2025)
by: Li, Yayuan, et al.
Published: (2025)
HSDA: High-frequency Shuffle Data Augmentation for Bird's-Eye-View Map Segmentation
by: Glisson, Calvin, et al.
Published: (2024)
by: Glisson, Calvin, et al.
Published: (2024)
SelvaBox: A high-resolution dataset for tropical tree crown detection
by: Baudchon, Hugo, et al.
Published: (2025)
by: Baudchon, Hugo, et al.
Published: (2025)
Reference Dataset and Benchmark for Reconstructing Laser Parameters from On-axis Video in Powder Bed Fusion of Bulk Stainless Steel
by: Blanc, Cyril, et al.
Published: (2024)
by: Blanc, Cyril, et al.
Published: (2024)
Semi-Supervised Segmentation via Embedding Matching
by: Xie, Weiyi, et al.
Published: (2024)
by: Xie, Weiyi, et al.
Published: (2024)
Dense Motion Captioning
by: Xu, Shiyao, et al.
Published: (2025)
by: Xu, Shiyao, et al.
Published: (2025)
A Novel Dataset for Flood Detection Robust to Seasonal Changes in Satellite Imagery
by: Jang, Youngsun, et al.
Published: (2025)
by: Jang, Youngsun, et al.
Published: (2025)
TD3Net: A temporal densely connected multi-dilated convolutional network for lipreading
by: Lee, Byung Hoon, et al.
Published: (2025)
by: Lee, Byung Hoon, et al.
Published: (2025)
CoMatcher: Multi-View Collaborative Feature Matching
by: Zhang, Jintao, et al.
Published: (2025)
by: Zhang, Jintao, et al.
Published: (2025)
MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition
by: Fan, Qiannan, et al.
Published: (2025)
by: Fan, Qiannan, et al.
Published: (2025)
NeuroGaze-Distill: Brain-informed Distillation and Depression-Inspired Geometric Priors for Robust Facial Emotion Recognition
by: Li, Zilin, et al.
Published: (2025)
by: Li, Zilin, et al.
Published: (2025)
Prompt Sensitivity in Vision-Language Grounding: How Small Changes in Wording Affect Object Detection
by: Deka, Dawar Jyoti, et al.
Published: (2026)
by: Deka, Dawar Jyoti, et al.
Published: (2026)
Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation
by: Opra, Balázs, et al.
Published: (2024)
by: Opra, Balázs, et al.
Published: (2024)
Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning
by: Semenov, Andrei, et al.
Published: (2024)
by: Semenov, Andrei, et al.
Published: (2024)
FeedbackSTS-Det: Sparse Frames-Based Spatio-Temporal Semantic Feedback Network for Moving Infrared Small Target Detection
by: Huang, Yian, et al.
Published: (2026)
by: Huang, Yian, et al.
Published: (2026)
From Latent to Engine Manifolds: Analyzing ImageBind's Multimodal Embedding Space
by: Hamara, Andrew, et al.
Published: (2024)
by: Hamara, Andrew, et al.
Published: (2024)
Capacity Constraint Analysis Using Object Detection for Smart Manufacturing
by: Ahmad, Hafiz Mughees, et al.
Published: (2024)
by: Ahmad, Hafiz Mughees, et al.
Published: (2024)
CG-HOI: Contact-Guided 3D Human-Object Interaction Generation
by: Diller, Christian, et al.
Published: (2023)
by: Diller, Christian, et al.
Published: (2023)
Pan-Arctic Permafrost Landform and Human-built Infrastructure Feature Detection with Vision Transformers and Location Embeddings
by: Perera, Amal S., et al.
Published: (2025)
by: Perera, Amal S., et al.
Published: (2025)
Canonical Space Representation for 4D Panoptic Segmentation of Articulated Objects
by: Gomes, Manuel, et al.
Published: (2025)
by: Gomes, Manuel, et al.
Published: (2025)
Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis
by: Estepa, Imanol G., et al.
Published: (2025)
by: Estepa, Imanol G., et al.
Published: (2025)
WaveMix: A Resource-efficient Neural Network for Image Analysis
by: Jeevan, Pranav, et al.
Published: (2022)
by: Jeevan, Pranav, et al.
Published: (2022)
Artificial Intelligence can Recognize Whether a Job Applicant is Selling and/or Lying According to Facial Expressions and Head Movements Much More Correctly Than Human Interviewers
by: Suen, Hung-Yue, et al.
Published: (2026)
by: Suen, Hung-Yue, et al.
Published: (2026)
Similar Items
-
CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive Decoding
by: Zhang, Xi, et al.
Published: (2025) -
Towards Accurate and Efficient Waste Image Classification: A Hybrid Deep Learning and Machine Learning Approach
by: Nguyen, Ngoc-Bao-Quang, et al.
Published: (2025) -
UTAL-GNN: Unsupervised Temporal Action Localization using Graph Neural Networks
by: Badatya, Bikash Kumar, et al.
Published: (2025) -
DeltaVLM: Interactive Remote Sensing Image Change Analysis via Instruction-guided Difference Perception
by: Deng, Pei, et al.
Published: (2025) -
FAME: Feature Activation Map Explanation on Image Classification and Face Recognition
by: Zhang, Xinyi, et al.
Published: (2026)