:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Park, Ji-Hoon, Ju, Yeong-Joon, Lee, Seong-Whan
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence 68T01
Online Access:	https://arxiv.org/abs/2402.10404
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MIRe: Enhancing Multimodal Queries Representation via Fusion-Free Modality Interaction for Multimodal Retrieval
by: Ju, Yeong-Joon, et al.
Published: (2024)

DarSwin: Distortion Aware Radial Swin Transformer
by: Athwale, Akshaya, et al.
Published: (2023)

Semantic Depth Matters: Explaining Errors of Deep Vision Networks through Perceived Class Similarities
by: Filus, Katarzyna, et al.
Published: (2025)

Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding
by: Gao, Rong, et al.
Published: (2024)

Deepfakes on Demand: the rise of accessible non-consensual deepfake image generators
by: Hawkins, Will, et al.
Published: (2025)

Matrix-Valued LogSumExp Approximation for Colour Morphology
by: Kahra, Marvin, et al.
Published: (2024)

Colour Morphological Distance Ordering based on the Log-Exp-Supremum
by: Kahra, Marvin, et al.
Published: (2025)

Sequence Transferability and Task Order Selection in Continual Learning
by: Nguyen, Thinh, et al.
Published: (2025)

From Volume Rendering to 3D Gaussian Splatting: Theory and Applications
by: Matias, Vitor Pereira, et al.
Published: (2025)

What is the Visual Cognition Gap between Humans and Multimodal LLMs?
by: Cao, Xu, et al.
Published: (2024)

Animation Needs Attention: A Holistic Approach to Slides Animation Comprehension with Visual-Language Models
by: Jiang, Yifan, et al.
Published: (2025)

DQE-CIR: Distinctive Query Embeddings through Learnable Attribute Weights and Target Relative Negative Sampling in Composed Image Retrieval
by: Park, Geon, et al.
Published: (2026)

MedShapeNet -- A Large-Scale Dataset of 3D Medical Shapes for Computer Vision
by: Li, Jianning, et al.
Published: (2023)

A Hierarchically Feature Reconstructed Autoencoder for Unsupervised Anomaly Detection
by: Chen, Honghui, et al.
Published: (2024)

Semantic Prioritization in Visual Counterfactual Explanations with Weighted Segmentation and Auto-Adaptive Region Selection
by: Zhang, Lintong, et al.
Published: (2025)

Universal Adversarial Perturbations for Vision-Language Pre-trained Models
by: Zhang, Peng-Fei, et al.
Published: (2024)

AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization
by: Kazemi, Amir, et al.
Published: (2024)

Accelerating Large Kernel Convolutions with Nested Winograd Transformation.pdf
by: Jiang, Jingbo, et al.
Published: (2021)

Empowering Manufacturers with Privacy-Preserving AI Tools: A Case Study in Privacy-Preserving Machine Learning to Solve Real-World Problems
by: Ji, Xiaoyu, et al.
Published: (2025)

Pedestrian intention prediction in Adverse Weather Conditions with Spiking Neural Networks and Dynamic Vision Sensors
by: Sakhai, Mustafa, et al.
Published: (2024)

VACoDe: Visual Augmented Contrastive Decoding
by: Kim, Sihyeon, et al.
Published: (2024)

HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
by: Chen, Honghui, et al.
Published: (2024)

Rethinking Evaluation of Multiple Sclerosis (MS) Lesion Segmentation Models
by: Basit, Abdul, et al.
Published: (2026)

A lifted Bregman strategy for training unfolded proximal neural network Gaussian denoisers
by: Wang, Xiaoyu, et al.
Published: (2024)

Socratic Planner: Self-QA-Based Zero-Shot Planning for Embodied Instruction Following
by: Shin, Suyeon, et al.
Published: (2024)

Upper-body free-breathing Magnetic Resonance Fingerprinting applied to the quantification of water T1 and fat fraction
by: Slioussarenko, Constantin, et al.
Published: (2024)

African Gender Classification Using Clothing Identification Via Deep Learning
by: Ozechi, Samuel
Published: (2025)

On Discrete Prompt Optimization for Diffusion Models
by: Wang, Ruochen, et al.
Published: (2024)

Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example
by: Yun, Kwan, et al.
Published: (2024)

Semantic2Graph: Graph-based Multi-modal Feature Fusion for Action Segmentation in Videos
by: Zhang, Junbin, et al.
Published: (2022)

Leveraging CNN and IoT for Effective E-Waste Management
by: Nadar, Ajesh Thangaraj, et al.
Published: (2025)

SASWISE-UE: Segmentation and Synthesis with Interpretable Scalable Ensembles for Uncertainty Estimation
by: Chen, Weijie, et al.
Published: (2024)

Reconstructing Gridded Data from Higher Autocorrelations
by: Casper, W. Riley, et al.
Published: (2025)

FST.ai 2.0: An Explainable AI Ecosystem for Fair, Fast, and Inclusive Decision-Making in Olympic and Paralympic Taekwondo
by: Shariatmadar, Keivan, et al.
Published: (2025)

Vision transformer-based multi-camera multi-object tracking framework for dairy cow monitoring
by: Abbas, Kumail, et al.
Published: (2025)

MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval
by: Park, Jeong-Woo, et al.
Published: (2025)

Evaluation of (Un-)Supervised Machine Learning Methods for GNSS Interference Classification with Real-World Data Discrepancies
by: Heublein, Lucas, et al.
Published: (2025)

Representation Selection via Cross-Model Agreement using Canonical Correlation Analysis
by: Lewis, Dylan B., et al.
Published: (2026)

Surrealistic-like Image Generation with Vision-Language Models
by: Ayten, Elif, et al.
Published: (2024)

TIFu: Tri-directional Implicit Function for High-Fidelity 3D Character Reconstruction
by: Lim, Byoungsung, et al.
Published: (2024)