Saved in:
| Main Authors: | Park, Ji-Hoon, Ju, Yeong-Joon, Lee, Seong-Whan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.10404 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MIRe: Enhancing Multimodal Queries Representation via Fusion-Free Modality Interaction for Multimodal Retrieval
by: Ju, Yeong-Joon, et al.
Published: (2024)
by: Ju, Yeong-Joon, et al.
Published: (2024)
DarSwin: Distortion Aware Radial Swin Transformer
by: Athwale, Akshaya, et al.
Published: (2023)
by: Athwale, Akshaya, et al.
Published: (2023)
Semantic Depth Matters: Explaining Errors of Deep Vision Networks through Perceived Class Similarities
by: Filus, Katarzyna, et al.
Published: (2025)
by: Filus, Katarzyna, et al.
Published: (2025)
Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding
by: Gao, Rong, et al.
Published: (2024)
by: Gao, Rong, et al.
Published: (2024)
Deepfakes on Demand: the rise of accessible non-consensual deepfake image generators
by: Hawkins, Will, et al.
Published: (2025)
by: Hawkins, Will, et al.
Published: (2025)
Matrix-Valued LogSumExp Approximation for Colour Morphology
by: Kahra, Marvin, et al.
Published: (2024)
by: Kahra, Marvin, et al.
Published: (2024)
Colour Morphological Distance Ordering based on the Log-Exp-Supremum
by: Kahra, Marvin, et al.
Published: (2025)
by: Kahra, Marvin, et al.
Published: (2025)
Sequence Transferability and Task Order Selection in Continual Learning
by: Nguyen, Thinh, et al.
Published: (2025)
by: Nguyen, Thinh, et al.
Published: (2025)
From Volume Rendering to 3D Gaussian Splatting: Theory and Applications
by: Matias, Vitor Pereira, et al.
Published: (2025)
by: Matias, Vitor Pereira, et al.
Published: (2025)
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
by: Cao, Xu, et al.
Published: (2024)
by: Cao, Xu, et al.
Published: (2024)
Animation Needs Attention: A Holistic Approach to Slides Animation Comprehension with Visual-Language Models
by: Jiang, Yifan, et al.
Published: (2025)
by: Jiang, Yifan, et al.
Published: (2025)
DQE-CIR: Distinctive Query Embeddings through Learnable Attribute Weights and Target Relative Negative Sampling in Composed Image Retrieval
by: Park, Geon, et al.
Published: (2026)
by: Park, Geon, et al.
Published: (2026)
MedShapeNet -- A Large-Scale Dataset of 3D Medical Shapes for Computer Vision
by: Li, Jianning, et al.
Published: (2023)
by: Li, Jianning, et al.
Published: (2023)
A Hierarchically Feature Reconstructed Autoencoder for Unsupervised Anomaly Detection
by: Chen, Honghui, et al.
Published: (2024)
by: Chen, Honghui, et al.
Published: (2024)
Semantic Prioritization in Visual Counterfactual Explanations with Weighted Segmentation and Auto-Adaptive Region Selection
by: Zhang, Lintong, et al.
Published: (2025)
by: Zhang, Lintong, et al.
Published: (2025)
Universal Adversarial Perturbations for Vision-Language Pre-trained Models
by: Zhang, Peng-Fei, et al.
Published: (2024)
by: Zhang, Peng-Fei, et al.
Published: (2024)
AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization
by: Kazemi, Amir, et al.
Published: (2024)
by: Kazemi, Amir, et al.
Published: (2024)
Accelerating Large Kernel Convolutions with Nested Winograd Transformation.pdf
by: Jiang, Jingbo, et al.
Published: (2021)
by: Jiang, Jingbo, et al.
Published: (2021)
Empowering Manufacturers with Privacy-Preserving AI Tools: A Case Study in Privacy-Preserving Machine Learning to Solve Real-World Problems
by: Ji, Xiaoyu, et al.
Published: (2025)
by: Ji, Xiaoyu, et al.
Published: (2025)
Pedestrian intention prediction in Adverse Weather Conditions with Spiking Neural Networks and Dynamic Vision Sensors
by: Sakhai, Mustafa, et al.
Published: (2024)
by: Sakhai, Mustafa, et al.
Published: (2024)
VACoDe: Visual Augmented Contrastive Decoding
by: Kim, Sihyeon, et al.
Published: (2024)
by: Kim, Sihyeon, et al.
Published: (2024)
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
by: Chen, Honghui, et al.
Published: (2024)
by: Chen, Honghui, et al.
Published: (2024)
Rethinking Evaluation of Multiple Sclerosis (MS) Lesion Segmentation Models
by: Basit, Abdul, et al.
Published: (2026)
by: Basit, Abdul, et al.
Published: (2026)
A lifted Bregman strategy for training unfolded proximal neural network Gaussian denoisers
by: Wang, Xiaoyu, et al.
Published: (2024)
by: Wang, Xiaoyu, et al.
Published: (2024)
Socratic Planner: Self-QA-Based Zero-Shot Planning for Embodied Instruction Following
by: Shin, Suyeon, et al.
Published: (2024)
by: Shin, Suyeon, et al.
Published: (2024)
Upper-body free-breathing Magnetic Resonance Fingerprinting applied to the quantification of water T1 and fat fraction
by: Slioussarenko, Constantin, et al.
Published: (2024)
by: Slioussarenko, Constantin, et al.
Published: (2024)
African Gender Classification Using Clothing Identification Via Deep Learning
by: Ozechi, Samuel
Published: (2025)
by: Ozechi, Samuel
Published: (2025)
On Discrete Prompt Optimization for Diffusion Models
by: Wang, Ruochen, et al.
Published: (2024)
by: Wang, Ruochen, et al.
Published: (2024)
Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example
by: Yun, Kwan, et al.
Published: (2024)
by: Yun, Kwan, et al.
Published: (2024)
Semantic2Graph: Graph-based Multi-modal Feature Fusion for Action Segmentation in Videos
by: Zhang, Junbin, et al.
Published: (2022)
by: Zhang, Junbin, et al.
Published: (2022)
Leveraging CNN and IoT for Effective E-Waste Management
by: Nadar, Ajesh Thangaraj, et al.
Published: (2025)
by: Nadar, Ajesh Thangaraj, et al.
Published: (2025)
SASWISE-UE: Segmentation and Synthesis with Interpretable Scalable Ensembles for Uncertainty Estimation
by: Chen, Weijie, et al.
Published: (2024)
by: Chen, Weijie, et al.
Published: (2024)
Reconstructing Gridded Data from Higher Autocorrelations
by: Casper, W. Riley, et al.
Published: (2025)
by: Casper, W. Riley, et al.
Published: (2025)
FST.ai 2.0: An Explainable AI Ecosystem for Fair, Fast, and Inclusive Decision-Making in Olympic and Paralympic Taekwondo
by: Shariatmadar, Keivan, et al.
Published: (2025)
by: Shariatmadar, Keivan, et al.
Published: (2025)
Vision transformer-based multi-camera multi-object tracking framework for dairy cow monitoring
by: Abbas, Kumail, et al.
Published: (2025)
by: Abbas, Kumail, et al.
Published: (2025)
MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval
by: Park, Jeong-Woo, et al.
Published: (2025)
by: Park, Jeong-Woo, et al.
Published: (2025)
Evaluation of (Un-)Supervised Machine Learning Methods for GNSS Interference Classification with Real-World Data Discrepancies
by: Heublein, Lucas, et al.
Published: (2025)
by: Heublein, Lucas, et al.
Published: (2025)
Representation Selection via Cross-Model Agreement using Canonical Correlation Analysis
by: Lewis, Dylan B., et al.
Published: (2026)
by: Lewis, Dylan B., et al.
Published: (2026)
Surrealistic-like Image Generation with Vision-Language Models
by: Ayten, Elif, et al.
Published: (2024)
by: Ayten, Elif, et al.
Published: (2024)
TIFu: Tri-directional Implicit Function for High-Fidelity 3D Character Reconstruction
by: Lim, Byoungsung, et al.
Published: (2024)
by: Lim, Byoungsung, et al.
Published: (2024)
Similar Items
-
MIRe: Enhancing Multimodal Queries Representation via Fusion-Free Modality Interaction for Multimodal Retrieval
by: Ju, Yeong-Joon, et al.
Published: (2024) -
DarSwin: Distortion Aware Radial Swin Transformer
by: Athwale, Akshaya, et al.
Published: (2023) -
Semantic Depth Matters: Explaining Errors of Deep Vision Networks through Perceived Class Similarities
by: Filus, Katarzyna, et al.
Published: (2025) -
Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding
by: Gao, Rong, et al.
Published: (2024) -
Deepfakes on Demand: the rise of accessible non-consensual deepfake image generators
by: Hawkins, Will, et al.
Published: (2025)