:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Storonkin, Daniil, Dziub, Ilia, Golyadkin, Maksim, Makarov, Ilya
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition I.4.0
Online Access:	https://arxiv.org/abs/2602.07062
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CerberusDet: Unified Multi-Dataset Object Detection
by: Tolstykh, Irina, et al.
Published: (2024)

Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender Estimation
by: Kuprashevich, Maksim, et al.
Published: (2024)

A Review of Pseudo-Labeling for Computer Vision
by: Kage, Patrick, et al.
Published: (2024)

Synthetic Photography Detection: A Visual Guidance for Identifying Synthetic Images Created by AI
by: Mathys, Melanie, et al.
Published: (2024)

Designing UNICORN: a Unified Benchmark for Imaging in Computational Pathology, Radiology, and Natural Language
by: Stegeman, Michelle, et al.
Published: (2026)

Synthetic Image Generation in Cyber Influence Operations: An Emergent Threat?
by: Mathys, Melanie, et al.
Published: (2024)

Stereo Vision Based Robot for Remote Monitoring with VR Support
by: S., Mohamed Fazil M., et al.
Published: (2024)

COLORA: Efficient Fine-Tuning for Convolutional Models with a Study Case on Optical Coherence Tomography Image Classification
by: Rivera, Mariano, et al.
Published: (2025)

High-Entropy Tokens as Multimodal Failure Points in Vision-Language Models
by: He, Mengqi, et al.
Published: (2025)

Vision Transformer-based Model for Severity Quantification of Lung Pneumonia Using Chest X-ray Images
by: Slika, Bouthaina, et al.
Published: (2023)

A Survey on Dynamic Neural Networks: from Computer Vision to Multi-modal Sensor Fusion
by: Montello, Fabio, et al.
Published: (2025)

Unified Local and Global Attention Interaction Modeling for Vision Transformers
by: Nguyen, Tan, et al.
Published: (2024)

Efficient Diffusion Training through Parallelization with Truncated Karhunen-Loève Expansion
by: Ren, Yumeng, et al.
Published: (2025)

Proto-FG3D: Prototype-based Interpretable Fine-Grained 3D Shape Classification
by: Ma, Shuxian, et al.
Published: (2025)

Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models
by: Zwick, Pascal, et al.
Published: (2024)

Foreground Focus: Enhancing Coherence and Fidelity in Camouflaged Image Generation
by: Chen, Pei-Chi, et al.
Published: (2025)

Event-based Solutions for Human-centered Applications: A Comprehensive Review
by: Adra, Mira, et al.
Published: (2025)

Collaborative Control for Geometry-Conditioned PBR Image Generation
by: Vainer, Shimon, et al.
Published: (2024)

BlanketGen2-Fit3D: Synthetic Blanket Augmentation Towards Improving Real-World In-Bed Blanket Occluded Human Pose Estimation
by: Karácsony, Tamás, et al.
Published: (2025)

Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection
by: Maity, Subhajit, et al.
Published: (2025)

Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image
by: Li, Pufan, et al.
Published: (2025)

StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
by: Zhao, Sijie, et al.
Published: (2024)

FLOWING: Implicit Neural Flows for Structure-Preserving Morphing
by: Bizzi, Arthur, et al.
Published: (2025)

Data-Augmented Multimodal Feature Fusion for Multiclass Visual Recognition of Oral Cancer Lesions
by: Naoum, Joy, et al.
Published: (2025)

PlacidDreamer: Advancing Harmony in Text-to-3D Generation
by: Huang, Shuo, et al.
Published: (2024)

Robust Self-calibration of Focal Lengths from the Fundamental Matrix
by: Kocur, Viktor, et al.
Published: (2023)

Textured-GS: Gaussian Splatting with Spatially Defined Color and Opacity
by: Huang, Zhentao, et al.
Published: (2024)

Robust automatic brain vessel segmentation in 3D CTA scans using dynamic 4D-CTA data
by: Ceballos-Arroyo, Alberto Mario, et al.
Published: (2026)

View-Consistent 3D Scene Editing via Dual-Path Structural Correspondense and Semantic Continuity
by: Li, Pufan, et al.
Published: (2026)

Goal-conditioned reinforcement learning for ultrasound navigation guidance
by: Amadou, Abdoul Aziz, et al.
Published: (2024)

Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation
by: Lu, Zhuqiang, et al.
Published: (2023)

Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder
by: Liu, Jia, et al.
Published: (2024)

Multi-Objective Optimization for Synthetic-to-Real Style Transfer
by: Chigot, Estelle, et al.
Published: (2026)

Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
by: Toker, Michael, et al.
Published: (2024)

Context-Enriched Contrastive Loss: Enhancing Presentation of Inherent Sample Connections in Contrastive Learning Framework
by: Deng, Haojin, et al.
Published: (2025)

Enhancing Explainable AI: A Hybrid Approach Combining GradCAM and LRP for CNN Interpretability
by: Dhore, Vaibhav, et al.
Published: (2024)

Inference-Time Scaling for Visual AutoRegressive modeling by Searching Representative Samples
by: Tang, Weidong, et al.
Published: (2026)

Composite Data Augmentations for Synthetic Image Detection Against Real-World Perturbations
by: Amarantidou, Efthymia, et al.
Published: (2025)

RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety
by: Dumitriu, Andrei, et al.
Published: (2025)

Label Delay in Online Continual Learning
by: Csaba, Botos, et al.
Published: (2023)