Saved in:
| Main Authors: | Storonkin, Daniil, Dziub, Ilia, Golyadkin, Maksim, Makarov, Ilya |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.07062 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CerberusDet: Unified Multi-Dataset Object Detection
by: Tolstykh, Irina, et al.
Published: (2024)
by: Tolstykh, Irina, et al.
Published: (2024)
Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender Estimation
by: Kuprashevich, Maksim, et al.
Published: (2024)
by: Kuprashevich, Maksim, et al.
Published: (2024)
A Review of Pseudo-Labeling for Computer Vision
by: Kage, Patrick, et al.
Published: (2024)
by: Kage, Patrick, et al.
Published: (2024)
Synthetic Photography Detection: A Visual Guidance for Identifying Synthetic Images Created by AI
by: Mathys, Melanie, et al.
Published: (2024)
by: Mathys, Melanie, et al.
Published: (2024)
Designing UNICORN: a Unified Benchmark for Imaging in Computational Pathology, Radiology, and Natural Language
by: Stegeman, Michelle, et al.
Published: (2026)
by: Stegeman, Michelle, et al.
Published: (2026)
Synthetic Image Generation in Cyber Influence Operations: An Emergent Threat?
by: Mathys, Melanie, et al.
Published: (2024)
by: Mathys, Melanie, et al.
Published: (2024)
Stereo Vision Based Robot for Remote Monitoring with VR Support
by: S., Mohamed Fazil M., et al.
Published: (2024)
by: S., Mohamed Fazil M., et al.
Published: (2024)
COLORA: Efficient Fine-Tuning for Convolutional Models with a Study Case on Optical Coherence Tomography Image Classification
by: Rivera, Mariano, et al.
Published: (2025)
by: Rivera, Mariano, et al.
Published: (2025)
High-Entropy Tokens as Multimodal Failure Points in Vision-Language Models
by: He, Mengqi, et al.
Published: (2025)
by: He, Mengqi, et al.
Published: (2025)
Vision Transformer-based Model for Severity Quantification of Lung Pneumonia Using Chest X-ray Images
by: Slika, Bouthaina, et al.
Published: (2023)
by: Slika, Bouthaina, et al.
Published: (2023)
A Survey on Dynamic Neural Networks: from Computer Vision to Multi-modal Sensor Fusion
by: Montello, Fabio, et al.
Published: (2025)
by: Montello, Fabio, et al.
Published: (2025)
Unified Local and Global Attention Interaction Modeling for Vision Transformers
by: Nguyen, Tan, et al.
Published: (2024)
by: Nguyen, Tan, et al.
Published: (2024)
Efficient Diffusion Training through Parallelization with Truncated Karhunen-Loève Expansion
by: Ren, Yumeng, et al.
Published: (2025)
by: Ren, Yumeng, et al.
Published: (2025)
Proto-FG3D: Prototype-based Interpretable Fine-Grained 3D Shape Classification
by: Ma, Shuxian, et al.
Published: (2025)
by: Ma, Shuxian, et al.
Published: (2025)
Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models
by: Zwick, Pascal, et al.
Published: (2024)
by: Zwick, Pascal, et al.
Published: (2024)
Foreground Focus: Enhancing Coherence and Fidelity in Camouflaged Image Generation
by: Chen, Pei-Chi, et al.
Published: (2025)
by: Chen, Pei-Chi, et al.
Published: (2025)
Event-based Solutions for Human-centered Applications: A Comprehensive Review
by: Adra, Mira, et al.
Published: (2025)
by: Adra, Mira, et al.
Published: (2025)
Collaborative Control for Geometry-Conditioned PBR Image Generation
by: Vainer, Shimon, et al.
Published: (2024)
by: Vainer, Shimon, et al.
Published: (2024)
BlanketGen2-Fit3D: Synthetic Blanket Augmentation Towards Improving Real-World In-Bed Blanket Occluded Human Pose Estimation
by: Karácsony, Tamás, et al.
Published: (2025)
by: Karácsony, Tamás, et al.
Published: (2025)
Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection
by: Maity, Subhajit, et al.
Published: (2025)
by: Maity, Subhajit, et al.
Published: (2025)
Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image
by: Li, Pufan, et al.
Published: (2025)
by: Li, Pufan, et al.
Published: (2025)
StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
by: Zhao, Sijie, et al.
Published: (2024)
by: Zhao, Sijie, et al.
Published: (2024)
FLOWING: Implicit Neural Flows for Structure-Preserving Morphing
by: Bizzi, Arthur, et al.
Published: (2025)
by: Bizzi, Arthur, et al.
Published: (2025)
Data-Augmented Multimodal Feature Fusion for Multiclass Visual Recognition of Oral Cancer Lesions
by: Naoum, Joy, et al.
Published: (2025)
by: Naoum, Joy, et al.
Published: (2025)
PlacidDreamer: Advancing Harmony in Text-to-3D Generation
by: Huang, Shuo, et al.
Published: (2024)
by: Huang, Shuo, et al.
Published: (2024)
Robust Self-calibration of Focal Lengths from the Fundamental Matrix
by: Kocur, Viktor, et al.
Published: (2023)
by: Kocur, Viktor, et al.
Published: (2023)
Textured-GS: Gaussian Splatting with Spatially Defined Color and Opacity
by: Huang, Zhentao, et al.
Published: (2024)
by: Huang, Zhentao, et al.
Published: (2024)
Robust automatic brain vessel segmentation in 3D CTA scans using dynamic 4D-CTA data
by: Ceballos-Arroyo, Alberto Mario, et al.
Published: (2026)
by: Ceballos-Arroyo, Alberto Mario, et al.
Published: (2026)
View-Consistent 3D Scene Editing via Dual-Path Structural Correspondense and Semantic Continuity
by: Li, Pufan, et al.
Published: (2026)
by: Li, Pufan, et al.
Published: (2026)
Goal-conditioned reinforcement learning for ultrasound navigation guidance
by: Amadou, Abdoul Aziz, et al.
Published: (2024)
by: Amadou, Abdoul Aziz, et al.
Published: (2024)
Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation
by: Lu, Zhuqiang, et al.
Published: (2023)
by: Lu, Zhuqiang, et al.
Published: (2023)
Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder
by: Liu, Jia, et al.
Published: (2024)
by: Liu, Jia, et al.
Published: (2024)
Multi-Objective Optimization for Synthetic-to-Real Style Transfer
by: Chigot, Estelle, et al.
Published: (2026)
by: Chigot, Estelle, et al.
Published: (2026)
Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
by: Toker, Michael, et al.
Published: (2024)
by: Toker, Michael, et al.
Published: (2024)
Context-Enriched Contrastive Loss: Enhancing Presentation of Inherent Sample Connections in Contrastive Learning Framework
by: Deng, Haojin, et al.
Published: (2025)
by: Deng, Haojin, et al.
Published: (2025)
Enhancing Explainable AI: A Hybrid Approach Combining GradCAM and LRP for CNN Interpretability
by: Dhore, Vaibhav, et al.
Published: (2024)
by: Dhore, Vaibhav, et al.
Published: (2024)
Inference-Time Scaling for Visual AutoRegressive modeling by Searching Representative Samples
by: Tang, Weidong, et al.
Published: (2026)
by: Tang, Weidong, et al.
Published: (2026)
Composite Data Augmentations for Synthetic Image Detection Against Real-World Perturbations
by: Amarantidou, Efthymia, et al.
Published: (2025)
by: Amarantidou, Efthymia, et al.
Published: (2025)
RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety
by: Dumitriu, Andrei, et al.
Published: (2025)
by: Dumitriu, Andrei, et al.
Published: (2025)
Label Delay in Online Continual Learning
by: Csaba, Botos, et al.
Published: (2023)
by: Csaba, Botos, et al.
Published: (2023)
Similar Items
-
CerberusDet: Unified Multi-Dataset Object Detection
by: Tolstykh, Irina, et al.
Published: (2024) -
Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender Estimation
by: Kuprashevich, Maksim, et al.
Published: (2024) -
A Review of Pseudo-Labeling for Computer Vision
by: Kage, Patrick, et al.
Published: (2024) -
Synthetic Photography Detection: A Visual Guidance for Identifying Synthetic Images Created by AI
by: Mathys, Melanie, et al.
Published: (2024) -
Designing UNICORN: a Unified Benchmark for Imaging in Computational Pathology, Radiology, and Natural Language
by: Stegeman, Michelle, et al.
Published: (2026)