Saved in:
| Main Authors: | Tokhchukov, Danil, Mirzoeva, Aysel, Kuznetsov, Andrey, Sobolev, Konstantin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.24800 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
T-LoRA: Single Image Diffusion Model Customization Without Overfitting
by: Soboleva, Vera, et al.
Published: (2025)
by: Soboleva, Vera, et al.
Published: (2025)
DynoSLAM: Dynamic SLAM with Generative Graph Neural Networks for Real-World Social Navigation
by: Tokhchukov, Danil, et al.
Published: (2026)
by: Tokhchukov, Danil, et al.
Published: (2026)
Test-Time Reasoning Through Visual Human Preferences with VLMs and Soft Rewards
by: Gambashidze, Alexander, et al.
Published: (2025)
by: Gambashidze, Alexander, et al.
Published: (2025)
Listener-Rewarded Thinking in VLMs for Image Preferences
by: Gambashidze, Alexander, et al.
Published: (2025)
by: Gambashidze, Alexander, et al.
Published: (2025)
FastFace: Tuning Identity Preservation in Distilled Diffusion via Guidance and Attention
by: Karpukhin, Sergey, et al.
Published: (2025)
by: Karpukhin, Sergey, et al.
Published: (2025)
SHIFT: Steering Hidden Intermediates in Flow Transformers
by: Konovalova, Nina, et al.
Published: (2026)
by: Konovalova, Nina, et al.
Published: (2026)
Forecast then Calibrate: Feature Caching as ODE for Efficient Diffusion Transformers
by: Zheng, Shikang, et al.
Published: (2025)
by: Zheng, Shikang, et al.
Published: (2025)
A High-Accuracy Fast Hough Transform with Linear-Log-Cubed Computational Complexity for Arbitrary-Shaped Images
by: Kazimirov, Danil, et al.
Published: (2025)
by: Kazimirov, Danil, et al.
Published: (2025)
ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models
by: Sorokin, Dmitrii, et al.
Published: (2025)
by: Sorokin, Dmitrii, et al.
Published: (2025)
MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models
by: Garifullin, Kamil, et al.
Published: (2025)
by: Garifullin, Kamil, et al.
Published: (2025)
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models
by: Razin, Aleksandr, et al.
Published: (2025)
by: Razin, Aleksandr, et al.
Published: (2025)
Optimizing Active Learning in Vision-Language Models via Parameter-Efficient Uncertainty Calibration
by: Narayanan, Athmanarayanan Lakshmi, et al.
Published: (2025)
by: Narayanan, Athmanarayanan Lakshmi, et al.
Published: (2025)
Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback
by: Konovalova, Nina, et al.
Published: (2025)
by: Konovalova, Nina, et al.
Published: (2025)
Simple Vision-Language Math Reasoning via Rendered Text
by: Skripkin, Matvey, et al.
Published: (2025)
by: Skripkin, Matvey, et al.
Published: (2025)
Generalization of Brady-Yong Algorithm for Fast Hough Transform to Arbitrary Image Size
by: Kazimirov, Danil, et al.
Published: (2024)
by: Kazimirov, Danil, et al.
Published: (2024)
BREPS: Bounding-Box Robustness Evaluation of Promptable Segmentation
by: Moskalenko, Andrey, et al.
Published: (2026)
by: Moskalenko, Andrey, et al.
Published: (2026)
Scaling Diffusion Transformers to 16 Billion Parameters
by: Fei, Zhengcong, et al.
Published: (2024)
by: Fei, Zhengcong, et al.
Published: (2024)
Inverse-and-Edit: Effective and Fast Image Editing by Cycle Consistency Models
by: Beletskii, Ilia, et al.
Published: (2025)
by: Beletskii, Ilia, et al.
Published: (2025)
RNNs, CNNs and Transformers in Human Action Recognition: A Survey and a Hybrid Model
by: Alomar, Khaled, et al.
Published: (2024)
by: Alomar, Khaled, et al.
Published: (2024)
Real-World Transferable Adversarial Attack on Face-Recognition Systems
by: Kaznacheev, Andrey, et al.
Published: (2025)
by: Kaznacheev, Andrey, et al.
Published: (2025)
Designing Parameter and Compute Efficient Diffusion Transformers using Distillation
by: Sundaresha, Vignesh
Published: (2025)
by: Sundaresha, Vignesh
Published: (2025)
ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning
by: Duan, Zhongjie, et al.
Published: (2024)
by: Duan, Zhongjie, et al.
Published: (2024)
Parameter-Efficient Interventions for Enhanced Model Merging
by: Osial, Marcin, et al.
Published: (2024)
by: Osial, Marcin, et al.
Published: (2024)
Prior-Guided Residual Diffusion: Calibrated and Efficient Medical Image Segmentation
by: Mao, Fuyou, et al.
Published: (2025)
by: Mao, Fuyou, et al.
Published: (2025)
DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing
by: Dong, Zhenyuan, et al.
Published: (2024)
by: Dong, Zhenyuan, et al.
Published: (2024)
Spread them Apart: Towards Robust Watermarking of Generated Content
by: Pautov, Mikhail, et al.
Published: (2025)
by: Pautov, Mikhail, et al.
Published: (2025)
An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques
by: Li, Chunxiao, et al.
Published: (2024)
by: Li, Chunxiao, et al.
Published: (2024)
Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach
by: Zhang, Taolin, et al.
Published: (2024)
by: Zhang, Taolin, et al.
Published: (2024)
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
by: Chang, Da, et al.
Published: (2024)
by: Chang, Da, et al.
Published: (2024)
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance
by: Guo, Jiayi, et al.
Published: (2025)
by: Guo, Jiayi, et al.
Published: (2025)
CST: Calibration Side-Tuning for Parameter and Memory Efficient Transfer Learning
by: Chen, Feng
Published: (2024)
by: Chen, Feng
Published: (2024)
CalFuse: Multi-Modal Continual Learning via Feature Calibration and Parameter Fusion
by: Guo, Juncen, et al.
Published: (2025)
by: Guo, Juncen, et al.
Published: (2025)
MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding
by: Li, Pengyi, et al.
Published: (2025)
by: Li, Pengyi, et al.
Published: (2025)
SOFI: Multi-Scale Deformable Transformer for Camera Calibration with Enhanced Line Queries
by: Janampa, Sebastian, et al.
Published: (2024)
by: Janampa, Sebastian, et al.
Published: (2024)
EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration
by: Aziz, Abu Zahid Bin, et al.
Published: (2024)
by: Aziz, Abu Zahid Bin, et al.
Published: (2024)
VDT-Auto: End-to-end Autonomous Driving with VLM-Guided Diffusion Transformers
by: Guo, Ziang, et al.
Published: (2025)
by: Guo, Ziang, et al.
Published: (2025)
Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation
by: Ly, Son Thai, et al.
Published: (2024)
by: Ly, Son Thai, et al.
Published: (2024)
PECTP: Parameter-Efficient Cross-Task Prompts for Incremental Vision Transformer
by: Feng, Qian, et al.
Published: (2024)
by: Feng, Qian, et al.
Published: (2024)
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models
by: Marjit, Shyam, et al.
Published: (2024)
by: Marjit, Shyam, et al.
Published: (2024)
StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models
by: Zhou, Mohan, et al.
Published: (2024)
by: Zhou, Mohan, et al.
Published: (2024)
Similar Items
-
T-LoRA: Single Image Diffusion Model Customization Without Overfitting
by: Soboleva, Vera, et al.
Published: (2025) -
DynoSLAM: Dynamic SLAM with Generative Graph Neural Networks for Real-World Social Navigation
by: Tokhchukov, Danil, et al.
Published: (2026) -
Test-Time Reasoning Through Visual Human Preferences with VLMs and Soft Rewards
by: Gambashidze, Alexander, et al.
Published: (2025) -
Listener-Rewarded Thinking in VLMs for Image Preferences
by: Gambashidze, Alexander, et al.
Published: (2025) -
FastFace: Tuning Identity Preservation in Distilled Diffusion via Guidance and Attention
by: Karpukhin, Sergey, et al.
Published: (2025)