:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Wu, Yichen, Liu, Xu, Zhao, Chenxuan, Wu, Xinyu
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2509.18619
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

CARE: Training-Free Controllable Restoration for Medical Images via Dual-Latent Steering
von: Liu, Xu
Veröffentlicht: (2026)

SteerFlow: Steering Rectified Flows for Faithful Inversion-Based Image Editing
von: Dao, Thinh, et al.
Veröffentlicht: (2026)

ModalPrompt: Towards Efficient Multimodal Continual Instruction Tuning with Dual-Modality Guided Prompt
von: Zeng, Fanhu, et al.
Veröffentlicht: (2024)

DepthMOT: Depth Cues Lead to a Strong Multi-Object Tracker
von: Wu, Jiapeng, et al.
Veröffentlicht: (2024)

Forest Before Trees: Latent Superposition for Efficient Visual Reasoning
von: Wang, Yubo, et al.
Veröffentlicht: (2026)

Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing
von: Huang, Jiancheng, et al.
Veröffentlicht: (2024)

Dual Diffusion Models for Multi-modal Guided 3D Avatar Generation
von: Li, Hong, et al.
Veröffentlicht: (2026)

ProEdit: Inversion-based Editing From Prompts Done Right
von: Ouyang, Zhi, et al.
Veröffentlicht: (2025)

Learning Multimodal Volumetric Features for Large-Scale Neuron Tracing
von: Chen, Qihua, et al.
Veröffentlicht: (2024)

EntroAD: Structural Entropy-Guided Prompt Adaptation for Zero-Shot Anomaly Detection
von: Zhao, Xinyu, et al.
Veröffentlicht: (2026)

Blind Inversion using Latent Diffusion Priors
von: Bai, Weimin, et al.
Veröffentlicht: (2024)

ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models
von: Tian, Xinyu, et al.
Veröffentlicht: (2023)

No Calibration, No Depth, No Problem: Cross-Sensor View Synthesis with 3D Consistency
von: Wu, Cho-Ying, et al.
Veröffentlicht: (2026)

GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation
von: Yang, Zhenya, et al.
Veröffentlicht: (2025)

Clue Matters: Leveraging Latent Visual Clues to Empower Video Reasoning
von: zhang, Kaixin, et al.
Veröffentlicht: (2026)

Referring Multiple Regions with Large Multimodal Models via Contextual Latent Steering
von: Xing, Yun, et al.
Veröffentlicht: (2026)

Latent Diffusion Inversion Requires Understanding the Latent Space
von: Rao, Mingxing, et al.
Veröffentlicht: (2025)

In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model
von: Yin, Junhui, et al.
Veröffentlicht: (2024)

Solving Minimal Problems Without Matrix Inversion Using FFT-Based Interpolation
von: Wu, Haidong, et al.
Veröffentlicht: (2026)

Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays
von: Sun, Zhichao, et al.
Veröffentlicht: (2024)

MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize
von: Xu, Haohang, et al.
Veröffentlicht: (2025)

Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models
von: Kim, Donghoon, et al.
Veröffentlicht: (2025)

Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency
von: Song, Bowen, et al.
Veröffentlicht: (2023)

Dual-Modality Anchor-Guided Filtering for Test-time Prompt Tuning
von: Choi, Jungwon, et al.
Veröffentlicht: (2026)

DCI: Dual-Conditional Inversion for Boosting Diffusion-Based Image Editing
von: Li, Zixiang, et al.
Veröffentlicht: (2025)

Counterfactual Visual Explanation via Causally-Guided Adversarial Steering
von: Qiao, Yiran, et al.
Veröffentlicht: (2025)

FilterPrompt: A Simple yet Efficient Approach to Guide Image Appearance Transfer in Diffusion Models
von: Wang, Xi, et al.
Veröffentlicht: (2024)

VLM-Guided Adaptive Negative Prompting for Creative Generation
von: Golan, Shelly, et al.
Veröffentlicht: (2025)

Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation
von: Liu, Zhihua, et al.
Veröffentlicht: (2025)

DualEdit: Dual Editing for Knowledge Updating in Vision-Language Models
von: Shi, Zhiyi, et al.
Veröffentlicht: (2025)

Enhancing Image Aesthetics with Dual-Conditioned Diffusion Models Guided by Multimodal Perception
von: Nan, Xinyu, et al.
Veröffentlicht: (2026)

CLIP-SCGI: Synthesized Caption-Guided Inversion for Person Re-Identification
von: Han, Qianru, et al.
Veröffentlicht: (2024)

Suppressing Prior-Comparison Hallucinations in Radiology Report Generation via Semantically Decoupled Latent Steering
von: Li, Ao, et al.
Veröffentlicht: (2026)

SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models
von: Zhou, Yang, et al.
Veröffentlicht: (2024)

Seeing It or Not? Interpretable Vision-aware Latent Steering to Mitigate Object Hallucinations
von: Chen, Boxu, et al.
Veröffentlicht: (2025)

InvSeg: Test-Time Prompt Inversion for Semantic Segmentation
von: Lin, Jiayi, et al.
Veröffentlicht: (2024)

StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing
von: Li, Senmao, et al.
Veröffentlicht: (2023)

Unpaired Multi-Domain Histopathology Virtual Staining using Dual Path Prompted Inversion
von: Xiong, Bing, et al.
Veröffentlicht: (2024)

InverseMeetInsert: Robust Real Image Editing via Geometric Accumulation Inversion in Guided Diffusion Models
von: Zheng, Yan, et al.
Veröffentlicht: (2024)

The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
von: Gao, Bingjie, et al.
Veröffentlicht: (2025)