:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Hu, Luyin, Gholami, Soheil, Dindelegan, George, Meling, Torstein R., Billard, Aude
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2508.18836
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Streamlining Image Editing with Layered Diffusion Brushes
by: Gholami, Peyman, et al.
Published: (2024)

Pixel-level Quality Assessment for Oriented Object Detection
by: Zhu, Yunhui, et al.
Published: (2025)

Monocular Markerless Motion Capture Enables Quantitative Assessment of Upper Extremity Reachable Workspace
by: Donahue, Seth, et al.
Published: (2026)

Microsurgical Instrument Segmentation for Robot-Assisted Surgery
by: Jeong, Tae Kyeong, et al.
Published: (2025)

Deep Tiny Network for Recognition-Oriented Face Image Quality Assessment
by: Peng, Baoyun, et al.
Published: (2021)

Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP
by: Basu, Samyadeep, et al.
Published: (2023)

CASP: Compression of Large Multimodal Models Based on Attention Sparsity
by: Gholami, Mohsen, et al.
Published: (2025)

Weather-Aware Object Detection Transformer for Domain Adaptation
by: Gharatappeh, Soheil, et al.
Published: (2025)

Prompt-based Ingredient-Oriented All-in-One Image Restoration
by: Gao, Hu, et al.
Published: (2023)

AI-Driven Three-Dimensional Reconstruction and Quantitative Analysis for Burn Injury Assessment
by: Kalaycioglu, S., et al.
Published: (2026)

Automated Assessment of Aesthetic Outcomes in Facial Plastic Surgery
by: Varghaei, Pegah, et al.
Published: (2025)

SVGEditBench: A Benchmark Dataset for Quantitative Assessment of LLM's SVG Editing Capabilities
by: Nishina, Kunato, et al.
Published: (2024)

Vector-Quantized Soft Label Compression for Dataset Distillation
by: Abbasi, Ali, et al.
Published: (2026)

PRIME: Prioritizing Interpretability in Failure Mode Extraction
by: Rezaei, Keivan, et al.
Published: (2023)

BEAR: A Video Dataset For Fine-grained Behaviors Recognition Oriented with Action and Environment Factors
by: Hu, Chengyang, et al.
Published: (2025)

Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
by: Hu, Teng, et al.
Published: (2025)

UniICL: Systematizing Unified Multimodal In-context Learning through a Capability-Oriented Taxonomy
by: Xu, Yicheng, et al.
Published: (2026)

AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models
by: Zarei, Arman, et al.
Published: (2025)

Emotion-Director: Bridging Affective Shortcut in Emotion-Oriented Image Generation
by: Jia, Guoli, et al.
Published: (2025)

Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding
by: Nguyen, Thong, et al.
Published: (2025)

Illumination Histogram Consistency Metric for Quantitative Assessment of Video Sequences
by: Chen, Long, et al.
Published: (2024)

Gamma: Toward Generic Image Assessment with Mixture of Assessment Experts
by: Zhou, Hantao, et al.
Published: (2025)

RiO-DETR: DETR for Real-time Oriented Object Detection
by: Hu, Zhangchi, et al.
Published: (2026)

Zero-shot Prompt-based Video Encoder for Surgical Gesture Recognition
by: Rao, Mingxing, et al.
Published: (2024)

OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images
by: Zhao, Jiaqi, et al.
Published: (2024)

Target-Oriented Object Grasping via Multimodal Human Guidance
by: Xie, Pengwei, et al.
Published: (2024)

NeuroNURBS: Learning Efficient Surface Representations for 3D Solids
by: Fan, Jiajie, et al.
Published: (2024)

Anatomy-Aware Unsupervised Detection and Localization of Retinal Abnormalities in Optical Coherence Tomography
by: Haghighi, Tania, et al.
Published: (2026)

Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes
by: Gholami, Mohsen, et al.
Published: (2025)

GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection
by: Murrugarra-LLerena, Jeffri, et al.
Published: (2025)

SceneDecorator: Towards Scene-Oriented Story Generation with Scene Planning and Scene Consistency
by: Song, Quanjian, et al.
Published: (2025)

Center-Oriented Prototype Contrastive Clustering
by: Dong, Shihao, et al.
Published: (2025)

ROSE: Retrieval-Oriented Segmentation Enhancement
by: Tang, Song, et al.
Published: (2026)

SOVC: Subject-Oriented Video Captioning
by: Teng, Chang, et al.
Published: (2023)

Efficient Transferable Optimal Transport via Min-Sliced Transport Plans
by: Liu, Xinran, et al.
Published: (2025)

SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control
by: Zarei, Arman, et al.
Published: (2025)

Localizing Knowledge in Diffusion Transformers
by: Zarei, Arman, et al.
Published: (2025)

Understanding Information Storage and Transfer in Multi-modal Large Language Models
by: Basu, Samyadeep, et al.
Published: (2024)

Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection
by: Hu, Zhangchi, et al.
Published: (2025)

ComDrive: Comfort-Oriented End-to-End Autonomous Driving
by: Wang, Junming, et al.
Published: (2024)