Saved in:
| Main Authors: | Hu, Luyin, Gholami, Soheil, Dindelegan, George, Meling, Torstein R., Billard, Aude |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.18836 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Streamlining Image Editing with Layered Diffusion Brushes
by: Gholami, Peyman, et al.
Published: (2024)
by: Gholami, Peyman, et al.
Published: (2024)
Pixel-level Quality Assessment for Oriented Object Detection
by: Zhu, Yunhui, et al.
Published: (2025)
by: Zhu, Yunhui, et al.
Published: (2025)
Monocular Markerless Motion Capture Enables Quantitative Assessment of Upper Extremity Reachable Workspace
by: Donahue, Seth, et al.
Published: (2026)
by: Donahue, Seth, et al.
Published: (2026)
Microsurgical Instrument Segmentation for Robot-Assisted Surgery
by: Jeong, Tae Kyeong, et al.
Published: (2025)
by: Jeong, Tae Kyeong, et al.
Published: (2025)
Deep Tiny Network for Recognition-Oriented Face Image Quality Assessment
by: Peng, Baoyun, et al.
Published: (2021)
by: Peng, Baoyun, et al.
Published: (2021)
Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP
by: Basu, Samyadeep, et al.
Published: (2023)
by: Basu, Samyadeep, et al.
Published: (2023)
CASP: Compression of Large Multimodal Models Based on Attention Sparsity
by: Gholami, Mohsen, et al.
Published: (2025)
by: Gholami, Mohsen, et al.
Published: (2025)
Weather-Aware Object Detection Transformer for Domain Adaptation
by: Gharatappeh, Soheil, et al.
Published: (2025)
by: Gharatappeh, Soheil, et al.
Published: (2025)
Prompt-based Ingredient-Oriented All-in-One Image Restoration
by: Gao, Hu, et al.
Published: (2023)
by: Gao, Hu, et al.
Published: (2023)
AI-Driven Three-Dimensional Reconstruction and Quantitative Analysis for Burn Injury Assessment
by: Kalaycioglu, S., et al.
Published: (2026)
by: Kalaycioglu, S., et al.
Published: (2026)
Automated Assessment of Aesthetic Outcomes in Facial Plastic Surgery
by: Varghaei, Pegah, et al.
Published: (2025)
by: Varghaei, Pegah, et al.
Published: (2025)
SVGEditBench: A Benchmark Dataset for Quantitative Assessment of LLM's SVG Editing Capabilities
by: Nishina, Kunato, et al.
Published: (2024)
by: Nishina, Kunato, et al.
Published: (2024)
Vector-Quantized Soft Label Compression for Dataset Distillation
by: Abbasi, Ali, et al.
Published: (2026)
by: Abbasi, Ali, et al.
Published: (2026)
PRIME: Prioritizing Interpretability in Failure Mode Extraction
by: Rezaei, Keivan, et al.
Published: (2023)
by: Rezaei, Keivan, et al.
Published: (2023)
BEAR: A Video Dataset For Fine-grained Behaviors Recognition Oriented with Action and Environment Factors
by: Hu, Chengyang, et al.
Published: (2025)
by: Hu, Chengyang, et al.
Published: (2025)
Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
by: Hu, Teng, et al.
Published: (2025)
by: Hu, Teng, et al.
Published: (2025)
UniICL: Systematizing Unified Multimodal In-context Learning through a Capability-Oriented Taxonomy
by: Xu, Yicheng, et al.
Published: (2026)
by: Xu, Yicheng, et al.
Published: (2026)
AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models
by: Zarei, Arman, et al.
Published: (2025)
by: Zarei, Arman, et al.
Published: (2025)
Emotion-Director: Bridging Affective Shortcut in Emotion-Oriented Image Generation
by: Jia, Guoli, et al.
Published: (2025)
by: Jia, Guoli, et al.
Published: (2025)
Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding
by: Nguyen, Thong, et al.
Published: (2025)
by: Nguyen, Thong, et al.
Published: (2025)
Illumination Histogram Consistency Metric for Quantitative Assessment of Video Sequences
by: Chen, Long, et al.
Published: (2024)
by: Chen, Long, et al.
Published: (2024)
Gamma: Toward Generic Image Assessment with Mixture of Assessment Experts
by: Zhou, Hantao, et al.
Published: (2025)
by: Zhou, Hantao, et al.
Published: (2025)
RiO-DETR: DETR for Real-time Oriented Object Detection
by: Hu, Zhangchi, et al.
Published: (2026)
by: Hu, Zhangchi, et al.
Published: (2026)
Zero-shot Prompt-based Video Encoder for Surgical Gesture Recognition
by: Rao, Mingxing, et al.
Published: (2024)
by: Rao, Mingxing, et al.
Published: (2024)
OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images
by: Zhao, Jiaqi, et al.
Published: (2024)
by: Zhao, Jiaqi, et al.
Published: (2024)
Target-Oriented Object Grasping via Multimodal Human Guidance
by: Xie, Pengwei, et al.
Published: (2024)
by: Xie, Pengwei, et al.
Published: (2024)
NeuroNURBS: Learning Efficient Surface Representations for 3D Solids
by: Fan, Jiajie, et al.
Published: (2024)
by: Fan, Jiajie, et al.
Published: (2024)
Anatomy-Aware Unsupervised Detection and Localization of Retinal Abnormalities in Optical Coherence Tomography
by: Haghighi, Tania, et al.
Published: (2026)
by: Haghighi, Tania, et al.
Published: (2026)
Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes
by: Gholami, Mohsen, et al.
Published: (2025)
by: Gholami, Mohsen, et al.
Published: (2025)
GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection
by: Murrugarra-LLerena, Jeffri, et al.
Published: (2025)
by: Murrugarra-LLerena, Jeffri, et al.
Published: (2025)
SceneDecorator: Towards Scene-Oriented Story Generation with Scene Planning and Scene Consistency
by: Song, Quanjian, et al.
Published: (2025)
by: Song, Quanjian, et al.
Published: (2025)
Center-Oriented Prototype Contrastive Clustering
by: Dong, Shihao, et al.
Published: (2025)
by: Dong, Shihao, et al.
Published: (2025)
ROSE: Retrieval-Oriented Segmentation Enhancement
by: Tang, Song, et al.
Published: (2026)
by: Tang, Song, et al.
Published: (2026)
SOVC: Subject-Oriented Video Captioning
by: Teng, Chang, et al.
Published: (2023)
by: Teng, Chang, et al.
Published: (2023)
Efficient Transferable Optimal Transport via Min-Sliced Transport Plans
by: Liu, Xinran, et al.
Published: (2025)
by: Liu, Xinran, et al.
Published: (2025)
SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control
by: Zarei, Arman, et al.
Published: (2025)
by: Zarei, Arman, et al.
Published: (2025)
Localizing Knowledge in Diffusion Transformers
by: Zarei, Arman, et al.
Published: (2025)
by: Zarei, Arman, et al.
Published: (2025)
Understanding Information Storage and Transfer in Multi-modal Large Language Models
by: Basu, Samyadeep, et al.
Published: (2024)
by: Basu, Samyadeep, et al.
Published: (2024)
Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection
by: Hu, Zhangchi, et al.
Published: (2025)
by: Hu, Zhangchi, et al.
Published: (2025)
ComDrive: Comfort-Oriented End-to-End Autonomous Driving
by: Wang, Junming, et al.
Published: (2024)
by: Wang, Junming, et al.
Published: (2024)
Similar Items
-
Streamlining Image Editing with Layered Diffusion Brushes
by: Gholami, Peyman, et al.
Published: (2024) -
Pixel-level Quality Assessment for Oriented Object Detection
by: Zhu, Yunhui, et al.
Published: (2025) -
Monocular Markerless Motion Capture Enables Quantitative Assessment of Upper Extremity Reachable Workspace
by: Donahue, Seth, et al.
Published: (2026) -
Microsurgical Instrument Segmentation for Robot-Assisted Surgery
by: Jeong, Tae Kyeong, et al.
Published: (2025) -
Deep Tiny Network for Recognition-Oriented Face Image Quality Assessment
by: Peng, Baoyun, et al.
Published: (2021)