Guardado en:
| Autores principales: | Espinosa, Miguel, Yang, Chenhongyi, Ericsson, Linus, McDonagh, Steven, Crowley, Elliot J. |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2507.02798 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks
por: Espinosa, Miguel, et al.
Publicado: (2024)
por: Espinosa, Miguel, et al.
Publicado: (2024)
einspace: Searching for Neural Architectures from Fundamental Operations
por: Ericsson, Linus, et al.
Publicado: (2024)
por: Ericsson, Linus, et al.
Publicado: (2024)
Label-Efficient Object Detection via Region Proposal Network Pre-Training
por: Dong, Nanqing, et al.
Publicado: (2022)
por: Dong, Nanqing, et al.
Publicado: (2022)
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
por: Yang, Chenhongyi, et al.
Publicado: (2024)
por: Yang, Chenhongyi, et al.
Publicado: (2024)
Plug and Play Active Learning for Object Detection
por: Yang, Chenhongyi, et al.
Publicado: (2022)
por: Yang, Chenhongyi, et al.
Publicado: (2022)
WidthFormer: Toward Efficient Transformer-based BEV View Transformation
por: Yang, Chenhongyi, et al.
Publicado: (2024)
por: Yang, Chenhongyi, et al.
Publicado: (2024)
GaussianHeadTalk: Wobble-Free 3D Talking Heads with Audio Driven Gaussian Splatting
por: Agarwal, Madhav, et al.
Publicado: (2025)
por: Agarwal, Madhav, et al.
Publicado: (2025)
Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models
por: Stogiannidis, Ilias, et al.
Publicado: (2025)
por: Stogiannidis, Ilias, et al.
Publicado: (2025)
EgoPoseFormer: A Simple Baseline for Stereo Egocentric 3D Human Pose Estimation
por: Yang, Chenhongyi, et al.
Publicado: (2024)
por: Yang, Chenhongyi, et al.
Publicado: (2024)
Improving Object Detection via Local-global Contrastive Learning
por: Triantafyllidou, Danai, et al.
Publicado: (2024)
por: Triantafyllidou, Danai, et al.
Publicado: (2024)
Concept-based Adversarial Attack: a Probabilistic Perspective
por: Zhang, Andi, et al.
Publicado: (2025)
por: Zhang, Andi, et al.
Publicado: (2025)
Why Do Vision Language Models Struggle To Recognize Human Emotions?
por: Agarwal, Madhav, et al.
Publicado: (2026)
por: Agarwal, Madhav, et al.
Publicado: (2026)
Erase to Enhance: Data-Efficient Machine Unlearning in MRI Reconstruction
por: Xue, Yuyang, et al.
Publicado: (2024)
por: Xue, Yuyang, et al.
Publicado: (2024)
CheXGenBench: A Unified Benchmark For Fidelity, Privacy and Utility of Synthetic Chest Radiographs
por: Dutt, Raman, et al.
Publicado: (2025)
por: Dutt, Raman, et al.
Publicado: (2025)
View-Consistent Diffusion Representations for 3D-Consistent Video Generation
por: Danier, Duolikun, et al.
Publicado: (2025)
por: Danier, Duolikun, et al.
Publicado: (2025)
Rethinking Inter-LoRA Orthogonality in Adapter Merging: Insights from Orthogonal Monte Carlo Dropout
por: Zhang, Andi, et al.
Publicado: (2025)
por: Zhang, Andi, et al.
Publicado: (2025)
COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails
por: Espinosa, Miguel, et al.
Publicado: (2025)
por: Espinosa, Miguel, et al.
Publicado: (2025)
COP-GEN: Latent Diffusion Transformer for Copernicus Earth Observation Data
por: Espinosa, Miguel, et al.
Publicado: (2026)
por: Espinosa, Miguel, et al.
Publicado: (2026)
Training-Free Dataset Pruning for Instance Segmentation
por: Dai, Yalun, et al.
Publicado: (2025)
por: Dai, Yalun, et al.
Publicado: (2025)
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
por: Tudosiu, Petru-Daniel, et al.
Publicado: (2024)
por: Tudosiu, Petru-Daniel, et al.
Publicado: (2024)
Evolutionary Architecture Search through Grammar-Based Sequence Alignment
por: Martín, Adri Gómez, et al.
Publicado: (2025)
por: Martín, Adri Gómez, et al.
Publicado: (2025)
SWiFT: Soft-Mask Weight Fine-tuning for Bias Mitigation
por: Yan, Junyu, et al.
Publicado: (2025)
por: Yan, Junyu, et al.
Publicado: (2025)
CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models
por: Xue, Yuyang, et al.
Publicado: (2025)
por: Xue, Yuyang, et al.
Publicado: (2025)
Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities
por: Dutt, Raman, et al.
Publicado: (2025)
por: Dutt, Raman, et al.
Publicado: (2025)
PerSense: Training-Free Personalized Instance Segmentation in Dense Images
por: Siddiqui, Muhammad Ibraheem, et al.
Publicado: (2024)
por: Siddiqui, Muhammad Ibraheem, et al.
Publicado: (2024)
Towards PerSense++: Advancing Training-Free Personalized Instance Segmentation in Dense Images
por: Siddiqui, Muhammad Ibraheem, et al.
Publicado: (2025)
por: Siddiqui, Muhammad Ibraheem, et al.
Publicado: (2025)
Beyond Pixel Histories: World Models with Persistent 3D State
por: Garcin, Samuel, et al.
Publicado: (2026)
por: Garcin, Samuel, et al.
Publicado: (2026)
Image-Conditioned Instance Prompt Network for Referring Remote Sensing Image Segmentation
por: Ren, Biaoyu, et al.
Publicado: (2026)
por: Ren, Biaoyu, et al.
Publicado: (2026)
Unlocking the Potential of Weakly Labeled Data: A Co-Evolutionary Learning Framework for Abnormality Detection and Report Generation
por: Sun, Jinghan, et al.
Publicado: (2024)
por: Sun, Jinghan, et al.
Publicado: (2024)
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation
por: Han, Yue, et al.
Publicado: (2023)
por: Han, Yue, et al.
Publicado: (2023)
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
por: Dutt, Raman, et al.
Publicado: (2023)
por: Dutt, Raman, et al.
Publicado: (2023)
Pose2Seg: Detection Free Human Instance Segmentation
por: Zhang, Song-Hai, et al.
Publicado: (2018)
por: Zhang, Song-Hai, et al.
Publicado: (2018)
Phrase-Instance Alignment for Generalized Referring Segmentation
por: Nguyen, E-Ro, et al.
Publicado: (2024)
por: Nguyen, E-Ro, et al.
Publicado: (2024)
FreePoint: Unsupervised Point Cloud Instance Segmentation
por: Zhang, Zhikai, et al.
Publicado: (2023)
por: Zhang, Zhikai, et al.
Publicado: (2023)
ISAC: Training-Free Instance-to-Semantic Attention Control for Improving Multi-Instance Generation
por: Jo, Sanghyun, et al.
Publicado: (2025)
por: Jo, Sanghyun, et al.
Publicado: (2025)
Hierarchical Collaborative Fusion for 3D Instance-aware Referring Expression Segmentation
por: Zhou, Keshen, et al.
Publicado: (2026)
por: Zhou, Keshen, et al.
Publicado: (2026)
Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation
por: Zhang, Dingwen, et al.
Publicado: (2024)
por: Zhang, Dingwen, et al.
Publicado: (2024)
Unsupervised Pre-Training for 3D Leaf Instance Segmentation
por: Roggiolani, Gianmarco, et al.
Publicado: (2024)
por: Roggiolani, Gianmarco, et al.
Publicado: (2024)
Hard-aware Instance Adaptive Self-training for Unsupervised Cross-domain Semantic Segmentation
por: Zhu, Chuang, et al.
Publicado: (2023)
por: Zhu, Chuang, et al.
Publicado: (2023)
ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving
por: Lyu, Yongxuan, et al.
Publicado: (2025)
por: Lyu, Yongxuan, et al.
Publicado: (2025)
Ejemplares similares
-
There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks
por: Espinosa, Miguel, et al.
Publicado: (2024) -
einspace: Searching for Neural Architectures from Fundamental Operations
por: Ericsson, Linus, et al.
Publicado: (2024) -
Label-Efficient Object Detection via Region Proposal Network Pre-Training
por: Dong, Nanqing, et al.
Publicado: (2022) -
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
por: Yang, Chenhongyi, et al.
Publicado: (2024) -
Plug and Play Active Learning for Object Detection
por: Yang, Chenhongyi, et al.
Publicado: (2022)