Saved in:
| Main Authors: | Petsangourakis, Giorgos, Sgouropoulos, Christos, Psomas, Bill, Giannakopoulos, Theodoros, Sfikas, Giorgos, Kakogeorgiou, Ioannis |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.16636 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Composed Image Retrieval for Remote Sensing
by: Psomas, Bill, et al.
Published: (2024)
by: Psomas, Bill, et al.
Published: (2024)
Attention, Please! Revisiting Attentive Probing Through the Lens of Efficiency
by: Psomas, Bill, et al.
Published: (2025)
by: Psomas, Bill, et al.
Published: (2025)
Benchmarking Composed Image Retrieval for Applied Earth Observation
by: Psomas, Bill, et al.
Published: (2026)
by: Psomas, Bill, et al.
Published: (2026)
Designing Practical Models for Isolated Word Visual Speech Recognition
by: Panagos, Iason Ioannis, et al.
Published: (2025)
by: Panagos, Iason Ioannis, et al.
Published: (2025)
Lightweight Operations for Visual Speech Recognition
by: Panagos, Iason Ioannis, et al.
Published: (2025)
by: Panagos, Iason Ioannis, et al.
Published: (2025)
DiffusionPen: Towards Controlling the Style of Handwritten Text Generation
by: Nikolaidou, Konstantina, et al.
Published: (2024)
by: Nikolaidou, Konstantina, et al.
Published: (2024)
Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?
by: Aravanis, Tilemachos, et al.
Published: (2026)
by: Aravanis, Tilemachos, et al.
Published: (2026)
Best Practices for a Handwritten Text Recognition System
by: Retsinas, George, et al.
Published: (2024)
by: Retsinas, George, et al.
Published: (2024)
Rethinking HTG Evaluation: Bridging Generation and Recognition
by: Nikolaidou, Konstantina, et al.
Published: (2024)
by: Nikolaidou, Konstantina, et al.
Published: (2024)
Bridging Mini-Batch and Asymptotic Analysis in Contrastive Learning: From InfoNCE to Kernel-Based Losses
by: Koromilas, Panagiotis, et al.
Published: (2024)
by: Koromilas, Panagiotis, et al.
Published: (2024)
Optimal Transport for Handwritten Text Recognition in a Low-Resource Regime
by: Wraight, Petros Georgoulas, et al.
Published: (2025)
by: Wraight, Petros Georgoulas, et al.
Published: (2025)
Structure Your Data: Towards Semantic Graph Counterfactuals
by: Dimitriou, Angeliki, et al.
Published: (2024)
by: Dimitriou, Angeliki, et al.
Published: (2024)
Dual Orthogonal Guidance for Robust Diffusion-based Handwritten Text Generation
by: Nikolaidou, Konstantina, et al.
Published: (2025)
by: Nikolaidou, Konstantina, et al.
Published: (2025)
On the Matrix Form of the Quaternion Fourier Transform and Quaternion Convolution
by: Sfikas, Giorgos, et al.
Published: (2023)
by: Sfikas, Giorgos, et al.
Published: (2023)
Composed Image Retrieval for Training-Free Domain Conversion
by: Efthymiadis, Nikos, et al.
Published: (2024)
by: Efthymiadis, Nikos, et al.
Published: (2024)
Are Euler angles a useful rotation parameterisation for pose estimation with Normalizing Flows?
by: Sfikas, Giorgos, et al.
Published: (2025)
by: Sfikas, Giorgos, et al.
Published: (2025)
A Principled Framework for Multi-View Contrastive Learning
by: Koromilas, Panagiotis, et al.
Published: (2025)
by: Koromilas, Panagiotis, et al.
Published: (2025)
Masked Generative Story Transformer with Character Guidance and Caption Augmentation
by: Papadimitriou, Christos, et al.
Published: (2024)
by: Papadimitriou, Christos, et al.
Published: (2024)
InDistill: Information flow-preserving knowledge distillation for model compression
by: Sarridis, Ioannis, et al.
Published: (2022)
by: Sarridis, Ioannis, et al.
Published: (2022)
Boosting Generative Image Modeling via Joint Image-Feature Synthesis
by: Kouzelis, Theodoros, et al.
Published: (2025)
by: Kouzelis, Theodoros, et al.
Published: (2025)
Instance-Level Composed Image Retrieval
by: Psomas, Bill, et al.
Published: (2025)
by: Psomas, Bill, et al.
Published: (2025)
ELViS: Efficient Visual Similarity from Local Descriptors that Generalizes Across Domains
by: Suma, Pavel, et al.
Published: (2026)
by: Suma, Pavel, et al.
Published: (2026)
Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
by: Karypidis, Efstathios, et al.
Published: (2025)
by: Karypidis, Efstathios, et al.
Published: (2025)
Quo Vadis Handwritten Text Generation for Handwritten Text Recognition?
by: Pippi, Vittorio, et al.
Published: (2025)
by: Pippi, Vittorio, et al.
Published: (2025)
AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval
by: Suma, Pavel, et al.
Published: (2024)
by: Suma, Pavel, et al.
Published: (2024)
Prompt2Fashion: An automatically generated fashion dataset
by: Argyrou, Georgia, et al.
Published: (2024)
by: Argyrou, Georgia, et al.
Published: (2024)
Automatic Generation of Fashion Images using Prompting in Generative Machine Learning Models
by: Argyrou, Georgia, et al.
Published: (2024)
by: Argyrou, Georgia, et al.
Published: (2024)
Counterfactual Edits for Generative Evaluation
by: Lymperaiou, Maria, et al.
Published: (2023)
by: Lymperaiou, Maria, et al.
Published: (2023)
SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers
by: Kakogeorgiou, Ioannis, et al.
Published: (2023)
by: Kakogeorgiou, Ioannis, et al.
Published: (2023)
DINO-Foresight: Looking into the Future with DINO
by: Karypidis, Efstathios, et al.
Published: (2024)
by: Karypidis, Efstathios, et al.
Published: (2024)
ENACT: Entropy-based Clustering of Attention Input for Reducing the Computational Needs of Object Detection Transformers
by: Savathrakis, Giorgos, et al.
Published: (2024)
by: Savathrakis, Giorgos, et al.
Published: (2024)
SeLoRA: Self-Expanding Low-Rank Adaptation of Latent Diffusion Model for Medical Image Synthesis
by: Mao, Yuchen, et al.
Published: (2024)
by: Mao, Yuchen, et al.
Published: (2024)
Instance-Level Generation for Representation Learning
by: Wu, Yankun, et al.
Published: (2025)
by: Wu, Yankun, et al.
Published: (2025)
Global-Aware Edge Prioritization for Pose Graph Initialization
by: Wei, Tong, et al.
Published: (2026)
by: Wei, Tong, et al.
Published: (2026)
HalCECE: A Framework for Explainable Hallucination Detection through Conceptual Counterfactuals in Image Captioning
by: Lymperaiou, Maria, et al.
Published: (2025)
by: Lymperaiou, Maria, et al.
Published: (2025)
U-CECE: A Universal Multi-Resolution Framework for Conceptual Counterfactual Explanations
by: Dimitriou, Angeliki, et al.
Published: (2026)
by: Dimitriou, Angeliki, et al.
Published: (2026)
Processing and acquisition traces in visual encoders: What does CLIP know about your camera?
by: Ramos, Ryan, et al.
Published: (2025)
by: Ramos, Ryan, et al.
Published: (2025)
SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph Retrieval
by: Chaidos, Nikolaos, et al.
Published: (2025)
by: Chaidos, Nikolaos, et al.
Published: (2025)
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation
by: Stojnić, Vladan, et al.
Published: (2025)
by: Stojnić, Vladan, et al.
Published: (2025)
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on Tasks and Challenges
by: Lymperaiou, Maria, et al.
Published: (2023)
by: Lymperaiou, Maria, et al.
Published: (2023)
Similar Items
-
Composed Image Retrieval for Remote Sensing
by: Psomas, Bill, et al.
Published: (2024) -
Attention, Please! Revisiting Attentive Probing Through the Lens of Efficiency
by: Psomas, Bill, et al.
Published: (2025) -
Benchmarking Composed Image Retrieval for Applied Earth Observation
by: Psomas, Bill, et al.
Published: (2026) -
Designing Practical Models for Isolated Word Visual Speech Recognition
by: Panagos, Iason Ioannis, et al.
Published: (2025) -
Lightweight Operations for Visual Speech Recognition
by: Panagos, Iason Ioannis, et al.
Published: (2025)