:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Schäfer, Frederik, Mandl, Luis, Kälber, Lars, Ricken, Tim
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Computer Vision and Pattern Recognition Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2605.05908
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Self-supervised pretraining for an iterative image size agnostic vision transformer
di: Prisadnikov, Nedyalko, et al.
Pubblicazione: (2026)

an interpretable vision transformer framework for automated brain tumor classification
di: Mbonu, Chinedu Emmanuel, et al.
Pubblicazione: (2026)

HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation
di: Jiang, Chengjie, et al.
Pubblicazione: (2024)

A novel network for classification of cuneiform tablet metadata
di: Hagelskjær, Frederik
Pubblicazione: (2026)

TREAD: Token Routing for Efficient Architecture-agnostic Diffusion Training
di: Krause, Felix, et al.
Pubblicazione: (2025)

Architecture and evaluation protocol for transformer-based visual object tracking in UAV applications
di: Borne, Augustin, et al.
Pubblicazione: (2026)

A hierarchical semantic segmentation framework for computer vision-based bridge damage detection
di: Liu, Jingxiao, et al.
Pubblicazione: (2022)

Enhancing the vision-language foundation model with key semantic knowledge-emphasized report refinement
di: Huang, Weijian, et al.
Pubblicazione: (2024)

On the effectiveness of multimodal privileged knowledge distillation in two vision transformer based diagnostic applications
di: Baur, Simon, et al.
Pubblicazione: (2025)

STARS: Sensor-agnostic Transformer Architecture for Remote Sensing
di: King, Ethan, et al.
Pubblicazione: (2024)

Separable DeepONet: Breaking the Curse of Dimensionality in Physics-Informed Machine Learning
di: Mandl, Luis, et al.
Pubblicazione: (2024)

Physics-Informed Time-Integrated DeepONet: Temporal Tangent Space Operator Learning for High-Accuracy Inference
di: Mandl, Luis, et al.
Pubblicazione: (2025)

MinkOcc: Towards real-time label-efficient semantic occupancy prediction
di: Sze, Samuel, et al.
Pubblicazione: (2025)

NormalView: sensor-agnostic tree species classification from backpack and aerial lidar data using geometric projections
di: Korkeala, Juho, et al.
Pubblicazione: (2025)

A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation
di: Sarker, Sushmita, et al.
Pubblicazione: (2024)

Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution
di: Sze, Samuel, et al.
Pubblicazione: (2024)

Cross multiscale vision transformer for deep fake detection
di: P, Akhshan, et al.
Pubblicazione: (2025)

Automated diagnosis of lung diseases using vision transformer: a comparative study on chest x-ray classification
di: Ahmad, Muhammad, et al.
Pubblicazione: (2025)

Beyond the final layer: Attentive multilayer fusion for vision transformers
di: Ciernik, Laure, et al.
Pubblicazione: (2026)

Category-aware EEG image generation based on wavelet transform and contrast semantic loss
di: Zhang, Enshang, et al.
Pubblicazione: (2025)

Depth-agnostic Single Image Dehazing
di: Xu, Honglei, et al.
Pubblicazione: (2024)

Towards Domain-agnostic Depth Completion
di: Xu, Guangkai, et al.
Pubblicazione: (2022)

FISHing in Uncertainty: Synthetic Contrastive Learning for Genetic Aberration Detection
di: Gutwein, Simon, et al.
Pubblicazione: (2024)

Interpreting vision transformers via residual replacement model
di: Kim, Jinyeong, et al.
Pubblicazione: (2025)

A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking
di: Papa, Lorenzo, et al.
Pubblicazione: (2023)

METER: a mobile vision transformer architecture for monocular depth estimation
di: Papa, L., et al.
Pubblicazione: (2024)

Steering CLIP's vision transformer with sparse autoencoders
di: Joseph, Sonia, et al.
Pubblicazione: (2025)

SPRINT: Script-agnostic Structure Recognition in Tables
di: Kudale, Dhruv, et al.
Pubblicazione: (2025)

Clothing agnostic Pre-inpainting Virtual Try-ON
di: Kim, Sehyun, et al.
Pubblicazione: (2025)

Scene-agnostic Pose Regression for Visual Localization
di: Zheng, Junwei, et al.
Pubblicazione: (2025)

SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers
di: Gopal, Bhavna, et al.
Pubblicazione: (2025)

Low-latency vision transformers via large-scale multi-head attention
di: Gross, Ronit D., et al.
Pubblicazione: (2025)

Initialization matters in few-shot adaptation of vision-language models for histopathological image classification
di: Meseguer, Pablo, et al.
Pubblicazione: (2026)

Are vision-language models ready to zero-shot replace supervised classification models in agriculture?
di: Ranario, Earl, et al.
Pubblicazione: (2025)

On the application of the Wasserstein metric to 2D curves classification
di: Kaliszewska, Agnieszka, et al.
Pubblicazione: (2026)

PIV3CAMS: a multi-camera dataset for multiple computer vision problems and its application to novel view-point synthesis
di: Kim, Sohyeong, et al.
Pubblicazione: (2024)

$L^3$:Scene-agnostic Visual Localization in the Wild
di: Zhang, Yu, et al.
Pubblicazione: (2026)

Preserving Marker Specificity with Lightweight Channel-Independent Representation Learning
di: Gutwein, Simon, et al.
Pubblicazione: (2025)

PTQ4ViT: Post-training quantization for vision transformers with twin uniform quantization
di: Yuan, Zhihang, et al.
Pubblicazione: (2021)

Two-Stream temporal transformer for video action classification
di: Kurpukdee, Nattapong, et al.
Pubblicazione: (2026)