:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tran, Manuel, Cid, Yashin Dicente, Lahiani, Amal, Theis, Fabian J., Peng, Tingying, Klaiman, Eldad
Format:	Preprint
Published:	2023
Subjects:	Artificial Intelligence Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2305.14243
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

B-Cos Aligned Transformers Learn Human-Interpretable Features
by: Tran, Manuel, et al.
Published: (2024)

Vision Transformer-Conditioned UNet for Domain-Adaptive Semantic Segmentation
by: Ortega, Joel Valdivia, et al.
Published: (2026)

A Large-Scale Benchmark of Cross-Modal Learning for Histology and Gene Expression in Spatial Transcriptomics
by: Gindra, Rushin H., et al.
Published: (2025)

Lightweight Data-Free Denoising for Detail-Preserving Biomedical Image Restoration
by: Chobola, Tomáš, et al.
Published: (2025)

LoReTrack: Efficient and Accurate Low-Resolution Transformer Tracking
by: Dong, Shaohua, et al.
Published: (2024)

Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations
by: Chobola, Tomáš, et al.
Published: (2024)

NOA: a versatile, extensible tool for AI-based organoid analysis
by: Konov, Mikhail, et al.
Published: (2025)

Multi-Modality Microscopy Image Style Transfer for Nuclei Segmentation
by: Liu, Ye, et al.
Published: (2021)

Graph Residual Noise Learner Network for Brain Connectivity Graph Prediction
by: Demirbilek, Oytun, et al.
Published: (2024)

FreeLoRA: Enabling Training-Free LoRA Fusion for Autoregressive Multi-Subject Personalization
by: Zheng, Peng, et al.
Published: (2025)

Low-resource finetuning of foundation models beats state-of-the-art in histopathology
by: Roth, Benedikt, et al.
Published: (2024)

An Over Complete Deep Learning Method for Inverse Problems
by: Eliasof, Moshe, et al.
Published: (2024)

LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers
by: Dalva, Yusuf, et al.
Published: (2025)

Randomized-MLP Regularization Improves Domain Adaptation and Interpretability in DINOv2
by: Ortega, Joel Valdivia, et al.
Published: (2025)

Social-MAE: A Transformer-Based Multimodal Autoencoder for Face and Voice
by: Bohy, Hugo, et al.
Published: (2025)

DAWN-FM: Data-Aware and Noise-Informed Flow Matching for Solving Inverse Problems
by: Ahamed, Shadab, et al.
Published: (2024)

Fully invertible hyperbolic neural networks for segmenting large-scale surface and sub-surface data
by: Peters, Bas, et al.
Published: (2024)

TAS-LoRA: Transformer Architecture Search with Mixture-of-LoRA Experts
by: Jeon, Jeimin, et al.
Published: (2026)

Graph Flow Matching: Enhancing Image Generation with Neighbor-Aware Flow Fields
by: Siddiqui, Md Shahriar Rahim, et al.
Published: (2025)

Inducing Spatial Locality in Vision Transformers through the Training Protocol
by: Toledo, Eduardo Santiago, et al.
Published: (2026)

LoFT: LoRA-fused Training Dataset Generation with Few-shot Guidance
by: Kim, Jae Myung, et al.
Published: (2025)

Disentangled Representation Learning with the Gromov-Monge Gap
by: Uscidda, Théo, et al.
Published: (2024)

MV-CoRe: Multimodal Visual-Conceptual Reasoning for Complex Visual Question Answering
by: Peng, Jingwei, et al.
Published: (2025)

Improved Training Technique for Shortcut Models
by: Nguyen, Anh, et al.
Published: (2025)

Towards Agentic AI for Multimodal-Guided Video Object Segmentation
by: Tran, Tuyen, et al.
Published: (2025)

Multimodal Instruction Tuning with Conditional Mixture of LoRA
by: Shen, Ying, et al.
Published: (2024)

K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs
by: Ouyang, Ziheng, et al.
Published: (2025)

E-M3RF: An Equivariant Multimodal 3D Re-assembly Framework
by: Islam, Adeela, et al.
Published: (2025)

HyperPointFormer: Multimodal Fusion in 3D Space with Dual-Branch Cross-Attention Transformers
by: Rizaldy, Aldino, et al.
Published: (2025)

LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models
by: Liang, Jian, et al.
Published: (2025)

Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking
by: Tong, Xin, et al.
Published: (2025)

In-Context LoRA for Diffusion Transformers
by: Huang, Lianghua, et al.
Published: (2024)

LoFormer: Local Frequency Transformer for Image Deblurring
by: Mao, Xintian, et al.
Published: (2024)

LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification
by: Lu, Yiding, et al.
Published: (2025)

PS-ReID: Advancing Person Re-Identification and Precise Segmentation with Multimodal Retrieval
by: Yan, Jincheng, et al.
Published: (2025)

A Re-ranking Method using K-nearest Weighted Fusion for Person Re-identification
by: Che, Huy, et al.
Published: (2025)

HiFi-Syn: Hierarchical Granularity Discrimination for High-Fidelity Synthesis of MR Images with Structure Preservation
by: Yu, Ziqi, et al.
Published: (2023)

Efficient Bayesian Inference from Noisy Pairwise Comparisons
by: Aczel, Till, et al.
Published: (2025)

A Transformer-based Multimodal Fusion Model for Efficient Crowd Counting Using Visual and Wireless Signals
by: Cui, Zhe, et al.
Published: (2025)

LoLA-SpecViT: Local Attention SwiGLU Vision Transformer with LoRA for Hyperspectral Imaging
by: Zidi, Fadi Abdeladhim, et al.
Published: (2025)