:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kolesnikov, Alexander, Pinto, André Susano, Tschannen, Michael
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2412.15129
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

JetFormer: An Autoregressive Generative Model of Raw Images and Text
by: Tschannen, Michael, et al.
Published: (2024)

Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers
by: Stanić, Aleksandar, et al.
Published: (2024)

NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows
by: Tarasov, Denis, et al.
Published: (2025)

PaliGemma: A versatile 3B VLM for transfer
by: Beyer, Lucas, et al.
Published: (2024)

Identity Curvature Laplace Approximation for Improved Out-of-Distribution Detection
by: Zhdanov, Maksim, et al.
Published: (2023)

Transformers without Normalization
by: Zhu, Jiachen, et al.
Published: (2025)

Diversifying Deep Ensembles: A Saliency Map Approach for Enhanced OOD Detection, Calibration, and Accuracy
by: Dereka, Stanislav, et al.
Published: (2023)

Stronger Normalization-Free Transformers
by: Chen, Mingzhi, et al.
Published: (2025)

Fast & Efficient Normalizing Flows and Applications of Image Generative Models
by: Nagar, Sandeep
Published: (2025)

STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
by: Gu, Jiatao, et al.
Published: (2025)

VAN-AD: Visual Masked Autoencoder with Normalizing Flow For Time Series Anomaly Detection
by: Chen, PengYu, et al.
Published: (2026)

MeanFlow Transformers with Representation Autoencoders
by: Hu, Zheyuan, et al.
Published: (2025)

Evaluation and Analysis of Deep Neural Transformers and Convolutional Neural Networks on Modern Remote Sensing Datasets
by: Hurt, J. Alex, et al.
Published: (2025)

HOFAR: High-Order Augmentation of Flow Autoregressive Transformers
by: Liang, Yingyu, et al.
Published: (2025)

Back Home: A Computer Vision Solution to Seashell Identification for Ecological Restoration
by: Valverde, Alexander, et al.
Published: (2025)

PINGS-X: Physics-Informed Normalized Gaussian Splatting with Axes Alignment for Efficient Super-Resolution of 4D Flow MRI
by: Jo, Sun, et al.
Published: (2025)

Enhancing Diffusion Models for High-Quality Image Generation
by: Shah, Jaineet, et al.
Published: (2024)

Context Normalization Layer with Applications
by: Faye, Bilal, et al.
Published: (2023)

LibraGrad: Balancing Gradient Flow for Universally Better Vision Transformer Attributions
by: Mehri, Faridoun, et al.
Published: (2024)

Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods
by: Zhou, Yiming, et al.
Published: (2024)

Video Motion Transfer with Diffusion Transformers
by: Pondaven, Alexander, et al.
Published: (2024)

The Disappearance of Timestep Embedding in Modern Time-Dependent Neural Networks
by: Kim, Bum Jun, et al.
Published: (2024)

Simplifying Multi-Task Architectures Through Task-Specific Normalization
by: Suteu, Mihai, et al.
Published: (2025)

Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
by: Schäfer, Lukas, et al.
Published: (2023)

Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers
by: Vielhaben, Johanna, et al.
Published: (2024)

Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment
by: Ma, Danqing, et al.
Published: (2024)

ScriptViT: Vision Transformer-Based Personalized Handwriting Generation
by: Acharya, Sajjan, et al.
Published: (2025)

Learning to Balance: Diverse Normalization for Cloth-Changing Person Re-Identification
by: Wang, Hongjun, et al.
Published: (2024)

pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation
by: Chen, Hansheng, et al.
Published: (2025)

DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
by: Zhou, Zihan, et al.
Published: (2025)

Partial Information Decomposition via Normalizing Flows in Latent Gaussian Distributions
by: Zhao, Wenyuan, et al.
Published: (2025)

ULTra: Unveiling Latent Token Interpretability in Transformer-Based Understanding and Segmentation
by: Hosseini, Hesam, et al.
Published: (2024)

Case Study: Transformer-Based Solution for the Automatic Digitization of Gas Plants
by: Bailo, I., et al.
Published: (2025)

CNC: Cross-modal Normality Constraint for Unsupervised Multi-class Anomaly Detection
by: Wang, Xiaolei, et al.
Published: (2024)

Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models
by: Guo, Xuyang, et al.
Published: (2025)

Exposure-Normalized Bed and Chair Fall Rates via Continuous AI Monitoring
by: Gabriel, Paolo, et al.
Published: (2026)

Transformer-Based Contrastive Meta-Learning For Low-Resource Generalizable Activity Recognition
by: Wang, Junyao, et al.
Published: (2024)

Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
by: Li, Wenhao, et al.
Published: (2023)

MimicNorm: Weight Mean and Last BN Layer Mimic the Dynamic of Batch Normalization
by: Fei, Wen, et al.
Published: (2020)

FlowDPS: Flow-Driven Posterior Sampling for Inverse Problems
by: Kim, Jeongsol, et al.
Published: (2025)