Saved in:
| Main Authors: | Kolesnikov, Alexander, Pinto, André Susano, Tschannen, Michael |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.15129 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
JetFormer: An Autoregressive Generative Model of Raw Images and Text
by: Tschannen, Michael, et al.
Published: (2024)
by: Tschannen, Michael, et al.
Published: (2024)
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers
by: Stanić, Aleksandar, et al.
Published: (2024)
by: Stanić, Aleksandar, et al.
Published: (2024)
NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows
by: Tarasov, Denis, et al.
Published: (2025)
by: Tarasov, Denis, et al.
Published: (2025)
PaliGemma: A versatile 3B VLM for transfer
by: Beyer, Lucas, et al.
Published: (2024)
by: Beyer, Lucas, et al.
Published: (2024)
Identity Curvature Laplace Approximation for Improved Out-of-Distribution Detection
by: Zhdanov, Maksim, et al.
Published: (2023)
by: Zhdanov, Maksim, et al.
Published: (2023)
Transformers without Normalization
by: Zhu, Jiachen, et al.
Published: (2025)
by: Zhu, Jiachen, et al.
Published: (2025)
Diversifying Deep Ensembles: A Saliency Map Approach for Enhanced OOD Detection, Calibration, and Accuracy
by: Dereka, Stanislav, et al.
Published: (2023)
by: Dereka, Stanislav, et al.
Published: (2023)
Stronger Normalization-Free Transformers
by: Chen, Mingzhi, et al.
Published: (2025)
by: Chen, Mingzhi, et al.
Published: (2025)
Fast & Efficient Normalizing Flows and Applications of Image Generative Models
by: Nagar, Sandeep
Published: (2025)
by: Nagar, Sandeep
Published: (2025)
STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
by: Gu, Jiatao, et al.
Published: (2025)
by: Gu, Jiatao, et al.
Published: (2025)
VAN-AD: Visual Masked Autoencoder with Normalizing Flow For Time Series Anomaly Detection
by: Chen, PengYu, et al.
Published: (2026)
by: Chen, PengYu, et al.
Published: (2026)
MeanFlow Transformers with Representation Autoencoders
by: Hu, Zheyuan, et al.
Published: (2025)
by: Hu, Zheyuan, et al.
Published: (2025)
Evaluation and Analysis of Deep Neural Transformers and Convolutional Neural Networks on Modern Remote Sensing Datasets
by: Hurt, J. Alex, et al.
Published: (2025)
by: Hurt, J. Alex, et al.
Published: (2025)
HOFAR: High-Order Augmentation of Flow Autoregressive Transformers
by: Liang, Yingyu, et al.
Published: (2025)
by: Liang, Yingyu, et al.
Published: (2025)
Back Home: A Computer Vision Solution to Seashell Identification for Ecological Restoration
by: Valverde, Alexander, et al.
Published: (2025)
by: Valverde, Alexander, et al.
Published: (2025)
PINGS-X: Physics-Informed Normalized Gaussian Splatting with Axes Alignment for Efficient Super-Resolution of 4D Flow MRI
by: Jo, Sun, et al.
Published: (2025)
by: Jo, Sun, et al.
Published: (2025)
Enhancing Diffusion Models for High-Quality Image Generation
by: Shah, Jaineet, et al.
Published: (2024)
by: Shah, Jaineet, et al.
Published: (2024)
Context Normalization Layer with Applications
by: Faye, Bilal, et al.
Published: (2023)
by: Faye, Bilal, et al.
Published: (2023)
LibraGrad: Balancing Gradient Flow for Universally Better Vision Transformer Attributions
by: Mehri, Faridoun, et al.
Published: (2024)
by: Mehri, Faridoun, et al.
Published: (2024)
Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods
by: Zhou, Yiming, et al.
Published: (2024)
by: Zhou, Yiming, et al.
Published: (2024)
Video Motion Transfer with Diffusion Transformers
by: Pondaven, Alexander, et al.
Published: (2024)
by: Pondaven, Alexander, et al.
Published: (2024)
The Disappearance of Timestep Embedding in Modern Time-Dependent Neural Networks
by: Kim, Bum Jun, et al.
Published: (2024)
by: Kim, Bum Jun, et al.
Published: (2024)
Simplifying Multi-Task Architectures Through Task-Specific Normalization
by: Suteu, Mihai, et al.
Published: (2025)
by: Suteu, Mihai, et al.
Published: (2025)
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
by: Schäfer, Lukas, et al.
Published: (2023)
by: Schäfer, Lukas, et al.
Published: (2023)
Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers
by: Vielhaben, Johanna, et al.
Published: (2024)
by: Vielhaben, Johanna, et al.
Published: (2024)
Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment
by: Ma, Danqing, et al.
Published: (2024)
by: Ma, Danqing, et al.
Published: (2024)
ScriptViT: Vision Transformer-Based Personalized Handwriting Generation
by: Acharya, Sajjan, et al.
Published: (2025)
by: Acharya, Sajjan, et al.
Published: (2025)
Learning to Balance: Diverse Normalization for Cloth-Changing Person Re-Identification
by: Wang, Hongjun, et al.
Published: (2024)
by: Wang, Hongjun, et al.
Published: (2024)
pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation
by: Chen, Hansheng, et al.
Published: (2025)
by: Chen, Hansheng, et al.
Published: (2025)
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
by: Zhou, Zihan, et al.
Published: (2025)
by: Zhou, Zihan, et al.
Published: (2025)
Partial Information Decomposition via Normalizing Flows in Latent Gaussian Distributions
by: Zhao, Wenyuan, et al.
Published: (2025)
by: Zhao, Wenyuan, et al.
Published: (2025)
ULTra: Unveiling Latent Token Interpretability in Transformer-Based Understanding and Segmentation
by: Hosseini, Hesam, et al.
Published: (2024)
by: Hosseini, Hesam, et al.
Published: (2024)
Case Study: Transformer-Based Solution for the Automatic Digitization of Gas Plants
by: Bailo, I., et al.
Published: (2025)
by: Bailo, I., et al.
Published: (2025)
CNC: Cross-modal Normality Constraint for Unsupervised Multi-class Anomaly Detection
by: Wang, Xiaolei, et al.
Published: (2024)
by: Wang, Xiaolei, et al.
Published: (2024)
Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models
by: Guo, Xuyang, et al.
Published: (2025)
by: Guo, Xuyang, et al.
Published: (2025)
Exposure-Normalized Bed and Chair Fall Rates via Continuous AI Monitoring
by: Gabriel, Paolo, et al.
Published: (2026)
by: Gabriel, Paolo, et al.
Published: (2026)
Transformer-Based Contrastive Meta-Learning For Low-Resource Generalizable Activity Recognition
by: Wang, Junyao, et al.
Published: (2024)
by: Wang, Junyao, et al.
Published: (2024)
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
by: Li, Wenhao, et al.
Published: (2023)
by: Li, Wenhao, et al.
Published: (2023)
MimicNorm: Weight Mean and Last BN Layer Mimic the Dynamic of Batch Normalization
by: Fei, Wen, et al.
Published: (2020)
by: Fei, Wen, et al.
Published: (2020)
FlowDPS: Flow-Driven Posterior Sampling for Inverse Problems
by: Kim, Jeongsol, et al.
Published: (2025)
by: Kim, Jeongsol, et al.
Published: (2025)
Similar Items
-
JetFormer: An Autoregressive Generative Model of Raw Images and Text
by: Tschannen, Michael, et al.
Published: (2024) -
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers
by: Stanić, Aleksandar, et al.
Published: (2024) -
NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows
by: Tarasov, Denis, et al.
Published: (2025) -
PaliGemma: A versatile 3B VLM for transfer
by: Beyer, Lucas, et al.
Published: (2024) -
Identity Curvature Laplace Approximation for Improved Out-of-Distribution Detection
by: Zhdanov, Maksim, et al.
Published: (2023)