Saved in:
| Main Authors: | Esteves, Carlos, Suhail, Mohammed, Makadia, Ameesh |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.09607 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Spectrally-Guided Diffusion Noise Schedules
by: Esteves, Carlos, et al.
Published: (2026)
by: Esteves, Carlos, et al.
Published: (2026)
Factorized Video Autoencoders for Efficient Generative Modelling
by: Suhail, Mohammed, et al.
Published: (2024)
by: Suhail, Mohammed, et al.
Published: (2024)
Single Mesh Diffusion Models with Field Latents for Texture Generation
by: Mitchel, Thomas W., et al.
Published: (2023)
by: Mitchel, Thomas W., et al.
Published: (2023)
Learning to Transform for Generalizable Instance-wise Invariance
by: Singhal, Utkarsh, et al.
Published: (2023)
by: Singhal, Utkarsh, et al.
Published: (2023)
Decomposing Private Image Generation via Coarse-to-Fine Wavelet Modeling
by: Bayrooti, Jasmine, et al.
Published: (2026)
by: Bayrooti, Jasmine, et al.
Published: (2026)
Wide-Baseline Relative Camera Pose Estimation with Directional Learning
by: Chen, Kefan, et al.
Published: (2021)
by: Chen, Kefan, et al.
Published: (2021)
3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code
by: Gao, Yipeng, et al.
Published: (2026)
by: Gao, Yipeng, et al.
Published: (2026)
No Training Wheels: Steering Vectors for Bias Correction at Inference Time
by: Gupta, Aviral, et al.
Published: (2025)
by: Gupta, Aviral, et al.
Published: (2025)
Network Inversion of Convolutional Neural Nets
by: Suhail, Pirzada, et al.
Published: (2024)
by: Suhail, Pirzada, et al.
Published: (2024)
PoissonNet: A Local-Global Approach for Learning on Surfaces
by: Maesumi, Arman, et al.
Published: (2025)
by: Maesumi, Arman, et al.
Published: (2025)
Network Inversion for Generating Confidently Classified Counterfeits
by: Suhail, Pirzada, et al.
Published: (2025)
by: Suhail, Pirzada, et al.
Published: (2025)
Activation Matching for Explanation Generation
by: Suhail, Pirzada, et al.
Published: (2025)
by: Suhail, Pirzada, et al.
Published: (2025)
Shortcut Learning Susceptibility in Vision Classifiers
by: Suhail, Pirzada, et al.
Published: (2025)
by: Suhail, Pirzada, et al.
Published: (2025)
BATR-FST: Bi-Level Adaptive Token Refinement for Few-Shot Transformers
by: Al-Habib, Mohammed, et al.
Published: (2025)
by: Al-Habib, Mohammed, et al.
Published: (2025)
Privacy Preserving Properties of Vision Classifiers
by: Suhail, Pirzada, et al.
Published: (2025)
by: Suhail, Pirzada, et al.
Published: (2025)
Network Inversion for Uncertainty-Aware Out-of-Distribution Detection
by: Suhail, Pirzada, et al.
Published: (2025)
by: Suhail, Pirzada, et al.
Published: (2025)
Semantic Prompting with Image-Token for Continual Learning
by: Han, Jisu, et al.
Published: (2024)
by: Han, Jisu, et al.
Published: (2024)
VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generations
by: Patel, Maitreya, et al.
Published: (2026)
by: Patel, Maitreya, et al.
Published: (2026)
Controllable Image Generation with Composed Parallel Token Prediction
by: Stirling, Jamie, et al.
Published: (2024)
by: Stirling, Jamie, et al.
Published: (2024)
Network Inversion and Its Applications
by: Suhail, Pirzada, et al.
Published: (2024)
by: Suhail, Pirzada, et al.
Published: (2024)
LGQ: Learning Discretization Geometry for Scalable and Stable Image Tokenization
by: Altun, Idil Bilge, et al.
Published: (2026)
by: Altun, Idil Bilge, et al.
Published: (2026)
MaskBit: Embedding-free Image Generation via Bit Tokens
by: Weber, Mark, et al.
Published: (2024)
by: Weber, Mark, et al.
Published: (2024)
End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer
by: Chu, Wenda, et al.
Published: (2026)
by: Chu, Wenda, et al.
Published: (2026)
One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression
by: Miwa, Keita, et al.
Published: (2025)
by: Miwa, Keita, et al.
Published: (2025)
Spectral and Spatial Graph Learning for Multispectral Solar Image Compression
by: Siwakoti, Prasiddha, et al.
Published: (2025)
by: Siwakoti, Prasiddha, et al.
Published: (2025)
Clustering-Guided Spatial-Spectral Mamba for Hyperspectral Image Classification
by: Dewis, Zack, et al.
Published: (2026)
by: Dewis, Zack, et al.
Published: (2026)
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
by: Bachmann, Roman, et al.
Published: (2025)
by: Bachmann, Roman, et al.
Published: (2025)
GeoToken: Hierarchical Geolocalization of Images via Next Token Prediction
by: Ghasemi, Narges, et al.
Published: (2025)
by: Ghasemi, Narges, et al.
Published: (2025)
Language-Guided Image Tokenization for Generation
by: Zha, Kaiwen, et al.
Published: (2024)
by: Zha, Kaiwen, et al.
Published: (2024)
Diffusion Autoencoders are Scalable Image Tokenizers
by: Chen, Yinbo, et al.
Published: (2025)
by: Chen, Yinbo, et al.
Published: (2025)
TIE: A Training-Inversion-Exclusion Framework for Visually Interpretable and Uncertainty-Guided Out-of-Distribution Detection
by: Suhail, Pirzada, et al.
Published: (2025)
by: Suhail, Pirzada, et al.
Published: (2025)
Direct Motion Models for Assessing Generated Videos
by: Allen, Kelsey, et al.
Published: (2025)
by: Allen, Kelsey, et al.
Published: (2025)
MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification
by: Fieback, Laura, et al.
Published: (2024)
by: Fieback, Laura, et al.
Published: (2024)
Boosting Text-to-Image Diffusion Models via Core Token Attention-Based Seed Selection
by: Zhang, Yunzhe, et al.
Published: (2026)
by: Zhang, Yunzhe, et al.
Published: (2026)
HyperTokens: Controlling Token Dynamics for Continual Video-Language Understanding
by: Nguyen, Toan, et al.
Published: (2026)
by: Nguyen, Toan, et al.
Published: (2026)
High-Resolution Image Reconstruction with Unsupervised Learning and Noisy Data Applied to Ion-Beam Dynamics for Particle Accelerators
by: Osswald, Francis, et al.
Published: (2026)
by: Osswald, Francis, et al.
Published: (2026)
Communication-Inspired Tokenization for Structured Image Representations
by: Davtyan, Aram, et al.
Published: (2026)
by: Davtyan, Aram, et al.
Published: (2026)
Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training
by: Chen, Yangyi, et al.
Published: (2025)
by: Chen, Yangyi, et al.
Published: (2025)
Hallucinatory Image Tokens: A Training-free EAZY Approach on Detecting and Mitigating Object Hallucinations in LVLMs
by: Che, Liwei, et al.
Published: (2025)
by: Che, Liwei, et al.
Published: (2025)
Hyperspectral Image Spectral-Spatial Feature Extraction via Tensor Principal Component Analysis
by: Ren, Yuemei, et al.
Published: (2024)
by: Ren, Yuemei, et al.
Published: (2024)
Similar Items
-
Spectrally-Guided Diffusion Noise Schedules
by: Esteves, Carlos, et al.
Published: (2026) -
Factorized Video Autoencoders for Efficient Generative Modelling
by: Suhail, Mohammed, et al.
Published: (2024) -
Single Mesh Diffusion Models with Field Latents for Texture Generation
by: Mitchel, Thomas W., et al.
Published: (2023) -
Learning to Transform for Generalizable Instance-wise Invariance
by: Singhal, Utkarsh, et al.
Published: (2023) -
Decomposing Private Image Generation via Coarse-to-Fine Wavelet Modeling
by: Bayrooti, Jasmine, et al.
Published: (2026)