:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Esteves, Carlos, Suhail, Mohammed, Makadia, Ameesh
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2412.09607
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Spectrally-Guided Diffusion Noise Schedules
by: Esteves, Carlos, et al.
Published: (2026)

Factorized Video Autoencoders for Efficient Generative Modelling
by: Suhail, Mohammed, et al.
Published: (2024)

Single Mesh Diffusion Models with Field Latents for Texture Generation
by: Mitchel, Thomas W., et al.
Published: (2023)

Learning to Transform for Generalizable Instance-wise Invariance
by: Singhal, Utkarsh, et al.
Published: (2023)

Decomposing Private Image Generation via Coarse-to-Fine Wavelet Modeling
by: Bayrooti, Jasmine, et al.
Published: (2026)

Wide-Baseline Relative Camera Pose Estimation with Directional Learning
by: Chen, Kefan, et al.
Published: (2021)

3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code
by: Gao, Yipeng, et al.
Published: (2026)

No Training Wheels: Steering Vectors for Bias Correction at Inference Time
by: Gupta, Aviral, et al.
Published: (2025)

Network Inversion of Convolutional Neural Nets
by: Suhail, Pirzada, et al.
Published: (2024)

PoissonNet: A Local-Global Approach for Learning on Surfaces
by: Maesumi, Arman, et al.
Published: (2025)

Network Inversion for Generating Confidently Classified Counterfeits
by: Suhail, Pirzada, et al.
Published: (2025)

Activation Matching for Explanation Generation
by: Suhail, Pirzada, et al.
Published: (2025)

Shortcut Learning Susceptibility in Vision Classifiers
by: Suhail, Pirzada, et al.
Published: (2025)

BATR-FST: Bi-Level Adaptive Token Refinement for Few-Shot Transformers
by: Al-Habib, Mohammed, et al.
Published: (2025)

Privacy Preserving Properties of Vision Classifiers
by: Suhail, Pirzada, et al.
Published: (2025)

Network Inversion for Uncertainty-Aware Out-of-Distribution Detection
by: Suhail, Pirzada, et al.
Published: (2025)

Semantic Prompting with Image-Token for Continual Learning
by: Han, Jisu, et al.
Published: (2024)

VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generations
by: Patel, Maitreya, et al.
Published: (2026)

Controllable Image Generation with Composed Parallel Token Prediction
by: Stirling, Jamie, et al.
Published: (2024)

Network Inversion and Its Applications
by: Suhail, Pirzada, et al.
Published: (2024)

LGQ: Learning Discretization Geometry for Scalable and Stable Image Tokenization
by: Altun, Idil Bilge, et al.
Published: (2026)

MaskBit: Embedding-free Image Generation via Bit Tokens
by: Weber, Mark, et al.
Published: (2024)

End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer
by: Chu, Wenda, et al.
Published: (2026)

One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression
by: Miwa, Keita, et al.
Published: (2025)

Spectral and Spatial Graph Learning for Multispectral Solar Image Compression
by: Siwakoti, Prasiddha, et al.
Published: (2025)

Clustering-Guided Spatial-Spectral Mamba for Hyperspectral Image Classification
by: Dewis, Zack, et al.
Published: (2026)

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
by: Bachmann, Roman, et al.
Published: (2025)

GeoToken: Hierarchical Geolocalization of Images via Next Token Prediction
by: Ghasemi, Narges, et al.
Published: (2025)

Language-Guided Image Tokenization for Generation
by: Zha, Kaiwen, et al.
Published: (2024)

Diffusion Autoencoders are Scalable Image Tokenizers
by: Chen, Yinbo, et al.
Published: (2025)

TIE: A Training-Inversion-Exclusion Framework for Visually Interpretable and Uncertainty-Guided Out-of-Distribution Detection
by: Suhail, Pirzada, et al.
Published: (2025)

Direct Motion Models for Assessing Generated Videos
by: Allen, Kelsey, et al.
Published: (2025)

MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification
by: Fieback, Laura, et al.
Published: (2024)

Boosting Text-to-Image Diffusion Models via Core Token Attention-Based Seed Selection
by: Zhang, Yunzhe, et al.
Published: (2026)

HyperTokens: Controlling Token Dynamics for Continual Video-Language Understanding
by: Nguyen, Toan, et al.
Published: (2026)

High-Resolution Image Reconstruction with Unsupervised Learning and Noisy Data Applied to Ion-Beam Dynamics for Particle Accelerators
by: Osswald, Francis, et al.
Published: (2026)

Communication-Inspired Tokenization for Structured Image Representations
by: Davtyan, Aram, et al.
Published: (2026)

Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training
by: Chen, Yangyi, et al.
Published: (2025)

Hallucinatory Image Tokens: A Training-free EAZY Approach on Detecting and Mitigating Object Hallucinations in LVLMs
by: Che, Liwei, et al.
Published: (2025)

Hyperspectral Image Spectral-Spatial Feature Extraction via Tensor Principal Component Analysis
by: Ren, Yuemei, et al.
Published: (2024)