Saved in:
Bibliographic Details
Main Authors: You, Haochen, Zhang, Heng, He, Hongyang, Li, Yuqi, Liu, Baojing
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.01140
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911413964898304
author You, Haochen
Zhang, Heng
He, Hongyang
Li, Yuqi
Liu, Baojing
author_facet You, Haochen
Zhang, Heng
He, Hongyang
Li, Yuqi
Liu, Baojing
contents Vector quantization (VQ) underpins modern generative and representation models by turning continuous latents into discrete tokens. Yet hard nearest-neighbor assignments are non-differentiable and are typically optimized with heuristic straight-through estimators, which couple the update step size to the quantization gap and train each code in isolation, leading to unstable gradients and severe codebook under-utilization at scale. In this paper, we introduce GRIT-VQ (Generalized Radius and Integrated Transform-Vector Quantization), a unified surrogate framework that keeps hard assignments in the forward pass while making VQ fully differentiable. GRIT-VQ replaces the straight-through estimator with a radius-based update that moves latents along the quantization direction with a controllable, geometry-aware step, and applies a data-agnostic integrated transform to the codebook so that all codes are updated through shared parameters instead of independently. Our theoretical analysis clarifies the fundamental optimization dynamics introduced by GRIT-VQ, establishing conditions for stable gradient flow, coordinated codebook evolution, and reliable avoidance of collapse across a broad family of quantizers. Across image reconstruction, image generation, and recommendation tokenization benchmarks, GRIT-VQ consistently improves reconstruction error, generative quality, and recommendation accuracy while substantially increasing codebook utilization compared to existing VQ variants.
format Preprint
id arxiv_https___arxiv_org_abs_2602_01140
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Generalized Radius and Integrated Codebook Transforms for Differentiable Vector Quantization
You, Haochen
Zhang, Heng
He, Hongyang
Li, Yuqi
Liu, Baojing
Machine Learning
Vector quantization (VQ) underpins modern generative and representation models by turning continuous latents into discrete tokens. Yet hard nearest-neighbor assignments are non-differentiable and are typically optimized with heuristic straight-through estimators, which couple the update step size to the quantization gap and train each code in isolation, leading to unstable gradients and severe codebook under-utilization at scale. In this paper, we introduce GRIT-VQ (Generalized Radius and Integrated Transform-Vector Quantization), a unified surrogate framework that keeps hard assignments in the forward pass while making VQ fully differentiable. GRIT-VQ replaces the straight-through estimator with a radius-based update that moves latents along the quantization direction with a controllable, geometry-aware step, and applies a data-agnostic integrated transform to the codebook so that all codes are updated through shared parameters instead of independently. Our theoretical analysis clarifies the fundamental optimization dynamics introduced by GRIT-VQ, establishing conditions for stable gradient flow, coordinated codebook evolution, and reliable avoidance of collapse across a broad family of quantizers. Across image reconstruction, image generation, and recommendation tokenization benchmarks, GRIT-VQ consistently improves reconstruction error, generative quality, and recommendation accuracy while substantially increasing codebook utilization compared to existing VQ variants.
title Generalized Radius and Integrated Codebook Transforms for Differentiable Vector Quantization
topic Machine Learning
url https://arxiv.org/abs/2602.01140