Saved in:
Bibliographic Details
Main Authors: Pan, Sicheng, Tang, Chen, Xie, Shuzhao, Yang, Ke, Zhang, Weixiang, Li, Jiawei, Chen, Bin, Xia, Shu-Tao, Wang, Zhi
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.01741
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910008535416832
author Pan, Sicheng
Tang, Chen
Xie, Shuzhao
Yang, Ke
Zhang, Weixiang
Li, Jiawei
Chen, Bin
Xia, Shu-Tao
Wang, Zhi
author_facet Pan, Sicheng
Tang, Chen
Xie, Shuzhao
Yang, Ke
Zhang, Weixiang
Li, Jiawei
Chen, Bin
Xia, Shu-Tao
Wang, Zhi
contents The burgeoning complexity and scale of 3D geometry models pose significant challenges for deployment on resource-constrained platforms. While Post-Training Quantization (PTQ) enables efficient inference without retraining, conventional methods, primarily optimized for 2D Vision Transformers, fail to transfer effectively to 3D models due to intricate feature distributions and prohibitive calibration overhead. To address these challenges, we propose TAPTQ, a Tail-Aware Post-Training Quantization pipeline specifically engineered for 3D geometric learning. Our contribution is threefold: (1) To overcome the data-scale bottleneck in 3D datasets, we develop a progressive coarse-to-fine calibration construction strategy that constructs a highly compact subset to achieve both statistical purity and geometric representativeness. (2) We reformulate the quantization interval search as an optimization problem and introduce a ternary-search-based solver, reducing the computational complexity from $\mathcal{O}(N)$ to $\mathcal{O}(\log N)$ for accelerated deployment. (3) To mitigate quantization error accumulation, we propose TRE-Guided Module-wise Compensation, which utilizes a Tail Relative Error (TRE) metric to adaptively identify and rectify distortions in modules sensitive to long-tailed activation outliers. Extensive experiments on the VGGT and Pi3 benchmarks demonstrate that TAPTQ consistently outperforms state-of-the-art PTQ methods in accuracy while significantly reducing calibration time. The code will be released soon.
format Preprint
id arxiv_https___arxiv_org_abs_2602_01741
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Tail-Aware Post-Training Quantization for 3D Geometry Models
Pan, Sicheng
Tang, Chen
Xie, Shuzhao
Yang, Ke
Zhang, Weixiang
Li, Jiawei
Chen, Bin
Xia, Shu-Tao
Wang, Zhi
Computer Vision and Pattern Recognition
The burgeoning complexity and scale of 3D geometry models pose significant challenges for deployment on resource-constrained platforms. While Post-Training Quantization (PTQ) enables efficient inference without retraining, conventional methods, primarily optimized for 2D Vision Transformers, fail to transfer effectively to 3D models due to intricate feature distributions and prohibitive calibration overhead. To address these challenges, we propose TAPTQ, a Tail-Aware Post-Training Quantization pipeline specifically engineered for 3D geometric learning. Our contribution is threefold: (1) To overcome the data-scale bottleneck in 3D datasets, we develop a progressive coarse-to-fine calibration construction strategy that constructs a highly compact subset to achieve both statistical purity and geometric representativeness. (2) We reformulate the quantization interval search as an optimization problem and introduce a ternary-search-based solver, reducing the computational complexity from $\mathcal{O}(N)$ to $\mathcal{O}(\log N)$ for accelerated deployment. (3) To mitigate quantization error accumulation, we propose TRE-Guided Module-wise Compensation, which utilizes a Tail Relative Error (TRE) metric to adaptively identify and rectify distortions in modules sensitive to long-tailed activation outliers. Extensive experiments on the VGGT and Pi3 benchmarks demonstrate that TAPTQ consistently outperforms state-of-the-art PTQ methods in accuracy while significantly reducing calibration time. The code will be released soon.
title Tail-Aware Post-Training Quantization for 3D Geometry Models
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2602.01741