Saved in:
| Main Author: | Lee, Jae Joong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.05366 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Language-Guided Invariance Probing of Vision-Language Models
by: Lee, Jae Joong
Published: (2025)
by: Lee, Jae Joong
Published: (2025)
Quant Experts: Token-aware Adaptive Error Reconstruction with Mixture of Experts for Large Vision-Language Models Quantization
by: Jia, Chenwei, et al.
Published: (2026)
by: Jia, Chenwei, et al.
Published: (2026)
SegQuant: A Semantics-Aware and Generalizable Quantization Framework for Diffusion Models
by: Zhang, Jiaji, et al.
Published: (2025)
by: Zhang, Jiaji, et al.
Published: (2025)
DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization
by: Lee, Dongyeun, et al.
Published: (2025)
by: Lee, Dongyeun, et al.
Published: (2025)
DilateQuant: Accurate and Efficient Diffusion Quantization via Weight Dilation
by: Liu, Xuewen, et al.
Published: (2024)
by: Liu, Xuewen, et al.
Published: (2024)
ReSpinQuant: Efficient Layer-Wise LLM Quantization via Subspace Residual Rotation Approximation
by: Kim, Suyoung, et al.
Published: (2026)
by: Kim, Suyoung, et al.
Published: (2026)
ProtoQuant: Quantization of Prototypical Parts For General and Fine-Grained Image Classification
by: Janusz, Mikołaj, et al.
Published: (2026)
by: Janusz, Mikołaj, et al.
Published: (2026)
DuQuant++: Fine-grained Rotation Enhances Microscaling FP4 Quantization
by: Lin, Haokun, et al.
Published: (2026)
by: Lin, Haokun, et al.
Published: (2026)
CacheQuant: Comprehensively Accelerated Diffusion Models
by: Liu, Xuewen, et al.
Published: (2025)
by: Liu, Xuewen, et al.
Published: (2025)
PlaneCycle: Training-Free 2D-to-3D Lifting of Foundation Models Without Adapters
by: Yu, Yinghong, et al.
Published: (2026)
by: Yu, Yinghong, et al.
Published: (2026)
Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with Bridge Block Reconstruction for IoT Systems
by: Lee, Jemin, et al.
Published: (2023)
by: Lee, Jemin, et al.
Published: (2023)
HouseLayout3D: A Benchmark and Training-Free Baseline for 3D Layout Estimation in the Wild
by: Bieri, Valentin, et al.
Published: (2025)
by: Bieri, Valentin, et al.
Published: (2025)
VGGT-CD: Training-Free Robust Registration for 3D Change Detection
by: Zhang, Wei, et al.
Published: (2026)
by: Zhang, Wei, et al.
Published: (2026)
Post-Training Quantization for 3D Medical Image Segmentation: A Practical Study on Real Inference Engines
by: Qu, Chongyu, et al.
Published: (2025)
by: Qu, Chongyu, et al.
Published: (2025)
WebAccessVL: Violation-Aware VLM for Web Accessibility
by: Zheng, Amber Yijia, et al.
Published: (2025)
by: Zheng, Amber Yijia, et al.
Published: (2025)
Trio-ViT: Post-Training Quantization and Acceleration for Softmax-Free Efficient Vision Transformer
by: Shi, Huihong, et al.
Published: (2024)
by: Shi, Huihong, et al.
Published: (2024)
Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers
by: Ding, Rui, et al.
Published: (2024)
by: Ding, Rui, et al.
Published: (2024)
Speed3R: Sparse Feed-forward 3D Reconstruction Models
by: Ren, Weining, et al.
Published: (2026)
by: Ren, Weining, et al.
Published: (2026)
GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement
by: Zhuang, Peiye, et al.
Published: (2024)
by: Zhuang, Peiye, et al.
Published: (2024)
TeHOR: Text-Guided 3D Human and Object Reconstruction with Textures
by: Nam, Hyeongjin, et al.
Published: (2026)
by: Nam, Hyeongjin, et al.
Published: (2026)
Masking Matters: Unlocking the Spatial Reasoning Capabilities of LLMs for 3D Scene-Language Understanding
by: Jeon, Yerim, et al.
Published: (2025)
by: Jeon, Yerim, et al.
Published: (2025)
OuroMamba: A Data-Free Quantization Framework for Vision Mamba
by: Ramachandran, Akshat, et al.
Published: (2025)
by: Ramachandran, Akshat, et al.
Published: (2025)
PhysQuantAgent: An Inference Pipeline of Mass Estimation for Vision-Language Models
by: Yokomizo, Hisayuki, et al.
Published: (2026)
by: Yokomizo, Hisayuki, et al.
Published: (2026)
Cross-scale Aligned Supervision for Training GANs
by: Hyun, Sangeek, et al.
Published: (2026)
by: Hyun, Sangeek, et al.
Published: (2026)
RGB2Point: 3D Point Cloud Generation from Single RGB Images
by: Lee, Jae Joong, et al.
Published: (2024)
by: Lee, Jae Joong, et al.
Published: (2024)
D3: Training-Free AI-Generated Video Detection Using Second-Order Features
by: Zheng, Chende, et al.
Published: (2025)
by: Zheng, Chende, et al.
Published: (2025)
FreeOrbit4D: Training-Free Arbitrary Camera Redirection for Monocular Videos via Foreground-Complete 4D Reconstruction
by: Cao, Wei, et al.
Published: (2026)
by: Cao, Wei, et al.
Published: (2026)
Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients
by: Xiang, Ziwei, et al.
Published: (2026)
by: Xiang, Ziwei, et al.
Published: (2026)
MedPruner: Training-Free Hierarchical Token Pruning for Efficient 3D Medical Image Understanding in Vision-Language Models
by: Liu, Shengyuan, et al.
Published: (2026)
by: Liu, Shengyuan, et al.
Published: (2026)
Occlusion-Aware Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction
by: Doh, Hyungjun, et al.
Published: (2025)
by: Doh, Hyungjun, et al.
Published: (2025)
S3D: Sketch-Driven 3D Model Generation
by: Song, Hail, et al.
Published: (2025)
by: Song, Hail, et al.
Published: (2025)
Post-Training Quantization for Video Matting
by: Zhu, Tianrui, et al.
Published: (2025)
by: Zhu, Tianrui, et al.
Published: (2025)
Image-Conditioned 3D Gaussian Splat Quantization
by: Liu, Xinshuang, et al.
Published: (2025)
by: Liu, Xinshuang, et al.
Published: (2025)
MIRe: Enhancing Multimodal Queries Representation via Fusion-Free Modality Interaction for Multimodal Retrieval
by: Ju, Yeong-Joon, et al.
Published: (2024)
by: Ju, Yeong-Joon, et al.
Published: (2024)
IPTQ-ViT: Post-Training Quantization of Non-linear Functions for Integer-only Vision Transformers
by: Kim, Gihwan, et al.
Published: (2025)
by: Kim, Gihwan, et al.
Published: (2025)
Training-Free Reward-Guided Image Editing via Trajectory Optimal Control
by: Chang, Jinho, et al.
Published: (2025)
by: Chang, Jinho, et al.
Published: (2025)
ClipTBP: Clip-Pair based Temporal Boundary Prediction with Boundary-Aware Learning for Moment Retrieval
by: Kim, Ji-Hyeon, et al.
Published: (2026)
by: Kim, Ji-Hyeon, et al.
Published: (2026)
FIQ: Fundamental Question Generation with the Integration of Question Embeddings for Video Question Answering
by: Oh, Ju-Young, et al.
Published: (2025)
by: Oh, Ju-Young, et al.
Published: (2025)
FreeAct: Freeing Activations for LLM Quantization
by: Liu, Xiaohao, et al.
Published: (2026)
by: Liu, Xiaohao, et al.
Published: (2026)
MARR: Module-Adaptive Residual Reconstruction for Low-Bit Post-Training Quantization
by: Su, Le, et al.
Published: (2026)
by: Su, Le, et al.
Published: (2026)
Similar Items
-
Language-Guided Invariance Probing of Vision-Language Models
by: Lee, Jae Joong
Published: (2025) -
Quant Experts: Token-aware Adaptive Error Reconstruction with Mixture of Experts for Large Vision-Language Models Quantization
by: Jia, Chenwei, et al.
Published: (2026) -
SegQuant: A Semantics-Aware and Generalizable Quantization Framework for Diffusion Models
by: Zhang, Jiaji, et al.
Published: (2025) -
DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization
by: Lee, Dongyeun, et al.
Published: (2025) -
DilateQuant: Accurate and Efficient Diffusion Quantization via Weight Dilation
by: Liu, Xuewen, et al.
Published: (2024)