:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yin, Zeyuan, Liu, Xiaoming
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2511.16642
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ST$^3$: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming
by: Zhuang, Jiedong, et al.
Published: (2024)

Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation
by: Yin, Minghao, et al.
Published: (2025)

Trim 3D Gaussian Splatting for Accurate Geometry Representation
by: Fan, Lue, et al.
Published: (2024)

Cross-Temporal 3D Gaussian Splatting for Sparse-View Guided Scene Update
by: An, Zeyuan, et al.
Published: (2025)

Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning
by: Ali, Muhammad Salman, et al.
Published: (2024)

Gaussian Swaying: Surface-Based Framework for Aerodynamic Simulation with 3D Gaussians
by: Yan, Hongru, et al.
Published: (2025)

STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians
by: Zeng, Yifei, et al.
Published: (2024)

GaussianTrimmer: Online Trimming Boundaries for 3DGS Segmentation
by: Liao, Liwei, et al.
Published: (2026)

TRiGS: Temporal Rigid-Body Motion for Scalable 4D Gaussian Splatting
by: Yeom, Suwoong, et al.
Published: (2026)

Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation
by: Yang, An, et al.
Published: (2025)

TRIM: A Self-Supervised Video Summarization Framework Maximizing Temporal Relative Information and Representativeness
by: Mishra, Pritam, et al.
Published: (2025)

StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion
by: Yang, Haoxin, et al.
Published: (2025)

Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser
by: Cai, Qingyuan, et al.
Published: (2024)

4DSTR: Advancing Generative 4D Gaussians with Spatial-Temporal Rectification for High-Quality and Consistent 4D Generation
by: Liu, Mengmeng, et al.
Published: (2025)

Agent-based Video Trimming
by: Yang, Lingfeng, et al.
Published: (2024)

MLLM-4D: Towards Visual-based Spatial-Temporal Intelligence
by: Yin, Xingyilang, et al.
Published: (2026)

GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning
by: Lu, Yiren, et al.
Published: (2026)

TIGaussian: Disentangle Gaussians for Spatial-Awared Text-Image-3D Alignment
by: Liu, Jiarun, et al.
Published: (2026)

ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting
by: Yan, Xiaoyang, et al.
Published: (2025)

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction
by: Cai, Yuanhao, et al.
Published: (2024)

TGSFormer: Scalable Temporal Gaussian Splatting for Embodied Semantic Scene Completion
by: Qian, Rui, et al.
Published: (2025)

Beyond Gaussians: Fast and High-Fidelity 3D Splatting with Linear Kernels
by: Chen, Haodong, et al.
Published: (2024)

Atlas Gaussians Diffusion for 3D Generation
by: Yang, Haitao, et al.
Published: (2024)

TokenTrim: Inference-Time Token Pruning for Autoregressive Long Video Generation
by: Shaulov, Ariel, et al.
Published: (2026)

Fed3DGS: Scalable 3D Gaussian Splatting with Federated Learning
by: Suzuki, Teppei
Published: (2024)

GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise
by: Li, Xinhai, et al.
Published: (2023)

GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors
by: Yu, Xiqian, et al.
Published: (2024)

ST-GDance++: A Scalable Spatial-Temporal Diffusion for Long-Duration Group Choreography
by: Xu, Jing, et al.
Published: (2026)

Bayesian Diffusion Models for 3D Shape Reconstruction
by: Xu, Haiyang, et al.
Published: (2024)

GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors
by: Yin, Xingyilang, et al.
Published: (2025)

Spatial-Temporal Perception with Causal Inference for Naturalistic Driving Action Recognition
by: Chang, Qing, et al.
Published: (2025)

StreamSTGS: Streaming Spatial and Temporal Gaussian Grids for Real-Time Free-Viewpoint Video
by: Ke, Zhihui, et al.
Published: (2025)

GR-Diffusion: 3D Gaussian Representation Meets Diffusion in Whole-Body PET Reconstruction
by: Geng, Mengxiao, et al.
Published: (2026)

HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction
by: Zhao, Haoyu, et al.
Published: (2024)

FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks
by: Li, Jinwei, et al.
Published: (2025)

Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models
by: Liang, Hanwen, et al.
Published: (2024)

LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
by: Li, Hongyu, et al.
Published: (2025)

HiddenObjects: Scalable Diffusion-Distilled Spatial Priors for Object Placement
by: Schouten, Marco, et al.
Published: (2026)

ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers
by: Ma, Yiyang, et al.
Published: (2025)

GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
by: Jiang, Haoyi, et al.
Published: (2024)