Saved in:
| Main Authors: | Yin, Zeyuan, Liu, Xiaoming |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.16642 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ST$^3$: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming
by: Zhuang, Jiedong, et al.
Published: (2024)
by: Zhuang, Jiedong, et al.
Published: (2024)
Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation
by: Yin, Minghao, et al.
Published: (2025)
by: Yin, Minghao, et al.
Published: (2025)
Trim 3D Gaussian Splatting for Accurate Geometry Representation
by: Fan, Lue, et al.
Published: (2024)
by: Fan, Lue, et al.
Published: (2024)
Cross-Temporal 3D Gaussian Splatting for Sparse-View Guided Scene Update
by: An, Zeyuan, et al.
Published: (2025)
by: An, Zeyuan, et al.
Published: (2025)
Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning
by: Ali, Muhammad Salman, et al.
Published: (2024)
by: Ali, Muhammad Salman, et al.
Published: (2024)
Gaussian Swaying: Surface-Based Framework for Aerodynamic Simulation with 3D Gaussians
by: Yan, Hongru, et al.
Published: (2025)
by: Yan, Hongru, et al.
Published: (2025)
STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians
by: Zeng, Yifei, et al.
Published: (2024)
by: Zeng, Yifei, et al.
Published: (2024)
GaussianTrimmer: Online Trimming Boundaries for 3DGS Segmentation
by: Liao, Liwei, et al.
Published: (2026)
by: Liao, Liwei, et al.
Published: (2026)
TRiGS: Temporal Rigid-Body Motion for Scalable 4D Gaussian Splatting
by: Yeom, Suwoong, et al.
Published: (2026)
by: Yeom, Suwoong, et al.
Published: (2026)
Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation
by: Yang, An, et al.
Published: (2025)
by: Yang, An, et al.
Published: (2025)
TRIM: A Self-Supervised Video Summarization Framework Maximizing Temporal Relative Information and Representativeness
by: Mishra, Pritam, et al.
Published: (2025)
by: Mishra, Pritam, et al.
Published: (2025)
StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion
by: Yang, Haoxin, et al.
Published: (2025)
by: Yang, Haoxin, et al.
Published: (2025)
Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser
by: Cai, Qingyuan, et al.
Published: (2024)
by: Cai, Qingyuan, et al.
Published: (2024)
4DSTR: Advancing Generative 4D Gaussians with Spatial-Temporal Rectification for High-Quality and Consistent 4D Generation
by: Liu, Mengmeng, et al.
Published: (2025)
by: Liu, Mengmeng, et al.
Published: (2025)
Agent-based Video Trimming
by: Yang, Lingfeng, et al.
Published: (2024)
by: Yang, Lingfeng, et al.
Published: (2024)
MLLM-4D: Towards Visual-based Spatial-Temporal Intelligence
by: Yin, Xingyilang, et al.
Published: (2026)
by: Yin, Xingyilang, et al.
Published: (2026)
GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning
by: Lu, Yiren, et al.
Published: (2026)
by: Lu, Yiren, et al.
Published: (2026)
TIGaussian: Disentangle Gaussians for Spatial-Awared Text-Image-3D Alignment
by: Liu, Jiarun, et al.
Published: (2026)
by: Liu, Jiarun, et al.
Published: (2026)
ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting
by: Yan, Xiaoyang, et al.
Published: (2025)
by: Yan, Xiaoyang, et al.
Published: (2025)
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction
by: Cai, Yuanhao, et al.
Published: (2024)
by: Cai, Yuanhao, et al.
Published: (2024)
TGSFormer: Scalable Temporal Gaussian Splatting for Embodied Semantic Scene Completion
by: Qian, Rui, et al.
Published: (2025)
by: Qian, Rui, et al.
Published: (2025)
Beyond Gaussians: Fast and High-Fidelity 3D Splatting with Linear Kernels
by: Chen, Haodong, et al.
Published: (2024)
by: Chen, Haodong, et al.
Published: (2024)
Atlas Gaussians Diffusion for 3D Generation
by: Yang, Haitao, et al.
Published: (2024)
by: Yang, Haitao, et al.
Published: (2024)
TokenTrim: Inference-Time Token Pruning for Autoregressive Long Video Generation
by: Shaulov, Ariel, et al.
Published: (2026)
by: Shaulov, Ariel, et al.
Published: (2026)
Fed3DGS: Scalable 3D Gaussian Splatting with Federated Learning
by: Suzuki, Teppei
Published: (2024)
by: Suzuki, Teppei
Published: (2024)
GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise
by: Li, Xinhai, et al.
Published: (2023)
by: Li, Xinhai, et al.
Published: (2023)
GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors
by: Yu, Xiqian, et al.
Published: (2024)
by: Yu, Xiqian, et al.
Published: (2024)
ST-GDance++: A Scalable Spatial-Temporal Diffusion for Long-Duration Group Choreography
by: Xu, Jing, et al.
Published: (2026)
by: Xu, Jing, et al.
Published: (2026)
Bayesian Diffusion Models for 3D Shape Reconstruction
by: Xu, Haiyang, et al.
Published: (2024)
by: Xu, Haiyang, et al.
Published: (2024)
GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors
by: Yin, Xingyilang, et al.
Published: (2025)
by: Yin, Xingyilang, et al.
Published: (2025)
Spatial-Temporal Perception with Causal Inference for Naturalistic Driving Action Recognition
by: Chang, Qing, et al.
Published: (2025)
by: Chang, Qing, et al.
Published: (2025)
StreamSTGS: Streaming Spatial and Temporal Gaussian Grids for Real-Time Free-Viewpoint Video
by: Ke, Zhihui, et al.
Published: (2025)
by: Ke, Zhihui, et al.
Published: (2025)
GR-Diffusion: 3D Gaussian Representation Meets Diffusion in Whole-Body PET Reconstruction
by: Geng, Mengxiao, et al.
Published: (2026)
by: Geng, Mengxiao, et al.
Published: (2026)
HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction
by: Zhao, Haoyu, et al.
Published: (2024)
by: Zhao, Haoyu, et al.
Published: (2024)
FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks
by: Li, Jinwei, et al.
Published: (2025)
by: Li, Jinwei, et al.
Published: (2025)
Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models
by: Liang, Hanwen, et al.
Published: (2024)
by: Liang, Hanwen, et al.
Published: (2024)
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
by: Li, Hongyu, et al.
Published: (2025)
by: Li, Hongyu, et al.
Published: (2025)
HiddenObjects: Scalable Diffusion-Distilled Spatial Priors for Object Placement
by: Schouten, Marco, et al.
Published: (2026)
by: Schouten, Marco, et al.
Published: (2026)
ResDiT: Evoking the Intrinsic Resolution Scalability in Diffusion Transformers
by: Ma, Yiyang, et al.
Published: (2025)
by: Ma, Yiyang, et al.
Published: (2025)
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
by: Jiang, Haoyi, et al.
Published: (2024)
by: Jiang, Haoyi, et al.
Published: (2024)
Similar Items
-
ST$^3$: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming
by: Zhuang, Jiedong, et al.
Published: (2024) -
Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation
by: Yin, Minghao, et al.
Published: (2025) -
Trim 3D Gaussian Splatting for Accurate Geometry Representation
by: Fan, Lue, et al.
Published: (2024) -
Cross-Temporal 3D Gaussian Splatting for Sparse-View Guided Scene Update
by: An, Zeyuan, et al.
Published: (2025) -
Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning
by: Ali, Muhammad Salman, et al.
Published: (2024)