Guardado en:
| Autores principales: | Zhang, Borui, Zheng, Wenzhao, Zhou, Jie, Lu, Jiwen |
|---|---|
| Formato: | Preprint |
| Publicado: |
2024
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2401.10442 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Preventing Local Pitfalls in Vector Quantization via Optimal Transport
por: Zhang, Borui, et al.
Publicado: (2024)
por: Zhang, Borui, et al.
Publicado: (2024)
SFTok: Bridging the Performance Gap in Discrete Tokenizers
por: Rao, Qihang, et al.
Publicado: (2025)
por: Rao, Qihang, et al.
Publicado: (2025)
Quantize-then-Rectify: Efficient VQ-VAE Training
por: Zhang, Borui, et al.
Publicado: (2025)
por: Zhang, Borui, et al.
Publicado: (2025)
Fast Shapley Value Estimation: A Unified Approach
por: Zhang, Borui, et al.
Publicado: (2023)
por: Zhang, Borui, et al.
Publicado: (2023)
Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
por: Wu, Yuqi, et al.
Publicado: (2025)
por: Wu, Yuqi, et al.
Publicado: (2025)
Learning Counterfactually Decoupled Attention for Open-World Model Attribution
por: Zheng, Yu, et al.
Publicado: (2025)
por: Zheng, Yu, et al.
Publicado: (2025)
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
por: Zuo, Sicheng, et al.
Publicado: (2024)
por: Zuo, Sicheng, et al.
Publicado: (2024)
DriveTok: 3D Driving Scene Tokenization for Unified Multi-View Reconstruction and Understanding
por: Zhuo, Dong, et al.
Publicado: (2026)
por: Zhuo, Dong, et al.
Publicado: (2026)
Doe-1: Closed-Loop Autonomous Driving with Large World Model
por: Zheng, Wenzhao, et al.
Publicado: (2024)
por: Zheng, Wenzhao, et al.
Publicado: (2024)
EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding
por: Wu, Yuqi, et al.
Publicado: (2024)
por: Wu, Yuqi, et al.
Publicado: (2024)
SpectralAR: Spectral Autoregressive Visual Generation
por: Huang, Yuanhui, et al.
Publicado: (2025)
por: Huang, Yuanhui, et al.
Publicado: (2025)
Streaming 4D Visual Geometry Transformer
por: Zhuo, Dong, et al.
Publicado: (2025)
por: Zhuo, Dong, et al.
Publicado: (2025)
Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection
por: Zeng, Shuai, et al.
Publicado: (2024)
por: Zeng, Shuai, et al.
Publicado: (2024)
A Faster Path to Continual Learning
por: Li, Wei, et al.
Publicado: (2026)
por: Li, Wei, et al.
Publicado: (2026)
Terra: Explorable Native 3D World Model with Point Latents
por: Huang, Yuanhui, et al.
Publicado: (2025)
por: Huang, Yuanhui, et al.
Publicado: (2025)
Owl-1: Omni World Model for Consistent Long Video Generation
por: Huang, Yuanhui, et al.
Publicado: (2024)
por: Huang, Yuanhui, et al.
Publicado: (2024)
GPD-1: Generative Pre-training for Driving
por: Xie, Zixun, et al.
Publicado: (2024)
por: Xie, Zixun, et al.
Publicado: (2024)
GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
por: Huang, Yuanhui, et al.
Publicado: (2024)
por: Huang, Yuanhui, et al.
Publicado: (2024)
Astra: General Interactive World Model with Autoregressive Denoising
por: Zhu, Yixuan, et al.
Publicado: (2025)
por: Zhu, Yixuan, et al.
Publicado: (2025)
GlobalMamba: Global Image Serialization for Vision Mamba
por: Wang, Chengkun, et al.
Publicado: (2024)
por: Wang, Chengkun, et al.
Publicado: (2024)
WaterVIB: Learning Minimal Sufficient Watermark Representations via Variational Information Bottleneck
por: He, Haoyuan, et al.
Publicado: (2026)
por: He, Haoyuan, et al.
Publicado: (2026)
PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views
por: Fei, Xin, et al.
Publicado: (2024)
por: Fei, Xin, et al.
Publicado: (2024)
Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving
por: Fei, Xin, et al.
Publicado: (2024)
por: Fei, Xin, et al.
Publicado: (2024)
Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
por: Wang, Lening, et al.
Publicado: (2024)
por: Wang, Lening, et al.
Publicado: (2024)
V2M: Visual 2-Dimensional Mamba for Image Representation Learning
por: Wang, Chengkun, et al.
Publicado: (2024)
por: Wang, Chengkun, et al.
Publicado: (2024)
OGGSplat: Open Gaussian Growing for Generalizable Reconstruction with Expanded Field-of-View
por: Wang, Yanbo, et al.
Publicado: (2025)
por: Wang, Yanbo, et al.
Publicado: (2025)
Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image
por: Zhang, Yanran, et al.
Publicado: (2025)
por: Zhang, Yanran, et al.
Publicado: (2025)
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
por: Huang, Yuanhui, et al.
Publicado: (2024)
por: Huang, Yuanhui, et al.
Publicado: (2024)
TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
por: Guo, Wenxuan, et al.
Publicado: (2025)
por: Guo, Wenxuan, et al.
Publicado: (2025)
BAMI: Training-Free Bias Mitigation in GUI Grounding
por: Zhang, Borui, et al.
Publicado: (2026)
por: Zhang, Borui, et al.
Publicado: (2026)
Dual-Path Stable Soft Prompt Generation for Domain Generalization
por: Zhang, Yuedi, et al.
Publicado: (2025)
por: Zhang, Yuedi, et al.
Publicado: (2025)
Measuring 3D Spatial Geometric Consistency in Dynamic Generated Videos
por: Dou, Weijia, et al.
Publicado: (2026)
por: Dou, Weijia, et al.
Publicado: (2026)
Pruning Self-attentions into Convolutional Layers in Single Path
por: He, Haoyu, et al.
Publicado: (2021)
por: He, Haoyu, et al.
Publicado: (2021)
Revisiting Mixout: An Overlooked Path to Robust Finetuning
por: Aminbeidokhti, Masih, et al.
Publicado: (2025)
por: Aminbeidokhti, Masih, et al.
Publicado: (2025)
Cluster Paths: Navigating Interpretability in Neural Networks
por: Kroeger, Nicholas M., et al.
Publicado: (2025)
por: Kroeger, Nicholas M., et al.
Publicado: (2025)
GenWorld: Towards Detecting AI-generated Real-world Simulation Videos
por: Chen, Weiliang, et al.
Publicado: (2025)
por: Chen, Weiliang, et al.
Publicado: (2025)
ClearGCD: Mitigating Shortcut Learning For Robust Generalized Category Discovery
por: Lyu, Kailin, et al.
Publicado: (2025)
por: Lyu, Kailin, et al.
Publicado: (2025)
Improving Integrated Gradient-based Transferable Adversarial Examples by Refining the Integration Path
por: Ren, Yuchen, et al.
Publicado: (2024)
por: Ren, Yuchen, et al.
Publicado: (2024)
Neural Path Guiding with Distribution Factorization
por: Figueiredo, Pedro, et al.
Publicado: (2025)
por: Figueiredo, Pedro, et al.
Publicado: (2025)
ODE$_t$(ODE$_l$): Shortcutting the Time and the Length in Diffusion and Flow Models for Faster Sampling
por: Gudovskiy, Denis, et al.
Publicado: (2025)
por: Gudovskiy, Denis, et al.
Publicado: (2025)
Ejemplares similares
-
Preventing Local Pitfalls in Vector Quantization via Optimal Transport
por: Zhang, Borui, et al.
Publicado: (2024) -
SFTok: Bridging the Performance Gap in Discrete Tokenizers
por: Rao, Qihang, et al.
Publicado: (2025) -
Quantize-then-Rectify: Efficient VQ-VAE Training
por: Zhang, Borui, et al.
Publicado: (2025) -
Fast Shapley Value Estimation: A Unified Approach
por: Zhang, Borui, et al.
Publicado: (2023) -
Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
por: Wu, Yuqi, et al.
Publicado: (2025)