:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Zhang, Borui, Zheng, Wenzhao, Zhou, Jie, Lu, Jiwen
Formato:	Preprint
Publicado:	2024
Materias:	Computer Vision and Pattern Recognition Machine Learning
Acceso en línea:	https://arxiv.org/abs/2401.10442
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Preventing Local Pitfalls in Vector Quantization via Optimal Transport
por: Zhang, Borui, et al.
Publicado: (2024)

SFTok: Bridging the Performance Gap in Discrete Tokenizers
por: Rao, Qihang, et al.
Publicado: (2025)

Quantize-then-Rectify: Efficient VQ-VAE Training
por: Zhang, Borui, et al.
Publicado: (2025)

Fast Shapley Value Estimation: A Unified Approach
por: Zhang, Borui, et al.
Publicado: (2023)

Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
por: Wu, Yuqi, et al.
Publicado: (2025)

Learning Counterfactually Decoupled Attention for Open-World Model Attribution
por: Zheng, Yu, et al.
Publicado: (2025)

GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
por: Zuo, Sicheng, et al.
Publicado: (2024)

DriveTok: 3D Driving Scene Tokenization for Unified Multi-View Reconstruction and Understanding
por: Zhuo, Dong, et al.
Publicado: (2026)

Doe-1: Closed-Loop Autonomous Driving with Large World Model
por: Zheng, Wenzhao, et al.
Publicado: (2024)

EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding
por: Wu, Yuqi, et al.
Publicado: (2024)

SpectralAR: Spectral Autoregressive Visual Generation
por: Huang, Yuanhui, et al.
Publicado: (2025)

Streaming 4D Visual Geometry Transformer
por: Zhuo, Dong, et al.
Publicado: (2025)

Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection
por: Zeng, Shuai, et al.
Publicado: (2024)

A Faster Path to Continual Learning
por: Li, Wei, et al.
Publicado: (2026)

Terra: Explorable Native 3D World Model with Point Latents
por: Huang, Yuanhui, et al.
Publicado: (2025)

Owl-1: Omni World Model for Consistent Long Video Generation
por: Huang, Yuanhui, et al.
Publicado: (2024)

GPD-1: Generative Pre-training for Driving
por: Xie, Zixun, et al.
Publicado: (2024)

GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
por: Huang, Yuanhui, et al.
Publicado: (2024)

Astra: General Interactive World Model with Autoregressive Denoising
por: Zhu, Yixuan, et al.
Publicado: (2025)

GlobalMamba: Global Image Serialization for Vision Mamba
por: Wang, Chengkun, et al.
Publicado: (2024)

WaterVIB: Learning Minimal Sufficient Watermark Representations via Variational Information Bottleneck
por: He, Haoyuan, et al.
Publicado: (2026)

PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views
por: Fei, Xin, et al.
Publicado: (2024)

Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving
por: Fei, Xin, et al.
Publicado: (2024)

Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
por: Wang, Lening, et al.
Publicado: (2024)

V2M: Visual 2-Dimensional Mamba for Image Representation Learning
por: Wang, Chengkun, et al.
Publicado: (2024)

OGGSplat: Open Gaussian Growing for Generalizable Reconstruction with Expanded Field-of-View
por: Wang, Yanbo, et al.
Publicado: (2025)

Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image
por: Zhang, Yanran, et al.
Publicado: (2025)

GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
por: Huang, Yuanhui, et al.
Publicado: (2024)

TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
por: Guo, Wenxuan, et al.
Publicado: (2025)

BAMI: Training-Free Bias Mitigation in GUI Grounding
por: Zhang, Borui, et al.
Publicado: (2026)

Dual-Path Stable Soft Prompt Generation for Domain Generalization
por: Zhang, Yuedi, et al.
Publicado: (2025)

Measuring 3D Spatial Geometric Consistency in Dynamic Generated Videos
por: Dou, Weijia, et al.
Publicado: (2026)

Pruning Self-attentions into Convolutional Layers in Single Path
por: He, Haoyu, et al.
Publicado: (2021)

Revisiting Mixout: An Overlooked Path to Robust Finetuning
por: Aminbeidokhti, Masih, et al.
Publicado: (2025)

Cluster Paths: Navigating Interpretability in Neural Networks
por: Kroeger, Nicholas M., et al.
Publicado: (2025)

GenWorld: Towards Detecting AI-generated Real-world Simulation Videos
por: Chen, Weiliang, et al.
Publicado: (2025)

ClearGCD: Mitigating Shortcut Learning For Robust Generalized Category Discovery
por: Lyu, Kailin, et al.
Publicado: (2025)

Improving Integrated Gradient-based Transferable Adversarial Examples by Refining the Integration Path
por: Ren, Yuchen, et al.
Publicado: (2024)

Neural Path Guiding with Distribution Factorization
por: Figueiredo, Pedro, et al.
Publicado: (2025)

ODE$_t$(ODE$_l$): Shortcutting the Time and the Length in Diffusion and Flow Models for Faster Sampling
por: Gudovskiy, Denis, et al.
Publicado: (2025)