Guardado en:
| Autores principales: | Bratulić, Jelena, Mittal, Sudhanshu, Brox, Thomas, Rupprecht, Christian |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2512.11508 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters
por: Hindel, Julia, et al.
Publicado: (2025)
por: Hindel, Julia, et al.
Publicado: (2025)
Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization Problems
por: Hoffmann, David T., et al.
Publicado: (2023)
por: Hoffmann, David T., et al.
Publicado: (2023)
Using Knowledge Graphs to harvest datasets for efficient CLIP model training
por: Ging, Simon, et al.
Publicado: (2025)
por: Ging, Simon, et al.
Publicado: (2025)
Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models
por: Mousakhan, Arian, et al.
Publicado: (2025)
por: Mousakhan, Arian, et al.
Publicado: (2025)
Assessing Multimodal Chronic Wound Embeddings with Expert Triplet Agreement
por: Kabus, Fabian, et al.
Publicado: (2026)
por: Kabus, Fabian, et al.
Publicado: (2026)
Unlocking In-Context Learning for Natural Datasets Beyond Language Modelling
por: Bratulić, Jelena, et al.
Publicado: (2025)
por: Bratulić, Jelena, et al.
Publicado: (2025)
Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image
por: Szymanowicz, Stanislaw, et al.
Publicado: (2024)
por: Szymanowicz, Stanislaw, et al.
Publicado: (2024)
Mix3R: Mixing Feed-forward Reconstruction and Generative 3D Priors for Joint Multi-view Aligned 3D Reconstruction and Pose Estimation
por: Lin, Siyou, et al.
Publicado: (2026)
por: Lin, Siyou, et al.
Publicado: (2026)
Reliev3R: Relieving Feed-forward Reconstruction from Multi-View Geometric Annotations
por: Chen, Youyu, et al.
Publicado: (2026)
por: Chen, Youyu, et al.
Publicado: (2026)
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views
por: Zhang, Shangzhan, et al.
Publicado: (2025)
por: Zhang, Shangzhan, et al.
Publicado: (2025)
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
por: Szymanowicz, Stanislaw, et al.
Publicado: (2023)
por: Szymanowicz, Stanislaw, et al.
Publicado: (2023)
Speed3R: Sparse Feed-forward 3D Reconstruction Models
por: Ren, Weining, et al.
Publicado: (2026)
por: Ren, Weining, et al.
Publicado: (2026)
Taming Feed-forward Reconstruction Models as Latent Encoders for 3D Generative Models
por: Wizadwongsa, Suttisak, et al.
Publicado: (2024)
por: Wizadwongsa, Suttisak, et al.
Publicado: (2024)
AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend
por: Wang, Hengyi, et al.
Publicado: (2025)
por: Wang, Hengyi, et al.
Publicado: (2025)
AnchorSplat: Feed-Forward 3D Gaussian Splatting with 3D Geometric Priors
por: Zhang, Xiaoxue, et al.
Publicado: (2026)
por: Zhang, Xiaoxue, et al.
Publicado: (2026)
MoRe: Motion-aware Feed-forward 4D Reconstruction Transformer
por: Fang, Juntong, et al.
Publicado: (2026)
por: Fang, Juntong, et al.
Publicado: (2026)
Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey
por: Zhang, Jiahui, et al.
Publicado: (2025)
por: Zhang, Jiahui, et al.
Publicado: (2025)
Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation
por: Ren, Weining, et al.
Publicado: (2025)
por: Ren, Weining, et al.
Publicado: (2025)
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects
por: Li, Ruining, et al.
Publicado: (2024)
por: Li, Ruining, et al.
Publicado: (2024)
sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views
por: Najafli, Eyvaz, et al.
Publicado: (2025)
por: Najafli, Eyvaz, et al.
Publicado: (2025)
Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT
por: Zhang, Wei, et al.
Publicado: (2025)
por: Zhang, Wei, et al.
Publicado: (2025)
TokenSplat: Token-aligned 3D Gaussian Splatting for Feed-forward Pose-free Reconstruction
por: Li, Yihui, et al.
Publicado: (2026)
por: Li, Yihui, et al.
Publicado: (2026)
Make-It-Poseable: Feed-forward Latent Posing Model for 3D Characters
por: Guo, Zhiyang, et al.
Publicado: (2025)
por: Guo, Zhiyang, et al.
Publicado: (2025)
Particulate: Feed-Forward 3D Object Articulation
por: Li, Ruining, et al.
Publicado: (2025)
por: Li, Ruining, et al.
Publicado: (2025)
DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction
por: Kaye, Ben, et al.
Publicado: (2024)
por: Kaye, Ben, et al.
Publicado: (2024)
ForeHOI: Feed-forward 3D Object Reconstruction from Daily Hand-Object Interaction Videos
por: Chen, Yuantao, et al.
Publicado: (2026)
por: Chen, Yuantao, et al.
Publicado: (2026)
HumanRAM: Feed-forward Human Reconstruction and Animation Model using Transformers
por: Yu, Zhiyuan, et al.
Publicado: (2025)
por: Yu, Zhiyuan, et al.
Publicado: (2025)
Farm3D: Learning Articulated 3D Animals by Distilling 2D Diffusion
por: Jakab, Tomas, et al.
Publicado: (2023)
por: Jakab, Tomas, et al.
Publicado: (2023)
Tracking-Guided 4D Generation: Foundation-Tracker Motion Priors for 3D Model Animation
por: Sun, Su, et al.
Publicado: (2025)
por: Sun, Su, et al.
Publicado: (2025)
Learning Dynamic Scene Reconstruction with Sinusoidal Geometric Priors
por: Guo, Tian, et al.
Publicado: (2025)
por: Guo, Tian, et al.
Publicado: (2025)
Probing into Camera Control of Video Models
por: Hou, Chen, et al.
Publicado: (2026)
por: Hou, Chen, et al.
Publicado: (2026)
Anomaly Detection with Conditioned Denoising Diffusion Models
por: Mousakhan, Arian, et al.
Publicado: (2023)
por: Mousakhan, Arian, et al.
Publicado: (2023)
DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view Input
por: Tian, Qijian, et al.
Publicado: (2024)
por: Tian, Qijian, et al.
Publicado: (2024)
AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views
por: Jiang, Lihan, et al.
Publicado: (2025)
por: Jiang, Lihan, et al.
Publicado: (2025)
Scene-Conditional 3D Object Stylization and Composition
por: Zhou, Jinghao, et al.
Publicado: (2023)
por: Zhou, Jinghao, et al.
Publicado: (2023)
FAST3DIS: Feed-forward Anchored Scene Transformer for 3D Instance Segmentation
por: Li, Changyang, et al.
Publicado: (2026)
por: Li, Changyang, et al.
Publicado: (2026)
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
por: Jevtić, Aleksandar, et al.
Publicado: (2025)
por: Jevtić, Aleksandar, et al.
Publicado: (2025)
Learning 3D Reconstruction with Priors in Test Time
por: Zhou, Lei, et al.
Publicado: (2026)
por: Zhou, Lei, et al.
Publicado: (2026)
Fus3D: Decoding Consolidated 3D Geometry from Feed-forward Geometry Transformer Latents
por: Fink, Laura, et al.
Publicado: (2026)
por: Fink, Laura, et al.
Publicado: (2026)
Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting
por: Engstler, Paul, et al.
Publicado: (2024)
por: Engstler, Paul, et al.
Publicado: (2024)
Ejemplares similares
-
Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters
por: Hindel, Julia, et al.
Publicado: (2025) -
Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization Problems
por: Hoffmann, David T., et al.
Publicado: (2023) -
Using Knowledge Graphs to harvest datasets for efficient CLIP model training
por: Ging, Simon, et al.
Publicado: (2025) -
Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models
por: Mousakhan, Arian, et al.
Publicado: (2025) -
Assessing Multimodal Chronic Wound Embeddings with Expert Triplet Agreement
por: Kabus, Fabian, et al.
Publicado: (2026)