Saved in:
| Main Authors: | Zhu, Lingyu, Rahtu, Esa, Zhao, Hang |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2207.01136 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
3D Gaussian Splatting with Fisheye Images: Field of View Analysis and Depth-Based Initialization
by: Gunes, Ulas, et al.
Published: (2025)
by: Gunes, Ulas, et al.
Published: (2025)
GS-Pose: Generalizable Segmentation-based 6D Object Pose Estimation with 3D Gaussian Splatting
by: Cai, Dingding, et al.
Published: (2024)
by: Cai, Dingding, et al.
Published: (2024)
PanDepth: Joint Panoptic Segmentation and Depth Completion
by: Lagos, Juan, et al.
Published: (2022)
by: Lagos, Juan, et al.
Published: (2022)
The Weighting Game: Evaluating Quality of Explainability Methods
by: Raatikainen, Lassi, et al.
Published: (2022)
by: Raatikainen, Lassi, et al.
Published: (2022)
MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis
by: Ren, Xuqian, et al.
Published: (2023)
by: Ren, Xuqian, et al.
Published: (2023)
Video Object Segmentation-Aware Audio Generation
by: Viertola, Ilpo, et al.
Published: (2025)
by: Viertola, Ilpo, et al.
Published: (2025)
SemSegDepth: A Combined Model for Semantic Segmentation and Depth Completion
by: Lagos, Juan Pablo, et al.
Published: (2022)
by: Lagos, Juan Pablo, et al.
Published: (2022)
HybVIO: Pushing the Limits of Real-time Visual-inertial Odometry
by: Seiskari, Otto, et al.
Published: (2021)
by: Seiskari, Otto, et al.
Published: (2021)
FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking
by: Gunes, Ulas, et al.
Published: (2025)
by: Gunes, Ulas, et al.
Published: (2025)
Temporally Aligned Audio for Video with Autoregression
by: Viertola, Ilpo, et al.
Published: (2024)
by: Viertola, Ilpo, et al.
Published: (2024)
UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM
by: Mansour, Mostafa, et al.
Published: (2024)
by: Mansour, Mostafa, et al.
Published: (2024)
DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing
by: Turkulainen, Matias, et al.
Published: (2024)
by: Turkulainen, Matias, et al.
Published: (2024)
Synchformer: Efficient Synchronization from Sparse Cues
by: Iashin, Vladimir, et al.
Published: (2024)
by: Iashin, Vladimir, et al.
Published: (2024)
AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones
by: Ren, Xuqian, et al.
Published: (2024)
by: Ren, Xuqian, et al.
Published: (2024)
Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion
by: Seiskari, Otto, et al.
Published: (2024)
by: Seiskari, Otto, et al.
Published: (2024)
Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction
by: Zhang, Runmin, et al.
Published: (2025)
by: Zhang, Runmin, et al.
Published: (2025)
From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D
by: Zhang, Jiahui, et al.
Published: (2025)
by: Zhang, Jiahui, et al.
Published: (2025)
V2V3D: View-to-View Denoised 3D Reconstruction for Light-Field Microscopy
by: Zhao, Jiayin, et al.
Published: (2025)
by: Zhao, Jiayin, et al.
Published: (2025)
View-on-Graph: Zero-shot 3D Visual Grounding via Vision-Language Reasoning on Scene Graphs
by: Liu, Yuanyuan, et al.
Published: (2025)
by: Liu, Yuanyuan, et al.
Published: (2025)
3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field
by: Bao, Zhenyu, et al.
Published: (2024)
by: Bao, Zhenyu, et al.
Published: (2024)
Neural Radiance and Gaze Fields for Visual Attention Modeling in 3D Environments
by: Chubarau, Andrei, et al.
Published: (2025)
by: Chubarau, Andrei, et al.
Published: (2025)
RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement
by: Luo, Hao, et al.
Published: (2024)
by: Luo, Hao, et al.
Published: (2024)
Vision Remember: Recovering Visual Information in Efficient LVLM with Vision Feature Resampling
by: Feng, Ze, et al.
Published: (2025)
by: Feng, Ze, et al.
Published: (2025)
Reinforced Embodied Active Defense: Exploiting Adaptive Interaction for Robust Visual Perception in Adversarial 3D Environments
by: Yang, Xiao, et al.
Published: (2025)
by: Yang, Xiao, et al.
Published: (2025)
NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines
by: Ahonen, Jukka I., et al.
Published: (2024)
by: Ahonen, Jukka I., et al.
Published: (2024)
Bridging the gap between image coding for machines and humans
by: Le, Nam, et al.
Published: (2024)
by: Le, Nam, et al.
Published: (2024)
A Neural Field-Based Approach for View Computation & Data Exploration in 3D Urban Environments
by: Cobeli, Stefan, et al.
Published: (2025)
by: Cobeli, Stefan, et al.
Published: (2025)
GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting
by: Ye, Baijun, et al.
Published: (2025)
by: Ye, Baijun, et al.
Published: (2025)
MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields
by: Xiao, Yuru, et al.
Published: (2024)
by: Xiao, Yuru, et al.
Published: (2024)
ViewSRD: 3D Visual Grounding via Structured Multi-View Decomposition
by: Huang, Ronggang, et al.
Published: (2025)
by: Huang, Ronggang, et al.
Published: (2025)
ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction
by: Feng, Yi, et al.
Published: (2024)
by: Feng, Yi, et al.
Published: (2024)
BabyVision: Visual Reasoning Beyond Language
by: Chen, Liang, et al.
Published: (2026)
by: Chen, Liang, et al.
Published: (2026)
A Modular Framework for Single-View 3D Reconstruction of Indoor Environments
by: Li, Yuxiao
Published: (2025)
by: Li, Yuxiao
Published: (2025)
Polar Parametrization for Vision-based Surround-View 3D Detection
by: Chen, Shaoyu, et al.
Published: (2022)
by: Chen, Shaoyu, et al.
Published: (2022)
Beyond Visual Cues: Synchronously Exploring Target-Centric Semantics for Vision-Language Tracking
by: Ge, Jiawei, et al.
Published: (2023)
by: Ge, Jiawei, et al.
Published: (2023)
Agent Skills Should Go Beyond Text: The Case for Visual Skills
by: Xu, Binxiao, et al.
Published: (2026)
by: Xu, Binxiao, et al.
Published: (2026)
Perceiving Beyond Language Priors: Enhancing Visual Comprehension and Attention in Multimodal Models
by: Ghatkesar, Aarti, et al.
Published: (2025)
by: Ghatkesar, Aarti, et al.
Published: (2025)
Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection
by: Koo, Juil, et al.
Published: (2025)
by: Koo, Juil, et al.
Published: (2025)
TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields
by: Huang, Tianyu, et al.
Published: (2023)
by: Huang, Tianyu, et al.
Published: (2023)
FoVA-Depth: Field-of-View Agnostic Depth Estimation for Cross-Dataset Generalization
by: Lichy, Daniel, et al.
Published: (2024)
by: Lichy, Daniel, et al.
Published: (2024)
Similar Items
-
3D Gaussian Splatting with Fisheye Images: Field of View Analysis and Depth-Based Initialization
by: Gunes, Ulas, et al.
Published: (2025) -
GS-Pose: Generalizable Segmentation-based 6D Object Pose Estimation with 3D Gaussian Splatting
by: Cai, Dingding, et al.
Published: (2024) -
PanDepth: Joint Panoptic Segmentation and Depth Completion
by: Lagos, Juan, et al.
Published: (2022) -
The Weighting Game: Evaluating Quality of Explainability Methods
by: Raatikainen, Lassi, et al.
Published: (2022) -
MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis
by: Ren, Xuqian, et al.
Published: (2023)