:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhu, Lingyu, Rahtu, Esa, Zhao, Hang
Format:	Preprint
Published:	2022
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2207.01136
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

3D Gaussian Splatting with Fisheye Images: Field of View Analysis and Depth-Based Initialization
by: Gunes, Ulas, et al.
Published: (2025)

GS-Pose: Generalizable Segmentation-based 6D Object Pose Estimation with 3D Gaussian Splatting
by: Cai, Dingding, et al.
Published: (2024)

PanDepth: Joint Panoptic Segmentation and Depth Completion
by: Lagos, Juan, et al.
Published: (2022)

The Weighting Game: Evaluating Quality of Explainability Methods
by: Raatikainen, Lassi, et al.
Published: (2022)

MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis
by: Ren, Xuqian, et al.
Published: (2023)

Video Object Segmentation-Aware Audio Generation
by: Viertola, Ilpo, et al.
Published: (2025)

SemSegDepth: A Combined Model for Semantic Segmentation and Depth Completion
by: Lagos, Juan Pablo, et al.
Published: (2022)

HybVIO: Pushing the Limits of Real-time Visual-inertial Odometry
by: Seiskari, Otto, et al.
Published: (2021)

FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking
by: Gunes, Ulas, et al.
Published: (2025)

Temporally Aligned Audio for Video with Autoregression
by: Viertola, Ilpo, et al.
Published: (2024)

UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM
by: Mansour, Mostafa, et al.
Published: (2024)

DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing
by: Turkulainen, Matias, et al.
Published: (2024)

Synchformer: Efficient Synchronization from Sparse Cues
by: Iashin, Vladimir, et al.
Published: (2024)

AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones
by: Ren, Xuqian, et al.
Published: (2024)

Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion
by: Seiskari, Otto, et al.
Published: (2024)

Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction
by: Zhang, Runmin, et al.
Published: (2025)

From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D
by: Zhang, Jiahui, et al.
Published: (2025)

V2V3D: View-to-View Denoised 3D Reconstruction for Light-Field Microscopy
by: Zhao, Jiayin, et al.
Published: (2025)

View-on-Graph: Zero-shot 3D Visual Grounding via Vision-Language Reasoning on Scene Graphs
by: Liu, Yuanyuan, et al.
Published: (2025)

3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field
by: Bao, Zhenyu, et al.
Published: (2024)

Neural Radiance and Gaze Fields for Visual Attention Modeling in 3D Environments
by: Chubarau, Andrei, et al.
Published: (2025)

RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement
by: Luo, Hao, et al.
Published: (2024)

Vision Remember: Recovering Visual Information in Efficient LVLM with Vision Feature Resampling
by: Feng, Ze, et al.
Published: (2025)

Reinforced Embodied Active Defense: Exploiting Adaptive Interaction for Robust Visual Perception in Adversarial 3D Environments
by: Yang, Xiao, et al.
Published: (2025)

NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines
by: Ahonen, Jukka I., et al.
Published: (2024)

Bridging the gap between image coding for machines and humans
by: Le, Nam, et al.
Published: (2024)

A Neural Field-Based Approach for View Computation & Data Exploration in 3D Urban Environments
by: Cobeli, Stefan, et al.
Published: (2025)

GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting
by: Ye, Baijun, et al.
Published: (2025)

MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields
by: Xiao, Yuru, et al.
Published: (2024)

ViewSRD: 3D Visual Grounding via Structured Multi-View Decomposition
by: Huang, Ronggang, et al.
Published: (2025)

ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction
by: Feng, Yi, et al.
Published: (2024)

BabyVision: Visual Reasoning Beyond Language
by: Chen, Liang, et al.
Published: (2026)

A Modular Framework for Single-View 3D Reconstruction of Indoor Environments
by: Li, Yuxiao
Published: (2025)

Polar Parametrization for Vision-based Surround-View 3D Detection
by: Chen, Shaoyu, et al.
Published: (2022)

Beyond Visual Cues: Synchronously Exploring Target-Centric Semantics for Vision-Language Tracking
by: Ge, Jiawei, et al.
Published: (2023)

Agent Skills Should Go Beyond Text: The Case for Visual Skills
by: Xu, Binxiao, et al.
Published: (2026)

Perceiving Beyond Language Priors: Enhancing Visual Comprehension and Attention in Multimodal Models
by: Ghatkesar, Aarti, et al.
Published: (2025)

Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection
by: Koo, Juil, et al.
Published: (2025)

TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields
by: Huang, Tianyu, et al.
Published: (2023)

FoVA-Depth: Field-of-View Agnostic Depth Estimation for Cross-Dataset Generalization
by: Lichy, Daniel, et al.
Published: (2024)