Saved in:
| Main Authors: | Pahari, Soham, Kumain, Sandeep C. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.06419 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Air Quality Prediction Using LOESS-ARIMA and Multi-Scale CNN-BiLSTM with Residual-Gated Attention
by: Pahari, Soham, et al.
Published: (2025)
by: Pahari, Soham, et al.
Published: (2025)
Geometry Matters: 3D Foundation Priors for Learning Semantic Correspondence
by: Jesslen, Artur, et al.
Published: (2026)
by: Jesslen, Artur, et al.
Published: (2026)
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
by: Pu, Fanqi, et al.
Published: (2024)
by: Pu, Fanqi, et al.
Published: (2024)
InterGSEdit: Interactive 3D Gaussian Splatting Editing with 3D Geometry-Consistent Attention Prior
by: Wen, Minghao, et al.
Published: (2025)
by: Wen, Minghao, et al.
Published: (2025)
Robust 4D Visual Geometry Transformer with Uncertainty-Aware Priors
by: Zang, Ying, et al.
Published: (2026)
by: Zang, Ying, et al.
Published: (2026)
Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries
by: Song, Qi, et al.
Published: (2024)
by: Song, Qi, et al.
Published: (2024)
VIDMP3: Video Editing by Representing Motion with Pose and Position Priors
by: Mishra, Sandeep, et al.
Published: (2025)
by: Mishra, Sandeep, et al.
Published: (2025)
Spurfies: Sparse Surface Reconstruction using Local Geometry Priors
by: Raj, Kevin, et al.
Published: (2024)
by: Raj, Kevin, et al.
Published: (2024)
EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric Vision
by: Cao, Yifei, et al.
Published: (2025)
by: Cao, Yifei, et al.
Published: (2025)
Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors
by: Chen, Minglin, et al.
Published: (2025)
by: Chen, Minglin, et al.
Published: (2025)
3D-PointZshotS: Geometry-Aware 3D Point Cloud Zero-Shot Semantic Segmentation Narrowing the Visual-Semantic Gap
by: Yang, Minmin, et al.
Published: (2025)
by: Yang, Minmin, et al.
Published: (2025)
Normal Transformer: Extracting Surface Geometry from LiDAR Points Enhanced by Visual Semantics
by: Lin, Ancheng, et al.
Published: (2022)
by: Lin, Ancheng, et al.
Published: (2022)
Generalizing Visual Geometry Priors to Sparse Gaussian Occupancy Prediction
by: Zhou, Changqing, et al.
Published: (2026)
by: Zhou, Changqing, et al.
Published: (2026)
PGAHum: Prior-Guided Geometry and Appearance Learning for High-Fidelity Animatable Human Reconstruction
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
Action-Geometry Prediction with 3D Geometric Prior for Bimanual Manipulation
by: Xu, Chongyang, et al.
Published: (2026)
by: Xu, Chongyang, et al.
Published: (2026)
QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy
by: Lilja, Adam, et al.
Published: (2025)
by: Lilja, Adam, et al.
Published: (2025)
Towards Visual Query Localization in the 3D World
by: Peng, Liang, et al.
Published: (2026)
by: Peng, Liang, et al.
Published: (2026)
TrajVG: 3D Trajectory-Coupled Visual Geometry Learning
by: Miao, Xingyu, et al.
Published: (2026)
by: Miao, Xingyu, et al.
Published: (2026)
Unleashing Semantic and Geometric Priors for 3D Scene Completion
by: Chen, Shiyuan, et al.
Published: (2025)
by: Chen, Shiyuan, et al.
Published: (2025)
Human Scanpath Prediction in Target-Present Visual Search with Semantic-Foveal Bayesian Attention
by: Luzio, João, et al.
Published: (2025)
by: Luzio, João, et al.
Published: (2025)
ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention
by: Wang, Jiawei, et al.
Published: (2023)
by: Wang, Jiawei, et al.
Published: (2023)
DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation
by: Yin, Bo-Wen, et al.
Published: (2025)
by: Yin, Bo-Wen, et al.
Published: (2025)
GeoTopoDiff: Learning Geometry--Topology Graph Priors through Boundary-Constrained Mixed Diffusion for Sparse-Slice 3D Porous Reconstruction
by: Shi, Yue, et al.
Published: (2026)
by: Shi, Yue, et al.
Published: (2026)
Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors
by: Zheng, Duo, et al.
Published: (2025)
by: Zheng, Duo, et al.
Published: (2025)
SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention
by: Xiao, Feng, et al.
Published: (2024)
by: Xiao, Feng, et al.
Published: (2024)
Query-aware Hub Prototype Learning for Few-Shot 3D Point Cloud Semantic Segmentation
by: Zhou, YiLin, et al.
Published: (2025)
by: Zhou, YiLin, et al.
Published: (2025)
GeoHand: Unlocking Prior Geometry Knowledge for Monocular 3D Hand Reconstruction
by: Lin, Weiquan, et al.
Published: (2026)
by: Lin, Weiquan, et al.
Published: (2026)
Lifting Vision: Ground to Aerial Localization with Reasoning Guided Planning
by: Pahari, Soham, et al.
Published: (2025)
by: Pahari, Soham, et al.
Published: (2025)
Per-Query Visual Concept Learning
by: Malca, Ori, et al.
Published: (2025)
by: Malca, Ori, et al.
Published: (2025)
C3DAG: Controlled 3D Animal Generation using 3D pose guidance
by: Mishra, Sandeep, et al.
Published: (2024)
by: Mishra, Sandeep, et al.
Published: (2024)
GeoQuery: Geometry-Query Diffusion for Sparse-View Reconstruction
by: Cao, Xiao, et al.
Published: (2026)
by: Cao, Xiao, et al.
Published: (2026)
PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation
by: Delmas, Ginger, et al.
Published: (2024)
by: Delmas, Ginger, et al.
Published: (2024)
Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting
by: Jin, Shilong, et al.
Published: (2025)
by: Jin, Shilong, et al.
Published: (2025)
Beyond Semantic Priors: Mitigating Optimization Collapse for Generalizable Visual Forensics
by: Liu, Jipeng, et al.
Published: (2026)
by: Liu, Jipeng, et al.
Published: (2026)
SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction
by: Tang, Yutao, et al.
Published: (2024)
by: Tang, Yutao, et al.
Published: (2024)
Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes
by: Mullen Jr, James F., et al.
Published: (2022)
by: Mullen Jr, James F., et al.
Published: (2022)
IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction
by: Li, Hao, et al.
Published: (2025)
by: Li, Hao, et al.
Published: (2025)
$π^3$: Permutation-Equivariant Visual Geometry Learning
by: Wang, Yifan, et al.
Published: (2025)
by: Wang, Yifan, et al.
Published: (2025)
VIMCAN: Visual-Inertial 3D Human Pose Estimation with Hybrid Mamba-Cross-Attention Network
by: Yang, Zepeng, et al.
Published: (2026)
by: Yang, Zepeng, et al.
Published: (2026)
ViHOI: Human-Object Interaction Synthesis with Visual Priors
by: Cai, Songjin, et al.
Published: (2026)
by: Cai, Songjin, et al.
Published: (2026)
Similar Items
-
Air Quality Prediction Using LOESS-ARIMA and Multi-Scale CNN-BiLSTM with Residual-Gated Attention
by: Pahari, Soham, et al.
Published: (2025) -
Geometry Matters: 3D Foundation Priors for Learning Semantic Correspondence
by: Jesslen, Artur, et al.
Published: (2026) -
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
by: Pu, Fanqi, et al.
Published: (2024) -
InterGSEdit: Interactive 3D Gaussian Splatting Editing with 3D Geometry-Consistent Attention Prior
by: Wen, Minghao, et al.
Published: (2025) -
Robust 4D Visual Geometry Transformer with Uncertainty-Aware Priors
by: Zang, Ying, et al.
Published: (2026)