:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Hu, Xin, Qin, Ke, Duan, Guiduo, Li, Ming, Li, Yuan-Fang, He, Tao
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2507.05798
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Towards Lifelong Scene Graph Generation with Knowledge-ware In-context Prompt Learning
by: He, Tao, et al.
Published: (2024)

Can We Build Scene Graphs, Not Classify Them? FlowSG: Progressive Image-Conditioned Scene Graph Generation with Flow Matching
by: Hu, Xin, et al.
Published: (2026)

Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
by: Liu, Tao, et al.
Published: (2024)

TiCAL:Typicality-Based Consistency-Aware Learning for Multimodal Emotion Recognition
by: Yin, Wen, et al.
Published: (2025)

PosSAM: Panoptic Open-vocabulary Segment Anything
by: VS, Vibashan, et al.
Published: (2024)

PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction
by: Yu, Xuan, et al.
Published: (2024)

Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition
by: Yin, Wen, et al.
Published: (2025)

Panoptic Scene Graph Generation with Semantics-Prototype Learning
by: Li, Li, et al.
Published: (2023)

DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data
by: Tu, Yuanpeng, et al.
Published: (2025)

LangPrecip: Language-Aware Multimodal Precipitation Nowcasting
by: Ling, Xudong, et al.
Published: (2025)

LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations
by: Xu, Mingjie, et al.
Published: (2024)

4D Panoptic Scene Graph Generation
by: Yang, Jingkang, et al.
Published: (2024)

Unbiased Dynamic Multimodal Fusion
by: Wei, Shicai, et al.
Published: (2026)

From Data to Modeling: Fully Open-vocabulary Scene Graph Generation
by: Chen, Zuyao, et al.
Published: (2025)

Frequency-guided Multi-level Reasoning for Scene Graph Generation in Video
by: Li, Chenxing, et al.
Published: (2026)

PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding
by: Zhai, Hongjia, et al.
Published: (2025)

Reliable Disentanglement Multi-view Learning Against View Adversarial Attacks
by: Wang, Xuyang, et al.
Published: (2025)

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models
by: Zhou, Zijian, et al.
Published: (2024)

Pair then Relation: Pair-Net for Panoptic Scene Graph Generation
by: Wang, Jinghao, et al.
Published: (2023)

Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention
by: Chen, Zuyao, et al.
Published: (2023)

PanopticQuery: Unified Query-Time Reasoning for 4D Scenes
by: Tang, Ruilin, et al.
Published: (2026)

OpenFusion++: An Open-vocabulary Real-time Scene Understanding System
by: Jin, Xiaofeng, et al.
Published: (2025)

PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
by: Li, Xiangtai, et al.
Published: (2023)

SPADE: Spectroscopic Photoacoustic Denoising using an Analytical and Data-free Enhancement Framework
by: Lin, Fangzhou, et al.
Published: (2024)

Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene
by: Wu, Shengqiong, et al.
Published: (2025)

DSFlash: Comprehensive Panoptic Scene Graph Generation in Realtime
by: Lorenz, Julian, et al.
Published: (2026)

OpenTie: Open-vocabulary Sequential Rebar Tying System
by: Liu, Mingze, et al.
Published: (2025)

OMCL: Open-vocabulary Monte Carlo Localization
by: Kruzhkov, Evgenii, et al.
Published: (2025)

CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting
by: Sun, Wei, et al.
Published: (2025)

SPADE: Towards Scalable Path Planning Architecture on Actionable Multi-Domain 3D Scene Graphs
by: Viswanathan, Vignesh Kottayam, et al.
Published: (2025)

Long-range Brain Graph Transformer
by: Yu, Shuo, et al.
Published: (2025)

CLASP: Closed-loop Asynchronous Spatial Perception for Open-vocabulary Desktop Object Grasping
by: Ling, Yiran, et al.
Published: (2026)

VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation
by: Zhou, Zijian, et al.
Published: (2023)

PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving
by: Shi, Yining, et al.
Published: (2024)

TD^2-Net: Toward Denoising and Debiasing for Dynamic Scene Graph Generation
by: Lin, Xin, et al.
Published: (2024)

Outlier detection in mixed-attribute data: a semi-supervised approach with fuzzy approximations and relative entropy
by: Chen, Baiyang, et al.
Published: (2025)

OpenVIS: Open-vocabulary Video Instance Segmentation
by: Guo, Pinxue, et al.
Published: (2023)

TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
by: Zhao, Chengyang, et al.
Published: (2023)

A Fair Ranking and New Model for Panoptic Scene Graph Generation
by: Lorenz, Julian, et al.
Published: (2024)

Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation
by: Nguyen, Thong Thanh, et al.
Published: (2024)