:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yan, Kun, Ji, Lei, Wu, Chenfei, Liang, Jian, Zhou, Ming, Duan, Nan, Ma, Shuai
Format:	Preprint
Published:	2022
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2210.04522
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Voila-A: Aligning Vision-Language Models with User's Gaze Attention
by: Yan, Kun, et al.
Published: (2023)

EDTalk++: Full Disentanglement for Controllable Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2025)

SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
by: Wu, Rongyuan, et al.
Published: (2023)

AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
by: Ni, Minheng, et al.
Published: (2024)

Low-Resolution Self-Attention for Semantic Segmentation
by: Wu, Yu-Huan, et al.
Published: (2023)

Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style
by: Tan, Shuai, et al.
Published: (2024)

Applying Unsupervised Semantic Segmentation to High-Resolution UAV Imagery for Enhanced Road Scene Parsing
by: Ma, Zihan, et al.
Published: (2024)

BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning
by: Xu, Xiao, et al.
Published: (2022)

StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis
by: Tang, Zecheng, et al.
Published: (2024)

Exploring the Low-Pass Filtering Behavior in Image Super-Resolution
by: Deng, Haoyu, et al.
Published: (2024)

Panorama Generation From NFoV Image Done Right
by: Zheng, Dian, et al.
Published: (2025)

Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
by: Han, Jian, et al.
Published: (2024)

Medverse: A Universal Model for Full-Resolution 3D Medical Image Segmentation, Transformation and Enhancement
by: Hu, Jiesi, et al.
Published: (2025)

Using Left and Right Brains Together: Towards Vision and Language Planning
by: Cen, Jun, et al.
Published: (2024)

SAM2-UNeXT: An Improved High-Resolution Baseline for Adapting Foundation Models to Downstream Segmentation Tasks
by: Xiong, Xinyu, et al.
Published: (2025)

TexVerse: A Universe of 3D Objects with High-Resolution Textures
by: Zhang, Yibo, et al.
Published: (2025)

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
by: Bai, Jinbin, et al.
Published: (2024)

Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas
by: Quattrini, Fabio, et al.
Published: (2024)

Real-time High-Resolution Neural Network with Semantic Guidance for Crack Segmentation
by: Li, Yongshang, et al.
Published: (2023)

Dense360: Dense Understanding from Omnidirectional Panoramas
by: Zhou, Yikang, et al.
Published: (2025)

Towards Robust In-Context Learning for Medical Image Segmentation via Data Synthesis
by: Hu, Jiesi, et al.
Published: (2025)

Semantic Lens: Instance-Centric Semantic Alignment for Video Super-Resolution
by: Tang, Qi, et al.
Published: (2023)

360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model
by: Wang, Qian, et al.
Published: (2024)

HRSeg: High-Resolution Visual Perception and Enhancement for Reasoning Segmentation
by: Lin, Weihuang, et al.
Published: (2025)

PSGS: Text-driven Panorama Sliding Scene Generation via Gaussian Splatting
by: Zhang, Xin, et al.
Published: (2026)

OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation
by: Zhong, Ding, et al.
Published: (2025)

Semantic Alignment for Multimodal Large Language Models
by: Wu, Tao, et al.
Published: (2024)

Control Your View: High-Resolution Global Semantic Manipulation in Learned Image Compression
by: Liang, Jiaming, et al.
Published: (2026)

PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting
by: Zhang, Cheng, et al.
Published: (2024)

PixelWizard: Towards Efficient High-Fidelity Video Generation at Ultra-Large Spatial Resolution
by: Li, Wenxue, et al.
Published: (2026)

Efficient Geometry-Controlled High-Resolution Satellite Image Synthesis
by: Vasilescu, Vlad, et al.
Published: (2026)

Visual Autoregressive Modeling for Image Super-Resolution
by: Qu, Yunpeng, et al.
Published: (2025)

Layered 3D Human Generation via Semantic-Aware Diffusion Model
by: Wang, Yi, et al.
Published: (2023)

OpenVTON-Bench: A Large-Scale High-Resolution Benchmark for Controllable Virtual Try-On Evaluation
by: Li, Jin, et al.
Published: (2026)

Multi-Scale Representations by Varying Window Attention for Semantic Segmentation
by: Yan, Haotian, et al.
Published: (2024)

MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation
by: Liao, Chenfei, et al.
Published: (2025)

SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas
by: Lambert, John, et al.
Published: (2024)

Sparse Refinement for Efficient High-Resolution Semantic Segmentation
by: Liu, Zhijian, et al.
Published: (2024)

SemanticHuman-HD: High-Resolution Semantic Disentangled 3D Human Generation
by: Zheng, Peng, et al.
Published: (2024)

Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic Inference
by: Zhang, Xu, et al.
Published: (2025)