Saved in:
| Main Authors: | Yan, Kun, Ji, Lei, Wu, Chenfei, Liang, Jian, Zhou, Ming, Duan, Nan, Ma, Shuai |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2210.04522 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Voila-A: Aligning Vision-Language Models with User's Gaze Attention
by: Yan, Kun, et al.
Published: (2023)
by: Yan, Kun, et al.
Published: (2023)
EDTalk++: Full Disentanglement for Controllable Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2025)
by: Tan, Shuai, et al.
Published: (2025)
SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
by: Wu, Rongyuan, et al.
Published: (2023)
by: Wu, Rongyuan, et al.
Published: (2023)
AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
by: Ni, Minheng, et al.
Published: (2024)
by: Ni, Minheng, et al.
Published: (2024)
Low-Resolution Self-Attention for Semantic Segmentation
by: Wu, Yu-Huan, et al.
Published: (2023)
by: Wu, Yu-Huan, et al.
Published: (2023)
Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style
by: Tan, Shuai, et al.
Published: (2024)
by: Tan, Shuai, et al.
Published: (2024)
Applying Unsupervised Semantic Segmentation to High-Resolution UAV Imagery for Enhanced Road Scene Parsing
by: Ma, Zihan, et al.
Published: (2024)
by: Ma, Zihan, et al.
Published: (2024)
BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning
by: Xu, Xiao, et al.
Published: (2022)
by: Xu, Xiao, et al.
Published: (2022)
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis
by: Tang, Zecheng, et al.
Published: (2024)
by: Tang, Zecheng, et al.
Published: (2024)
Exploring the Low-Pass Filtering Behavior in Image Super-Resolution
by: Deng, Haoyu, et al.
Published: (2024)
by: Deng, Haoyu, et al.
Published: (2024)
Panorama Generation From NFoV Image Done Right
by: Zheng, Dian, et al.
Published: (2025)
by: Zheng, Dian, et al.
Published: (2025)
Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
by: Han, Jian, et al.
Published: (2024)
by: Han, Jian, et al.
Published: (2024)
Medverse: A Universal Model for Full-Resolution 3D Medical Image Segmentation, Transformation and Enhancement
by: Hu, Jiesi, et al.
Published: (2025)
by: Hu, Jiesi, et al.
Published: (2025)
Using Left and Right Brains Together: Towards Vision and Language Planning
by: Cen, Jun, et al.
Published: (2024)
by: Cen, Jun, et al.
Published: (2024)
SAM2-UNeXT: An Improved High-Resolution Baseline for Adapting Foundation Models to Downstream Segmentation Tasks
by: Xiong, Xinyu, et al.
Published: (2025)
by: Xiong, Xinyu, et al.
Published: (2025)
TexVerse: A Universe of 3D Objects with High-Resolution Textures
by: Zhang, Yibo, et al.
Published: (2025)
by: Zhang, Yibo, et al.
Published: (2025)
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
by: Bai, Jinbin, et al.
Published: (2024)
by: Bai, Jinbin, et al.
Published: (2024)
Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas
by: Quattrini, Fabio, et al.
Published: (2024)
by: Quattrini, Fabio, et al.
Published: (2024)
Real-time High-Resolution Neural Network with Semantic Guidance for Crack Segmentation
by: Li, Yongshang, et al.
Published: (2023)
by: Li, Yongshang, et al.
Published: (2023)
Dense360: Dense Understanding from Omnidirectional Panoramas
by: Zhou, Yikang, et al.
Published: (2025)
by: Zhou, Yikang, et al.
Published: (2025)
Towards Robust In-Context Learning for Medical Image Segmentation via Data Synthesis
by: Hu, Jiesi, et al.
Published: (2025)
by: Hu, Jiesi, et al.
Published: (2025)
Semantic Lens: Instance-Centric Semantic Alignment for Video Super-Resolution
by: Tang, Qi, et al.
Published: (2023)
by: Tang, Qi, et al.
Published: (2023)
360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model
by: Wang, Qian, et al.
Published: (2024)
by: Wang, Qian, et al.
Published: (2024)
HRSeg: High-Resolution Visual Perception and Enhancement for Reasoning Segmentation
by: Lin, Weihuang, et al.
Published: (2025)
by: Lin, Weihuang, et al.
Published: (2025)
PSGS: Text-driven Panorama Sliding Scene Generation via Gaussian Splatting
by: Zhang, Xin, et al.
Published: (2026)
by: Zhang, Xin, et al.
Published: (2026)
OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation
by: Zhong, Ding, et al.
Published: (2025)
by: Zhong, Ding, et al.
Published: (2025)
Semantic Alignment for Multimodal Large Language Models
by: Wu, Tao, et al.
Published: (2024)
by: Wu, Tao, et al.
Published: (2024)
Control Your View: High-Resolution Global Semantic Manipulation in Learned Image Compression
by: Liang, Jiaming, et al.
Published: (2026)
by: Liang, Jiaming, et al.
Published: (2026)
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting
by: Zhang, Cheng, et al.
Published: (2024)
by: Zhang, Cheng, et al.
Published: (2024)
PixelWizard: Towards Efficient High-Fidelity Video Generation at Ultra-Large Spatial Resolution
by: Li, Wenxue, et al.
Published: (2026)
by: Li, Wenxue, et al.
Published: (2026)
Efficient Geometry-Controlled High-Resolution Satellite Image Synthesis
by: Vasilescu, Vlad, et al.
Published: (2026)
by: Vasilescu, Vlad, et al.
Published: (2026)
Visual Autoregressive Modeling for Image Super-Resolution
by: Qu, Yunpeng, et al.
Published: (2025)
by: Qu, Yunpeng, et al.
Published: (2025)
Layered 3D Human Generation via Semantic-Aware Diffusion Model
by: Wang, Yi, et al.
Published: (2023)
by: Wang, Yi, et al.
Published: (2023)
OpenVTON-Bench: A Large-Scale High-Resolution Benchmark for Controllable Virtual Try-On Evaluation
by: Li, Jin, et al.
Published: (2026)
by: Li, Jin, et al.
Published: (2026)
Multi-Scale Representations by Varying Window Attention for Semantic Segmentation
by: Yan, Haotian, et al.
Published: (2024)
by: Yan, Haotian, et al.
Published: (2024)
MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation
by: Liao, Chenfei, et al.
Published: (2025)
by: Liao, Chenfei, et al.
Published: (2025)
SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas
by: Lambert, John, et al.
Published: (2024)
by: Lambert, John, et al.
Published: (2024)
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
by: Liu, Zhijian, et al.
Published: (2024)
by: Liu, Zhijian, et al.
Published: (2024)
SemanticHuman-HD: High-Resolution Semantic Disentangled 3D Human Generation
by: Zheng, Peng, et al.
Published: (2024)
by: Zheng, Peng, et al.
Published: (2024)
Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic Inference
by: Zhang, Xu, et al.
Published: (2025)
by: Zhang, Xu, et al.
Published: (2025)
Similar Items
-
Voila-A: Aligning Vision-Language Models with User's Gaze Attention
by: Yan, Kun, et al.
Published: (2023) -
EDTalk++: Full Disentanglement for Controllable Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2025) -
SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
by: Wu, Rongyuan, et al.
Published: (2023) -
AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
by: Ni, Minheng, et al.
Published: (2024) -
Low-Resolution Self-Attention for Semantic Segmentation
by: Wu, Yu-Huan, et al.
Published: (2023)