:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mao, Lingjun, Tang, Zineng, Suhr, Alane
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.06184
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Grounding Language in Multi-Perspective Referential Communication
by: Tang, Zineng, et al.
Published: (2024)

Images are Worth Variable Length of Representations
by: Mao, Lingjun, et al.
Published: (2025)

TULIP: Towards Unified Language-Image Pretraining
by: Tang, Zineng, et al.
Published: (2025)

ScribbleEdit: Synthetic Data for Image Editing with Scribbles and Text
by: Ji, Anya, et al.
Published: (2026)

Anything in Any Scene: Photorealistic Video Object Insertion
by: Bai, Chen, et al.
Published: (2024)

RealMaster: Lifting Rendered Scenes into Photorealistic Video
by: Cohen-Bar, Dana, et al.
Published: (2026)

PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
by: Dirik, Alara, et al.
Published: (2025)

Infinigen Indoors: Photorealistic Indoor Scenes using Procedural Generation
by: Raistrick, Alexander, et al.
Published: (2024)

SketchingReality: From Freehand Scene Sketches To Photorealistic Images
by: Bourouis, Ahmed, et al.
Published: (2026)

The Art of Deception: Color Visual Illusions and Diffusion Models
by: Gomez-Villa, Alex, et al.
Published: (2024)

4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
by: Yu, Heng, et al.
Published: (2024)

Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception
by: Tzevelekakis, Konstantinos, et al.
Published: (2024)

ShelfGaussian: Shelf-Supervised Open-Vocabulary Gaussian-based 3D Scene Understanding
by: Zhao, Lingjun, et al.
Published: (2025)

Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes
by: Wang, Shuyun, et al.
Published: (2025)

Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting
by: Yang, Zeyu, et al.
Published: (2023)

Illusions in Humans and AI: How Visual Perception Aligns and Diverges
by: Yang, Jianyi, et al.
Published: (2025)

UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation
by: Lu, Yichong, et al.
Published: (2024)

Progressive Photorealistic Simplification
by: Rosenthal, Adi, et al.
Published: (2026)

Spatial Colour Mixing Illusions as a Perception Stress Test for Vision-Language Models
by: Basoc, Nicoleta-Nina, et al.
Published: (2026)

Illusion-Aware Visual Preprocessing and Anti-Illusion Prompting for Classic Illusion Understanding in Vision-Language Models
by: Zha, Junli, et al.
Published: (2026)

Visually Prompted Benchmarks Are Surprisingly Fragile
by: Feng, Haiwen, et al.
Published: (2025)

A Perception CNN for Facial Expression Recognition
by: Tian, Chunwei, et al.
Published: (2025)

Reflections Unlock: Geometry-Aware Reflection Disentanglement in 3D Gaussian Splatting for Photorealistic Scenes Rendering
by: Song, Jiayi, et al.
Published: (2025)

Photorealistic Phantom Roads in Real Scenes: Disentangling 3D Hallucinations from Physical Geometry
by: Nguyen, Hoang, et al.
Published: (2025)

HOSIG: Full-Body Human-Object-Scene Interaction Generation with Hierarchical Scene Perception
by: Yao, Wei, et al.
Published: (2025)

The Illusion-Illusion: Vision Language Models See Illusions Where There are None
by: Ullman, Tomer
Published: (2024)

IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models
by: Shahgir, Haz Sameen, et al.
Published: (2024)

AutoPresent: Designing Structured Visuals from Scratch
by: Ge, Jiaxin, et al.
Published: (2025)

Color Bind: Exploring Color Perception in Text-to-Image Models
by: Shomer-Chai, Shay, et al.
Published: (2025)

InceptionHuman: Controllable Prompt-to-NeRF for Photorealistic 3D Human Generation
by: Kao, Shiu-hong, et al.
Published: (2023)

IllusionBench+: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models
by: Zhang, Yiming, et al.
Published: (2025)

Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures
by: Seizinger, Tim, et al.
Published: (2025)

Instant Photorealistic Neural Radiance Fields Stylization
by: Li, Shaoxu, et al.
Published: (2023)

Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions
by: Sun, Xiaoxiao, et al.
Published: (2026)

Video Perception Models for 3D Scene Synthesis
by: Huang, Rui, et al.
Published: (2025)

GarchingSim: An Autonomous Driving Simulator with Photorealistic Scenes and Minimalist Workflow
by: Zhou, Liguo, et al.
Published: (2024)

AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game
by: Chi, Yizhou, et al.
Published: (2024)

Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images
by: Aziz, Memoona, et al.
Published: (2024)

Large-scale Photorealistic Outdoor 3D Scene Reconstruction from UAV Imagery Using Gaussian Splatting Techniques
by: Maikos, Christos, et al.
Published: (2026)

Comparative Analysis Of Color Models For Human Perception And Visual Color Difference
by: Burambekova, Aruzhan, et al.
Published: (2024)