Saved in:
| Main Authors: | Mao, Lingjun, Tang, Zineng, Suhr, Alane |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.06184 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Grounding Language in Multi-Perspective Referential Communication
by: Tang, Zineng, et al.
Published: (2024)
by: Tang, Zineng, et al.
Published: (2024)
Images are Worth Variable Length of Representations
by: Mao, Lingjun, et al.
Published: (2025)
by: Mao, Lingjun, et al.
Published: (2025)
TULIP: Towards Unified Language-Image Pretraining
by: Tang, Zineng, et al.
Published: (2025)
by: Tang, Zineng, et al.
Published: (2025)
ScribbleEdit: Synthetic Data for Image Editing with Scribbles and Text
by: Ji, Anya, et al.
Published: (2026)
by: Ji, Anya, et al.
Published: (2026)
Anything in Any Scene: Photorealistic Video Object Insertion
by: Bai, Chen, et al.
Published: (2024)
by: Bai, Chen, et al.
Published: (2024)
RealMaster: Lifting Rendered Scenes into Photorealistic Video
by: Cohen-Bar, Dana, et al.
Published: (2026)
by: Cohen-Bar, Dana, et al.
Published: (2026)
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
by: Dirik, Alara, et al.
Published: (2025)
by: Dirik, Alara, et al.
Published: (2025)
Infinigen Indoors: Photorealistic Indoor Scenes using Procedural Generation
by: Raistrick, Alexander, et al.
Published: (2024)
by: Raistrick, Alexander, et al.
Published: (2024)
SketchingReality: From Freehand Scene Sketches To Photorealistic Images
by: Bourouis, Ahmed, et al.
Published: (2026)
by: Bourouis, Ahmed, et al.
Published: (2026)
The Art of Deception: Color Visual Illusions and Diffusion Models
by: Gomez-Villa, Alex, et al.
Published: (2024)
by: Gomez-Villa, Alex, et al.
Published: (2024)
4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
by: Yu, Heng, et al.
Published: (2024)
by: Yu, Heng, et al.
Published: (2024)
Sun Off, Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception
by: Tzevelekakis, Konstantinos, et al.
Published: (2024)
by: Tzevelekakis, Konstantinos, et al.
Published: (2024)
ShelfGaussian: Shelf-Supervised Open-Vocabulary Gaussian-based 3D Scene Understanding
by: Zhao, Lingjun, et al.
Published: (2025)
by: Zhao, Lingjun, et al.
Published: (2025)
Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes
by: Wang, Shuyun, et al.
Published: (2025)
by: Wang, Shuyun, et al.
Published: (2025)
Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting
by: Yang, Zeyu, et al.
Published: (2023)
by: Yang, Zeyu, et al.
Published: (2023)
Illusions in Humans and AI: How Visual Perception Aligns and Diverges
by: Yang, Jianyi, et al.
Published: (2025)
by: Yang, Jianyi, et al.
Published: (2025)
UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation
by: Lu, Yichong, et al.
Published: (2024)
by: Lu, Yichong, et al.
Published: (2024)
Progressive Photorealistic Simplification
by: Rosenthal, Adi, et al.
Published: (2026)
by: Rosenthal, Adi, et al.
Published: (2026)
Spatial Colour Mixing Illusions as a Perception Stress Test for Vision-Language Models
by: Basoc, Nicoleta-Nina, et al.
Published: (2026)
by: Basoc, Nicoleta-Nina, et al.
Published: (2026)
Illusion-Aware Visual Preprocessing and Anti-Illusion Prompting for Classic Illusion Understanding in Vision-Language Models
by: Zha, Junli, et al.
Published: (2026)
by: Zha, Junli, et al.
Published: (2026)
Visually Prompted Benchmarks Are Surprisingly Fragile
by: Feng, Haiwen, et al.
Published: (2025)
by: Feng, Haiwen, et al.
Published: (2025)
A Perception CNN for Facial Expression Recognition
by: Tian, Chunwei, et al.
Published: (2025)
by: Tian, Chunwei, et al.
Published: (2025)
Reflections Unlock: Geometry-Aware Reflection Disentanglement in 3D Gaussian Splatting for Photorealistic Scenes Rendering
by: Song, Jiayi, et al.
Published: (2025)
by: Song, Jiayi, et al.
Published: (2025)
Photorealistic Phantom Roads in Real Scenes: Disentangling 3D Hallucinations from Physical Geometry
by: Nguyen, Hoang, et al.
Published: (2025)
by: Nguyen, Hoang, et al.
Published: (2025)
HOSIG: Full-Body Human-Object-Scene Interaction Generation with Hierarchical Scene Perception
by: Yao, Wei, et al.
Published: (2025)
by: Yao, Wei, et al.
Published: (2025)
The Illusion-Illusion: Vision Language Models See Illusions Where There are None
by: Ullman, Tomer
Published: (2024)
by: Ullman, Tomer
Published: (2024)
IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models
by: Shahgir, Haz Sameen, et al.
Published: (2024)
by: Shahgir, Haz Sameen, et al.
Published: (2024)
AutoPresent: Designing Structured Visuals from Scratch
by: Ge, Jiaxin, et al.
Published: (2025)
by: Ge, Jiaxin, et al.
Published: (2025)
Color Bind: Exploring Color Perception in Text-to-Image Models
by: Shomer-Chai, Shay, et al.
Published: (2025)
by: Shomer-Chai, Shay, et al.
Published: (2025)
InceptionHuman: Controllable Prompt-to-NeRF for Photorealistic 3D Human Generation
by: Kao, Shiu-hong, et al.
Published: (2023)
by: Kao, Shiu-hong, et al.
Published: (2023)
IllusionBench+: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models
by: Zhang, Yiming, et al.
Published: (2025)
by: Zhang, Yiming, et al.
Published: (2025)
Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures
by: Seizinger, Tim, et al.
Published: (2025)
by: Seizinger, Tim, et al.
Published: (2025)
Instant Photorealistic Neural Radiance Fields Stylization
by: Li, Shaoxu, et al.
Published: (2023)
by: Li, Shaoxu, et al.
Published: (2023)
Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions
by: Sun, Xiaoxiao, et al.
Published: (2026)
by: Sun, Xiaoxiao, et al.
Published: (2026)
Video Perception Models for 3D Scene Synthesis
by: Huang, Rui, et al.
Published: (2025)
by: Huang, Rui, et al.
Published: (2025)
GarchingSim: An Autonomous Driving Simulator with Photorealistic Scenes and Minimalist Workflow
by: Zhou, Liguo, et al.
Published: (2024)
by: Zhou, Liguo, et al.
Published: (2024)
AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game
by: Chi, Yizhou, et al.
Published: (2024)
by: Chi, Yizhou, et al.
Published: (2024)
Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images
by: Aziz, Memoona, et al.
Published: (2024)
by: Aziz, Memoona, et al.
Published: (2024)
Large-scale Photorealistic Outdoor 3D Scene Reconstruction from UAV Imagery Using Gaussian Splatting Techniques
by: Maikos, Christos, et al.
Published: (2026)
by: Maikos, Christos, et al.
Published: (2026)
Comparative Analysis Of Color Models For Human Perception And Visual Color Difference
by: Burambekova, Aruzhan, et al.
Published: (2024)
by: Burambekova, Aruzhan, et al.
Published: (2024)
Similar Items
-
Grounding Language in Multi-Perspective Referential Communication
by: Tang, Zineng, et al.
Published: (2024) -
Images are Worth Variable Length of Representations
by: Mao, Lingjun, et al.
Published: (2025) -
TULIP: Towards Unified Language-Image Pretraining
by: Tang, Zineng, et al.
Published: (2025) -
ScribbleEdit: Synthetic Data for Image Editing with Scribbles and Text
by: Ji, Anya, et al.
Published: (2026) -
Anything in Any Scene: Photorealistic Video Object Insertion
by: Bai, Chen, et al.
Published: (2024)