Saved in:
| Main Authors: | Yang, Jiaxin, Hou, Yu, Liu, Muxin, Liu, Weixuan, Yuan, Ze, Chen, Zeming, Wang, Zhongrui, Qi, Xiaojuan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.13493 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EditFlow: Benchmarking and Optimizing Code Edit Recommendation Systems via Reconstruction of Developer Flows
by: Liu, Chenyan, et al.
Published: (2026)
by: Liu, Chenyan, et al.
Published: (2026)
PhysUniBench: A Multi-Modal Physics Reasoning Benchmark at Undergraduate Level
by: Wang, Lintao, et al.
Published: (2025)
by: Wang, Lintao, et al.
Published: (2025)
PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs
by: Zhang, Zixin, et al.
Published: (2025)
by: Zhang, Zixin, et al.
Published: (2025)
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing
by: Xiao, Yicheng, et al.
Published: (2026)
by: Xiao, Yicheng, et al.
Published: (2026)
ImgEdit: A Unified Image Editing Dataset and Benchmark
by: Ye, Yang, et al.
Published: (2025)
by: Ye, Yang, et al.
Published: (2025)
CogniEdit: Dense Gradient Flow Optimization for Fine-Grained Image Editing
by: Li, Yan, et al.
Published: (2025)
by: Li, Yan, et al.
Published: (2025)
EditThinker: Unlocking Iterative Reasoning for Any Image Editor
by: Li, Hongyu, et al.
Published: (2025)
by: Li, Hongyu, et al.
Published: (2025)
LocateEdit-Bench: A Benchmark for Instruction-Based Editing Localization
by: Wu, Shiyu, et al.
Published: (2026)
by: Wu, Shiyu, et al.
Published: (2026)
SafeToolBench: Pioneering a Prospective Benchmark to Evaluating Tool Utilization Safety in LLMs
by: Xia, Hongfei, et al.
Published: (2025)
by: Xia, Hongfei, et al.
Published: (2025)
BilliardPhys-Bench: Benchmarking Physical Reasoning and Visual Dynamics of Multimodal LLMs
by: Wang, Ben, et al.
Published: (2026)
by: Wang, Ben, et al.
Published: (2026)
PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding
by: Chow, Wei, et al.
Published: (2025)
by: Chow, Wei, et al.
Published: (2025)
SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding
by: Zhang, Chenkai, et al.
Published: (2025)
by: Zhang, Chenkai, et al.
Published: (2025)
MG-Nav: Dual-Scale Visual Navigation via Sparse Spatial Memory
by: Wang, Bo, et al.
Published: (2025)
by: Wang, Bo, et al.
Published: (2025)
Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction
by: Guo, Xiao, et al.
Published: (2024)
by: Guo, Xiao, et al.
Published: (2024)
DBellQuant: Breaking the Bell with Double-Bell Transformation for LLMs Post Training Binarization
by: Ye, Zijian, et al.
Published: (2025)
by: Ye, Zijian, et al.
Published: (2025)
LiFR-Seg: Anytime High-Frame-Rate Segmentation via Event-Guided Propagation
by: Wu, Xiaoshan, et al.
Published: (2026)
by: Wu, Xiaoshan, et al.
Published: (2026)
BioProBench: Comprehensive Dataset and Benchmark in Biological Protocol Understanding and Reasoning
by: Liu, Yuyang, et al.
Published: (2025)
by: Liu, Yuyang, et al.
Published: (2025)
DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model
by: Yuan, Ming, et al.
Published: (2025)
by: Yuan, Ming, et al.
Published: (2025)
OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps
by: Li, Bingnan, et al.
Published: (2025)
by: Li, Bingnan, et al.
Published: (2025)
PhysEdit: Physically-Consistent Region-Aware Image Editing via Adaptive Spatio-Temporal Reasoning
by: Li, Guandong, et al.
Published: (2026)
by: Li, Guandong, et al.
Published: (2026)
InEdit-Bench: Benchmarking Intermediate Logical Pathways for Intelligent Image Editing Models
by: Sheng, Zhiqiang, et al.
Published: (2026)
by: Sheng, Zhiqiang, et al.
Published: (2026)
Reason-Then-Retrieve for CoVR-R with Structured Edit Prompts and Dense-Sparse Fusion
by: Liu, DongQing, et al.
Published: (2026)
by: Liu, DongQing, et al.
Published: (2026)
SafeGen-Bench: Benchmarking Safety in Image-Conditioned Text-to-Video Generation
by: Ma, Yingzi, et al.
Published: (2026)
by: Ma, Yingzi, et al.
Published: (2026)
From Editor to Dense Geometry Estimator
by: Wang, JiYuan, et al.
Published: (2025)
by: Wang, JiYuan, et al.
Published: (2025)
FineEdit: Fine-Grained Image Edit with Bounding Box Guidance
by: Xu, Haohang, et al.
Published: (2026)
by: Xu, Haohang, et al.
Published: (2026)
ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies
by: Wang, Chenglin, et al.
Published: (2025)
by: Wang, Chenglin, et al.
Published: (2025)
HomeSafeBench: A Benchmark for Embodied Vision-Language Models in Free-Exploration Home Safety Inspection
by: Gao, Siyuan, et al.
Published: (2025)
by: Gao, Siyuan, et al.
Published: (2025)
Beyond Literal Mapping: Benchmarking and Improving Non-Literal Translation Evaluation
by: Tian, Yanzhi, et al.
Published: (2026)
by: Tian, Yanzhi, et al.
Published: (2026)
VTEdit-Bench: A Comprehensive Benchmark for Multi-Reference Image Editing Models in Virtual Try-On
by: Liang, Xiaoye, et al.
Published: (2026)
by: Liang, Xiaoye, et al.
Published: (2026)
Norm Anchors Make Model Edits Last
by: Liu, Mingda, et al.
Published: (2026)
by: Liu, Mingda, et al.
Published: (2026)
Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling
by: Bai, Xuehai, et al.
Published: (2026)
by: Bai, Xuehai, et al.
Published: (2026)
A Lagrangian Conditional Gaussian Koopman Network for Data Assimilation and Prediction
by: Wang, Zhongrui, et al.
Published: (2026)
by: Wang, Zhongrui, et al.
Published: (2026)
PhysAlign: Physics-Coherent Image-to-Video Generation through Feature and 3D Representation Alignment
by: Xiong, Zhexiao, et al.
Published: (2026)
by: Xiong, Zhexiao, et al.
Published: (2026)
MotionEdit: Benchmarking and Learning Motion-Centric Image Editing
by: Wan, Yixin, et al.
Published: (2025)
by: Wan, Yixin, et al.
Published: (2025)
EditInspector: A Benchmark for Evaluation of Text-Guided Image Edits
by: Yosef, Ron, et al.
Published: (2025)
by: Yosef, Ron, et al.
Published: (2025)
WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing
by: Zhang, Hui, et al.
Published: (2026)
by: Zhang, Hui, et al.
Published: (2026)
CCrepairBench: A High-Fidelity Benchmark and Reinforcement Learning Framework for C++ Compilation Repair
by: Sun, Weixuan, et al.
Published: (2025)
by: Sun, Weixuan, et al.
Published: (2025)
SeePhys: Does Seeing Help Thinking? -- Benchmarking Vision-Based Physics Reasoning
by: Xiang, Kun, et al.
Published: (2025)
by: Xiang, Kun, et al.
Published: (2025)
Hamiltonian quantization of complex Chern-Simons theory at level-$k$
by: Han, Muxin
Published: (2025)
by: Han, Muxin
Published: (2025)
On the summation and triangulation independence of Lorentzian spinfoam amplitudes for all LQG
by: Han, Muxin
Published: (2025)
by: Han, Muxin
Published: (2025)
Similar Items
-
EditFlow: Benchmarking and Optimizing Code Edit Recommendation Systems via Reconstruction of Developer Flows
by: Liu, Chenyan, et al.
Published: (2026) -
PhysUniBench: A Multi-Modal Physics Reasoning Benchmark at Undergraduate Level
by: Wang, Lintao, et al.
Published: (2025) -
PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs
by: Zhang, Zixin, et al.
Published: (2025) -
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing
by: Xiao, Yicheng, et al.
Published: (2026) -
ImgEdit: A Unified Image Editing Dataset and Benchmark
by: Ye, Yang, et al.
Published: (2025)