Guardado en:
| Autores principales: | Jiang, Chengjie, Zhou, Yunqi, Yan, Jiafeng, Li, Jing, Li, Jiayang, Zhou, Yue, He, Hongjie, Li, Jonathan |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2508.17102 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Look Where It Matters: Training-Free Ultra-HR Remote Sensing VQA via Adaptive Zoom Search
por: Zhou, Yunqi, et al.
Publicado: (2025)
por: Zhou, Yunqi, et al.
Publicado: (2025)
DCDB: Dynamic Conditional Dual Diffusion Bridge for Ill-posed Multi-Tasks
por: Huang, Chengjie, et al.
Publicado: (2025)
por: Huang, Chengjie, et al.
Publicado: (2025)
Towards Unified Semantic and Controllable Image Fusion: A Diffusion Transformer Approach
por: Li, Jiayang, et al.
Publicado: (2025)
por: Li, Jiayang, et al.
Publicado: (2025)
GeoZero: Incentivizing Reasoning from Scratch on Geospatial Scenes
por: Wang, Di, et al.
Publicado: (2025)
por: Wang, Di, et al.
Publicado: (2025)
Characterization of dim light response in DVS pixel: Discontinuity of event triggering time
por: Jiang, Xiao, et al.
Publicado: (2024)
por: Jiang, Xiao, et al.
Publicado: (2024)
SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model
por: Li, Kaiyu, et al.
Publicado: (2025)
por: Li, Kaiyu, et al.
Publicado: (2025)
Geospatial-Reasoning-Driven Vocabulary-Agnostic Remote Sensing Semantic Segmentation
por: Zhou, Chufeng, et al.
Publicado: (2026)
por: Zhou, Chufeng, et al.
Publicado: (2026)
Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation
por: Zhou, Qiji, et al.
Publicado: (2025)
por: Zhou, Qiji, et al.
Publicado: (2025)
GRASP: GRAph-Structured Pyramidal Whole Slide Image Representation
por: Mirabadi, Ali Khajegili, et al.
Publicado: (2024)
por: Mirabadi, Ali Khajegili, et al.
Publicado: (2024)
Artemis: Structured Visual Reasoning for Perception Policy Learning
por: Tang, Wei, et al.
Publicado: (2025)
por: Tang, Wei, et al.
Publicado: (2025)
HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation
por: Jiang, Chengjie, et al.
Publicado: (2024)
por: Jiang, Chengjie, et al.
Publicado: (2024)
BLINK-Twice: You see, but do you observe? A Reasoning Benchmark on Visual Perception
por: Ye, Junyan, et al.
Publicado: (2025)
por: Ye, Junyan, et al.
Publicado: (2025)
RemoteReasoner: Towards Unifying Geospatial Reasoning Workflow
por: Yao, Liang, et al.
Publicado: (2025)
por: Yao, Liang, et al.
Publicado: (2025)
GRASP: A Rehearsal Policy for Efficient Online Continual Learning
por: Harun, Md Yousuf, et al.
Publicado: (2023)
por: Harun, Md Yousuf, et al.
Publicado: (2023)
Gaussian Building Mesh (GBM): Extract a Building's 3D Mesh with Google Earth and Gaussian Splatting
por: Gao, Kyle, et al.
Publicado: (2024)
por: Gao, Kyle, et al.
Publicado: (2024)
GRASP: Learning to Ground Social Reasoning in Multi-Person Non-Verbal Interactions
por: Kim, Junho, et al.
Publicado: (2026)
por: Kim, Junho, et al.
Publicado: (2026)
Digital Twin Buildings: 3D Modeling, GIS Integration, and Visual Descriptions Using Gaussian Splatting, ChatGPT/Deepseek, and Google Maps Platform
por: Gao, Kyle, et al.
Publicado: (2025)
por: Gao, Kyle, et al.
Publicado: (2025)
Learning county from pixels: corn yield prediction with attention-weighted multiple instance learning
por: Wang, Xiaoyu, et al.
Publicado: (2023)
por: Wang, Xiaoyu, et al.
Publicado: (2023)
GRASP: Guided Residual Adapters with Sample-wise Partitioning
por: Nützel, Felix, et al.
Publicado: (2025)
por: Nützel, Felix, et al.
Publicado: (2025)
Marmot: Object-Level Self-Correction via Multi-Agent Reasoning
por: Sun, Jiayang, et al.
Publicado: (2025)
por: Sun, Jiayang, et al.
Publicado: (2025)
Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning
por: He, Qingdong, et al.
Publicado: (2025)
por: He, Qingdong, et al.
Publicado: (2025)
RemoteZero: Geospatial Reasoning with Zero Human Annotations
por: Yao, Liang, et al.
Publicado: (2026)
por: Yao, Liang, et al.
Publicado: (2026)
SceneForge: Structured World Supervision from 3D Interventions
por: Li, Jizhizi, et al.
Publicado: (2026)
por: Li, Jizhizi, et al.
Publicado: (2026)
GeoTexDensifier: Geometry-Texture-Aware Densification for High-Quality Photorealistic 3D Gaussian Splatting
por: Jiang, Hanqing, et al.
Publicado: (2024)
por: Jiang, Hanqing, et al.
Publicado: (2024)
Vision-EKIPL: External Knowledge-Infused Policy Learning for Visual Reasoning
por: Wang, Chaoyang, et al.
Publicado: (2025)
por: Wang, Chaoyang, et al.
Publicado: (2025)
Noisy Label Refinement with Semantically Reliable Synthetic Images
por: Li, Yingxuan, et al.
Publicado: (2025)
por: Li, Yingxuan, et al.
Publicado: (2025)
Context-Aware Iteration Policy Network for Efficient Optical Flow Estimation
por: Cheng, Ri, et al.
Publicado: (2023)
por: Cheng, Ri, et al.
Publicado: (2023)
CAR: Controllable Autoregressive Modeling for Visual Generation
por: Yao, Ziyu, et al.
Publicado: (2024)
por: Yao, Ziyu, et al.
Publicado: (2024)
Prior-guided Hierarchical Instance-pixel Contrastive Learning for Ultrasound Speckle Noise Suppression
por: Bu, Zhenyu, et al.
Publicado: (2026)
por: Bu, Zhenyu, et al.
Publicado: (2026)
Application of Multimodal Fusion Deep Learning Model in Disease Recognition
por: Liu, Xiaoyi, et al.
Publicado: (2024)
por: Liu, Xiaoyi, et al.
Publicado: (2024)
Seg the HAB: Language-Guided Geospatial Algae Bloom Reasoning and Segmentation
por: Hsieh, Patterson, et al.
Publicado: (2025)
por: Hsieh, Patterson, et al.
Publicado: (2025)
Dual-scale Enhanced and Cross-generative Consistency Learning for Semi-supervised Medical Image Segmentation
por: Gu, Yunqi, et al.
Publicado: (2023)
por: Gu, Yunqi, et al.
Publicado: (2023)
UniGeoSeg: Towards Unified Open-World Segmentation for Geospatial Scenes
por: Ni, Shuo, et al.
Publicado: (2025)
por: Ni, Shuo, et al.
Publicado: (2025)
MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning
por: Chen, Xinyan, et al.
Publicado: (2025)
por: Chen, Xinyan, et al.
Publicado: (2025)
P-SLCR: Unsupervised Point Cloud Semantic Segmentation via Prototypes Structure Learning and Consistent Reasoning
por: Zhan, Lixin, et al.
Publicado: (2026)
por: Zhan, Lixin, et al.
Publicado: (2026)
MedReason-R1: Learning to Reason for CT Diagnosis with Reinforcement Learning and Local Zoom
por: Li, Yifan, et al.
Publicado: (2025)
por: Li, Yifan, et al.
Publicado: (2025)
MDeRainNet: An Efficient Macro-pixel Image Rain Removal Network
por: Yan, Tao, et al.
Publicado: (2024)
por: Yan, Tao, et al.
Publicado: (2024)
Enhanced 3D Urban Scene Reconstruction and Point Cloud Densification using Gaussian Splatting and Google Earth Imagery
por: Gao, Kyle, et al.
Publicado: (2024)
por: Gao, Kyle, et al.
Publicado: (2024)
NeRF: Neural Radiance Field in 3D Vision: A Comprehensive Review (Updated Post-Gaussian Splatting)
por: Gao, Kyle, et al.
Publicado: (2022)
por: Gao, Kyle, et al.
Publicado: (2022)
DT-NeRF: A Diffusion and Transformer-Based Optimization Approach for Neural Radiance Fields in 3D Reconstruction
por: Liu, Bo, et al.
Publicado: (2025)
por: Liu, Bo, et al.
Publicado: (2025)
Ejemplares similares
-
Look Where It Matters: Training-Free Ultra-HR Remote Sensing VQA via Adaptive Zoom Search
por: Zhou, Yunqi, et al.
Publicado: (2025) -
DCDB: Dynamic Conditional Dual Diffusion Bridge for Ill-posed Multi-Tasks
por: Huang, Chengjie, et al.
Publicado: (2025) -
Towards Unified Semantic and Controllable Image Fusion: A Diffusion Transformer Approach
por: Li, Jiayang, et al.
Publicado: (2025) -
GeoZero: Incentivizing Reasoning from Scratch on Geospatial Scenes
por: Wang, Di, et al.
Publicado: (2025) -
Characterization of dim light response in DVS pixel: Discontinuity of event triggering time
por: Jiang, Xiao, et al.
Publicado: (2024)