:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zhang, Yabo, Zeng, Yihan, Li, Qingyun, Hu, Zhen, Han, Kavin, Zuo, Wangmeng
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Machine Learning Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2509.12867
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning
von: Yao, Zhenquan, et al.
Veröffentlicht: (2026)

LPT++: Efficient Training on Mixture of Long-tailed Experts
von: Dong, Bowen, et al.
Veröffentlicht: (2024)

FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors
von: Zhang, Yabo, et al.
Veröffentlicht: (2025)

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning
von: Lu, Pan, et al.
Veröffentlicht: (2025)

Reinforcing VLMs to Use Tools for Detailed Visual Reasoning Under Resource Constraints
von: Kumar, Sunil, et al.
Veröffentlicht: (2025)

DeepFRC: An End-to-End Deep Learning Model for Functional Registration and Classification
von: Jiang, Siyuan, et al.
Veröffentlicht: (2025)

MIRAGE: Assessing Hallucination in Multimodal Reasoning Chains of MLLM
von: Dong, Bowen, et al.
Veröffentlicht: (2025)

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning
von: Yang, Zuhao, et al.
Veröffentlicht: (2026)

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
von: Jiang, Dongfu, et al.
Veröffentlicht: (2025)

ToolTok: Tool Tokenization for Efficient and Generalizable GUI Agents
von: Wang, Xiaoce, et al.
Veröffentlicht: (2026)

Latent Code Augmentation Based on Stable Diffusion for Data-free Substitute Attacks
von: Shao, Mingwen, et al.
Veröffentlicht: (2023)

ScrollScape: Unlocking 32K Image Generation With Video Diffusion Priors
von: Yu, Haodong, et al.
Veröffentlicht: (2026)

GAZE: Grounded Agentic Zero-shot Evaluation with Viewer-Level Tools and Literature Retrieval on Rare Brain MRI
von: Alim, Duaa, et al.
Veröffentlicht: (2026)

Applications and Effect Evaluation of Generative Adversarial Networks in Semi-Supervised Learning
von: Hu, Jiyu, et al.
Veröffentlicht: (2025)

FLIP: Towards Comprehensive and Reliable Evaluation of Federated Prompt Learning
von: Liao, Dongping, et al.
Veröffentlicht: (2025)

Object-Centric World Models from Few-Shot Annotations for Sample-Efficient Reinforcement Learning
von: Zhang, Weipu, et al.
Veröffentlicht: (2025)

Improving Transferability of Adversarial Examples via Bayesian Attacks
von: Li, Qizhang, et al.
Veröffentlicht: (2023)

Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
von: Toubal, Imad Eddine, et al.
Veröffentlicht: (2024)

WALT: Web Agents that Learn Tools
von: Prabhu, Viraj, et al.
Veröffentlicht: (2025)

OPERA: An Agent for Image Restoration with End-to-End Joint Planning-Execution Optimization
von: Zhu, Feng, et al.
Veröffentlicht: (2026)

Evaluating Similitude and Robustness of Deep Image Denoising Models via Adversarial Attack
von: Ning, Jie, et al.
Veröffentlicht: (2023)

LLM as a Complementary Optimizer to Gradient Descent: A Case Study in Prompt Tuning
von: Guo, Zixian, et al.
Veröffentlicht: (2024)

SDM: A Powerful Tool for Evaluating Model Robustness
von: Liu, Xinlei, et al.
Veröffentlicht: (2026)

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement Learning
von: Zhao, Jiaxing, et al.
Veröffentlicht: (2025)

ChatHuman: Chatting about 3D Humans with Tools
von: Lin, Jing, et al.
Veröffentlicht: (2024)

Tutor-Student Reinforcement Learning: A Dynamic Curriculum for Robust Deepfake Detection
von: Lei, Zhanhe, et al.
Veröffentlicht: (2026)

Sims: An Interactive Tool for Geospatial Matching and Clustering
von: Zaytar, Akram, et al.
Veröffentlicht: (2024)

A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia
von: GV, Hirthik Mathesh, et al.
Veröffentlicht: (2025)

M2CURL: Sample-Efficient Multimodal Reinforcement Learning via Self-Supervised Representation Learning for Robotic Manipulation
von: Lygerakis, Fotios, et al.
Veröffentlicht: (2024)

Weakly Semi-supervised Tool Detection in Minimally Invasive Surgery Videos
von: Fujii, Ryo, et al.
Veröffentlicht: (2024)

UCTB: An Urban Computing Tool Box for Building Spatiotemporal Prediction Services
von: Fang, Jiangyi, et al.
Veröffentlicht: (2023)

PARASIDE: An Automatic Paranasal Sinus Segmentation and Structure Analysis Tool for MRI
von: Möller, Hendrik, et al.
Veröffentlicht: (2025)

ChartAgent: A Chart Understanding Framework with Tool Integrated Reasoning
von: Wang, Boran, et al.
Veröffentlicht: (2025)

B-GRTO: Bootstrapped Group Relative Tool Optimization for Referring Segmentation
von: Markov, Mario, et al.
Veröffentlicht: (2026)

DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors
von: Huang, Tianyu, et al.
Veröffentlicht: (2024)

Performance-guided Reinforced Active Learning for Object Detection
von: Liang, Zhixuan, et al.
Veröffentlicht: (2026)

Robust Principal Component Analysis via Discriminant Sample Weight Learning
von: Deng, Yingzhuo, et al.
Veröffentlicht: (2024)

EgoSurgery-Tool: A Dataset of Surgical Tool and Hand Detection from Egocentric Open Surgery Videos
von: Fujii, Ryo, et al.
Veröffentlicht: (2024)

Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards
von: Liu, Qingming, et al.
Veröffentlicht: (2025)

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
von: Yang, Qi, et al.
Veröffentlicht: (2025)