:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sreedharan, Sarath, Sikes, Kelsey, Blanchard, Nathaniel, Mason, Lisa, Krishnaswamy, Nikhil, Zarestky, Jill
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2510.01432
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Reducing Human-Robot Goal State Divergence with Environment Design
by: Sikes, Kelsey, et al.
Published: (2024)

AGI Is Coming... Right After AI Learns to Play Wordle
by: Shekkizhar, Sarath, et al.
Published: (2025)

Deep Expert Injection for Anchoring Retinal VLMs with Domain-Specific Knowledge
by: Lu, Shuai, et al.
Published: (2026)

Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety
by: Shriram, Shashank, et al.
Published: (2025)

doScenes: An Autonomous Driving Dataset with Natural Language Instruction for Human Interaction and Vision-Language Navigation
by: Roy, Parthib, et al.
Published: (2024)

Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling
by: Jha, Saurav, et al.
Published: (2025)

AIDE: Agentically Improve Visual Language Model with Domain Experts
by: Chiu, Ming-Chang, et al.
Published: (2025)

CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design
by: Govindarajan, Prashant, et al.
Published: (2025)

Recall and Refine: A Simple but Effective Source-free Open-set Domain Adaptation Framework
by: Nejjar, Ismail, et al.
Published: (2024)

Item Region-based Style Classification Network (IRSN): A Fashion Style Classifier Based on Domain Knowledge of Fashion Experts
by: Choi, Jinyoung, et al.
Published: (2025)

Generative AI for Visualizing Highway Construction Hazards Through Synthetic Images and Temporal Sequences
by: Neece, Trevor, et al.
Published: (2026)

ExpertSim: Fast Particle Detector Simulation Using Mixture-of-Generative-Experts
by: Będkowski, Patryk, et al.
Published: (2025)

Spatio-Semantic Expert Routing Architecture with Mixture-of-Experts for Referring Image Segmentation
by: Dalaq, Alaa, et al.
Published: (2026)

MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers
by: Chen, Yiwen, et al.
Published: (2024)

Can Vision-Language Models Understand Construction Workers? An Exploratory Study
by: Bui, Hieu, et al.
Published: (2026)

Product of Experts for Visual Generation
by: Zhang, Yunzhi, et al.
Published: (2025)

A Function-Centric Perspective on Flat and Sharp Minima
by: Mason-Williams, Israel, et al.
Published: (2025)

Designing Production-Scale OCR for India: Multilingual and Domain-Specific Systems
by: Faraz, Ali, et al.
Published: (2026)

Text2Graph VPR: A Text-to-Graph Expert System for Explainable Place Recognition in Changing Environments
by: Yousefzadeh, Saeideh, et al.
Published: (2025)

One Step Closer: Creating the Future to Boost Monocular Semantic Scene Completion
by: Lu, Haoang, et al.
Published: (2025)

Overcoming Visual Clutter in Vision Language Action Models via Concept-Gated Visual Distillation
by: Song, Sangmim, et al.
Published: (2026)

Quant Experts: Token-aware Adaptive Error Reconstruction with Mixture of Experts for Large Vision-Language Models Quantization
by: Jia, Chenwei, et al.
Published: (2026)

American Sign Language Alphabet Recognition using Deep Learning
by: Kasukurthi, Nikhil, et al.
Published: (2019)

Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge
by: Liu, Linshen, et al.
Published: (2025)

Visual Reasoning Agent: Robust Vision Systems in Remote Sensing via Inference-Time Scaling
by: Yu, Chung-En Johnny, et al.
Published: (2025)

Why MLLMs Struggle to Determine Object Orientations
by: Gopinath, Anju, et al.
Published: (2026)

Fire360: A Benchmark for Robust Perception and Episodic Memory in Degraded 360-Degree Firefighting Videos
by: Tiwari, Aditi, et al.
Published: (2025)

Satellite to Street : Disaster Impact Estimator
by: Sai, Sreesritha, et al.
Published: (2025)

Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding
by: Zhang, Tuo, et al.
Published: (2024)

The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan Archives
by: Śanchez, Èric, et al.
Published: (2024)

GazeMoE: Perception of Gaze Target with Mixture-of-Experts
by: Dai, Zhuangzhuang, et al.
Published: (2026)

Aligning Characteristic Descriptors with Images for Human-Expert-like Explainability
by: Yalavarthi, Bharat Chandra, et al.
Published: (2024)

DiffusionAgent: Navigating Expert Models for Agentic Image Generation
by: Qin, Jie, et al.
Published: (2024)

Remix-DiT: Mixing Diffusion Transformers for Multi-Expert Denoising
by: Fang, Gongfan, et al.
Published: (2024)

Hyperbolic and Evidence-Prioritized Experts for Large Vision-Language Models
by: Zhou, Zijie, et al.
Published: (2026)

Evaluating the Capability of Video Question Generation for Expert Knowledge Elicitation
by: Zhang, Huaying, et al.
Published: (2025)

Simple Agents Outperform Experts in Biomedical Imaging Workflow Optimization
by: Xuefei, et al.
Published: (2025)

VidPrism: Heterogeneous Mixture of Experts for Image-to-Video Transfer
by: Lin, Rui, et al.
Published: (2026)

Multi-Site Class-Incremental Learning with Weighted Experts in Echocardiography
by: Bransby, Kit M., et al.
Published: (2024)

Metis-HOME: Hybrid Optimized Mixture-of-Experts for Multimodal Reasoning
by: Lan, Xiaohan, et al.
Published: (2025)