Saved in:
| Main Authors: | Sreedharan, Sarath, Sikes, Kelsey, Blanchard, Nathaniel, Mason, Lisa, Krishnaswamy, Nikhil, Zarestky, Jill |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.01432 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Reducing Human-Robot Goal State Divergence with Environment Design
by: Sikes, Kelsey, et al.
Published: (2024)
by: Sikes, Kelsey, et al.
Published: (2024)
AGI Is Coming... Right After AI Learns to Play Wordle
by: Shekkizhar, Sarath, et al.
Published: (2025)
by: Shekkizhar, Sarath, et al.
Published: (2025)
Deep Expert Injection for Anchoring Retinal VLMs with Domain-Specific Knowledge
by: Lu, Shuai, et al.
Published: (2026)
by: Lu, Shuai, et al.
Published: (2026)
Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety
by: Shriram, Shashank, et al.
Published: (2025)
by: Shriram, Shashank, et al.
Published: (2025)
doScenes: An Autonomous Driving Dataset with Natural Language Instruction for Human Interaction and Vision-Language Navigation
by: Roy, Parthib, et al.
Published: (2024)
by: Roy, Parthib, et al.
Published: (2024)
Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling
by: Jha, Saurav, et al.
Published: (2025)
by: Jha, Saurav, et al.
Published: (2025)
AIDE: Agentically Improve Visual Language Model with Domain Experts
by: Chiu, Ming-Chang, et al.
Published: (2025)
by: Chiu, Ming-Chang, et al.
Published: (2025)
CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design
by: Govindarajan, Prashant, et al.
Published: (2025)
by: Govindarajan, Prashant, et al.
Published: (2025)
Recall and Refine: A Simple but Effective Source-free Open-set Domain Adaptation Framework
by: Nejjar, Ismail, et al.
Published: (2024)
by: Nejjar, Ismail, et al.
Published: (2024)
Item Region-based Style Classification Network (IRSN): A Fashion Style Classifier Based on Domain Knowledge of Fashion Experts
by: Choi, Jinyoung, et al.
Published: (2025)
by: Choi, Jinyoung, et al.
Published: (2025)
Generative AI for Visualizing Highway Construction Hazards Through Synthetic Images and Temporal Sequences
by: Neece, Trevor, et al.
Published: (2026)
by: Neece, Trevor, et al.
Published: (2026)
ExpertSim: Fast Particle Detector Simulation Using Mixture-of-Generative-Experts
by: Będkowski, Patryk, et al.
Published: (2025)
by: Będkowski, Patryk, et al.
Published: (2025)
Spatio-Semantic Expert Routing Architecture with Mixture-of-Experts for Referring Image Segmentation
by: Dalaq, Alaa, et al.
Published: (2026)
by: Dalaq, Alaa, et al.
Published: (2026)
MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers
by: Chen, Yiwen, et al.
Published: (2024)
by: Chen, Yiwen, et al.
Published: (2024)
Can Vision-Language Models Understand Construction Workers? An Exploratory Study
by: Bui, Hieu, et al.
Published: (2026)
by: Bui, Hieu, et al.
Published: (2026)
Product of Experts for Visual Generation
by: Zhang, Yunzhi, et al.
Published: (2025)
by: Zhang, Yunzhi, et al.
Published: (2025)
A Function-Centric Perspective on Flat and Sharp Minima
by: Mason-Williams, Israel, et al.
Published: (2025)
by: Mason-Williams, Israel, et al.
Published: (2025)
Designing Production-Scale OCR for India: Multilingual and Domain-Specific Systems
by: Faraz, Ali, et al.
Published: (2026)
by: Faraz, Ali, et al.
Published: (2026)
Text2Graph VPR: A Text-to-Graph Expert System for Explainable Place Recognition in Changing Environments
by: Yousefzadeh, Saeideh, et al.
Published: (2025)
by: Yousefzadeh, Saeideh, et al.
Published: (2025)
One Step Closer: Creating the Future to Boost Monocular Semantic Scene Completion
by: Lu, Haoang, et al.
Published: (2025)
by: Lu, Haoang, et al.
Published: (2025)
Overcoming Visual Clutter in Vision Language Action Models via Concept-Gated Visual Distillation
by: Song, Sangmim, et al.
Published: (2026)
by: Song, Sangmim, et al.
Published: (2026)
Quant Experts: Token-aware Adaptive Error Reconstruction with Mixture of Experts for Large Vision-Language Models Quantization
by: Jia, Chenwei, et al.
Published: (2026)
by: Jia, Chenwei, et al.
Published: (2026)
American Sign Language Alphabet Recognition using Deep Learning
by: Kasukurthi, Nikhil, et al.
Published: (2019)
by: Kasukurthi, Nikhil, et al.
Published: (2019)
Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge
by: Liu, Linshen, et al.
Published: (2025)
by: Liu, Linshen, et al.
Published: (2025)
Visual Reasoning Agent: Robust Vision Systems in Remote Sensing via Inference-Time Scaling
by: Yu, Chung-En Johnny, et al.
Published: (2025)
by: Yu, Chung-En Johnny, et al.
Published: (2025)
Why MLLMs Struggle to Determine Object Orientations
by: Gopinath, Anju, et al.
Published: (2026)
by: Gopinath, Anju, et al.
Published: (2026)
Fire360: A Benchmark for Robust Perception and Episodic Memory in Degraded 360-Degree Firefighting Videos
by: Tiwari, Aditi, et al.
Published: (2025)
by: Tiwari, Aditi, et al.
Published: (2025)
Satellite to Street : Disaster Impact Estimator
by: Sai, Sreesritha, et al.
Published: (2025)
by: Sai, Sreesritha, et al.
Published: (2025)
Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding
by: Zhang, Tuo, et al.
Published: (2024)
by: Zhang, Tuo, et al.
Published: (2024)
The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan Archives
by: Śanchez, Èric, et al.
Published: (2024)
by: Śanchez, Èric, et al.
Published: (2024)
GazeMoE: Perception of Gaze Target with Mixture-of-Experts
by: Dai, Zhuangzhuang, et al.
Published: (2026)
by: Dai, Zhuangzhuang, et al.
Published: (2026)
Aligning Characteristic Descriptors with Images for Human-Expert-like Explainability
by: Yalavarthi, Bharat Chandra, et al.
Published: (2024)
by: Yalavarthi, Bharat Chandra, et al.
Published: (2024)
DiffusionAgent: Navigating Expert Models for Agentic Image Generation
by: Qin, Jie, et al.
Published: (2024)
by: Qin, Jie, et al.
Published: (2024)
Remix-DiT: Mixing Diffusion Transformers for Multi-Expert Denoising
by: Fang, Gongfan, et al.
Published: (2024)
by: Fang, Gongfan, et al.
Published: (2024)
Hyperbolic and Evidence-Prioritized Experts for Large Vision-Language Models
by: Zhou, Zijie, et al.
Published: (2026)
by: Zhou, Zijie, et al.
Published: (2026)
Evaluating the Capability of Video Question Generation for Expert Knowledge Elicitation
by: Zhang, Huaying, et al.
Published: (2025)
by: Zhang, Huaying, et al.
Published: (2025)
Simple Agents Outperform Experts in Biomedical Imaging Workflow Optimization
by: Xuefei, et al.
Published: (2025)
by: Xuefei, et al.
Published: (2025)
VidPrism: Heterogeneous Mixture of Experts for Image-to-Video Transfer
by: Lin, Rui, et al.
Published: (2026)
by: Lin, Rui, et al.
Published: (2026)
Multi-Site Class-Incremental Learning with Weighted Experts in Echocardiography
by: Bransby, Kit M., et al.
Published: (2024)
by: Bransby, Kit M., et al.
Published: (2024)
Metis-HOME: Hybrid Optimized Mixture-of-Experts for Multimodal Reasoning
by: Lan, Xiaohan, et al.
Published: (2025)
by: Lan, Xiaohan, et al.
Published: (2025)
Similar Items
-
Reducing Human-Robot Goal State Divergence with Environment Design
by: Sikes, Kelsey, et al.
Published: (2024) -
AGI Is Coming... Right After AI Learns to Play Wordle
by: Shekkizhar, Sarath, et al.
Published: (2025) -
Deep Expert Injection for Anchoring Retinal VLMs with Domain-Specific Knowledge
by: Lu, Shuai, et al.
Published: (2026) -
Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety
by: Shriram, Shashank, et al.
Published: (2025) -
doScenes: An Autonomous Driving Dataset with Natural Language Instruction for Human Interaction and Vision-Language Navigation
by: Roy, Parthib, et al.
Published: (2024)