:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lyu, Ruiqi, Turcan, Alistair, Wilder, Bryan
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.06530
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Federated Epidemic Surveillance
by: Lyu, Ruiqi, et al.
Published: (2023)

Explaining Concept Shift with Interpretable Feature Attribution
by: Lyu, Ruiqi, et al.
Published: (2025)

Combining digital data streams and epidemic networks for real time outbreak detection
by: Lyu, Ruiqi, et al.
Published: (2025)

EpiCastBench: Datasets and Benchmarks for Multivariate Epidemic Forecasting
by: Panja, Madhurima, et al.
Published: (2026)

Improving constraint-based discovery with robust propagation and reliable LLM priors
by: Lyu, Ruiqi, et al.
Published: (2025)

EpiLLM: Unlocking the Potential of Large Language Models in Epidemic Forecasting
by: Gong, Chenghua, et al.
Published: (2025)

Epi$^2$-Net: Advancing Epidemic Dynamics Forecasting with Physics-Inspired Neural Networks
by: Sun, Rui, et al.
Published: (2025)

SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition
by: Xu, Peiran, et al.
Published: (2025)

EpiPlanAgent: Agentic Automated Epidemic Response Planning
by: Mao, Kangkun, et al.
Published: (2025)

TusoAI: Agentic Optimization for Scientific Methods
by: Turcan, Alistair, et al.
Published: (2025)

EarthSpatialBench: Benchmarking Spatial Reasoning Capabilities of Multimodal LLMs on Earth Imagery
by: Xu, Zelin, et al.
Published: (2026)

SKILLFOUNDRY: Building Self-Evolving Agent Skill Libraries from Heterogeneous Scientific Resources
by: Shen, Shuaike, et al.
Published: (2026)

Cube Bench: A Benchmark for Spatial Visual Reasoning in MLLMs
by: Anand, Dhruv, et al.
Published: (2025)

EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks with Large Vision-Language Models
by: Du, Mengfei, et al.
Published: (2024)

GeoAgentBench: A Dynamic Execution Benchmark for Tool-Augmented Agents in Spatial Analysis
by: Yu, Bo, et al.
Published: (2026)

SpatialBench-UC: Uncertainty-Aware Evaluation of Spatial Prompt Following in Text-to-Image Generation
by: Rostane, Amine
Published: (2026)

MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
by: Lin, Jingli, et al.
Published: (2025)

Can LLMs Reconcile Knowledge Conflicts in Counterfactual Reasoning
by: Yamin, Khurram, et al.
Published: (2025)

Spatial Competence Benchmark
by: Vira, Jash, et al.
Published: (2026)

Evaluating the Effectiveness of Data Augmentation for Emotion Classification in Low-Resource Settings
by: Arora, Aashish, et al.
Published: (2024)

Computationally Assisted Quality Control for Public Health Data Streams
by: Joshi, Ananya, et al.
Published: (2023)

ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
by: Karger, Ezra, et al.
Published: (2024)

Healthcare LLM Benchmarks Are Only as Good as Their Explicit Assumptions
by: Raman, Naveen, et al.
Published: (2026)

ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models
by: Zhang, Lingfeng, et al.
Published: (2024)

EpiQAL: Benchmarking Large Language Models in Epidemiological Question Answering and Reasoning
by: Wei, Mingyang, et al.
Published: (2026)

Spatial CAPTCHA: Generatively Benchmarking Spatial Reasoning for Human-Machine Differentiation
by: Kharlamova, Arina, et al.
Published: (2025)

Linear Attention is Enough in Spatial-Temporal Forecasting
by: Ning, Xinyu
Published: (2024)

Verifiable Benchmarking of Long-Horizon Spatial Biology
by: Diks, Ian, et al.
Published: (2026)

SpatialText: A Pure-Text Cognitive Benchmark for Spatial Understanding in Large Language Models
by: Jiang, Peiyao, et al.
Published: (2026)

Leaving the Nest: Going Beyond Local Loss Functions for Predict-Then-Optimize
by: Shah, Sanket, et al.
Published: (2023)

Bench to the Future: A Pastcasting Benchmark for Forecasting Agents
by: FutureSearch, et al.
Published: (2025)

ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models
by: Li, Dingming, et al.
Published: (2025)

Reward Engineering for Spatial Epidemic Simulations: A Reinforcement Learning Platform for Individual Behavioral Learning
by: Rakhshandehroo, Radman, et al.
Published: (2025)

FHIR-AgentBench: Benchmarking LLM Agents for Realistic Interoperable EHR Question Answering
by: Lee, Gyubok, et al.
Published: (2025)

IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery
by: Sheth, Ivaxi, et al.
Published: (2026)

The Limits of AI-Driven Allocation: Optimal Screening under Aleatoric Uncertainty
by: Cortes-Gomez, Santiago, et al.
Published: (2026)

PRISM: A Benchmark for Programmatic Spatial-Temporal Reasoning
by: Zhang, Qiran, et al.
Published: (2026)

AirQualityBench: A Realistic Evaluation Benchmark for Global Air Quality Forecasting
by: Xu, Xing, et al.
Published: (2026)

From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors
by: Zhang, Zhengshen, et al.
Published: (2025)

Reinforcement learning with combinatorial actions for coupled restless bandits
by: Xu, Lily, et al.
Published: (2025)