Saved in:
| Main Authors: | Lyu, Ruiqi, Turcan, Alistair, Wilder, Bryan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.06530 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Federated Epidemic Surveillance
by: Lyu, Ruiqi, et al.
Published: (2023)
by: Lyu, Ruiqi, et al.
Published: (2023)
Explaining Concept Shift with Interpretable Feature Attribution
by: Lyu, Ruiqi, et al.
Published: (2025)
by: Lyu, Ruiqi, et al.
Published: (2025)
Combining digital data streams and epidemic networks for real time outbreak detection
by: Lyu, Ruiqi, et al.
Published: (2025)
by: Lyu, Ruiqi, et al.
Published: (2025)
EpiCastBench: Datasets and Benchmarks for Multivariate Epidemic Forecasting
by: Panja, Madhurima, et al.
Published: (2026)
by: Panja, Madhurima, et al.
Published: (2026)
Improving constraint-based discovery with robust propagation and reliable LLM priors
by: Lyu, Ruiqi, et al.
Published: (2025)
by: Lyu, Ruiqi, et al.
Published: (2025)
EpiLLM: Unlocking the Potential of Large Language Models in Epidemic Forecasting
by: Gong, Chenghua, et al.
Published: (2025)
by: Gong, Chenghua, et al.
Published: (2025)
Epi$^2$-Net: Advancing Epidemic Dynamics Forecasting with Physics-Inspired Neural Networks
by: Sun, Rui, et al.
Published: (2025)
by: Sun, Rui, et al.
Published: (2025)
SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition
by: Xu, Peiran, et al.
Published: (2025)
by: Xu, Peiran, et al.
Published: (2025)
EpiPlanAgent: Agentic Automated Epidemic Response Planning
by: Mao, Kangkun, et al.
Published: (2025)
by: Mao, Kangkun, et al.
Published: (2025)
TusoAI: Agentic Optimization for Scientific Methods
by: Turcan, Alistair, et al.
Published: (2025)
by: Turcan, Alistair, et al.
Published: (2025)
EarthSpatialBench: Benchmarking Spatial Reasoning Capabilities of Multimodal LLMs on Earth Imagery
by: Xu, Zelin, et al.
Published: (2026)
by: Xu, Zelin, et al.
Published: (2026)
SKILLFOUNDRY: Building Self-Evolving Agent Skill Libraries from Heterogeneous Scientific Resources
by: Shen, Shuaike, et al.
Published: (2026)
by: Shen, Shuaike, et al.
Published: (2026)
Cube Bench: A Benchmark for Spatial Visual Reasoning in MLLMs
by: Anand, Dhruv, et al.
Published: (2025)
by: Anand, Dhruv, et al.
Published: (2025)
EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks with Large Vision-Language Models
by: Du, Mengfei, et al.
Published: (2024)
by: Du, Mengfei, et al.
Published: (2024)
GeoAgentBench: A Dynamic Execution Benchmark for Tool-Augmented Agents in Spatial Analysis
by: Yu, Bo, et al.
Published: (2026)
by: Yu, Bo, et al.
Published: (2026)
SpatialBench-UC: Uncertainty-Aware Evaluation of Spatial Prompt Following in Text-to-Image Generation
by: Rostane, Amine
Published: (2026)
by: Rostane, Amine
Published: (2026)
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
by: Lin, Jingli, et al.
Published: (2025)
by: Lin, Jingli, et al.
Published: (2025)
Can LLMs Reconcile Knowledge Conflicts in Counterfactual Reasoning
by: Yamin, Khurram, et al.
Published: (2025)
by: Yamin, Khurram, et al.
Published: (2025)
Spatial Competence Benchmark
by: Vira, Jash, et al.
Published: (2026)
by: Vira, Jash, et al.
Published: (2026)
Evaluating the Effectiveness of Data Augmentation for Emotion Classification in Low-Resource Settings
by: Arora, Aashish, et al.
Published: (2024)
by: Arora, Aashish, et al.
Published: (2024)
Computationally Assisted Quality Control for Public Health Data Streams
by: Joshi, Ananya, et al.
Published: (2023)
by: Joshi, Ananya, et al.
Published: (2023)
ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
by: Karger, Ezra, et al.
Published: (2024)
by: Karger, Ezra, et al.
Published: (2024)
Healthcare LLM Benchmarks Are Only as Good as Their Explicit Assumptions
by: Raman, Naveen, et al.
Published: (2026)
by: Raman, Naveen, et al.
Published: (2026)
ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models
by: Zhang, Lingfeng, et al.
Published: (2024)
by: Zhang, Lingfeng, et al.
Published: (2024)
EpiQAL: Benchmarking Large Language Models in Epidemiological Question Answering and Reasoning
by: Wei, Mingyang, et al.
Published: (2026)
by: Wei, Mingyang, et al.
Published: (2026)
Spatial CAPTCHA: Generatively Benchmarking Spatial Reasoning for Human-Machine Differentiation
by: Kharlamova, Arina, et al.
Published: (2025)
by: Kharlamova, Arina, et al.
Published: (2025)
Linear Attention is Enough in Spatial-Temporal Forecasting
by: Ning, Xinyu
Published: (2024)
by: Ning, Xinyu
Published: (2024)
Verifiable Benchmarking of Long-Horizon Spatial Biology
by: Diks, Ian, et al.
Published: (2026)
by: Diks, Ian, et al.
Published: (2026)
SpatialText: A Pure-Text Cognitive Benchmark for Spatial Understanding in Large Language Models
by: Jiang, Peiyao, et al.
Published: (2026)
by: Jiang, Peiyao, et al.
Published: (2026)
Leaving the Nest: Going Beyond Local Loss Functions for Predict-Then-Optimize
by: Shah, Sanket, et al.
Published: (2023)
by: Shah, Sanket, et al.
Published: (2023)
Bench to the Future: A Pastcasting Benchmark for Forecasting Agents
by: FutureSearch, et al.
Published: (2025)
by: FutureSearch, et al.
Published: (2025)
ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models
by: Li, Dingming, et al.
Published: (2025)
by: Li, Dingming, et al.
Published: (2025)
Reward Engineering for Spatial Epidemic Simulations: A Reinforcement Learning Platform for Individual Behavioral Learning
by: Rakhshandehroo, Radman, et al.
Published: (2025)
by: Rakhshandehroo, Radman, et al.
Published: (2025)
FHIR-AgentBench: Benchmarking LLM Agents for Realistic Interoperable EHR Question Answering
by: Lee, Gyubok, et al.
Published: (2025)
by: Lee, Gyubok, et al.
Published: (2025)
IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery
by: Sheth, Ivaxi, et al.
Published: (2026)
by: Sheth, Ivaxi, et al.
Published: (2026)
The Limits of AI-Driven Allocation: Optimal Screening under Aleatoric Uncertainty
by: Cortes-Gomez, Santiago, et al.
Published: (2026)
by: Cortes-Gomez, Santiago, et al.
Published: (2026)
PRISM: A Benchmark for Programmatic Spatial-Temporal Reasoning
by: Zhang, Qiran, et al.
Published: (2026)
by: Zhang, Qiran, et al.
Published: (2026)
AirQualityBench: A Realistic Evaluation Benchmark for Global Air Quality Forecasting
by: Xu, Xing, et al.
Published: (2026)
by: Xu, Xing, et al.
Published: (2026)
From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors
by: Zhang, Zhengshen, et al.
Published: (2025)
by: Zhang, Zhengshen, et al.
Published: (2025)
Reinforcement learning with combinatorial actions for coupled restless bandits
by: Xu, Lily, et al.
Published: (2025)
by: Xu, Lily, et al.
Published: (2025)
Similar Items
-
Federated Epidemic Surveillance
by: Lyu, Ruiqi, et al.
Published: (2023) -
Explaining Concept Shift with Interpretable Feature Attribution
by: Lyu, Ruiqi, et al.
Published: (2025) -
Combining digital data streams and epidemic networks for real time outbreak detection
by: Lyu, Ruiqi, et al.
Published: (2025) -
EpiCastBench: Datasets and Benchmarks for Multivariate Epidemic Forecasting
by: Panja, Madhurima, et al.
Published: (2026) -
Improving constraint-based discovery with robust propagation and reliable LLM priors
by: Lyu, Ruiqi, et al.
Published: (2025)