Saved in:
| Main Author: | Long, Yanan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.15211 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning Causal Representations from General Environments: Identifiability and Intrinsic Ambiguity
by: Jin, Jikai, et al.
Published: (2023)
by: Jin, Jikai, et al.
Published: (2023)
Bayesian Nonparametric Dynamical Clustering of Time Series
by: Pérez-Herrero, Adrián, et al.
Published: (2025)
by: Pérez-Herrero, Adrián, et al.
Published: (2025)
HiBayES: A Hierarchical Bayesian Modeling Framework for AI Evaluation Statistics
by: Luettgau, Lennart, et al.
Published: (2025)
by: Luettgau, Lennart, et al.
Published: (2025)
Data-Driven Bayesian Network Models of Hurricane Evacuation Decision Making
by: Wang, Hui Sophie, et al.
Published: (2023)
by: Wang, Hui Sophie, et al.
Published: (2023)
Bayesian Networks for Causal Analysis in Socioecological Systems
by: Cabañas, Rafael, et al.
Published: (2024)
by: Cabañas, Rafael, et al.
Published: (2024)
Conformal Safety Monitoring for Flight Testing: A Case Study in Data-Driven Safety Learning
by: Feldman, Aaron O., et al.
Published: (2025)
by: Feldman, Aaron O., et al.
Published: (2025)
Process-Aware Analysis of Treatment Paths in Heart Failure Patients: A Case Study
by: Beyel, Harry H., et al.
Published: (2024)
by: Beyel, Harry H., et al.
Published: (2024)
Decision Quality Evaluation Framework at Pinterest
by: Tian, Yuqi, et al.
Published: (2026)
by: Tian, Yuqi, et al.
Published: (2026)
Automated Vehicles at Unsignalized Intersections: Safety and Efficiency Implications of Mixed Human and Automated Traffic
by: Rahmani, Saeed, et al.
Published: (2024)
by: Rahmani, Saeed, et al.
Published: (2024)
Performance Evaluation of Large Language Models in Statistical Programming
by: Song, Xinyi, et al.
Published: (2025)
by: Song, Xinyi, et al.
Published: (2025)
Wilcoxon Nonparametric CFAR Scheme for Ship Detection in SAR Image
by: Meng, Xiangwei
Published: (2024)
by: Meng, Xiangwei
Published: (2024)
Causal inference approach to appraise long-term effects of maintenance policy on functional performance of asphalt pavements
by: You, Lingyun, et al.
Published: (2024)
by: You, Lingyun, et al.
Published: (2024)
StatLLM: A Dataset for Evaluating the Performance of Large Language Models in Statistical Analysis
by: Song, Xinyi, et al.
Published: (2025)
by: Song, Xinyi, et al.
Published: (2025)
Evaluating the Use of Large Language Models as Synthetic Social Agents in Social Science Research
by: Madden, Emma Rose
Published: (2025)
by: Madden, Emma Rose
Published: (2025)
A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution
by: Hu, Zhengmian, et al.
Published: (2024)
by: Hu, Zhengmian, et al.
Published: (2024)
Eligibility-Aware Evidence Synthesis: An Agentic Framework for Clinical Trial Meta-Analysis
by: Zhao, Yao, et al.
Published: (2026)
by: Zhao, Yao, et al.
Published: (2026)
What If They Took the Shot? A Hierarchical Bayesian Framework for Counterfactual Expected Goals
by: Mahmudlu, Mikayil, et al.
Published: (2025)
by: Mahmudlu, Mikayil, et al.
Published: (2025)
Bridging Data Barriers among Participants: Assessing the Potential of Geoenergy through Federated Learning
by: Peng, Weike, et al.
Published: (2024)
by: Peng, Weike, et al.
Published: (2024)
Bayesian Networks and Machine Learning for COVID-19 Severity Explanation and Demographic Symptom Classification
by: Ajayi, Oluwaseun T., et al.
Published: (2024)
by: Ajayi, Oluwaseun T., et al.
Published: (2024)
Scalable Spatiotemporal Prediction with Bayesian Neural Fields
by: Saad, Feras, et al.
Published: (2024)
by: Saad, Feras, et al.
Published: (2024)
Bayesian inversion of GPR waveforms for sub-surface material characterization: an uncertainty-aware retrieval of soil moisture and overlaying biomass properties
by: Aziz, Ishfaq, et al.
Published: (2023)
by: Aziz, Ishfaq, et al.
Published: (2023)
Error-Controlled Borrowing from External Data Using Wasserstein Ambiguity Sets
by: Kimura, Yui, et al.
Published: (2026)
by: Kimura, Yui, et al.
Published: (2026)
Adaptive Quantum-Safe Cryptography for 6G Vehicular Networks via Context-Aware Optimization
by: Sengupta, Poushali, et al.
Published: (2026)
by: Sengupta, Poushali, et al.
Published: (2026)
"All that Glitters": Approaches to Evaluations with Unreliable Model and Human Annotations
by: Hardy, Michael
Published: (2024)
by: Hardy, Michael
Published: (2024)
Towards Reliable LLM Evaluation: Correcting the Winner's Curse in Adaptive Benchmarking
by: Xu, Yang, et al.
Published: (2026)
by: Xu, Yang, et al.
Published: (2026)
Reinforcement Learning from Human Feedback with High-Confidence Safety Constraints
by: Chittepu, Yaswanth, et al.
Published: (2025)
by: Chittepu, Yaswanth, et al.
Published: (2025)
A More Realistic Evaluation of Cross-Frequency Transfer Learning and Foundation Forecasting Models
by: Olivares, Kin G., et al.
Published: (2025)
by: Olivares, Kin G., et al.
Published: (2025)
SureMap: Simultaneous Mean Estimation for Single-Task and Multi-Task Disaggregated Evaluation
by: Khodak, Mikhail, et al.
Published: (2024)
by: Khodak, Mikhail, et al.
Published: (2024)
On the Mechanistic Interpretability of Neural Networks for Causality in Bio-statistics
by: Conan, Jean-Baptiste A.
Published: (2025)
by: Conan, Jean-Baptiste A.
Published: (2025)
Decade-long Emission Forecasting with an Ensemble Model in Taiwan
by: Hung, Gordon, et al.
Published: (2025)
by: Hung, Gordon, et al.
Published: (2025)
Calculating Customer Lifetime Value and Churn using Beta Geometric Negative Binomial and Gamma-Gamma Distribution in a NFT based setting
by: Das, Sagarnil
Published: (2025)
by: Das, Sagarnil
Published: (2025)
Unlocking the Potential of Past Research: Using Generative AI to Reconstruct Healthcare Simulation Models
by: Monks, Thomas, et al.
Published: (2025)
by: Monks, Thomas, et al.
Published: (2025)
Bridging the Data Gap in AI Reliability Research and Establishing DR-AIR, a Comprehensive Data Repository for AI Reliability
by: Zheng, Simin, et al.
Published: (2025)
by: Zheng, Simin, et al.
Published: (2025)
Scalable Stewardship of an LLM-Assisted Clinical Benchmark with Physician Oversight
by: Ye, Junze, et al.
Published: (2025)
by: Ye, Junze, et al.
Published: (2025)
Space evaluation at the starting point of soccer transitions
by: Ogawa, Yohei, et al.
Published: (2025)
by: Ogawa, Yohei, et al.
Published: (2025)
A survey of using EHR as real-world evidence for discovering and validating new drug indications
by: Talukdar, Nabasmita, et al.
Published: (2025)
by: Talukdar, Nabasmita, et al.
Published: (2025)
Classifying Metamorphic versus Single-Fold Proteins with Statistical Learning and AlphaFold2
by: Chen, Yongkai, et al.
Published: (2025)
by: Chen, Yongkai, et al.
Published: (2025)
Towards a rigorous evaluation of RAG systems: the challenge of due diligence
by: Martinon, Grégoire, et al.
Published: (2025)
by: Martinon, Grégoire, et al.
Published: (2025)
Cinder: A fast and fair matchmaking system
by: Pal, Saurav
Published: (2025)
by: Pal, Saurav
Published: (2025)
Using large language models to produce literature reviews: Usages and systematic biases of microphysics parametrizations in 2699 publications
by: Zhang, Tianhang, et al.
Published: (2025)
by: Zhang, Tianhang, et al.
Published: (2025)
Similar Items
-
Learning Causal Representations from General Environments: Identifiability and Intrinsic Ambiguity
by: Jin, Jikai, et al.
Published: (2023) -
Bayesian Nonparametric Dynamical Clustering of Time Series
by: Pérez-Herrero, Adrián, et al.
Published: (2025) -
HiBayES: A Hierarchical Bayesian Modeling Framework for AI Evaluation Statistics
by: Luettgau, Lennart, et al.
Published: (2025) -
Data-Driven Bayesian Network Models of Hurricane Evacuation Decision Making
by: Wang, Hui Sophie, et al.
Published: (2023) -
Bayesian Networks for Causal Analysis in Socioecological Systems
by: Cabañas, Rafael, et al.
Published: (2024)