Saved in:
| Main Authors: | Gaebler, Johann D., Goel, Sharad, Huq, Aziz, Tambe, Prasanna |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.03086 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Simple, Statistically Robust Test of Discrimination
by: Gaebler, Johann D., et al.
Published: (2024)
by: Gaebler, Johann D., et al.
Published: (2024)
Mitigating Included- and Omitted-Variable Bias in Estimates of Disparate Impact
by: Jung, Jongbin, et al.
Published: (2018)
by: Jung, Jongbin, et al.
Published: (2018)
Blocks as geographic discontinuities: The effect of polling place assignment on voting
by: Tomkins, Sabina, et al.
Published: (2021)
by: Tomkins, Sabina, et al.
Published: (2021)
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation
by: Lum, Kristian, et al.
Published: (2024)
by: Lum, Kristian, et al.
Published: (2024)
Does a Large Language Model Really Speak in Human-Like Language?
by: Park, Mose, et al.
Published: (2025)
by: Park, Mose, et al.
Published: (2025)
LAVA: Language Model Assisted Verbal Autopsy for Cause-of-Death Determination
by: Chen, Yiqun T., et al.
Published: (2025)
by: Chen, Yiqun T., et al.
Published: (2025)
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations
by: Miller, Evan
Published: (2024)
by: Miller, Evan
Published: (2024)
Enhancing Systematic Reviews with Large Language Models: Using GPT-4 and Kimi
by: Kaptur, Dandan Chen, et al.
Published: (2025)
by: Kaptur, Dandan Chen, et al.
Published: (2025)
Language Hierarchization Provides the Optimal Solution to Human Working Memory Limits
by: Chen, Luyao, et al.
Published: (2026)
by: Chen, Luyao, et al.
Published: (2026)
A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial Scenarios
by: Ackerman, Samuel, et al.
Published: (2024)
by: Ackerman, Samuel, et al.
Published: (2024)
Judging It, Washing It: Scoring and Greenwashing Corporate Climate Disclosures using Large Language Models
by: Chuang, Marianne, et al.
Published: (2025)
by: Chuang, Marianne, et al.
Published: (2025)
A Profit-Based Measure of Lending Discrimination
by: Coots, Madison, et al.
Published: (2025)
by: Coots, Madison, et al.
Published: (2025)
Large Language Models for Full-Text Methods Assessment: A Case Study on Mediation Analysis
by: Zhang, Wenqing, et al.
Published: (2025)
by: Zhang, Wenqing, et al.
Published: (2025)
Personalized Prediction of Perceived Message Effectiveness Using Large Language Model Based Digital Twins
by: Han, Jasmin, et al.
Published: (2026)
by: Han, Jasmin, et al.
Published: (2026)
Systematic Evaluation of Uncertainty Estimation Methods in Large Language Models
by: Hobelsberger, Christian, et al.
Published: (2025)
by: Hobelsberger, Christian, et al.
Published: (2025)
The Use of a Large Language Model for Cyberbullying Detection
by: Ogunleye, Bayode, et al.
Published: (2024)
by: Ogunleye, Bayode, et al.
Published: (2024)
Repeated Sequences Reveal Gaps between Large Language Models and Natural Language
by: Tanaka-Ishii, Kumiko
Published: (2026)
by: Tanaka-Ishii, Kumiko
Published: (2026)
Sampling the Swadesh List to Identify Similar Languages with Tree Spaces
by: Ordway, Garett, et al.
Published: (2024)
by: Ordway, Garett, et al.
Published: (2024)
Bayesian Evaluation of Large Language Model Behavior
by: Longjohn, Rachel, et al.
Published: (2025)
by: Longjohn, Rachel, et al.
Published: (2025)
Language Markers of Emotion Flexibility Predict Depression and Anxiety Treatment Outcomes
by: Brindle, Benjamin, et al.
Published: (2026)
by: Brindle, Benjamin, et al.
Published: (2026)
Metacognitive Myopia in Large Language Models
by: Scholten, Florian, et al.
Published: (2024)
by: Scholten, Florian, et al.
Published: (2024)
Dynamic Topic Language Model on Heterogeneous Children's Mental Health Clinical Notes
by: Ye, Hanwen, et al.
Published: (2023)
by: Ye, Hanwen, et al.
Published: (2023)
TransitGPT: A Generative AI-based framework for interacting with GTFS data using Large Language Models
by: Devunuri, Saipraneeth, et al.
Published: (2024)
by: Devunuri, Saipraneeth, et al.
Published: (2024)
Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors
by: Abdelrahman, Ahmed S., et al.
Published: (2025)
by: Abdelrahman, Ahmed S., et al.
Published: (2025)
Limits of Large Language Models in Debating Humans
by: Flamino, James, et al.
Published: (2024)
by: Flamino, James, et al.
Published: (2024)
Using Twitter Data to Understand Public Perceptions of Approved versus Off-label Use for COVID-19-related Medications
by: Hua, Yining, et al.
Published: (2022)
by: Hua, Yining, et al.
Published: (2022)
Improving Probabilistic Models in Text Classification via Active Learning
by: Bosley, Mitchell, et al.
Published: (2022)
by: Bosley, Mitchell, et al.
Published: (2022)
A Design-based Solution for Causal Inference with Text: Can a Language Model Be Too Large?
by: Tierney, Graham, et al.
Published: (2025)
by: Tierney, Graham, et al.
Published: (2025)
LLMs and Agentic AI in Insurance Decision-Making: Opportunities and Challenges For Africa
by: Hill, Graham, et al.
Published: (2025)
by: Hill, Graham, et al.
Published: (2025)
Domain-Shift-Aware Conformal Prediction for Large Language Models
by: Lin, Zhexiao, et al.
Published: (2025)
by: Lin, Zhexiao, et al.
Published: (2025)
The GPT Surprise: Offering Large Language Model Chat in a Massive Coding Class Reduced Engagement but Increased Adopters Exam Performances
by: Nie, Allen, et al.
Published: (2024)
by: Nie, Allen, et al.
Published: (2024)
How to Choose a Threshold for an Evaluation Metric for Large Language Models
by: Sarmah, Bhaskarjit, et al.
Published: (2024)
by: Sarmah, Bhaskarjit, et al.
Published: (2024)
From Traditional Taggers to LLMs: A Comparative Study of POS Tagging for Medieval Romance Languages
by: Schöffel, Matthias, et al.
Published: (2026)
by: Schöffel, Matthias, et al.
Published: (2026)
Language Models as Causal Effect Generators
by: Bynum, Lucius E. J., et al.
Published: (2024)
by: Bynum, Lucius E. J., et al.
Published: (2024)
A Latent Dirichlet Allocation (LDA) Semantic Text Analytics Approach to Explore Topical Features in Charity Crowdfunding Campaigns
by: Muzumdar, Prathamesh, et al.
Published: (2024)
by: Muzumdar, Prathamesh, et al.
Published: (2024)
Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine Knowledge
by: Yizhen, Li, et al.
Published: (2024)
by: Yizhen, Li, et al.
Published: (2024)
The Multi-Range Theory of Translation Quality Measurement: MQM scoring models and Statistical Quality Control
by: Lommel, Arle, et al.
Published: (2024)
by: Lommel, Arle, et al.
Published: (2024)
Less than one percent of words would be affected by gender-inclusive language in German press texts
by: Müller-Spitzer, Carolin, et al.
Published: (2024)
by: Müller-Spitzer, Carolin, et al.
Published: (2024)
Emotion Detection with Transformers: A Comparative Study
by: Rezapour, Mahdi
Published: (2024)
by: Rezapour, Mahdi
Published: (2024)
Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning
by: Jiang, Eric Hanchen, et al.
Published: (2026)
by: Jiang, Eric Hanchen, et al.
Published: (2026)
Similar Items
-
A Simple, Statistically Robust Test of Discrimination
by: Gaebler, Johann D., et al.
Published: (2024) -
Mitigating Included- and Omitted-Variable Bias in Estimates of Disparate Impact
by: Jung, Jongbin, et al.
Published: (2018) -
Blocks as geographic discontinuities: The effect of polling place assignment on voting
by: Tomkins, Sabina, et al.
Published: (2021) -
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation
by: Lum, Kristian, et al.
Published: (2024) -
Does a Large Language Model Really Speak in Human-Like Language?
by: Park, Mose, et al.
Published: (2025)