:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gaebler, Johann D., Goel, Sharad, Huq, Aziz, Tambe, Prasanna
Format:	Preprint
Published:	2024
Subjects:	Applications Computation and Language
Online Access:	https://arxiv.org/abs/2404.03086
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Simple, Statistically Robust Test of Discrimination
by: Gaebler, Johann D., et al.
Published: (2024)

Mitigating Included- and Omitted-Variable Bias in Estimates of Disparate Impact
by: Jung, Jongbin, et al.
Published: (2018)

Blocks as geographic discontinuities: The effect of polling place assignment on voting
by: Tomkins, Sabina, et al.
Published: (2021)

Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation
by: Lum, Kristian, et al.
Published: (2024)

Does a Large Language Model Really Speak in Human-Like Language?
by: Park, Mose, et al.
Published: (2025)

LAVA: Language Model Assisted Verbal Autopsy for Cause-of-Death Determination
by: Chen, Yiqun T., et al.
Published: (2025)

Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations
by: Miller, Evan
Published: (2024)

Enhancing Systematic Reviews with Large Language Models: Using GPT-4 and Kimi
by: Kaptur, Dandan Chen, et al.
Published: (2025)

Language Hierarchization Provides the Optimal Solution to Human Working Memory Limits
by: Chen, Luyao, et al.
Published: (2026)

A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial Scenarios
by: Ackerman, Samuel, et al.
Published: (2024)

Judging It, Washing It: Scoring and Greenwashing Corporate Climate Disclosures using Large Language Models
by: Chuang, Marianne, et al.
Published: (2025)

A Profit-Based Measure of Lending Discrimination
by: Coots, Madison, et al.
Published: (2025)

Large Language Models for Full-Text Methods Assessment: A Case Study on Mediation Analysis
by: Zhang, Wenqing, et al.
Published: (2025)

Personalized Prediction of Perceived Message Effectiveness Using Large Language Model Based Digital Twins
by: Han, Jasmin, et al.
Published: (2026)

Systematic Evaluation of Uncertainty Estimation Methods in Large Language Models
by: Hobelsberger, Christian, et al.
Published: (2025)

The Use of a Large Language Model for Cyberbullying Detection
by: Ogunleye, Bayode, et al.
Published: (2024)

Repeated Sequences Reveal Gaps between Large Language Models and Natural Language
by: Tanaka-Ishii, Kumiko
Published: (2026)

Sampling the Swadesh List to Identify Similar Languages with Tree Spaces
by: Ordway, Garett, et al.
Published: (2024)

Bayesian Evaluation of Large Language Model Behavior
by: Longjohn, Rachel, et al.
Published: (2025)

Language Markers of Emotion Flexibility Predict Depression and Anxiety Treatment Outcomes
by: Brindle, Benjamin, et al.
Published: (2026)

Metacognitive Myopia in Large Language Models
by: Scholten, Florian, et al.
Published: (2024)

Dynamic Topic Language Model on Heterogeneous Children's Mental Health Clinical Notes
by: Ye, Hanwen, et al.
Published: (2023)

TransitGPT: A Generative AI-based framework for interacting with GTFS data using Large Language Models
by: Devunuri, Saipraneeth, et al.
Published: (2024)

Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors
by: Abdelrahman, Ahmed S., et al.
Published: (2025)

Limits of Large Language Models in Debating Humans
by: Flamino, James, et al.
Published: (2024)

Using Twitter Data to Understand Public Perceptions of Approved versus Off-label Use for COVID-19-related Medications
by: Hua, Yining, et al.
Published: (2022)

Improving Probabilistic Models in Text Classification via Active Learning
by: Bosley, Mitchell, et al.
Published: (2022)

A Design-based Solution for Causal Inference with Text: Can a Language Model Be Too Large?
by: Tierney, Graham, et al.
Published: (2025)

LLMs and Agentic AI in Insurance Decision-Making: Opportunities and Challenges For Africa
by: Hill, Graham, et al.
Published: (2025)

Domain-Shift-Aware Conformal Prediction for Large Language Models
by: Lin, Zhexiao, et al.
Published: (2025)

The GPT Surprise: Offering Large Language Model Chat in a Massive Coding Class Reduced Engagement but Increased Adopters Exam Performances
by: Nie, Allen, et al.
Published: (2024)

How to Choose a Threshold for an Evaluation Metric for Large Language Models
by: Sarmah, Bhaskarjit, et al.
Published: (2024)

From Traditional Taggers to LLMs: A Comparative Study of POS Tagging for Medieval Romance Languages
by: Schöffel, Matthias, et al.
Published: (2026)

Language Models as Causal Effect Generators
by: Bynum, Lucius E. J., et al.
Published: (2024)

A Latent Dirichlet Allocation (LDA) Semantic Text Analytics Approach to Explore Topical Features in Charity Crowdfunding Campaigns
by: Muzumdar, Prathamesh, et al.
Published: (2024)

Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine Knowledge
by: Yizhen, Li, et al.
Published: (2024)

The Multi-Range Theory of Translation Quality Measurement: MQM scoring models and Statistical Quality Control
by: Lommel, Arle, et al.
Published: (2024)

Less than one percent of words would be affected by gender-inclusive language in German press texts
by: Müller-Spitzer, Carolin, et al.
Published: (2024)

Emotion Detection with Transformers: A Comparative Study
by: Rezapour, Mahdi
Published: (2024)

Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning
by: Jiang, Eric Hanchen, et al.
Published: (2026)