:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shichman, Mollie, Bonial, Claire, Blodgett, Austin, Hudson, Taylor, Ferraro, Francis, Rudinger, Rachel
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.18452
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Beyond Memorization: Assessing Semantic Generalization in Large Language Models Using Phrasal Constructions
by: Scivetti, Wesley, et al.
Published: (2025)

Understanding Common Ground Misalignment in Goal-Oriented Dialog: A Case-Study with Ubuntu Chat Logs
by: Sarkar, Rupak, et al.
Published: (2025)

Neither Stochastic Parroting nor AGI: LLMs Solve Tasks through Context-Directed Extrapolation from Training Data Priors
by: Madabushi, Harish Tayyar, et al.
Published: (2025)

Evaluating CxG Generalisation in LLMs via Construction-Based NLI Fine Tuning
by: Mackintosh, Tom, et al.
Published: (2025)

Test-Time Reasoners Are Strategic Multiple-Choice Test-Takers
by: Balepur, Nishant, et al.
Published: (2025)

It's Not Easy Being Wrong: Large Language Models Struggle with Process of Elimination Reasoning
by: Balepur, Nishant, et al.
Published: (2023)

Dancing with Deer: A Constructional Perspective on MWEs in the Era of LLMs
by: Bonial, Claire, et al.
Published: (2025)

Is Your Large Language Model Knowledgeable or a Choices-Only Cheater?
by: Balepur, Nishant, et al.
Published: (2024)

NLI under the Microscope: What Atomic Hypothesis Decomposition Reveals
by: Srikanth, Neha, et al.
Published: (2025)

Human-Robot Dialogue Annotation for Multi-Modal Common Ground
by: Bonial, Claire, et al.
Published: (2024)

RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents
by: Liu, Zhuoran, et al.
Published: (2024)

Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the U.S
by: Acquaye, Christabel, et al.
Published: (2024)

On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models
by: Sancheti, Abhilasha, et al.
Published: (2024)

Beyond Math: Stories as a Testbed for Memorization-Constrained Reasoning in LLMs
by: Jiang, Yuxuan, et al.
Published: (2024)

Bridging Reasoning Trajectories in On-Policy Distillation via Near-Future Guidance
by: Jiang, Yuxuan, et al.
Published: (2026)

CoRE: Condition-based Reasoning for Identifying Outcome Variance in Complex Events
by: Vallurupalli, Sai, et al.
Published: (2025)

GPS-Independent Localization Techniques for Disaster Rescue
by: Li, Yingquan, et al.
Published: (2025)

OPTO-MECHANICAL DESIGN OF FRIDA
by: V. Bringas
Published: (2013)

Coronagraph feasibility studies on FRIDA
by: M. N´Diaye
Published: (2007)

HRI Challenges Influencing Low Usage of Robotic Systems in Disaster Response and Rescue Operations
by: Hoque, Shahinul, et al.
Published: (2024)

DRP: Distilled Reasoning Pruning with Skill-aware Step Decomposition for Efficient Large Reasoning Models
by: Jiang, Yuxuan, et al.
Published: (2025)

How often are errors in natural language reasoning due to paraphrastic variability?
by: Srikanth, Neha, et al.
Published: (2024)

Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?
by: Balepur, Nishant, et al.
Published: (2024)

Retracted: Earthquake Disaster Rescue Model Based on Complex Adaptive System Theory
by: Complexity
Published: (2024)

A SUBLIMAÇÃO EM FRIDA KAHLO
by: Marli Miranda Bastos
Published: (2008)

Undergraduate Student Success and Library Use: A Multimethod Approach
by: Mayer, Jennifer, et al.
Published: (2020)

Application of Big Data in Disaster Rescue: Coupling Model, Technical System and Effect Evaluation
by: Genli Tang, et al.
Published: (2025)

Plausibly Problematic Questions in Multiple-Choice Benchmarks for Commonsense Reasoning
by: Palta, Shramay, et al.
Published: (2024)

Psychometric Evaluation of the Core Competencies in Disaster Nursing Scale for Disaster Rescue Nurses in Mainland China
by: Jinjia Lai, et al.
Published: (2025)

DiscoTrace: Representing and Comparing Answering Strategies of Humans and LLMs in Information-Seeking Question Answering
by: Srikanth, Neha, et al.
Published: (2026)

On the Mutual Influence of Gender and Occupation in LLM Representations
by: An, Haozhe, et al.
Published: (2025)

Language Models Predict Empathy Gaps Between Social In-groups and Out-groups
by: Hou, Yu, et al.
Published: (2025)

HIGH CONTRAST IMAGING OPTICAL SPECIFICATIONS FOR FRIDA
by: S. Cuevas
Published: (2013)

Petri Net Relaxation for Infeasibility Explanation and Sequential Task Planning
by: Le, Nguyen Cong Nhat, et al.
Published: (2026)

The Electronic School Library Resource Center: Facilities Planning for the New Information Technologies.
by: Blodgett, Teresa, et al.
Published: (1995)

RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue
by: Raman, Naveen, et al.
Published: (2025)

Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above
by: Balepur, Nishant, et al.
Published: (2025)

Multiple LLM Agents Debate for Equitable Cultural Alignment
by: Ki, Dayeon, et al.
Published: (2025)

Speaking the Right Language: The Impact of Expertise Alignment in User-AI Interactions
by: Palta, Shramay, et al.
Published: (2025)

Everything is Plausible: Investigating the Impact of LLM Rationales on Human Notions of Plausibility
by: Palta, Shramay, et al.
Published: (2025)