Saved in:
| Main Authors: | Shichman, Mollie, Bonial, Claire, Blodgett, Austin, Hudson, Taylor, Ferraro, Francis, Rudinger, Rachel |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.18452 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Beyond Memorization: Assessing Semantic Generalization in Large Language Models Using Phrasal Constructions
by: Scivetti, Wesley, et al.
Published: (2025)
by: Scivetti, Wesley, et al.
Published: (2025)
Understanding Common Ground Misalignment in Goal-Oriented Dialog: A Case-Study with Ubuntu Chat Logs
by: Sarkar, Rupak, et al.
Published: (2025)
by: Sarkar, Rupak, et al.
Published: (2025)
Neither Stochastic Parroting nor AGI: LLMs Solve Tasks through Context-Directed Extrapolation from Training Data Priors
by: Madabushi, Harish Tayyar, et al.
Published: (2025)
by: Madabushi, Harish Tayyar, et al.
Published: (2025)
Evaluating CxG Generalisation in LLMs via Construction-Based NLI Fine Tuning
by: Mackintosh, Tom, et al.
Published: (2025)
by: Mackintosh, Tom, et al.
Published: (2025)
Test-Time Reasoners Are Strategic Multiple-Choice Test-Takers
by: Balepur, Nishant, et al.
Published: (2025)
by: Balepur, Nishant, et al.
Published: (2025)
It's Not Easy Being Wrong: Large Language Models Struggle with Process of Elimination Reasoning
by: Balepur, Nishant, et al.
Published: (2023)
by: Balepur, Nishant, et al.
Published: (2023)
Dancing with Deer: A Constructional Perspective on MWEs in the Era of LLMs
by: Bonial, Claire, et al.
Published: (2025)
by: Bonial, Claire, et al.
Published: (2025)
Is Your Large Language Model Knowledgeable or a Choices-Only Cheater?
by: Balepur, Nishant, et al.
Published: (2024)
by: Balepur, Nishant, et al.
Published: (2024)
NLI under the Microscope: What Atomic Hypothesis Decomposition Reveals
by: Srikanth, Neha, et al.
Published: (2025)
by: Srikanth, Neha, et al.
Published: (2025)
Human-Robot Dialogue Annotation for Multi-Modal Common Ground
by: Bonial, Claire, et al.
Published: (2024)
by: Bonial, Claire, et al.
Published: (2024)
RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents
by: Liu, Zhuoran, et al.
Published: (2024)
by: Liu, Zhuoran, et al.
Published: (2024)
Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the U.S
by: Acquaye, Christabel, et al.
Published: (2024)
by: Acquaye, Christabel, et al.
Published: (2024)
On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models
by: Sancheti, Abhilasha, et al.
Published: (2024)
by: Sancheti, Abhilasha, et al.
Published: (2024)
Beyond Math: Stories as a Testbed for Memorization-Constrained Reasoning in LLMs
by: Jiang, Yuxuan, et al.
Published: (2024)
by: Jiang, Yuxuan, et al.
Published: (2024)
Bridging Reasoning Trajectories in On-Policy Distillation via Near-Future Guidance
by: Jiang, Yuxuan, et al.
Published: (2026)
by: Jiang, Yuxuan, et al.
Published: (2026)
CoRE: Condition-based Reasoning for Identifying Outcome Variance in Complex Events
by: Vallurupalli, Sai, et al.
Published: (2025)
by: Vallurupalli, Sai, et al.
Published: (2025)
GPS-Independent Localization Techniques for Disaster Rescue
by: Li, Yingquan, et al.
Published: (2025)
by: Li, Yingquan, et al.
Published: (2025)
OPTO-MECHANICAL DESIGN OF FRIDA
by: V. Bringas
Published: (2013)
by: V. Bringas
Published: (2013)
Coronagraph feasibility studies on FRIDA
by: M. N´Diaye
Published: (2007)
by: M. N´Diaye
Published: (2007)
HRI Challenges Influencing Low Usage of Robotic Systems in Disaster Response and Rescue Operations
by: Hoque, Shahinul, et al.
Published: (2024)
by: Hoque, Shahinul, et al.
Published: (2024)
DRP: Distilled Reasoning Pruning with Skill-aware Step Decomposition for Efficient Large Reasoning Models
by: Jiang, Yuxuan, et al.
Published: (2025)
by: Jiang, Yuxuan, et al.
Published: (2025)
How often are errors in natural language reasoning due to paraphrastic variability?
by: Srikanth, Neha, et al.
Published: (2024)
by: Srikanth, Neha, et al.
Published: (2024)
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?
by: Balepur, Nishant, et al.
Published: (2024)
by: Balepur, Nishant, et al.
Published: (2024)
Retracted: Earthquake Disaster Rescue Model Based on Complex Adaptive System Theory
by: Complexity
Published: (2024)
by: Complexity
Published: (2024)
A SUBLIMAÇÃO EM FRIDA KAHLO
by: Marli Miranda Bastos
Published: (2008)
by: Marli Miranda Bastos
Published: (2008)
Undergraduate Student Success and Library Use: A Multimethod Approach
by: Mayer, Jennifer, et al.
Published: (2020)
by: Mayer, Jennifer, et al.
Published: (2020)
Application of Big Data in Disaster Rescue: Coupling Model, Technical System and Effect Evaluation
by: Genli Tang, et al.
Published: (2025)
by: Genli Tang, et al.
Published: (2025)
Plausibly Problematic Questions in Multiple-Choice Benchmarks for Commonsense Reasoning
by: Palta, Shramay, et al.
Published: (2024)
by: Palta, Shramay, et al.
Published: (2024)
Psychometric Evaluation of the Core Competencies in Disaster Nursing Scale for Disaster Rescue Nurses in Mainland China
by: Jinjia Lai, et al.
Published: (2025)
by: Jinjia Lai, et al.
Published: (2025)
DiscoTrace: Representing and Comparing Answering Strategies of Humans and LLMs in Information-Seeking Question Answering
by: Srikanth, Neha, et al.
Published: (2026)
by: Srikanth, Neha, et al.
Published: (2026)
On the Mutual Influence of Gender and Occupation in LLM Representations
by: An, Haozhe, et al.
Published: (2025)
by: An, Haozhe, et al.
Published: (2025)
Language Models Predict Empathy Gaps Between Social In-groups and Out-groups
by: Hou, Yu, et al.
Published: (2025)
by: Hou, Yu, et al.
Published: (2025)
HIGH CONTRAST IMAGING OPTICAL SPECIFICATIONS FOR FRIDA
by: S. Cuevas
Published: (2013)
by: S. Cuevas
Published: (2013)
Petri Net Relaxation for Infeasibility Explanation and Sequential Task Planning
by: Le, Nguyen Cong Nhat, et al.
Published: (2026)
by: Le, Nguyen Cong Nhat, et al.
Published: (2026)
The Electronic School Library Resource Center: Facilities Planning for the New Information Technologies.
by: Blodgett, Teresa, et al.
Published: (1995)
by: Blodgett, Teresa, et al.
Published: (1995)
RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue
by: Raman, Naveen, et al.
Published: (2025)
by: Raman, Naveen, et al.
Published: (2025)
Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above
by: Balepur, Nishant, et al.
Published: (2025)
by: Balepur, Nishant, et al.
Published: (2025)
Multiple LLM Agents Debate for Equitable Cultural Alignment
by: Ki, Dayeon, et al.
Published: (2025)
by: Ki, Dayeon, et al.
Published: (2025)
Speaking the Right Language: The Impact of Expertise Alignment in User-AI Interactions
by: Palta, Shramay, et al.
Published: (2025)
by: Palta, Shramay, et al.
Published: (2025)
Everything is Plausible: Investigating the Impact of LLM Rationales on Human Notions of Plausibility
by: Palta, Shramay, et al.
Published: (2025)
by: Palta, Shramay, et al.
Published: (2025)
Similar Items
-
Beyond Memorization: Assessing Semantic Generalization in Large Language Models Using Phrasal Constructions
by: Scivetti, Wesley, et al.
Published: (2025) -
Understanding Common Ground Misalignment in Goal-Oriented Dialog: A Case-Study with Ubuntu Chat Logs
by: Sarkar, Rupak, et al.
Published: (2025) -
Neither Stochastic Parroting nor AGI: LLMs Solve Tasks through Context-Directed Extrapolation from Training Data Priors
by: Madabushi, Harish Tayyar, et al.
Published: (2025) -
Evaluating CxG Generalisation in LLMs via Construction-Based NLI Fine Tuning
by: Mackintosh, Tom, et al.
Published: (2025) -
Test-Time Reasoners Are Strategic Multiple-Choice Test-Takers
by: Balepur, Nishant, et al.
Published: (2025)