:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cuneo, Nicole, Graves, Eleanor, Rakshit, Supantho, Goldberg, Adele E.
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2505.09005
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Meaning-infused grammar: Gradient Acceptability Shapes the Geometric Representations of Constructions in LLMs
by: Rakshit, Supantho, et al.
Published: (2025)

A suite of LMs comprehend puzzle statements as well as humans
by: Goldberg, Adele E, et al.
Published: (2025)

Learning Unacceptability: Repeated Exposure to Acceptable Sentences Improves Adult Learners’ Recognition of Unacceptable Sentences
by: Karina Tachihara, et al.
Published: (2024)

Linguistic Generalizations are not Rules: Impacts on Evaluation of LMs
by: Weissweiler, Leonie, et al.
Published: (2025)

Role of Dependency Distance in Text Simplification: A Human vs ChatGPT Simplification Comparison
by: Lee, Sumi, et al.
Published: (2024)

From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences
by: Kodali, Prashant, et al.
Published: (2024)

Predicting Sentence Acceptability Judgments in Multimodal Contexts
by: Jang, Hyewon, et al.
Published: (2026)

Text Understanding in GPT-4 vs Humans
by: Shultz, Thomas R., et al.
Published: (2024)

If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models
by: Orth, Jasmin, et al.
Published: (2025)

Parallelograms Strike Back: LLMs Generate Better Analogies than People
by: Liu, Qiawen Ella, et al.
Published: (2026)

Generative AI-Based Text Generation Methods Using Pre-Trained GPT-2 Model
by: Pandey, Rohit, et al.
Published: (2024)

Inner Speech as Behavior Guides: Steerable Imitation of Diverse Behaviors for Human-AI coordination
by: Trivedi, Rakshit, et al.
Published: (2026)

Leveraging Collection-Wide Similarities for Unsupervised Document Structure Extraction
by: Lior, Gili, et al.
Published: (2024)

A comparison of Human, GPT-3.5, and GPT-4 Performance in a University-Level Coding Course
by: Yeadon, Will, et al.
Published: (2024)

The Distribution of Dependency Distance and Hierarchical Distance in Contemporary Written Japanese and Its Influencing Factors
by: Wang, Linxuan, et al.
Published: (2025)

Humans Perceive Wrong Narratives from AI Reasoning Texts
by: Levy, Mosh, et al.
Published: (2025)

Quantum Transfer Learning for Acceptability Judgements
by: Buonaiuto, Giuseppe, et al.
Published: (2024)

COLD-Steer: Steering Large Language Models via In-Context One-step Learning Dynamics
by: Sharma, Kartik, et al.
Published: (2026)

Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs
by: Tian, Runchu, et al.
Published: (2024)

MELA: Multilingual Evaluation of Linguistic Acceptability
by: Zhang, Ziyin, et al.
Published: (2023)

Leveraging a Cognitive Model to Measure Subjective Similarity of Human and GPT-4 Written Content
by: Malloy, Tailia, et al.
Published: (2024)

Adversarial Robustness through Dynamic Ensemble Learning
by: Waghela, Hetvi, et al.
Published: (2024)

Pseudo-Deliberation in Language Models: When Reasoning Fails to Align Values and Actions
by: Rakshit, Sushrita, et al.
Published: (2026)

Affective Experiences of International and Home Students during the Information Search Process
by: Haley, Adele Nicole, et al.
Published: (2017)

A Comparison of Human and ChatGPT Classification Performance on Complex Social Media Data
by: Green, Breanna E., et al.
Published: (2025)

GPT-4's One-Dimensional Mapping of Morality: How the Accuracy of Country-Estimates Depends on Moral Domain
by: Strimling, Pontus, et al.
Published: (2024)

Contextualising (Im)plausible Events Triggers Figurative Language
by: Eichel, Annerose, et al.
Published: (2026)

100% Elimination of Hallucinations on RAGTruth for GPT-4 and GPT-3.5 Turbo
by: Wood, Michael C., et al.
Published: (2024)

SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling
by: Puvvada, Krishna C., et al.
Published: (2025)

From General to Targeted Rewards: Surpassing GPT-4 in Open-Ended Long-Context Generation
by: Guo, Zhihan, et al.
Published: (2025)

The Human and the Mechanical: logos, truthfulness, and ChatGPT
by: Giannakidou, Anastasia, et al.
Published: (2024)

Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models
by: Chen, Longze, et al.
Published: (2024)

GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration
by: Wake, Naoki, et al.
Published: (2023)

DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
by: Liu, Zhengliang, et al.
Published: (2023)

Is ChatGPT More Empathetic than Humans?
by: Welivita, Anuradha, et al.
Published: (2024)

Is GPT-4 Less Politically Biased than GPT-3.5? A Renewed Investigation of ChatGPT's Political Biases
by: Weber, Erik, et al.
Published: (2024)

On the Role of Context in Reading Time Prediction
by: Opedal, Andreas, et al.
Published: (2024)

Better Bill GPT: Comparing Large Language Models against Legal Invoice Reviewers
by: Whitehouse, Nick, et al.
Published: (2025)

GPTEval: A Survey on Assessments of ChatGPT and GPT-4
by: Mao, Rui, et al.
Published: (2023)

Classification performance and reproducibility of GPT-4 omni for information extraction from veterinary electronic health records
by: Wulcan, Judit M, et al.
Published: (2024)