Saved in:
| Main Authors: | Cuneo, Nicole, Graves, Eleanor, Rakshit, Supantho, Goldberg, Adele E. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.09005 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Meaning-infused grammar: Gradient Acceptability Shapes the Geometric Representations of Constructions in LLMs
by: Rakshit, Supantho, et al.
Published: (2025)
by: Rakshit, Supantho, et al.
Published: (2025)
A suite of LMs comprehend puzzle statements as well as humans
by: Goldberg, Adele E, et al.
Published: (2025)
by: Goldberg, Adele E, et al.
Published: (2025)
Learning Unacceptability: Repeated Exposure to Acceptable Sentences Improves Adult Learners’ Recognition of Unacceptable Sentences
by: Karina Tachihara, et al.
Published: (2024)
by: Karina Tachihara, et al.
Published: (2024)
Linguistic Generalizations are not Rules: Impacts on Evaluation of LMs
by: Weissweiler, Leonie, et al.
Published: (2025)
by: Weissweiler, Leonie, et al.
Published: (2025)
Role of Dependency Distance in Text Simplification: A Human vs ChatGPT Simplification Comparison
by: Lee, Sumi, et al.
Published: (2024)
by: Lee, Sumi, et al.
Published: (2024)
From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences
by: Kodali, Prashant, et al.
Published: (2024)
by: Kodali, Prashant, et al.
Published: (2024)
Predicting Sentence Acceptability Judgments in Multimodal Contexts
by: Jang, Hyewon, et al.
Published: (2026)
by: Jang, Hyewon, et al.
Published: (2026)
Text Understanding in GPT-4 vs Humans
by: Shultz, Thomas R., et al.
Published: (2024)
by: Shultz, Thomas R., et al.
Published: (2024)
If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models
by: Orth, Jasmin, et al.
Published: (2025)
by: Orth, Jasmin, et al.
Published: (2025)
Parallelograms Strike Back: LLMs Generate Better Analogies than People
by: Liu, Qiawen Ella, et al.
Published: (2026)
by: Liu, Qiawen Ella, et al.
Published: (2026)
Generative AI-Based Text Generation Methods Using Pre-Trained GPT-2 Model
by: Pandey, Rohit, et al.
Published: (2024)
by: Pandey, Rohit, et al.
Published: (2024)
Inner Speech as Behavior Guides: Steerable Imitation of Diverse Behaviors for Human-AI coordination
by: Trivedi, Rakshit, et al.
Published: (2026)
by: Trivedi, Rakshit, et al.
Published: (2026)
Leveraging Collection-Wide Similarities for Unsupervised Document Structure Extraction
by: Lior, Gili, et al.
Published: (2024)
by: Lior, Gili, et al.
Published: (2024)
A comparison of Human, GPT-3.5, and GPT-4 Performance in a University-Level Coding Course
by: Yeadon, Will, et al.
Published: (2024)
by: Yeadon, Will, et al.
Published: (2024)
The Distribution of Dependency Distance and Hierarchical Distance in Contemporary Written Japanese and Its Influencing Factors
by: Wang, Linxuan, et al.
Published: (2025)
by: Wang, Linxuan, et al.
Published: (2025)
Humans Perceive Wrong Narratives from AI Reasoning Texts
by: Levy, Mosh, et al.
Published: (2025)
by: Levy, Mosh, et al.
Published: (2025)
Quantum Transfer Learning for Acceptability Judgements
by: Buonaiuto, Giuseppe, et al.
Published: (2024)
by: Buonaiuto, Giuseppe, et al.
Published: (2024)
COLD-Steer: Steering Large Language Models via In-Context One-step Learning Dynamics
by: Sharma, Kartik, et al.
Published: (2026)
by: Sharma, Kartik, et al.
Published: (2026)
Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs
by: Tian, Runchu, et al.
Published: (2024)
by: Tian, Runchu, et al.
Published: (2024)
MELA: Multilingual Evaluation of Linguistic Acceptability
by: Zhang, Ziyin, et al.
Published: (2023)
by: Zhang, Ziyin, et al.
Published: (2023)
Leveraging a Cognitive Model to Measure Subjective Similarity of Human and GPT-4 Written Content
by: Malloy, Tailia, et al.
Published: (2024)
by: Malloy, Tailia, et al.
Published: (2024)
Adversarial Robustness through Dynamic Ensemble Learning
by: Waghela, Hetvi, et al.
Published: (2024)
by: Waghela, Hetvi, et al.
Published: (2024)
Pseudo-Deliberation in Language Models: When Reasoning Fails to Align Values and Actions
by: Rakshit, Sushrita, et al.
Published: (2026)
by: Rakshit, Sushrita, et al.
Published: (2026)
Affective Experiences of International and Home Students during the Information Search Process
by: Haley, Adele Nicole, et al.
Published: (2017)
by: Haley, Adele Nicole, et al.
Published: (2017)
A Comparison of Human and ChatGPT Classification Performance on Complex Social Media Data
by: Green, Breanna E., et al.
Published: (2025)
by: Green, Breanna E., et al.
Published: (2025)
GPT-4's One-Dimensional Mapping of Morality: How the Accuracy of Country-Estimates Depends on Moral Domain
by: Strimling, Pontus, et al.
Published: (2024)
by: Strimling, Pontus, et al.
Published: (2024)
Contextualising (Im)plausible Events Triggers Figurative Language
by: Eichel, Annerose, et al.
Published: (2026)
by: Eichel, Annerose, et al.
Published: (2026)
100% Elimination of Hallucinations on RAGTruth for GPT-4 and GPT-3.5 Turbo
by: Wood, Michael C., et al.
Published: (2024)
by: Wood, Michael C., et al.
Published: (2024)
SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling
by: Puvvada, Krishna C., et al.
Published: (2025)
by: Puvvada, Krishna C., et al.
Published: (2025)
From General to Targeted Rewards: Surpassing GPT-4 in Open-Ended Long-Context Generation
by: Guo, Zhihan, et al.
Published: (2025)
by: Guo, Zhihan, et al.
Published: (2025)
The Human and the Mechanical: logos, truthfulness, and ChatGPT
by: Giannakidou, Anastasia, et al.
Published: (2024)
by: Giannakidou, Anastasia, et al.
Published: (2024)
Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models
by: Chen, Longze, et al.
Published: (2024)
by: Chen, Longze, et al.
Published: (2024)
GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration
by: Wake, Naoki, et al.
Published: (2023)
by: Wake, Naoki, et al.
Published: (2023)
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
by: Liu, Zhengliang, et al.
Published: (2023)
by: Liu, Zhengliang, et al.
Published: (2023)
Is ChatGPT More Empathetic than Humans?
by: Welivita, Anuradha, et al.
Published: (2024)
by: Welivita, Anuradha, et al.
Published: (2024)
Is GPT-4 Less Politically Biased than GPT-3.5? A Renewed Investigation of ChatGPT's Political Biases
by: Weber, Erik, et al.
Published: (2024)
by: Weber, Erik, et al.
Published: (2024)
On the Role of Context in Reading Time Prediction
by: Opedal, Andreas, et al.
Published: (2024)
by: Opedal, Andreas, et al.
Published: (2024)
Better Bill GPT: Comparing Large Language Models against Legal Invoice Reviewers
by: Whitehouse, Nick, et al.
Published: (2025)
by: Whitehouse, Nick, et al.
Published: (2025)
GPTEval: A Survey on Assessments of ChatGPT and GPT-4
by: Mao, Rui, et al.
Published: (2023)
by: Mao, Rui, et al.
Published: (2023)
Classification performance and reproducibility of GPT-4 omni for information extraction from veterinary electronic health records
by: Wulcan, Judit M, et al.
Published: (2024)
by: Wulcan, Judit M, et al.
Published: (2024)
Similar Items
-
Meaning-infused grammar: Gradient Acceptability Shapes the Geometric Representations of Constructions in LLMs
by: Rakshit, Supantho, et al.
Published: (2025) -
A suite of LMs comprehend puzzle statements as well as humans
by: Goldberg, Adele E, et al.
Published: (2025) -
Learning Unacceptability: Repeated Exposure to Acceptable Sentences Improves Adult Learners’ Recognition of Unacceptable Sentences
by: Karina Tachihara, et al.
Published: (2024) -
Linguistic Generalizations are not Rules: Impacts on Evaluation of LMs
by: Weissweiler, Leonie, et al.
Published: (2025) -
Role of Dependency Distance in Text Simplification: A Human vs ChatGPT Simplification Comparison
by: Lee, Sumi, et al.
Published: (2024)