:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	West, Peter, Potts, Christopher
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2505.00047
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Let the Poem Hit the Rhythm: Using a Byte-Based Transformer for Beat-Aligned Poetry Generation
by: Elzohbi, Mohamad, et al.
Published: (2024)

Counterfactual Simulation Training for Chain-of-Thought Faithfulness
by: Hase, Peter, et al.
Published: (2026)

Large Language Models Align with the Human Brain during Creative Thinking
by: Ismayilzada, Mete, et al.
Published: (2026)

Demystifying Verbatim Memorization in Large Language Models
by: Huang, Jing, et al.
Published: (2024)

Beyond Divergent Creativity: A Human-Based Evaluation of Creativity in Large Language Models
by: Nakajima, Kumiko, et al.
Published: (2026)

Improved Representation Steering for Language Models
by: Wu, Zhengxuan, et al.
Published: (2025)

False Friends Are Not Foes: Investigating Vocabulary Overlap in Multilingual Language Models
by: Kallini, Julie, et al.
Published: (2025)

Baba Is AI: Break the Rules to Beat the Benchmark
by: Cloos, Nathan, et al.
Published: (2024)

MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
by: Zhong, Zexuan, et al.
Published: (2023)

Routing Absorption in Sparse Attention: Why Random Gates Are Hard to Beat
by: Aquino-Michaels, Keston
Published: (2026)

Rewarding Creativity: A Human-Aligned Generative Reward Model for Reinforcement Learning in Storytelling
by: Li, Zhaoyan, et al.
Published: (2026)

Language models as tools for investigating the distinction between possible and impossible natural languages
by: Kallini, Julie, et al.
Published: (2025)

A paradox of AI fluency
by: Potts, Christopher, et al.
Published: (2026)

Invisible failures in human-AI interactions
by: Potts, Christopher, et al.
Published: (2026)

SLMEval: Entropy-Based Calibration for Human-Aligned Evaluation of Large Language Models
by: Daynauth, Roland, et al.
Published: (2025)

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?
by: Chen, Canyu, et al.
Published: (2024)

On the Creativity of Large Language Models
by: Franceschelli, Giorgio, et al.
Published: (2023)

Characterising the Creative Process in Humans and Large Language Models
by: Nath, Surabhi S., et al.
Published: (2024)

Knowledge Access Beats Model Size: Memory Augmented Routing for Persistent AI Agents
by: Liu, Xunzhuo, et al.
Published: (2026)

ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation
by: Wu, Zhengxuan, et al.
Published: (2023)

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues
by: Sreedhar, Makesh Narsimhan, et al.
Published: (2024)

RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations
by: Huang, Jing, et al.
Published: (2024)

Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling
by: Li, Margaret, et al.
Published: (2024)

Blind Baselines Beat Membership Inference Attacks for Foundation Models
by: Das, Debeshee, et al.
Published: (2024)

CreativEval: Evaluating Creativity of LLM-Based Hardware Code Generation
by: DeLorenzo, Matthew, et al.
Published: (2024)

I am a Strange Dataset: Metalinguistic Tests for Language Models
by: Thrush, Tristan, et al.
Published: (2024)

Mission: Impossible Language Models
by: Kallini, Julie, et al.
Published: (2024)

Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions
by: Tang, Chenming, et al.
Published: (2024)

AlignCultura: Towards Culturally Aligned Large Language Models?
by: Kashyap, Gautam Siddharth, et al.
Published: (2026)

MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
by: Kallini, Julie, et al.
Published: (2024)

DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines
by: Singhvi, Arnav, et al.
Published: (2023)

CreativityPrism: A Holistic Evaluation Framework for Large Language Model Creativity
by: Hou, Zhaoyi Joey, et al.
Published: (2025)

Outcome Rewards Do Not Guarantee Verifiable or Causally Important Reasoning
by: Yu, Qinan, et al.
Published: (2026)

ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning
by: She, Jingyuan Selena, et al.
Published: (2023)

On Recipe Memorization and Creativity in Large Language Models: Is Your Model a Creative Cook, a Bad Cook, or Merely a Plagiator?
by: Kvapil, Jan, et al.
Published: (2025)

Beat-Based Rhythm Quantization of MIDI Performances
by: Wachter, Maximilian, et al.
Published: (2025)

Privacy-Preserving Instructions for Aligning Large Language Models
by: Yu, Da, et al.
Published: (2024)

Blackbox Model Provenance via Palimpsestic Membership Inference
by: Kuditipudi, Rohith, et al.
Published: (2025)

What Shapes a Creative Machine Mind? Comprehensively Benchmarking Creativity in Foundation Models
by: He, Zicong, et al.
Published: (2025)

Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach
by: Li, Ruizhe, et al.
Published: (2025)