Saved in:
| Main Authors: | West, Peter, Potts, Christopher |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.00047 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Let the Poem Hit the Rhythm: Using a Byte-Based Transformer for Beat-Aligned Poetry Generation
by: Elzohbi, Mohamad, et al.
Published: (2024)
by: Elzohbi, Mohamad, et al.
Published: (2024)
Counterfactual Simulation Training for Chain-of-Thought Faithfulness
by: Hase, Peter, et al.
Published: (2026)
by: Hase, Peter, et al.
Published: (2026)
Large Language Models Align with the Human Brain during Creative Thinking
by: Ismayilzada, Mete, et al.
Published: (2026)
by: Ismayilzada, Mete, et al.
Published: (2026)
Demystifying Verbatim Memorization in Large Language Models
by: Huang, Jing, et al.
Published: (2024)
by: Huang, Jing, et al.
Published: (2024)
Beyond Divergent Creativity: A Human-Based Evaluation of Creativity in Large Language Models
by: Nakajima, Kumiko, et al.
Published: (2026)
by: Nakajima, Kumiko, et al.
Published: (2026)
Improved Representation Steering for Language Models
by: Wu, Zhengxuan, et al.
Published: (2025)
by: Wu, Zhengxuan, et al.
Published: (2025)
False Friends Are Not Foes: Investigating Vocabulary Overlap in Multilingual Language Models
by: Kallini, Julie, et al.
Published: (2025)
by: Kallini, Julie, et al.
Published: (2025)
Baba Is AI: Break the Rules to Beat the Benchmark
by: Cloos, Nathan, et al.
Published: (2024)
by: Cloos, Nathan, et al.
Published: (2024)
MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
by: Zhong, Zexuan, et al.
Published: (2023)
by: Zhong, Zexuan, et al.
Published: (2023)
Routing Absorption in Sparse Attention: Why Random Gates Are Hard to Beat
by: Aquino-Michaels, Keston
Published: (2026)
by: Aquino-Michaels, Keston
Published: (2026)
Rewarding Creativity: A Human-Aligned Generative Reward Model for Reinforcement Learning in Storytelling
by: Li, Zhaoyan, et al.
Published: (2026)
by: Li, Zhaoyan, et al.
Published: (2026)
Language models as tools for investigating the distinction between possible and impossible natural languages
by: Kallini, Julie, et al.
Published: (2025)
by: Kallini, Julie, et al.
Published: (2025)
A paradox of AI fluency
by: Potts, Christopher, et al.
Published: (2026)
by: Potts, Christopher, et al.
Published: (2026)
Invisible failures in human-AI interactions
by: Potts, Christopher, et al.
Published: (2026)
by: Potts, Christopher, et al.
Published: (2026)
SLMEval: Entropy-Based Calibration for Human-Aligned Evaluation of Large Language Models
by: Daynauth, Roland, et al.
Published: (2025)
by: Daynauth, Roland, et al.
Published: (2025)
ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?
by: Chen, Canyu, et al.
Published: (2024)
by: Chen, Canyu, et al.
Published: (2024)
On the Creativity of Large Language Models
by: Franceschelli, Giorgio, et al.
Published: (2023)
by: Franceschelli, Giorgio, et al.
Published: (2023)
Characterising the Creative Process in Humans and Large Language Models
by: Nath, Surabhi S., et al.
Published: (2024)
by: Nath, Surabhi S., et al.
Published: (2024)
Knowledge Access Beats Model Size: Memory Augmented Routing for Persistent AI Agents
by: Liu, Xunzhuo, et al.
Published: (2026)
by: Liu, Xunzhuo, et al.
Published: (2026)
ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation
by: Wu, Zhengxuan, et al.
Published: (2023)
by: Wu, Zhengxuan, et al.
Published: (2023)
CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues
by: Sreedhar, Makesh Narsimhan, et al.
Published: (2024)
by: Sreedhar, Makesh Narsimhan, et al.
Published: (2024)
RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations
by: Huang, Jing, et al.
Published: (2024)
by: Huang, Jing, et al.
Published: (2024)
Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling
by: Li, Margaret, et al.
Published: (2024)
by: Li, Margaret, et al.
Published: (2024)
Blind Baselines Beat Membership Inference Attacks for Foundation Models
by: Das, Debeshee, et al.
Published: (2024)
by: Das, Debeshee, et al.
Published: (2024)
CreativEval: Evaluating Creativity of LLM-Based Hardware Code Generation
by: DeLorenzo, Matthew, et al.
Published: (2024)
by: DeLorenzo, Matthew, et al.
Published: (2024)
I am a Strange Dataset: Metalinguistic Tests for Language Models
by: Thrush, Tristan, et al.
Published: (2024)
by: Thrush, Tristan, et al.
Published: (2024)
Mission: Impossible Language Models
by: Kallini, Julie, et al.
Published: (2024)
by: Kallini, Julie, et al.
Published: (2024)
Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions
by: Tang, Chenming, et al.
Published: (2024)
by: Tang, Chenming, et al.
Published: (2024)
AlignCultura: Towards Culturally Aligned Large Language Models?
by: Kashyap, Gautam Siddharth, et al.
Published: (2026)
by: Kashyap, Gautam Siddharth, et al.
Published: (2026)
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
by: Kallini, Julie, et al.
Published: (2024)
by: Kallini, Julie, et al.
Published: (2024)
DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines
by: Singhvi, Arnav, et al.
Published: (2023)
by: Singhvi, Arnav, et al.
Published: (2023)
CreativityPrism: A Holistic Evaluation Framework for Large Language Model Creativity
by: Hou, Zhaoyi Joey, et al.
Published: (2025)
by: Hou, Zhaoyi Joey, et al.
Published: (2025)
Outcome Rewards Do Not Guarantee Verifiable or Causally Important Reasoning
by: Yu, Qinan, et al.
Published: (2026)
by: Yu, Qinan, et al.
Published: (2026)
ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning
by: She, Jingyuan Selena, et al.
Published: (2023)
by: She, Jingyuan Selena, et al.
Published: (2023)
On Recipe Memorization and Creativity in Large Language Models: Is Your Model a Creative Cook, a Bad Cook, or Merely a Plagiator?
by: Kvapil, Jan, et al.
Published: (2025)
by: Kvapil, Jan, et al.
Published: (2025)
Beat-Based Rhythm Quantization of MIDI Performances
by: Wachter, Maximilian, et al.
Published: (2025)
by: Wachter, Maximilian, et al.
Published: (2025)
Privacy-Preserving Instructions for Aligning Large Language Models
by: Yu, Da, et al.
Published: (2024)
by: Yu, Da, et al.
Published: (2024)
Blackbox Model Provenance via Palimpsestic Membership Inference
by: Kuditipudi, Rohith, et al.
Published: (2025)
by: Kuditipudi, Rohith, et al.
Published: (2025)
What Shapes a Creative Machine Mind? Comprehensively Benchmarking Creativity in Foundation Models
by: He, Zicong, et al.
Published: (2025)
by: He, Zicong, et al.
Published: (2025)
Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach
by: Li, Ruizhe, et al.
Published: (2025)
by: Li, Ruizhe, et al.
Published: (2025)
Similar Items
-
Let the Poem Hit the Rhythm: Using a Byte-Based Transformer for Beat-Aligned Poetry Generation
by: Elzohbi, Mohamad, et al.
Published: (2024) -
Counterfactual Simulation Training for Chain-of-Thought Faithfulness
by: Hase, Peter, et al.
Published: (2026) -
Large Language Models Align with the Human Brain during Creative Thinking
by: Ismayilzada, Mete, et al.
Published: (2026) -
Demystifying Verbatim Memorization in Large Language Models
by: Huang, Jing, et al.
Published: (2024) -
Beyond Divergent Creativity: A Human-Based Evaluation of Creativity in Large Language Models
by: Nakajima, Kumiko, et al.
Published: (2026)