:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Verhagen, Mark D., Stroebl, Benedikt, Liu, Tiffany, Liu, Lydia T., Salganik, Matthew J.
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2507.03027
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark
by: Siegel, Zachary S., et al.
Published: (2024)

Localized Cultural Knowledge is Conserved and Controllable in Large Language Models
by: Veselovsky, Veniamin, et al.
Published: (2025)

Emergent inabilities? Inverse scaling over the course of pretraining
by: Michaelov, James A., et al.
Published: (2023)

Information Retrieval Induced Safety Degradation in AI Agents
by: Yu, Cheng, et al.
Published: (2025)

Stories of Your Life as Others: A Round-Trip Evaluation of LLM-Generated Life Stories Conditioned on Rich Psychometric Profiles
by: Wigler, Ben, et al.
Published: (2026)

Which course? Discourse! Teaching Discourse and Generation in the Era of LLMs
by: Li, Junyi Jessy, et al.
Published: (2026)

TopicENA: Enabling Epistemic Network Analysis at Scale through Automated Topic-Based Coding
by: Lu, Owen H. T., et al.
Published: (2026)

MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning
by: Li, Jiachun, et al.
Published: (2026)

The Limits of Inference Scaling Through Resampling
by: Stroebl, Benedikt, et al.
Published: (2024)

Facts Do Care About Your Language: Assessing Answer Quality of Multilingual LLMs
by: Kansal, Yuval, et al.
Published: (2025)

GLeMM: A large-scale multilingual dataset for morphological research
by: Nabil, Hathout, et al.
Published: (2026)

Towards Enabling FAIR Dataspaces Using Large Language Models
by: Arnold, Benedikt T., et al.
Published: (2024)

Social Media for Mental Health: Data, Methods, and Findings
by: Kamarudin, Nur Shazwani, et al.
Published: (2025)

Towards a Comparative Framework for Compositional AI Models
by: Duneau, Tiffany
Published: (2025)

Stan: An LLM-based thermodynamics course assistant
by: Furst, Eric M., et al.
Published: (2026)

The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks
by: Ebing, Benedikt, et al.
Published: (2025)

To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource Languages
by: Ebing, Benedikt, et al.
Published: (2023)

CQA-Eval: Designing Reliable Evaluations of Multi-paragraph Clinical QA under Resource Constraints
by: Bologna, Federica, et al.
Published: (2025)

CL-bench Life: Can Language Models Learn from Real-Life Context?
by: Dou, Shihan, et al.
Published: (2026)

The High Cost of Incivility: Quantifying Interaction Inefficiency via Multi-Agent Monte Carlo Simulations
by: Mangold, Benedikt
Published: (2025)

Taking a turn for the better: Conversation redirection throughout the course of mental-health therapy
by: Nguyen, Vivian, et al.
Published: (2024)

TRN-R1-Zero: Text-rich Network Reasoning via LLMs with Reinforcement Learning Only
by: Liu, Yilun, et al.
Published: (2026)

Practising responsibility: Ethics in NLP as a hands-on course
by: Nissim, Malvina, et al.
Published: (2025)

Hazards in Daily Life? Enabling Robots to Proactively Detect and Resolve Anomalies
by: Song, Zirui, et al.
Published: (2024)

Capturing research literature attitude towards Sustainable Development Goals: an LLM-based topic modeling approach
by: Invernici, Francesco, et al.
Published: (2024)

Large-scale User Game Lifecycle Representation Learning
by: Gou, Yanjie, et al.
Published: (2025)

TransAlign: Machine Translation Encoders are Strong Word Aligners, Too
by: Ebing, Benedikt, et al.
Published: (2025)

One Script Instead of Hundreds? On Pretraining Romanized Encoder Language Models
by: Ebing, Benedikt, et al.
Published: (2026)

Read it in Two Steps: Translating Extremely Low-Resource Languages with Code-Augmented Grammar Books
by: Zhang, Chen, et al.
Published: (2025)

Adversarial Negotiation Dynamics in Generative Language Models
by: Kolbeinsson, Arinbjörn, et al.
Published: (2024)

Runtime Verification: Monitoring, Knowledge, and Uncertainty (Lecture Notes)
by: Bollig, Benedikt
Published: (2026)

Causal Past Logic for Runtime Verification of Distributed LLM Agent Workflows
by: Bollig, Benedikt
Published: (2026)

Ask Patients with Patience: Enabling LLMs for Human-Centric Medical Dialogue with Grounded Reasoning
by: Zhu, Jiayuan, et al.
Published: (2025)

Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences
by: Liu, Hongyi, et al.
Published: (2024)

Exploring Diachronic and Diatopic Changes in Dialect Continua: Tasks, Datasets and Challenges
by: Çelikkol, Melis, et al.
Published: (2024)

Making Sentence Embeddings Robust to User-Generated Content
by: Nishimwe, Lydia, et al.
Published: (2024)

Semantic Pivots Enable Cross-Lingual Transfer in Large Language Models
by: He, Kaiyu, et al.
Published: (2025)

Xmodel-1.5: An 1B-scale Multilingual LLM
by: Qun, Wang, et al.
Published: (2024)

JE-IRT: A Geometric Lens on LLM Abilities through Joint Embedding Item Response Theory
by: Yao, Louie Hong, et al.
Published: (2025)

Rethinking ChatGPT's Success: Usability and Cognitive Behaviors Enabled by Auto-regressive LLMs' Prompting
by: Li, Xinzhe, et al.
Published: (2024)