Saved in:
| Main Authors: | Verhagen, Mark D., Stroebl, Benedikt, Liu, Tiffany, Liu, Lydia T., Salganik, Matthew J. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.03027 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark
by: Siegel, Zachary S., et al.
Published: (2024)
by: Siegel, Zachary S., et al.
Published: (2024)
Localized Cultural Knowledge is Conserved and Controllable in Large Language Models
by: Veselovsky, Veniamin, et al.
Published: (2025)
by: Veselovsky, Veniamin, et al.
Published: (2025)
Emergent inabilities? Inverse scaling over the course of pretraining
by: Michaelov, James A., et al.
Published: (2023)
by: Michaelov, James A., et al.
Published: (2023)
Information Retrieval Induced Safety Degradation in AI Agents
by: Yu, Cheng, et al.
Published: (2025)
by: Yu, Cheng, et al.
Published: (2025)
Stories of Your Life as Others: A Round-Trip Evaluation of LLM-Generated Life Stories Conditioned on Rich Psychometric Profiles
by: Wigler, Ben, et al.
Published: (2026)
by: Wigler, Ben, et al.
Published: (2026)
Which course? Discourse! Teaching Discourse and Generation in the Era of LLMs
by: Li, Junyi Jessy, et al.
Published: (2026)
by: Li, Junyi Jessy, et al.
Published: (2026)
TopicENA: Enabling Epistemic Network Analysis at Scale through Automated Topic-Based Coding
by: Lu, Owen H. T., et al.
Published: (2026)
by: Lu, Owen H. T., et al.
Published: (2026)
MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning
by: Li, Jiachun, et al.
Published: (2026)
by: Li, Jiachun, et al.
Published: (2026)
The Limits of Inference Scaling Through Resampling
by: Stroebl, Benedikt, et al.
Published: (2024)
by: Stroebl, Benedikt, et al.
Published: (2024)
Facts Do Care About Your Language: Assessing Answer Quality of Multilingual LLMs
by: Kansal, Yuval, et al.
Published: (2025)
by: Kansal, Yuval, et al.
Published: (2025)
GLeMM: A large-scale multilingual dataset for morphological research
by: Nabil, Hathout, et al.
Published: (2026)
by: Nabil, Hathout, et al.
Published: (2026)
Towards Enabling FAIR Dataspaces Using Large Language Models
by: Arnold, Benedikt T., et al.
Published: (2024)
by: Arnold, Benedikt T., et al.
Published: (2024)
Social Media for Mental Health: Data, Methods, and Findings
by: Kamarudin, Nur Shazwani, et al.
Published: (2025)
by: Kamarudin, Nur Shazwani, et al.
Published: (2025)
Towards a Comparative Framework for Compositional AI Models
by: Duneau, Tiffany
Published: (2025)
by: Duneau, Tiffany
Published: (2025)
Stan: An LLM-based thermodynamics course assistant
by: Furst, Eric M., et al.
Published: (2026)
by: Furst, Eric M., et al.
Published: (2026)
The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks
by: Ebing, Benedikt, et al.
Published: (2025)
by: Ebing, Benedikt, et al.
Published: (2025)
To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource Languages
by: Ebing, Benedikt, et al.
Published: (2023)
by: Ebing, Benedikt, et al.
Published: (2023)
CQA-Eval: Designing Reliable Evaluations of Multi-paragraph Clinical QA under Resource Constraints
by: Bologna, Federica, et al.
Published: (2025)
by: Bologna, Federica, et al.
Published: (2025)
CL-bench Life: Can Language Models Learn from Real-Life Context?
by: Dou, Shihan, et al.
Published: (2026)
by: Dou, Shihan, et al.
Published: (2026)
The High Cost of Incivility: Quantifying Interaction Inefficiency via Multi-Agent Monte Carlo Simulations
by: Mangold, Benedikt
Published: (2025)
by: Mangold, Benedikt
Published: (2025)
Taking a turn for the better: Conversation redirection throughout the course of mental-health therapy
by: Nguyen, Vivian, et al.
Published: (2024)
by: Nguyen, Vivian, et al.
Published: (2024)
TRN-R1-Zero: Text-rich Network Reasoning via LLMs with Reinforcement Learning Only
by: Liu, Yilun, et al.
Published: (2026)
by: Liu, Yilun, et al.
Published: (2026)
Practising responsibility: Ethics in NLP as a hands-on course
by: Nissim, Malvina, et al.
Published: (2025)
by: Nissim, Malvina, et al.
Published: (2025)
Hazards in Daily Life? Enabling Robots to Proactively Detect and Resolve Anomalies
by: Song, Zirui, et al.
Published: (2024)
by: Song, Zirui, et al.
Published: (2024)
Capturing research literature attitude towards Sustainable Development Goals: an LLM-based topic modeling approach
by: Invernici, Francesco, et al.
Published: (2024)
by: Invernici, Francesco, et al.
Published: (2024)
Large-scale User Game Lifecycle Representation Learning
by: Gou, Yanjie, et al.
Published: (2025)
by: Gou, Yanjie, et al.
Published: (2025)
TransAlign: Machine Translation Encoders are Strong Word Aligners, Too
by: Ebing, Benedikt, et al.
Published: (2025)
by: Ebing, Benedikt, et al.
Published: (2025)
One Script Instead of Hundreds? On Pretraining Romanized Encoder Language Models
by: Ebing, Benedikt, et al.
Published: (2026)
by: Ebing, Benedikt, et al.
Published: (2026)
Read it in Two Steps: Translating Extremely Low-Resource Languages with Code-Augmented Grammar Books
by: Zhang, Chen, et al.
Published: (2025)
by: Zhang, Chen, et al.
Published: (2025)
Adversarial Negotiation Dynamics in Generative Language Models
by: Kolbeinsson, Arinbjörn, et al.
Published: (2024)
by: Kolbeinsson, Arinbjörn, et al.
Published: (2024)
Runtime Verification: Monitoring, Knowledge, and Uncertainty (Lecture Notes)
by: Bollig, Benedikt
Published: (2026)
by: Bollig, Benedikt
Published: (2026)
Causal Past Logic for Runtime Verification of Distributed LLM Agent Workflows
by: Bollig, Benedikt
Published: (2026)
by: Bollig, Benedikt
Published: (2026)
Ask Patients with Patience: Enabling LLMs for Human-Centric Medical Dialogue with Grounded Reasoning
by: Zhu, Jiayuan, et al.
Published: (2025)
by: Zhu, Jiayuan, et al.
Published: (2025)
Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences
by: Liu, Hongyi, et al.
Published: (2024)
by: Liu, Hongyi, et al.
Published: (2024)
Exploring Diachronic and Diatopic Changes in Dialect Continua: Tasks, Datasets and Challenges
by: Çelikkol, Melis, et al.
Published: (2024)
by: Çelikkol, Melis, et al.
Published: (2024)
Making Sentence Embeddings Robust to User-Generated Content
by: Nishimwe, Lydia, et al.
Published: (2024)
by: Nishimwe, Lydia, et al.
Published: (2024)
Semantic Pivots Enable Cross-Lingual Transfer in Large Language Models
by: He, Kaiyu, et al.
Published: (2025)
by: He, Kaiyu, et al.
Published: (2025)
Xmodel-1.5: An 1B-scale Multilingual LLM
by: Qun, Wang, et al.
Published: (2024)
by: Qun, Wang, et al.
Published: (2024)
JE-IRT: A Geometric Lens on LLM Abilities through Joint Embedding Item Response Theory
by: Yao, Louie Hong, et al.
Published: (2025)
by: Yao, Louie Hong, et al.
Published: (2025)
Rethinking ChatGPT's Success: Usability and Cognitive Behaviors Enabled by Auto-regressive LLMs' Prompting
by: Li, Xinzhe, et al.
Published: (2024)
by: Li, Xinzhe, et al.
Published: (2024)
Similar Items
-
CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark
by: Siegel, Zachary S., et al.
Published: (2024) -
Localized Cultural Knowledge is Conserved and Controllable in Large Language Models
by: Veselovsky, Veniamin, et al.
Published: (2025) -
Emergent inabilities? Inverse scaling over the course of pretraining
by: Michaelov, James A., et al.
Published: (2023) -
Information Retrieval Induced Safety Degradation in AI Agents
by: Yu, Cheng, et al.
Published: (2025) -
Stories of Your Life as Others: A Round-Trip Evaluation of LLM-Generated Life Stories Conditioned on Rich Psychometric Profiles
by: Wigler, Ben, et al.
Published: (2026)