Saved in:
| Main Authors: | Viteri, Scott, Lamparth, Max, Chatain, Peter, Barrett, Clark |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.18988 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
by: Lamparth, Max, et al.
Published: (2023)
by: Lamparth, Max, et al.
Published: (2023)
Risks from Language Models for Automated Mental Healthcare: Ethics and Structure for Implementation
by: Grabb, Declan, et al.
Published: (2024)
by: Grabb, Declan, et al.
Published: (2024)
Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
by: Shrivastava, Aryan, et al.
Published: (2024)
by: Shrivastava, Aryan, et al.
Published: (2024)
Uncovering Latent Chain of Thought Vectors in Language Models
by: Zhang, Jason, et al.
Published: (2024)
by: Zhang, Jason, et al.
Published: (2024)
One Bias After Another: Mechanistic Reward Shaping and Persistent Biases in Language Reward Models
by: Fein, Daniel, et al.
Published: (2026)
by: Fein, Daniel, et al.
Published: (2026)
Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame Simulations
by: Lamparth, Max, et al.
Published: (2024)
by: Lamparth, Max, et al.
Published: (2024)
Escalation Risks from Language Models in Military and Diplomatic Decision-Making
by: Rivera, Juan-Pablo, et al.
Published: (2024)
by: Rivera, Juan-Pablo, et al.
Published: (2024)
Lemur: Integrating Large Language Models in Automated Program Verification
by: Wu, Haoze, et al.
Published: (2023)
by: Wu, Haoze, et al.
Published: (2023)
Moving Beyond Medical Exams: A Clinician-Annotated Fairness Dataset of Real-World Tasks and Ambiguity in Mental Healthcare
by: Lamparth, Max, et al.
Published: (2025)
by: Lamparth, Max, et al.
Published: (2025)
The termination of Nielsen transformations applied to word equations with length constraints
by: Przybocki, Benjamin, et al.
Published: (2025)
by: Przybocki, Benjamin, et al.
Published: (2025)
Language Modeling by Language Models
by: Cheng, Junyan, et al.
Published: (2025)
by: Cheng, Junyan, et al.
Published: (2025)
Faithful Autoformalization via Roundtrip Verification and Repair
by: Amrollahi, Daneshvar, et al.
Published: (2026)
by: Amrollahi, Daneshvar, et al.
Published: (2026)
Taking Complete Finite Prefixes To High Level, Symbolically
by: Würdemann, Nick, et al.
Published: (2023)
by: Würdemann, Nick, et al.
Published: (2023)
TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks
by: Garbas, Lukas, et al.
Published: (2024)
by: Garbas, Lukas, et al.
Published: (2024)
Markovian Generation Chains in Large Language Models
by: Geng, Mingmeng, et al.
Published: (2026)
by: Geng, Mingmeng, et al.
Published: (2026)
Analysing Differences in Persuasive Language in LLM-Generated Text: Uncovering Stereotypical Gender Patterns
by: Pauli, Amalie Brogaard, et al.
Published: (2026)
by: Pauli, Amalie Brogaard, et al.
Published: (2026)
Non-Markovian Discrete Diffusion with Causal Language Models
by: Zhang, Yangtian, et al.
Published: (2025)
by: Zhang, Yangtian, et al.
Published: (2025)
Word Meanings in Transformer Language Models
by: Grindrod, Jumbly, et al.
Published: (2025)
by: Grindrod, Jumbly, et al.
Published: (2025)
Automatic Pruning of Fine-tuning Datasets for Transformer-based Language Models
by: Tayaranian, Mohammadreza, et al.
Published: (2024)
by: Tayaranian, Mohammadreza, et al.
Published: (2024)
Can Language Models Serve as Text-Based World Simulators?
by: Wang, Ruoyao, et al.
Published: (2024)
by: Wang, Ruoyao, et al.
Published: (2024)
Searching for Structure: Investigating Emergent Communication with Large Language Models
by: Kouwenhoven, Tom, et al.
Published: (2024)
by: Kouwenhoven, Tom, et al.
Published: (2024)
TyDi QA-WANA: A Benchmark for Information-Seeking Question Answering in Languages of West Asia and North Africa
by: Riley, Parker, et al.
Published: (2025)
by: Riley, Parker, et al.
Published: (2025)
Traces of Social Competence in Large Language Models
by: Kouwenhoven, Tom, et al.
Published: (2026)
by: Kouwenhoven, Tom, et al.
Published: (2026)
Shaping Shared Languages: Human and Large Language Models' Inductive Biases in Emergent Communication
by: Kouwenhoven, Tom, et al.
Published: (2025)
by: Kouwenhoven, Tom, et al.
Published: (2025)
Vocabulary Expansion of Large Language Models via Kullback-Leibler-Based Self-Distillation
by: Linder, Max Rehman
Published: (2025)
by: Linder, Max Rehman
Published: (2025)
Automated Generation of High-Quality Medical Simulation Scenarios Through Integration of Semi-Structured Data and Large Language Models
by: Sumpter, Scott
Published: (2024)
by: Sumpter, Scott
Published: (2024)
Linear Recency Bias During Training Improves Transformers' Fit to Reading Times
by: Clark, Christian, et al.
Published: (2024)
by: Clark, Christian, et al.
Published: (2024)
BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models
by: Wiland, Jacek, et al.
Published: (2024)
by: Wiland, Jacek, et al.
Published: (2024)
Language Models Represent Space and Time
by: Gurnee, Wes, et al.
Published: (2023)
by: Gurnee, Wes, et al.
Published: (2023)
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models
by: Land, Sander, et al.
Published: (2024)
by: Land, Sander, et al.
Published: (2024)
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills
by: Nottingham, Kolby, et al.
Published: (2024)
by: Nottingham, Kolby, et al.
Published: (2024)
Exploratory Semantic Reliability Analysis of Wind Turbine Maintenance Logs using Large Language Models
by: Malyi, Max, et al.
Published: (2025)
by: Malyi, Max, et al.
Published: (2025)
Weak Supervision Dynamic KL-Weighted Diffusion Models Guided by Large Language Models
by: Perry, Julian, et al.
Published: (2025)
by: Perry, Julian, et al.
Published: (2025)
ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer
by: Yueyu, Lin, et al.
Published: (2025)
by: Yueyu, Lin, et al.
Published: (2025)
DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers
by: Müller-Eberstein, Max, et al.
Published: (2025)
by: Müller-Eberstein, Max, et al.
Published: (2025)
Language Models Use Trigonometry to Do Addition
by: Kantamneni, Subhash, et al.
Published: (2025)
by: Kantamneni, Subhash, et al.
Published: (2025)
Will Large Language Models Transform Clinical Prediction?
by: Yildiz, Yusuf, et al.
Published: (2025)
by: Yildiz, Yusuf, et al.
Published: (2025)
Lattice Annotated Temporal (LAT) Logic for Non-Markovian Reasoning
by: Mukherji, Kaustuv, et al.
Published: (2025)
by: Mukherji, Kaustuv, et al.
Published: (2025)
Improving Reward Models with Synthetic Critiques
by: Ye, Zihuiwen, et al.
Published: (2024)
by: Ye, Zihuiwen, et al.
Published: (2024)
Semformer: Transformer Language Models with Semantic Planning
by: Yin, Yongjing, et al.
Published: (2024)
by: Yin, Yongjing, et al.
Published: (2024)
Similar Items
-
Analyzing And Editing Inner Mechanisms Of Backdoored Language Models
by: Lamparth, Max, et al.
Published: (2023) -
Risks from Language Models for Automated Mental Healthcare: Ethics and Structure for Implementation
by: Grabb, Declan, et al.
Published: (2024) -
Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
by: Shrivastava, Aryan, et al.
Published: (2024) -
Uncovering Latent Chain of Thought Vectors in Language Models
by: Zhang, Jason, et al.
Published: (2024) -
One Bias After Another: Mechanistic Reward Shaping and Persistent Biases in Language Reward Models
by: Fein, Daniel, et al.
Published: (2026)