Saved in:
| Main Authors: | Levine, Lionel, Santerre, John, Young, Alex S., Levine, T. Barry, Campion, Francis, Sarrafzadeh, Majid |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.11082 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PRISM-Consult: A Panel-of-Experts Architecture for Clinician-Aligned Diagnosis
by: Levine, Lionel, et al.
Published: (2025)
by: Levine, Lionel, et al.
Published: (2025)
Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
by: Hong, Joey, et al.
Published: (2024)
by: Hong, Joey, et al.
Published: (2024)
Self-Challenging Language Model Agents
by: Zhou, Yifei, et al.
Published: (2025)
by: Zhou, Yifei, et al.
Published: (2025)
Fundamental Limitations of Alignment in Large Language Models
by: Wolf, Yotam, et al.
Published: (2023)
by: Wolf, Yotam, et al.
Published: (2023)
EigenBench: A Comparative Behavioral Measure of Value Alignment
by: Chang, Jonathn, et al.
Published: (2025)
by: Chang, Jonathn, et al.
Published: (2025)
Tradeoffs Between Alignment and Helpfulness in Language Models with Steering Methods
by: Wolf, Yotam, et al.
Published: (2024)
by: Wolf, Yotam, et al.
Published: (2024)
PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models
by: Gupta, Shashi Kant, et al.
Published: (2024)
by: Gupta, Shashi Kant, et al.
Published: (2024)
Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL
by: Hong, Joey, et al.
Published: (2025)
by: Hong, Joey, et al.
Published: (2025)
Leveraging ChatGPT and Other NLP Methods for Identifying Risk and Protective Behaviors in MSM: Social Media and Dating apps Text Analysis
by: Beikzadeh, Mehrab, et al.
Published: (2026)
by: Beikzadeh, Mehrab, et al.
Published: (2026)
Unfamiliar Finetuning Examples Control How Language Models Hallucinate
by: Kang, Katie, et al.
Published: (2024)
by: Kang, Katie, et al.
Published: (2024)
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
by: Zhou, Yifei, et al.
Published: (2024)
by: Zhou, Yifei, et al.
Published: (2024)
PRISM: A Methodology for Auditing Biases in Large Language Models
by: Azzopardi, Leif, et al.
Published: (2024)
by: Azzopardi, Leif, et al.
Published: (2024)
Evaluating & Reducing Deceptive Dialogue From Language Models with Multi-turn RL
by: Abdulhai, Marwa, et al.
Published: (2025)
by: Abdulhai, Marwa, et al.
Published: (2025)
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models
by: Kim, Hyunwoo, et al.
Published: (2025)
by: Kim, Hyunwoo, et al.
Published: (2025)
Generating Benchmarks for Factuality Evaluation of Language Models
by: Muhlgay, Dor, et al.
Published: (2023)
by: Muhlgay, Dor, et al.
Published: (2023)
Exploring the Impact of Dataset Statistical Effect Size on Model Performance and Data Sample Size Sufficiency
by: Hatamian, Arya, et al.
Published: (2025)
by: Hatamian, Arya, et al.
Published: (2025)
Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations
by: Hong, Joey, et al.
Published: (2024)
by: Hong, Joey, et al.
Published: (2024)
Language Guided Skill Discovery
by: Rho, Seungeun, et al.
Published: (2024)
by: Rho, Seungeun, et al.
Published: (2024)
Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning
by: Abdulhai, Marwa, et al.
Published: (2025)
by: Abdulhai, Marwa, et al.
Published: (2025)
Training Large Language Models to Predict Clinical Events
by: Turtel, Benjamin, et al.
Published: (2026)
by: Turtel, Benjamin, et al.
Published: (2026)
RLAC: Reinforcement Learning with Adversarial Critic for Free-Form Generation Tasks
by: Wu, Mian, et al.
Published: (2025)
by: Wu, Mian, et al.
Published: (2025)
Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?
by: Chen, Pinzhen, et al.
Published: (2024)
by: Chen, Pinzhen, et al.
Published: (2024)
Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models
by: Zhao, Yida, et al.
Published: (2024)
by: Zhao, Yida, et al.
Published: (2024)
Story2MIDI: Emotionally Aligned Music Generation from Text
by: Shokri, Mohammad, et al.
Published: (2025)
by: Shokri, Mohammad, et al.
Published: (2025)
Non-Markovian Discrete Diffusion with Causal Language Models
by: Zhang, Yangtian, et al.
Published: (2025)
by: Zhang, Yangtian, et al.
Published: (2025)
Exploring Cross-model Neuronal Correlations in the Context of Predicting Model Performance and Generalizability
by: Oskouie, Haniyeh Ehsani, et al.
Published: (2024)
by: Oskouie, Haniyeh Ehsani, et al.
Published: (2024)
PRISM: Probing Reasoning, Instruction, and Source Memory in LLM Hallucinations
by: Wu, Yuhe, et al.
Published: (2026)
by: Wu, Yuhe, et al.
Published: (2026)
PRISM-MCTS: Learning from Reasoning Trajectories with Metacognitive Reflection
by: Cheng, Siyuan, et al.
Published: (2026)
by: Cheng, Siyuan, et al.
Published: (2026)
PRISM: Parametrically Refactoring Inference for Speculative Sampling Draft Models
by: Wang, Xuliang, et al.
Published: (2026)
by: Wang, Xuliang, et al.
Published: (2026)
MI-to-Mid Distilled Compression (M2M-DC): An Hybrid-Information-Guided-Block Pruning with Progressive Inner Slicing Approach to Model Compression
by: Levine, Lionel, et al.
Published: (2025)
by: Levine, Lionel, et al.
Published: (2025)
A Monosemantic Attribution Framework for Stable Interpretability in Clinical Neuroscience Transformer-Based Language Models
by: Mamalakis, Michail, et al.
Published: (2026)
by: Mamalakis, Michail, et al.
Published: (2026)
Zero-Overhead Introspection for Adaptive Test-Time Compute
by: Manvi, Rohin, et al.
Published: (2025)
by: Manvi, Rohin, et al.
Published: (2025)
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
by: Li, Chengshu, et al.
Published: (2023)
by: Li, Chengshu, et al.
Published: (2023)
Transformer-based Causal Language Models Perform Clustering
by: Wu, Xinbo, et al.
Published: (2024)
by: Wu, Xinbo, et al.
Published: (2024)
Iterative Translation Refinement with Large Language Models
by: Chen, Pinzhen, et al.
Published: (2023)
by: Chen, Pinzhen, et al.
Published: (2023)
PluriHarms: Benchmarking the Full Spectrum of Human Judgments on AI Harm
by: Li, Jing-Jing, et al.
Published: (2026)
by: Li, Jing-Jing, et al.
Published: (2026)
Controlled LLM-based Reasoning for Clinical Trial Retrieval
by: Jullien, Mael, et al.
Published: (2024)
by: Jullien, Mael, et al.
Published: (2024)
Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale
by: Hu, Xiang, et al.
Published: (2024)
by: Hu, Xiang, et al.
Published: (2024)
Edinburgh Clinical NLP at MEDIQA-CORR 2024: Guiding Large Language Models with Hints
by: Gema, Aryo Pradipta, et al.
Published: (2024)
by: Gema, Aryo Pradipta, et al.
Published: (2024)
Querying Structured Data Through Natural Language Using Language Models
by: Valentin-Micu, Hontan, et al.
Published: (2026)
by: Valentin-Micu, Hontan, et al.
Published: (2026)
Similar Items
-
PRISM-Consult: A Panel-of-Experts Architecture for Clinician-Aligned Diagnosis
by: Levine, Lionel, et al.
Published: (2025) -
Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
by: Hong, Joey, et al.
Published: (2024) -
Self-Challenging Language Model Agents
by: Zhou, Yifei, et al.
Published: (2025) -
Fundamental Limitations of Alignment in Large Language Models
by: Wolf, Yotam, et al.
Published: (2023) -
EigenBench: A Comparative Behavioral Measure of Value Alignment
by: Chang, Jonathn, et al.
Published: (2025)