:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Levine, Lionel, Santerre, John, Young, Alex S., Levine, T. Barry, Campion, Francis, Sarrafzadeh, Majid
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2506.11082
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PRISM-Consult: A Panel-of-Experts Architecture for Clinician-Aligned Diagnosis
by: Levine, Lionel, et al.
Published: (2025)

Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
by: Hong, Joey, et al.
Published: (2024)

Self-Challenging Language Model Agents
by: Zhou, Yifei, et al.
Published: (2025)

Fundamental Limitations of Alignment in Large Language Models
by: Wolf, Yotam, et al.
Published: (2023)

EigenBench: A Comparative Behavioral Measure of Value Alignment
by: Chang, Jonathn, et al.
Published: (2025)

Tradeoffs Between Alignment and Helpfulness in Language Models with Steering Methods
by: Wolf, Yotam, et al.
Published: (2024)

PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models
by: Gupta, Shashi Kant, et al.
Published: (2024)

Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL
by: Hong, Joey, et al.
Published: (2025)

Leveraging ChatGPT and Other NLP Methods for Identifying Risk and Protective Behaviors in MSM: Social Media and Dating apps Text Analysis
by: Beikzadeh, Mehrab, et al.
Published: (2026)

Unfamiliar Finetuning Examples Control How Language Models Hallucinate
by: Kang, Katie, et al.
Published: (2024)

ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
by: Zhou, Yifei, et al.
Published: (2024)

PRISM: A Methodology for Auditing Biases in Large Language Models
by: Azzopardi, Leif, et al.
Published: (2024)

Evaluating & Reducing Deceptive Dialogue From Language Models with Multi-turn RL
by: Abdulhai, Marwa, et al.
Published: (2025)

Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models
by: Kim, Hyunwoo, et al.
Published: (2025)

Generating Benchmarks for Factuality Evaluation of Language Models
by: Muhlgay, Dor, et al.
Published: (2023)

Exploring the Impact of Dataset Statistical Effect Size on Model Performance and Data Sample Size Sufficiency
by: Hatamian, Arya, et al.
Published: (2025)

Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations
by: Hong, Joey, et al.
Published: (2024)

Language Guided Skill Discovery
by: Rho, Seungeun, et al.
Published: (2024)

Consistently Simulating Human Personas with Multi-Turn Reinforcement Learning
by: Abdulhai, Marwa, et al.
Published: (2025)

Training Large Language Models to Predict Clinical Events
by: Turtel, Benjamin, et al.
Published: (2026)

RLAC: Reinforcement Learning with Adversarial Critic for Free-Form Generation Tasks
by: Wu, Mian, et al.
Published: (2025)

Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?
by: Chen, Pinzhen, et al.
Published: (2024)

Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models
by: Zhao, Yida, et al.
Published: (2024)

Story2MIDI: Emotionally Aligned Music Generation from Text
by: Shokri, Mohammad, et al.
Published: (2025)

Non-Markovian Discrete Diffusion with Causal Language Models
by: Zhang, Yangtian, et al.
Published: (2025)

Exploring Cross-model Neuronal Correlations in the Context of Predicting Model Performance and Generalizability
by: Oskouie, Haniyeh Ehsani, et al.
Published: (2024)

PRISM: Probing Reasoning, Instruction, and Source Memory in LLM Hallucinations
by: Wu, Yuhe, et al.
Published: (2026)

PRISM-MCTS: Learning from Reasoning Trajectories with Metacognitive Reflection
by: Cheng, Siyuan, et al.
Published: (2026)

PRISM: Parametrically Refactoring Inference for Speculative Sampling Draft Models
by: Wang, Xuliang, et al.
Published: (2026)

MI-to-Mid Distilled Compression (M2M-DC): An Hybrid-Information-Guided-Block Pruning with Progressive Inner Slicing Approach to Model Compression
by: Levine, Lionel, et al.
Published: (2025)

A Monosemantic Attribution Framework for Stable Interpretability in Clinical Neuroscience Transformer-Based Language Models
by: Mamalakis, Michail, et al.
Published: (2026)

Zero-Overhead Introspection for Adaptive Test-Time Compute
by: Manvi, Rohin, et al.
Published: (2025)

Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
by: Li, Chengshu, et al.
Published: (2023)

Transformer-based Causal Language Models Perform Clustering
by: Wu, Xinbo, et al.
Published: (2024)

Iterative Translation Refinement with Large Language Models
by: Chen, Pinzhen, et al.
Published: (2023)

PluriHarms: Benchmarking the Full Spectrum of Human Judgments on AI Harm
by: Li, Jing-Jing, et al.
Published: (2026)

Controlled LLM-based Reasoning for Clinical Trial Retrieval
by: Jullien, Mael, et al.
Published: (2024)

Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale
by: Hu, Xiang, et al.
Published: (2024)

Edinburgh Clinical NLP at MEDIQA-CORR 2024: Guiding Large Language Models with Hints
by: Gema, Aryo Pradipta, et al.
Published: (2024)

Querying Structured Data Through Natural Language Using Language Models
by: Valentin-Micu, Hontan, et al.
Published: (2026)