Saved in:
| Main Author: | Lee, TK |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.13762 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Influencing Humans to Conform to Preference Models for RLHF
by: Hatgis-Kessell, Stephane, et al.
Published: (2025)
by: Hatgis-Kessell, Stephane, et al.
Published: (2025)
Beyond Compliance: How AI Could Help Creative Writers by Refusing Them
by: Qin, Hua Xuan, et al.
Published: (2026)
by: Qin, Hua Xuan, et al.
Published: (2026)
Refusal as Silence: Gendered Disparities in Vision-Language Model Responses
by: Luo, Sha, et al.
Published: (2024)
by: Luo, Sha, et al.
Published: (2024)
Oyster-I: Beyond Refusal -- Constructive Safety Alignment for Responsible Language Models
by: Duan, Ranjie, et al.
Published: (2025)
by: Duan, Ranjie, et al.
Published: (2025)
Are Large Language Models Aligned with People's Social Intuitions for Human-Robot Interactions?
by: Wachowiak, Lennart, et al.
Published: (2024)
by: Wachowiak, Lennart, et al.
Published: (2024)
Aligning Model Evaluations with Human Preferences: Mitigating Token Count Bias in Language Model Assessments
by: Daynauth, Roland, et al.
Published: (2024)
by: Daynauth, Roland, et al.
Published: (2024)
TalkToAgent: A Human-centric Explanation of Reinforcement Learning Agents with Large Language Models
by: Kim, Haechang, et al.
Published: (2025)
by: Kim, Haechang, et al.
Published: (2025)
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback
by: Yuan, Yifu, et al.
Published: (2024)
by: Yuan, Yifu, et al.
Published: (2024)
Interactive Design by Integrating a Large Pre-Trained Language Model and Building Information Modeling
by: Jang, Suhyung, et al.
Published: (2023)
by: Jang, Suhyung, et al.
Published: (2023)
As Confidence Aligns: Exploring the Effect of AI Confidence on Human Self-confidence in Human-AI Decision Making
by: Li, Jingshu, et al.
Published: (2025)
by: Li, Jingshu, et al.
Published: (2025)
Script-Strategy Aligned Generation: Aligning LLMs with Expert-Crafted Dialogue Scripts and Therapeutic Strategies for Psychotherapy
by: Sun, Xin, et al.
Published: (2024)
by: Sun, Xin, et al.
Published: (2024)
Do Metrics for Counterfactual Explanations Align with User Perception?
by: Liedeker, Felix, et al.
Published: (2026)
by: Liedeker, Felix, et al.
Published: (2026)
Sasha: Creative Goal-Oriented Reasoning in Smart Homes with Large Language Models
by: King, Evan, et al.
Published: (2023)
by: King, Evan, et al.
Published: (2023)
Inertia in Moral and Value Judgments of Large Language Models
by: Lee, Bruce W., et al.
Published: (2024)
by: Lee, Bruce W., et al.
Published: (2024)
Beyond Fixed Psychological Personas: State Beats Trait, but Language Models are State-Blind
by: Harry, Tamunotonye, et al.
Published: (2026)
by: Harry, Tamunotonye, et al.
Published: (2026)
AVIN-Chat: An Audio-Visual Interactive Chatbot System with Emotional State Tuning
by: Park, Chanhyuk, et al.
Published: (2024)
by: Park, Chanhyuk, et al.
Published: (2024)
Dreaming to Assist: Learning to Align with Human Objectives for Shared Control in High-Speed Racing
by: DeCastro, Jonathan, et al.
Published: (2024)
by: DeCastro, Jonathan, et al.
Published: (2024)
Chain of Empathy: Enhancing Empathetic Response of Large Language Models Based on Psychotherapy Models
by: Lee, Yoon Kyung, et al.
Published: (2023)
by: Lee, Yoon Kyung, et al.
Published: (2023)
Design Generative AI for Practitioners: Exploring Interaction Approaches Aligned with Creative Practice
by: Peng, Xiaohan, et al.
Published: (2026)
by: Peng, Xiaohan, et al.
Published: (2026)
CoLyricist: Enhancing Lyric Writing with AI through Workflow-Aligned Support
by: Yoshida, Masahiro, et al.
Published: (2026)
by: Yoshida, Masahiro, et al.
Published: (2026)
Evalet: Evaluating Large Language Models through Functional Fragmentation
by: Kim, Tae Soo, et al.
Published: (2025)
by: Kim, Tae Soo, et al.
Published: (2025)
Large Language Model-Based Interpretable Machine Learning Control in Building Energy Systems
by: Zhang, Liang, et al.
Published: (2024)
by: Zhang, Liang, et al.
Published: (2024)
MindfulAgents: Personalizing Mindfulness Meditation via an Expert-Aligned Multi-Agent System
by: Wu, Mengyuan Millie, et al.
Published: (2026)
by: Wu, Mengyuan Millie, et al.
Published: (2026)
Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences
by: Shankar, Shreya, et al.
Published: (2024)
by: Shankar, Shreya, et al.
Published: (2024)
From Promising Capability to Pervasive Bias: Assessing Large Language Models for Emergency Department Triage
by: Lee, Joseph, et al.
Published: (2025)
by: Lee, Joseph, et al.
Published: (2025)
Orchestrating Attention: Bringing Harmony to the 'Chaos' of Neurodivergent Learning States
by: Navneet, Satyam Kumar, et al.
Published: (2026)
by: Navneet, Satyam Kumar, et al.
Published: (2026)
A Multi-Agent Conversational Bandit Approach to Online Evaluation and Selection of User-Aligned LLM Responses
by: Dai, Xiangxiang, et al.
Published: (2025)
by: Dai, Xiangxiang, et al.
Published: (2025)
Aligning LLMs with Individual Preferences via Interaction
by: Wu, Shujin, et al.
Published: (2024)
by: Wu, Shujin, et al.
Published: (2024)
Literary Narrative as Moral Probe : A Cross-System Framework for Evaluating AI Ethical Reasoning and Refusal Behavior
by: Flynn, David C.
Published: (2026)
by: Flynn, David C.
Published: (2026)
Moving Beyond Review: Applying Language Models to Planning and Translation in Reflection
by: Neshaei, Seyed Parsa, et al.
Published: (2026)
by: Neshaei, Seyed Parsa, et al.
Published: (2026)
From Keyboard to Chatbot: An AI-powered Integration Platform with Large-Language Models for Teaching Computational Thinking for Young Children
by: Lee, Changjae, et al.
Published: (2024)
by: Lee, Changjae, et al.
Published: (2024)
Secret Use of Large Language Model (LLM)
by: Zhang, Zhiping, et al.
Published: (2024)
by: Zhang, Zhiping, et al.
Published: (2024)
Layout Generation Agents with Large Language Models
by: Sasazawa, Yuichi, et al.
Published: (2024)
by: Sasazawa, Yuichi, et al.
Published: (2024)
Automatic Large Language Models Creation of Interactive Learning Lessons
by: Lin, Jionghao, et al.
Published: (2025)
by: Lin, Jionghao, et al.
Published: (2025)
From Static to Interactive: Authoring Interactive Visualizations via Natural Language
by: Liu, Can, et al.
Published: (2026)
by: Liu, Can, et al.
Published: (2026)
Navigating the Path of Writing: Outline-guided Text Generation with Large Language Models
by: Lee, Yukyung, et al.
Published: (2024)
by: Lee, Yukyung, et al.
Published: (2024)
Direct Advantage Regression: Aligning LLMs with Online AI Reward
by: He, Li, et al.
Published: (2025)
by: He, Li, et al.
Published: (2025)
Advancing Building Energy Modeling with Large Language Models: Exploration and Case Studies
by: Zhang, Liang, et al.
Published: (2024)
by: Zhang, Liang, et al.
Published: (2024)
Metacognition and Uncertainty Communication in Humans and Large Language Models
by: Steyvers, Mark, et al.
Published: (2025)
by: Steyvers, Mark, et al.
Published: (2025)
Anthropomorphism and Trust in Human-Large Language Model interactions
by: Kadambi, Akila, et al.
Published: (2026)
by: Kadambi, Akila, et al.
Published: (2026)
Similar Items
-
Influencing Humans to Conform to Preference Models for RLHF
by: Hatgis-Kessell, Stephane, et al.
Published: (2025) -
Beyond Compliance: How AI Could Help Creative Writers by Refusing Them
by: Qin, Hua Xuan, et al.
Published: (2026) -
Refusal as Silence: Gendered Disparities in Vision-Language Model Responses
by: Luo, Sha, et al.
Published: (2024) -
Oyster-I: Beyond Refusal -- Constructive Safety Alignment for Responsible Language Models
by: Duan, Ranjie, et al.
Published: (2025) -
Are Large Language Models Aligned with People's Social Intuitions for Human-Robot Interactions?
by: Wachowiak, Lennart, et al.
Published: (2024)