:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Schwitzgebel, Eric, Sebo, Jeff
Format:	Preprint
Published:	2025
Subjects:	Computers and Society Artificial Intelligence
Online Access:	https://arxiv.org/abs/2507.06263
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Subjective Experience in AI Systems: What Do AI Researchers and the Public Believe?
by: Dreksler, Noemi, et al.
Published: (2025)

AI and Consciousness
by: Schwitzgebel, Eric
Published: (2025)

Taking AI Welfare Seriously
by: Long, Robert, et al.
Published: (2024)

Artificial Intelligence as Strange Intelligence: Against Linear Models of Intelligence
by: Chilson, Kendra, et al.
Published: (2026)

COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
by: Choi, Dasol, et al.
Published: (2026)

Social World Model-Augmented Mechanism Design Policy Learning
by: Zhang, Xiaoyuan, et al.
Published: (2025)

Designing Ethical Learning for Agentic AI: Toegye Yi Hwang's Ethical Emotion Regulation Framework
by: Kim, Ji Yeon
Published: (2026)

Towards Integrated Alignment
by: Reis, Ben Y., et al.
Published: (2025)

The AI Alignment Paradox
by: West, Robert, et al.
Published: (2024)

The Alignment Target Problem: Divergent Moral Judgments of Humans, AI Systems, and Their Designers
by: Chen, Benjamin Minhao, et al.
Published: (2026)

ELIZA Reinterpreted: The world's first chatbot was not intended as a chatbot at all
by: Shrager, Jeff
Published: (2024)

Societal Alignment Frameworks Can Improve LLM Alignment
by: Stańczak, Karolina, et al.
Published: (2025)

Rethinking AI Cultural Alignment
by: Bravansky, Michal, et al.
Published: (2025)

EFO: the Emotion Frame Ontology
by: De Giorgis, Stefano, et al.
Published: (2024)

Scopes of Alignment
by: Varshney, Kush R., et al.
Published: (2025)

An Evaluation of Cultural Value Alignment in LLM
by: Sukiennik, Nicholas, et al.
Published: (2025)

Justifications for Democratizing AI Alignment and Their Prospects
by: Steingrüber, André, et al.
Published: (2025)

Understanding the Process of Human-AI Value Alignment
by: McKinlay, Jack, et al.
Published: (2025)

Fluent but Foreign: Even Regional LLMs Lack Cultural Alignment
by: Agarwal, Dhruv, et al.
Published: (2025)

Whose Truth? Pluralistic Geo-Alignment for (Agentic) AI
by: Janowicz, Krzysztof, et al.
Published: (2025)

Simple Role Assignment is Extraordinarily Effective for Safety Alignment
by: Ziheng, Zhou, et al.
Published: (2026)

Dynamic Normativity: Necessary and Sufficient Conditions for Value Alignment
by: Corrêa, Nicholas Kluge
Published: (2024)

Assessing the State of AI Policy
by: DeFranco, Joanna F., et al.
Published: (2024)

Emotional Manipulation by AI Companions
by: De Freitas, Julian, et al.
Published: (2025)

AI and Human Oversight: A Risk-Based Framework for Alignment
by: Kandikatla, Laxmiraju, et al.
Published: (2025)

Chain of Alignment: Integrating Public Will with Expert Intelligence for Language Model Alignment
by: Konya, Andrew, et al.
Published: (2024)

Alignment as Jurisprudence
by: Caputo, Nicholas
Published: (2026)

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents
by: Wang, Peisong, et al.
Published: (2025)

Relative Principals, Pluralistic Alignment, and the Structural Value Alignment Problem
by: LaCroix, Travis
Published: (2026)

Alignment, Agency and Autonomy in Frontier AI: A Systems Engineering Perspective
by: Tallam, Krti
Published: (2025)

Using AI Alignment Theory to understand the potential pitfalls of regulatory frameworks
by: Tlaie, Alejandro
Published: (2024)

Cross-Cultural Simulation of Citizen Emotional Responses to Bureaucratic Red Tape Using LLM Agents
by: Ni, Wanchun, et al.
Published: (2026)

Reinforcing Stereotypes of Anger: Emotion AI on African American Vernacular English
by: Dorn, Rebecca, et al.
Published: (2025)

Wide Reflective Equilibrium in LLM Alignment: Bridging Moral Epistemology and AI Safety
by: Brophy, Matthew
Published: (2025)

Privacy Ethics Alignment in AI: A Stakeholder-Centric Framework for Ethical AI
by: Barthwal, Ankur, et al.
Published: (2025)

Co-Designing Interdisciplinary Design Projects with AI
by: Liow, Wei Ting, et al.
Published: (2025)

Are Dilemmas and Conflicts in LLM Alignment Solvable? A View from Priority Graph
by: Tang, Zhenheng, et al.
Published: (2026)

Alignment Is Not Enough: A Relational Framework for Moral Standing in Human-AI Interaction
by: Pasandi, Faezeh B., et al.
Published: (2026)

ELIZA Reanimated: The world's first chatbot restored on the world's first time sharing system
by: Lane, Rupert, et al.
Published: (2025)

Deliberative Alignment: Reasoning Enables Safer Language Models
by: Guan, Melody Y., et al.
Published: (2024)