Saved in:
| Main Authors: | Schwitzgebel, Eric, Sebo, Jeff |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.06263 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Subjective Experience in AI Systems: What Do AI Researchers and the Public Believe?
by: Dreksler, Noemi, et al.
Published: (2025)
by: Dreksler, Noemi, et al.
Published: (2025)
AI and Consciousness
by: Schwitzgebel, Eric
Published: (2025)
by: Schwitzgebel, Eric
Published: (2025)
Taking AI Welfare Seriously
by: Long, Robert, et al.
Published: (2024)
by: Long, Robert, et al.
Published: (2024)
Artificial Intelligence as Strange Intelligence: Against Linear Models of Intelligence
by: Chilson, Kendra, et al.
Published: (2026)
by: Chilson, Kendra, et al.
Published: (2026)
COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
by: Choi, Dasol, et al.
Published: (2026)
by: Choi, Dasol, et al.
Published: (2026)
Social World Model-Augmented Mechanism Design Policy Learning
by: Zhang, Xiaoyuan, et al.
Published: (2025)
by: Zhang, Xiaoyuan, et al.
Published: (2025)
Designing Ethical Learning for Agentic AI: Toegye Yi Hwang's Ethical Emotion Regulation Framework
by: Kim, Ji Yeon
Published: (2026)
by: Kim, Ji Yeon
Published: (2026)
Towards Integrated Alignment
by: Reis, Ben Y., et al.
Published: (2025)
by: Reis, Ben Y., et al.
Published: (2025)
The AI Alignment Paradox
by: West, Robert, et al.
Published: (2024)
by: West, Robert, et al.
Published: (2024)
The Alignment Target Problem: Divergent Moral Judgments of Humans, AI Systems, and Their Designers
by: Chen, Benjamin Minhao, et al.
Published: (2026)
by: Chen, Benjamin Minhao, et al.
Published: (2026)
ELIZA Reinterpreted: The world's first chatbot was not intended as a chatbot at all
by: Shrager, Jeff
Published: (2024)
by: Shrager, Jeff
Published: (2024)
Societal Alignment Frameworks Can Improve LLM Alignment
by: Stańczak, Karolina, et al.
Published: (2025)
by: Stańczak, Karolina, et al.
Published: (2025)
Rethinking AI Cultural Alignment
by: Bravansky, Michal, et al.
Published: (2025)
by: Bravansky, Michal, et al.
Published: (2025)
EFO: the Emotion Frame Ontology
by: De Giorgis, Stefano, et al.
Published: (2024)
by: De Giorgis, Stefano, et al.
Published: (2024)
Scopes of Alignment
by: Varshney, Kush R., et al.
Published: (2025)
by: Varshney, Kush R., et al.
Published: (2025)
An Evaluation of Cultural Value Alignment in LLM
by: Sukiennik, Nicholas, et al.
Published: (2025)
by: Sukiennik, Nicholas, et al.
Published: (2025)
Justifications for Democratizing AI Alignment and Their Prospects
by: Steingrüber, André, et al.
Published: (2025)
by: Steingrüber, André, et al.
Published: (2025)
Understanding the Process of Human-AI Value Alignment
by: McKinlay, Jack, et al.
Published: (2025)
by: McKinlay, Jack, et al.
Published: (2025)
Fluent but Foreign: Even Regional LLMs Lack Cultural Alignment
by: Agarwal, Dhruv, et al.
Published: (2025)
by: Agarwal, Dhruv, et al.
Published: (2025)
Whose Truth? Pluralistic Geo-Alignment for (Agentic) AI
by: Janowicz, Krzysztof, et al.
Published: (2025)
by: Janowicz, Krzysztof, et al.
Published: (2025)
Simple Role Assignment is Extraordinarily Effective for Safety Alignment
by: Ziheng, Zhou, et al.
Published: (2026)
by: Ziheng, Zhou, et al.
Published: (2026)
Dynamic Normativity: Necessary and Sufficient Conditions for Value Alignment
by: Corrêa, Nicholas Kluge
Published: (2024)
by: Corrêa, Nicholas Kluge
Published: (2024)
Assessing the State of AI Policy
by: DeFranco, Joanna F., et al.
Published: (2024)
by: DeFranco, Joanna F., et al.
Published: (2024)
Emotional Manipulation by AI Companions
by: De Freitas, Julian, et al.
Published: (2025)
by: De Freitas, Julian, et al.
Published: (2025)
AI and Human Oversight: A Risk-Based Framework for Alignment
by: Kandikatla, Laxmiraju, et al.
Published: (2025)
by: Kandikatla, Laxmiraju, et al.
Published: (2025)
Chain of Alignment: Integrating Public Will with Expert Intelligence for Language Model Alignment
by: Konya, Andrew, et al.
Published: (2024)
by: Konya, Andrew, et al.
Published: (2024)
Alignment as Jurisprudence
by: Caputo, Nicholas
Published: (2026)
by: Caputo, Nicholas
Published: (2026)
RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents
by: Wang, Peisong, et al.
Published: (2025)
by: Wang, Peisong, et al.
Published: (2025)
Relative Principals, Pluralistic Alignment, and the Structural Value Alignment Problem
by: LaCroix, Travis
Published: (2026)
by: LaCroix, Travis
Published: (2026)
Alignment, Agency and Autonomy in Frontier AI: A Systems Engineering Perspective
by: Tallam, Krti
Published: (2025)
by: Tallam, Krti
Published: (2025)
Using AI Alignment Theory to understand the potential pitfalls of regulatory frameworks
by: Tlaie, Alejandro
Published: (2024)
by: Tlaie, Alejandro
Published: (2024)
Cross-Cultural Simulation of Citizen Emotional Responses to Bureaucratic Red Tape Using LLM Agents
by: Ni, Wanchun, et al.
Published: (2026)
by: Ni, Wanchun, et al.
Published: (2026)
Reinforcing Stereotypes of Anger: Emotion AI on African American Vernacular English
by: Dorn, Rebecca, et al.
Published: (2025)
by: Dorn, Rebecca, et al.
Published: (2025)
Wide Reflective Equilibrium in LLM Alignment: Bridging Moral Epistemology and AI Safety
by: Brophy, Matthew
Published: (2025)
by: Brophy, Matthew
Published: (2025)
Privacy Ethics Alignment in AI: A Stakeholder-Centric Framework for Ethical AI
by: Barthwal, Ankur, et al.
Published: (2025)
by: Barthwal, Ankur, et al.
Published: (2025)
Co-Designing Interdisciplinary Design Projects with AI
by: Liow, Wei Ting, et al.
Published: (2025)
by: Liow, Wei Ting, et al.
Published: (2025)
Are Dilemmas and Conflicts in LLM Alignment Solvable? A View from Priority Graph
by: Tang, Zhenheng, et al.
Published: (2026)
by: Tang, Zhenheng, et al.
Published: (2026)
Alignment Is Not Enough: A Relational Framework for Moral Standing in Human-AI Interaction
by: Pasandi, Faezeh B., et al.
Published: (2026)
by: Pasandi, Faezeh B., et al.
Published: (2026)
ELIZA Reanimated: The world's first chatbot restored on the world's first time sharing system
by: Lane, Rupert, et al.
Published: (2025)
by: Lane, Rupert, et al.
Published: (2025)
Deliberative Alignment: Reasoning Enables Safer Language Models
by: Guan, Melody Y., et al.
Published: (2024)
by: Guan, Melody Y., et al.
Published: (2024)
Similar Items
-
Subjective Experience in AI Systems: What Do AI Researchers and the Public Believe?
by: Dreksler, Noemi, et al.
Published: (2025) -
AI and Consciousness
by: Schwitzgebel, Eric
Published: (2025) -
Taking AI Welfare Seriously
by: Long, Robert, et al.
Published: (2024) -
Artificial Intelligence as Strange Intelligence: Against Linear Models of Intelligence
by: Chilson, Kendra, et al.
Published: (2026) -
COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
by: Choi, Dasol, et al.
Published: (2026)