:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Tlaie, Alejandro
Format:	Preprint
Published:	2024
Subjects:	Computers and Society Artificial Intelligence
Online Access:	https://arxiv.org/abs/2410.19749
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Large language models in medicine: the potentials and pitfalls
by: Omiye, Jesutofunmi A., et al.
Published: (2023)

Exploring and steering the moral compass of Large Language Models
by: Tlaie, Alejandro
Published: (2024)

Promises and pitfalls of artificial intelligence for legal applications
by: Kapoor, Sayash, et al.
Published: (2024)

Rules, Cases, and Reasoning: Positivist Legal Theory as a Framework for Pluralistic AI Alignment
by: Caputo, Nicholas A.
Published: (2024)

Securing External Deeper-than-black-box GPAI Evaluations
by: Tlaie, Alejandro, et al.
Published: (2025)

The AI Alignment Paradox
by: West, Robert, et al.
Published: (2024)

Rethinking AI Cultural Alignment
by: Bravansky, Michal, et al.
Published: (2025)

Justifications for Democratizing AI Alignment and Their Prospects
by: Steingrüber, André, et al.
Published: (2025)

AI and Social Theory
by: Mokander, Jakob, et al.
Published: (2024)

Understanding the Process of Human-AI Value Alignment
by: McKinlay, Jack, et al.
Published: (2025)

Privacy Ethics Alignment in AI: A Stakeholder-Centric Framework for Ethical AI
by: Barthwal, Ankur, et al.
Published: (2025)

Whose Truth? Pluralistic Geo-Alignment for (Agentic) AI
by: Janowicz, Krzysztof, et al.
Published: (2025)

Addressing the regulatory gap: moving towards an EU AI audit ecosystem beyond the AI Act by including civil society
by: Hartmann, David, et al.
Published: (2024)

Security, privacy, and agentic AI in a regulatory view: From definitions and distinctions to provisions and reflections
by: Zhang, Shiliang, et al.
Published: (2026)

Building the ethical AI framework of the future: from philosophy to practice
by: Catapang, Jasper Kyle
Published: (2026)

The Value of Gen-AI Conversations: A bottom-up Framework for AI Value Alignment
by: Motnikar, Lenart, et al.
Published: (2025)

AI and Human Oversight: A Risk-Based Framework for Alignment
by: Kandikatla, Laxmiraju, et al.
Published: (2025)

AI threats to national security can be countered through an incident regime
by: Ortega, Alejandro
Published: (2025)

AI Alignment at Your Discretion
by: Buyl, Maarten, et al.
Published: (2025)

Alignment, Agency and Autonomy in Frontier AI: A Systems Engineering Perspective
by: Tallam, Krti
Published: (2025)

Representative Social Choice: From Learning Theory to AI Alignment
by: Qiu, Tianyi
Published: (2024)

Characterizing AI Agents for Alignment and Governance
by: Kasirzadeh, Atoosa, et al.
Published: (2025)

AI for bureaucratic productivity: Measuring the potential of AI to help automate 143 million UK government transactions
by: Straub, Vincent J., et al.
Published: (2024)

Alignment Is Not Enough: A Relational Framework for Moral Standing in Human-AI Interaction
by: Pasandi, Faezeh B., et al.
Published: (2026)

Wide Reflective Equilibrium in LLM Alignment: Bridging Moral Epistemology and AI Safety
by: Brophy, Matthew
Published: (2025)

Large language models eroding science understanding: an experimental study
by: Collins, Harry, et al.
Published: (2026)

ELEPHANT: Measuring and understanding social sycophancy in LLMs
by: Cheng, Myra, et al.
Published: (2025)

Teacher agency in the age of generative AI: towards a framework of hybrid intelligence for learning design
by: Frøsig, Thomas B, et al.
Published: (2024)

AI-Educational Development Loop (AI-EDL): A Conceptual Framework to Bridge AI Capabilities with Classical Educational Theories
by: Yu, Ning, et al.
Published: (2025)

The Coming Crisis of Multi-Agent Misalignment: AI Alignment Must Be a Dynamic and Social Process
by: Carichon, Florian, et al.
Published: (2025)

Alignment Debt: The Hidden Work of Making AI Usable
by: Oyemike, Cumi, et al.
Published: (2025)

From tools to thieves: Measuring and understanding public perceptions of AI through crowdsourced metaphors
by: Cheng, Myra, et al.
Published: (2025)

Standing on FURM ground -- A framework for evaluating Fair, Useful, and Reliable AI Models in healthcare systems
by: Callahan, Alison, et al.
Published: (2024)

AI Thinking: A framework for rethinking artificial intelligence in practice
by: Newman-Griffis, Denis
Published: (2024)

Balancing Safety and Helpfulness in Healthcare AI Assistants through Iterative Preference Alignment
by: Nghiem, Huy, et al.
Published: (2025)

Deconstructing Student Perceptions of Generative AI (GenAI) through an Expectancy Value Theory (EVT)-based Instrument
by: Chan, Cecilia Ka Yuk, et al.
Published: (2023)

Why AI Alignment Failure Is Structural: Learned Human Interaction Structures and AGI as an Endogenous Evolutionary Shock
by: Sornette, Didier, et al.
Published: (2026)

Risk Alignment in Agentic AI Systems
by: Clatterbuck, Hayley, et al.
Published: (2024)

Commercial Persuasion in AI-Mediated Conversations
by: Salvi, Francesco, et al.
Published: (2026)

Bidirectional Human-AI Alignment in Education for Trustworthy Learning Environments
by: Shen, Hua
Published: (2025)