Saved in:
| Main Author: | Tlaie, Alejandro |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.19749 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Large language models in medicine: the potentials and pitfalls
by: Omiye, Jesutofunmi A., et al.
Published: (2023)
by: Omiye, Jesutofunmi A., et al.
Published: (2023)
Exploring and steering the moral compass of Large Language Models
by: Tlaie, Alejandro
Published: (2024)
by: Tlaie, Alejandro
Published: (2024)
Promises and pitfalls of artificial intelligence for legal applications
by: Kapoor, Sayash, et al.
Published: (2024)
by: Kapoor, Sayash, et al.
Published: (2024)
Rules, Cases, and Reasoning: Positivist Legal Theory as a Framework for Pluralistic AI Alignment
by: Caputo, Nicholas A.
Published: (2024)
by: Caputo, Nicholas A.
Published: (2024)
Securing External Deeper-than-black-box GPAI Evaluations
by: Tlaie, Alejandro, et al.
Published: (2025)
by: Tlaie, Alejandro, et al.
Published: (2025)
The AI Alignment Paradox
by: West, Robert, et al.
Published: (2024)
by: West, Robert, et al.
Published: (2024)
Rethinking AI Cultural Alignment
by: Bravansky, Michal, et al.
Published: (2025)
by: Bravansky, Michal, et al.
Published: (2025)
Justifications for Democratizing AI Alignment and Their Prospects
by: Steingrüber, André, et al.
Published: (2025)
by: Steingrüber, André, et al.
Published: (2025)
AI and Social Theory
by: Mokander, Jakob, et al.
Published: (2024)
by: Mokander, Jakob, et al.
Published: (2024)
Understanding the Process of Human-AI Value Alignment
by: McKinlay, Jack, et al.
Published: (2025)
by: McKinlay, Jack, et al.
Published: (2025)
Privacy Ethics Alignment in AI: A Stakeholder-Centric Framework for Ethical AI
by: Barthwal, Ankur, et al.
Published: (2025)
by: Barthwal, Ankur, et al.
Published: (2025)
Whose Truth? Pluralistic Geo-Alignment for (Agentic) AI
by: Janowicz, Krzysztof, et al.
Published: (2025)
by: Janowicz, Krzysztof, et al.
Published: (2025)
Addressing the regulatory gap: moving towards an EU AI audit ecosystem beyond the AI Act by including civil society
by: Hartmann, David, et al.
Published: (2024)
by: Hartmann, David, et al.
Published: (2024)
Security, privacy, and agentic AI in a regulatory view: From definitions and distinctions to provisions and reflections
by: Zhang, Shiliang, et al.
Published: (2026)
by: Zhang, Shiliang, et al.
Published: (2026)
Building the ethical AI framework of the future: from philosophy to practice
by: Catapang, Jasper Kyle
Published: (2026)
by: Catapang, Jasper Kyle
Published: (2026)
The Value of Gen-AI Conversations: A bottom-up Framework for AI Value Alignment
by: Motnikar, Lenart, et al.
Published: (2025)
by: Motnikar, Lenart, et al.
Published: (2025)
AI and Human Oversight: A Risk-Based Framework for Alignment
by: Kandikatla, Laxmiraju, et al.
Published: (2025)
by: Kandikatla, Laxmiraju, et al.
Published: (2025)
AI threats to national security can be countered through an incident regime
by: Ortega, Alejandro
Published: (2025)
by: Ortega, Alejandro
Published: (2025)
AI Alignment at Your Discretion
by: Buyl, Maarten, et al.
Published: (2025)
by: Buyl, Maarten, et al.
Published: (2025)
Alignment, Agency and Autonomy in Frontier AI: A Systems Engineering Perspective
by: Tallam, Krti
Published: (2025)
by: Tallam, Krti
Published: (2025)
Representative Social Choice: From Learning Theory to AI Alignment
by: Qiu, Tianyi
Published: (2024)
by: Qiu, Tianyi
Published: (2024)
Characterizing AI Agents for Alignment and Governance
by: Kasirzadeh, Atoosa, et al.
Published: (2025)
by: Kasirzadeh, Atoosa, et al.
Published: (2025)
AI for bureaucratic productivity: Measuring the potential of AI to help automate 143 million UK government transactions
by: Straub, Vincent J., et al.
Published: (2024)
by: Straub, Vincent J., et al.
Published: (2024)
Alignment Is Not Enough: A Relational Framework for Moral Standing in Human-AI Interaction
by: Pasandi, Faezeh B., et al.
Published: (2026)
by: Pasandi, Faezeh B., et al.
Published: (2026)
Wide Reflective Equilibrium in LLM Alignment: Bridging Moral Epistemology and AI Safety
by: Brophy, Matthew
Published: (2025)
by: Brophy, Matthew
Published: (2025)
Large language models eroding science understanding: an experimental study
by: Collins, Harry, et al.
Published: (2026)
by: Collins, Harry, et al.
Published: (2026)
ELEPHANT: Measuring and understanding social sycophancy in LLMs
by: Cheng, Myra, et al.
Published: (2025)
by: Cheng, Myra, et al.
Published: (2025)
Teacher agency in the age of generative AI: towards a framework of hybrid intelligence for learning design
by: Frøsig, Thomas B, et al.
Published: (2024)
by: Frøsig, Thomas B, et al.
Published: (2024)
AI-Educational Development Loop (AI-EDL): A Conceptual Framework to Bridge AI Capabilities with Classical Educational Theories
by: Yu, Ning, et al.
Published: (2025)
by: Yu, Ning, et al.
Published: (2025)
The Coming Crisis of Multi-Agent Misalignment: AI Alignment Must Be a Dynamic and Social Process
by: Carichon, Florian, et al.
Published: (2025)
by: Carichon, Florian, et al.
Published: (2025)
Alignment Debt: The Hidden Work of Making AI Usable
by: Oyemike, Cumi, et al.
Published: (2025)
by: Oyemike, Cumi, et al.
Published: (2025)
From tools to thieves: Measuring and understanding public perceptions of AI through crowdsourced metaphors
by: Cheng, Myra, et al.
Published: (2025)
by: Cheng, Myra, et al.
Published: (2025)
Standing on FURM ground -- A framework for evaluating Fair, Useful, and Reliable AI Models in healthcare systems
by: Callahan, Alison, et al.
Published: (2024)
by: Callahan, Alison, et al.
Published: (2024)
AI Thinking: A framework for rethinking artificial intelligence in practice
by: Newman-Griffis, Denis
Published: (2024)
by: Newman-Griffis, Denis
Published: (2024)
Balancing Safety and Helpfulness in Healthcare AI Assistants through Iterative Preference Alignment
by: Nghiem, Huy, et al.
Published: (2025)
by: Nghiem, Huy, et al.
Published: (2025)
Deconstructing Student Perceptions of Generative AI (GenAI) through an Expectancy Value Theory (EVT)-based Instrument
by: Chan, Cecilia Ka Yuk, et al.
Published: (2023)
by: Chan, Cecilia Ka Yuk, et al.
Published: (2023)
Why AI Alignment Failure Is Structural: Learned Human Interaction Structures and AGI as an Endogenous Evolutionary Shock
by: Sornette, Didier, et al.
Published: (2026)
by: Sornette, Didier, et al.
Published: (2026)
Risk Alignment in Agentic AI Systems
by: Clatterbuck, Hayley, et al.
Published: (2024)
by: Clatterbuck, Hayley, et al.
Published: (2024)
Commercial Persuasion in AI-Mediated Conversations
by: Salvi, Francesco, et al.
Published: (2026)
by: Salvi, Francesco, et al.
Published: (2026)
Bidirectional Human-AI Alignment in Education for Trustworthy Learning Environments
by: Shen, Hua
Published: (2025)
by: Shen, Hua
Published: (2025)
Similar Items
-
Large language models in medicine: the potentials and pitfalls
by: Omiye, Jesutofunmi A., et al.
Published: (2023) -
Exploring and steering the moral compass of Large Language Models
by: Tlaie, Alejandro
Published: (2024) -
Promises and pitfalls of artificial intelligence for legal applications
by: Kapoor, Sayash, et al.
Published: (2024) -
Rules, Cases, and Reasoning: Positivist Legal Theory as a Framework for Pluralistic AI Alignment
by: Caputo, Nicholas A.
Published: (2024) -
Securing External Deeper-than-black-box GPAI Evaluations
by: Tlaie, Alejandro, et al.
Published: (2025)