Saved in:
| Main Authors: | Reis, Ben Y., La Cava, William |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.06592 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Culturally Grounded Personas in Large Language Models: Characterization and Alignment with Socio-Psychological Value Frameworks
by: Greco, Candida M., et al.
Published: (2026)
by: Greco, Candida M., et al.
Published: (2026)
Luminol-AIDetect: Fast Zero-shot Machine-Generated Text Detection based on Perplexity under Text Shuffling
by: La Cava, Lucio, et al.
Published: (2026)
by: La Cava, Lucio, et al.
Published: (2026)
Toward Preference-aligned Large Language Models via Residual-based Model Steering
by: La Cava, Lucio, et al.
Published: (2025)
by: La Cava, Lucio, et al.
Published: (2025)
OpenTuringBench: An Open-Model-based Benchmark and Framework for Machine-Generated Text Detection and Attribution
by: La Cava, Lucio, et al.
Published: (2025)
by: La Cava, Lucio, et al.
Published: (2025)
Open Models, Closed Minds? On Agents Capabilities in Mimicking Human Personalities through Open Large Language Models
by: La Cava, Lucio, et al.
Published: (2024)
by: La Cava, Lucio, et al.
Published: (2024)
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
by: La Cava, Lucio, et al.
Published: (2024)
by: La Cava, Lucio, et al.
Published: (2024)
Talking the Talk Does Not Entail Walking the Walk: On the Limits of Large Language Models in Lexical Entailment Recognition
by: Greco, Candida M., et al.
Published: (2024)
by: Greco, Candida M., et al.
Published: (2024)
Authorship Attribution in Multilingual Machine-Generated Texts
by: La Cava, Lucio, et al.
Published: (2025)
by: La Cava, Lucio, et al.
Published: (2025)
Relative Principals, Pluralistic Alignment, and the Structural Value Alignment Problem
by: LaCroix, Travis
Published: (2026)
by: LaCroix, Travis
Published: (2026)
Visually Wired NFTs: Exploring the Role of Inspiration in Non-Fungible Tokens
by: La Cava, Lucio, et al.
Published: (2023)
by: La Cava, Lucio, et al.
Published: (2023)
Chain of Alignment: Integrating Public Will with Expert Intelligence for Language Model Alignment
by: Konya, Andrew, et al.
Published: (2024)
by: Konya, Andrew, et al.
Published: (2024)
Slurry-as-a-Service: A Modest Proposal on Scalable Pluralistic Alignment for Nutrient Optimization
by: Hong, Rachel, et al.
Published: (2026)
by: Hong, Rachel, et al.
Published: (2026)
The AI Alignment Paradox
by: West, Robert, et al.
Published: (2024)
by: West, Robert, et al.
Published: (2024)
Being Considerate as a Pathway Towards Pluralistic Alignment for Agentic AI
by: Alamdari, Parand A., et al.
Published: (2024)
by: Alamdari, Parand A., et al.
Published: (2024)
Societal Alignment Frameworks Can Improve LLM Alignment
by: Stańczak, Karolina, et al.
Published: (2025)
by: Stańczak, Karolina, et al.
Published: (2025)
Rethinking AI Cultural Alignment
by: Bravansky, Michal, et al.
Published: (2025)
by: Bravansky, Michal, et al.
Published: (2025)
The Emotional Alignment Design Policy
by: Schwitzgebel, Eric, et al.
Published: (2025)
by: Schwitzgebel, Eric, et al.
Published: (2025)
Scopes of Alignment
by: Varshney, Kush R., et al.
Published: (2025)
by: Varshney, Kush R., et al.
Published: (2025)
Ethical Challenges and Evolving Strategies in the Integration of Artificial Intelligence into Clinical Practice
by: Weiner, Ellison B., et al.
Published: (2024)
by: Weiner, Ellison B., et al.
Published: (2024)
An Evaluation of Cultural Value Alignment in LLM
by: Sukiennik, Nicholas, et al.
Published: (2025)
by: Sukiennik, Nicholas, et al.
Published: (2025)
Justifications for Democratizing AI Alignment and Their Prospects
by: Steingrüber, André, et al.
Published: (2025)
by: Steingrüber, André, et al.
Published: (2025)
Resurrecting Socrates in the Age of AI: A Study Protocol for Evaluating a Socratic Tutor to Support Research Question Development in Higher Education
by: Degen, Ben
Published: (2025)
by: Degen, Ben
Published: (2025)
Understanding the Process of Human-AI Value Alignment
by: McKinlay, Jack, et al.
Published: (2025)
by: McKinlay, Jack, et al.
Published: (2025)
Fluent but Foreign: Even Regional LLMs Lack Cultural Alignment
by: Agarwal, Dhruv, et al.
Published: (2025)
by: Agarwal, Dhruv, et al.
Published: (2025)
Whose Truth? Pluralistic Geo-Alignment for (Agentic) AI
by: Janowicz, Krzysztof, et al.
Published: (2025)
by: Janowicz, Krzysztof, et al.
Published: (2025)
Simple Role Assignment is Extraordinarily Effective for Safety Alignment
by: Ziheng, Zhou, et al.
Published: (2026)
by: Ziheng, Zhou, et al.
Published: (2026)
Dynamic Normativity: Necessary and Sufficient Conditions for Value Alignment
by: Corrêa, Nicholas Kluge
Published: (2024)
by: Corrêa, Nicholas Kluge
Published: (2024)
AI and Human Oversight: A Risk-Based Framework for Alignment
by: Kandikatla, Laxmiraju, et al.
Published: (2025)
by: Kandikatla, Laxmiraju, et al.
Published: (2025)
Alignment as Jurisprudence
by: Caputo, Nicholas
Published: (2026)
by: Caputo, Nicholas
Published: (2026)
Alignment, Agency and Autonomy in Frontier AI: A Systems Engineering Perspective
by: Tallam, Krti
Published: (2025)
by: Tallam, Krti
Published: (2025)
COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
by: Choi, Dasol, et al.
Published: (2026)
by: Choi, Dasol, et al.
Published: (2026)
Using AI Alignment Theory to understand the potential pitfalls of regulatory frameworks
by: Tlaie, Alejandro
Published: (2024)
by: Tlaie, Alejandro
Published: (2024)
Towards Experience-Centered AI: A Framework for Integrating Lived Experience in Design and Development
by: Gautam, Sanjana, et al.
Published: (2025)
by: Gautam, Sanjana, et al.
Published: (2025)
Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance
by: La Cava, Lucio, et al.
Published: (2024)
by: La Cava, Lucio, et al.
Published: (2024)
Wide Reflective Equilibrium in LLM Alignment: Bridging Moral Epistemology and AI Safety
by: Brophy, Matthew
Published: (2025)
by: Brophy, Matthew
Published: (2025)
Privacy Ethics Alignment in AI: A Stakeholder-Centric Framework for Ethical AI
by: Barthwal, Ankur, et al.
Published: (2025)
by: Barthwal, Ankur, et al.
Published: (2025)
Are Dilemmas and Conflicts in LLM Alignment Solvable? A View from Priority Graph
by: Tang, Zhenheng, et al.
Published: (2026)
by: Tang, Zhenheng, et al.
Published: (2026)
Alignment Is Not Enough: A Relational Framework for Moral Standing in Human-AI Interaction
by: Pasandi, Faezeh B., et al.
Published: (2026)
by: Pasandi, Faezeh B., et al.
Published: (2026)
Toward AI Systems That Understand Self and Others: A Multi-Phase Inference Framework for Human Cognitive Diversity and World-Model Alignment
by: Takahashi, Toru
Published: (2026)
by: Takahashi, Toru
Published: (2026)
Dropouts in Confidence: Moral Uncertainty in Human-LLM Alignment
by: Kwon, Jea, et al.
Published: (2025)
by: Kwon, Jea, et al.
Published: (2025)
Similar Items
-
Culturally Grounded Personas in Large Language Models: Characterization and Alignment with Socio-Psychological Value Frameworks
by: Greco, Candida M., et al.
Published: (2026) -
Luminol-AIDetect: Fast Zero-shot Machine-Generated Text Detection based on Perplexity under Text Shuffling
by: La Cava, Lucio, et al.
Published: (2026) -
Toward Preference-aligned Large Language Models via Residual-based Model Steering
by: La Cava, Lucio, et al.
Published: (2025) -
OpenTuringBench: An Open-Model-based Benchmark and Framework for Machine-Generated Text Detection and Attribution
by: La Cava, Lucio, et al.
Published: (2025) -
Open Models, Closed Minds? On Agents Capabilities in Mimicking Human Personalities through Open Large Language Models
by: La Cava, Lucio, et al.
Published: (2024)