Saved in:
| Main Authors: | Rienecker, Jasmine, Mpofu, Katarina, Goel, Naman, Datta, Siddhartha, Zhao, Jun, Danielsson, Oscar, Thorsen, Fredrik |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.18300 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AI vs. Human Moderators: A Comparative Evaluation of Multimodal LLMs in Content Moderation for Brand Safety
by: Levi, Adi, et al.
Published: (2025)
by: Levi, Adi, et al.
Published: (2025)
Policy Cards: Machine-Readable Runtime Governance for Autonomous AI Agents
by: Mavračić, Juraj
Published: (2025)
by: Mavračić, Juraj
Published: (2025)
Integration of Contextual Descriptors in Ontology Alignment for Enrichment of Semantic Correspondence
by: Manziuk, Eduard, et al.
Published: (2024)
by: Manziuk, Eduard, et al.
Published: (2024)
Why we need an AI-resilient society
by: Bartz-Beielstein, Thomas
Published: (2019)
by: Bartz-Beielstein, Thomas
Published: (2019)
The Company You Keep: How LLMs Respond to Dark Triad Traits
by: Lu, Zeyi, et al.
Published: (2026)
by: Lu, Zeyi, et al.
Published: (2026)
The Language Labyrinth: Constructive Critique on the Terminology Used in the AI Discourse
by: Rehak, Rainer
Published: (2023)
by: Rehak, Rainer
Published: (2023)
PoliCon: Evaluating LLMs on Achieving Diverse Political Consensus Objectives
by: Zhang, Zhaowei, et al.
Published: (2025)
by: Zhang, Zhaowei, et al.
Published: (2025)
The Hall of AI Fears and Hopes: Comparing the Views of AI Influencers and those of Members of the U.S. Public Through an Interactive Platform
by: Moreira, Gustavo, et al.
Published: (2025)
by: Moreira, Gustavo, et al.
Published: (2025)
AI Narrative Breakdown. A Critical Assessment of Power and Promise
by: Rehak, Rainer
Published: (2026)
by: Rehak, Rainer
Published: (2026)
Fairness Is Not Enough: Auditing Competence and Intersectional Bias in AI-powered Resume Screening
by: Webster, Kevin T
Published: (2025)
by: Webster, Kevin T
Published: (2025)
Big Help or Big Brother? Auditing Tracking, Profiling, and Personalization in Generative AI Assistants
by: Vekaria, Yash, et al.
Published: (2025)
by: Vekaria, Yash, et al.
Published: (2025)
"What Is It That You Don't Understand?" Language Games and Black Box Algorithms
by: Demichelis, Remy
Published: (2026)
by: Demichelis, Remy
Published: (2026)
AI Ethics Principles in Practice: Perspectives of Designers and Developers
by: Sanderson, Conrad, et al.
Published: (2021)
by: Sanderson, Conrad, et al.
Published: (2021)
Governed Auditable Decisioning Under Uncertainty: Synthesis and Agentic Extension
by: Solozobov, Oleg
Published: (2026)
by: Solozobov, Oleg
Published: (2026)
A Real-Time Diminished Reality Approach to Privacy in MR Collaboration
by: Fane, Christian
Published: (2025)
by: Fane, Christian
Published: (2025)
Cultural Encoding in Large Language Models: The Existence Gap in AI-Mediated Brand Discovery
by: Junyao, Huang, et al.
Published: (2025)
by: Junyao, Huang, et al.
Published: (2025)
From Native Memes to Global Moderation: Cross-Cultural Evaluation of Vision-Language Models for Hateful Meme Detection
by: Wang, Mo, et al.
Published: (2026)
by: Wang, Mo, et al.
Published: (2026)
The Cultural Gene of Large Language Models: A Study on the Impact of Cross-Corpus Training on Model Values and Biases
by: Fenech-Borg, Emanuel Z., et al.
Published: (2025)
by: Fenech-Borg, Emanuel Z., et al.
Published: (2025)
The Necessity of AI Audit Standards Boards
by: Manheim, David, et al.
Published: (2024)
by: Manheim, David, et al.
Published: (2024)
Harmful Terms and Where to Find Them: Measuring and Modeling Unfavorable Financial Terms and Conditions in Shopping Websites at Scale
by: Tsai, Elisa, et al.
Published: (2025)
by: Tsai, Elisa, et al.
Published: (2025)
AI-Augmented Science and the New Institutional Scarcities
by: Lovén, Lauri
Published: (2026)
by: Lovén, Lauri
Published: (2026)
Generative AI and the Future of the Digital Commons: Five Open Questions and Knowledge Gaps
by: Noroozian, Arman, et al.
Published: (2025)
by: Noroozian, Arman, et al.
Published: (2025)
Co-design for Trustworthy AI: An Interpretable and Explainable Tool for Type 2 Diabetes Prediction Using Genomic Polygenic Risk Scores
by: Beuthan, Ralf, et al.
Published: (2026)
by: Beuthan, Ralf, et al.
Published: (2026)
From Deception to Perception: The Surprising Benefits of Deepfakes for Detecting, Measuring, and Mitigating Bias
by: Liu, Yizhi, et al.
Published: (2025)
by: Liu, Yizhi, et al.
Published: (2025)
A Roadmap for Multilingual, Multimodal Domain Independent Deception Detection
by: Boumber, Dainis, et al.
Published: (2024)
by: Boumber, Dainis, et al.
Published: (2024)
The Invisible Coalition Partner: How LLMs Vote When Democracy Gets Concrete
by: Barmettler, Joel
Published: (2026)
by: Barmettler, Joel
Published: (2026)
Closing the SNAP Gap: Identifying Under-Enrollment in High-Poverty ZIP Codes
by: Ray, Auyona
Published: (2025)
by: Ray, Auyona
Published: (2025)
The Accountability Paradox: How Platform API Restrictions Undermine AI Transparency Mandates
by: Burnat, Florian A. D., et al.
Published: (2025)
by: Burnat, Florian A. D., et al.
Published: (2025)
HybridVFL: Disentangled Feature Learning for Edge-Enabled Vertical Federated Multimodal Classification
by: Anoosha, Mostafa, et al.
Published: (2025)
by: Anoosha, Mostafa, et al.
Published: (2025)
AI to Learn 2.0: A Deliverable-Oriented Governance Framework and Maturity Rubric for Opaque AI in Learning-Intensive Domains
by: Shintani, Seine A.
Published: (2026)
by: Shintani, Seine A.
Published: (2026)
Formal Proofs as Structured Explanations: Proposing Several Tasks on Explainable Natural Language Inference
by: Abzianidze, Lasha
Published: (2023)
by: Abzianidze, Lasha
Published: (2023)
Institutions for the Post-Scarcity of Judgment
by: Lovén, Lauri
Published: (2026)
by: Lovén, Lauri
Published: (2026)
Uncovering Bugs in Formal Explainers: A Case Study with PyXAI
by: Huang, Xuanxiang, et al.
Published: (2025)
by: Huang, Xuanxiang, et al.
Published: (2025)
Beyond Imperfect Alternatives with Rulemapping: A Neuro-Symbolic Case Study on Online Hate Speech
by: von Cossel, Oskar
Published: (2026)
by: von Cossel, Oskar
Published: (2026)
REMIND: Input Loss Landscapes Reveal Residual Memorization in Post-Unlearning LLMs
by: Cohen, Liran, et al.
Published: (2025)
by: Cohen, Liran, et al.
Published: (2025)
Can LLMs Understand What We Cannot Say? Measuring Multilevel Alignment Through Abortion Stigma Across Cognitive, Interpersonal, and Structural Levels
by: Sharma, Anika, et al.
Published: (2025)
by: Sharma, Anika, et al.
Published: (2025)
Decoding Memes: A Comparative Study of Machine Learning Models for Template Identification
by: Murgás, Levente, et al.
Published: (2024)
by: Murgás, Levente, et al.
Published: (2024)
Synthetic Sociality: How Generative Models Privatize the Social Fabric
by: Dodik, Ana, et al.
Published: (2026)
by: Dodik, Ana, et al.
Published: (2026)
MetaCloak-JPEG: JPEG-Robust Adversarial Perturbation for Preventing Unauthorized DreamBooth-Based Deepfake Generation
by: Fardin, Tanjim Rahaman, et al.
Published: (2026)
by: Fardin, Tanjim Rahaman, et al.
Published: (2026)
A Three Steps Methodological Approach to Legal Governance Validation
by: Casanovas, Pompeu, et al.
Published: (2024)
by: Casanovas, Pompeu, et al.
Published: (2024)
Similar Items
-
AI vs. Human Moderators: A Comparative Evaluation of Multimodal LLMs in Content Moderation for Brand Safety
by: Levi, Adi, et al.
Published: (2025) -
Policy Cards: Machine-Readable Runtime Governance for Autonomous AI Agents
by: Mavračić, Juraj
Published: (2025) -
Integration of Contextual Descriptors in Ontology Alignment for Enrichment of Semantic Correspondence
by: Manziuk, Eduard, et al.
Published: (2024) -
Why we need an AI-resilient society
by: Bartz-Beielstein, Thomas
Published: (2019) -
The Company You Keep: How LLMs Respond to Dark Triad Traits
by: Lu, Zeyi, et al.
Published: (2026)