Saved in:
| Main Authors: | Ungless, Eddie L., Vitsakis, Nikolas, Talat, Zeerak, Garforth, James, Ross, Björn, Onken, Arno, Kasirzadeh, Atoosa, Birch, Alexandra |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.16022 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models
by: Ungless, Eddie L., et al.
Published: (2024)
by: Ungless, Eddie L., et al.
Published: (2024)
Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices
by: Sigurgeirsson, Atli, et al.
Published: (2024)
by: Sigurgeirsson, Atli, et al.
Published: (2024)
A Capabilities Approach to Studying Bias and Harm in Language Technologies
by: Nigatu, Hellina Hailu, et al.
Published: (2024)
by: Nigatu, Hellina Hailu, et al.
Published: (2024)
Experiences of Censorship on TikTok Across Marginalised Identities
by: Ungless, Eddie L., et al.
Published: (2024)
by: Ungless, Eddie L., et al.
Published: (2024)
"Till I can get my satisfaction": Open Questions in the Public Desire to Punish AI
by: Ungless, Eddie L., et al.
Published: (2025)
by: Ungless, Eddie L., et al.
Published: (2025)
CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models
by: Pistilli, Giada, et al.
Published: (2024)
by: Pistilli, Giada, et al.
Published: (2024)
Impoverished Language Technology: The Lack of (Social) Class in NLP
by: Curry, Amanda Cercas, et al.
Published: (2024)
by: Curry, Amanda Cercas, et al.
Published: (2024)
Online Learning Defense against Iterative Jailbreak Attacks via Prompt Optimization
by: Kaneko, Masahiro, et al.
Published: (2025)
by: Kaneko, Masahiro, et al.
Published: (2025)
Subjective $\textit{Isms}$? On the Danger of Conflating Hate and Offence in Abusive Language Detection
by: Curry, Amanda Cercas, et al.
Published: (2024)
by: Curry, Amanda Cercas, et al.
Published: (2024)
Understanding "Democratization" in NLP and ML Research
by: Subramonian, Arjun, et al.
Published: (2024)
by: Subramonian, Arjun, et al.
Published: (2024)
Unrequited Emotions: Investigating the Gaps in Motivation and Practice in Speech Emotion Recognition Research
by: Wong, Taryn, et al.
Published: (2026)
by: Wong, Taryn, et al.
Published: (2026)
Voices in a Crowd: Searching for Clusters of Unique Perspectives
by: Vitsakis, Nikolas, et al.
Published: (2024)
by: Vitsakis, Nikolas, et al.
Published: (2024)
Measurement challenges in AI catastrophic risk governance and safety frameworks
by: Kasirzadeh, Atoosa
Published: (2024)
by: Kasirzadeh, Atoosa
Published: (2024)
Two Types of AI Existential Risk: Decisive and Accumulative
by: Kasirzadeh, Atoosa
Published: (2024)
by: Kasirzadeh, Atoosa
Published: (2024)
Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon
by: Koto, Fajri, et al.
Published: (2024)
by: Koto, Fajri, et al.
Published: (2024)
Evaluating Long-Term Memory for Long-Context Question Answering
by: Terranova, Alessandra, et al.
Published: (2025)
by: Terranova, Alessandra, et al.
Published: (2025)
Exploitation All the Way Down: Calling out the Root Cause of Bad Online Experiences for Users of the "Majority World"
by: Nigatu, Hellina Hailu, et al.
Published: (2024)
by: Nigatu, Hellina Hailu, et al.
Published: (2024)
Classist Tools: Social Class Correlates with Performance in NLP
by: Curry, Amanda Cercas, et al.
Published: (2024)
by: Curry, Amanda Cercas, et al.
Published: (2024)
Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
by: Parekh, Amit, et al.
Published: (2024)
by: Parekh, Amit, et al.
Published: (2024)
CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts
by: Nikandrou, Malvina, et al.
Published: (2024)
by: Nikandrou, Malvina, et al.
Published: (2024)
FedMental: Evaluating Federated Learning for Mental Health Detection from Social Media Data
by: Abdelkadir, Nuredin Ali, et al.
Published: (2026)
by: Abdelkadir, Nuredin Ali, et al.
Published: (2026)
The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels
by: Fleisig, Eve, et al.
Published: (2024)
by: Fleisig, Eve, et al.
Published: (2024)
Re-examining Sexism and Misogyny Classification with Annotator Attitudes
by: Jiang, Aiqi, et al.
Published: (2024)
by: Jiang, Aiqi, et al.
Published: (2024)
AI Safety for Everyone
by: Gyevnar, Balint, et al.
Published: (2025)
by: Gyevnar, Balint, et al.
Published: (2025)
Beyond Model Interpretability: Socio-Structural Explanations in Machine Learning
by: Smart, Andrew, et al.
Published: (2024)
by: Smart, Andrew, et al.
Published: (2024)
Explanation Hacking: The perils of algorithmic recourse
by: Sullivan, Emily, et al.
Published: (2024)
by: Sullivan, Emily, et al.
Published: (2024)
Characterizing AI Agents for Alignment and Governance
by: Kasirzadeh, Atoosa, et al.
Published: (2025)
by: Kasirzadeh, Atoosa, et al.
Published: (2025)
Bridging the Gap in the Responsible AI Divides
by: Gyevnár, Bálint, et al.
Published: (2026)
by: Gyevnár, Bálint, et al.
Published: (2026)
Code-Switching in End-to-End Automatic Speech Recognition: A Systematic Literature Review
by: Agro, Maha Tufail, et al.
Published: (2025)
by: Agro, Maha Tufail, et al.
Published: (2025)
Generative Value Conflicts Reveal LLM Priorities
by: Liu, Andy, et al.
Published: (2025)
by: Liu, Andy, et al.
Published: (2025)
IYKYK: Using language models to decode extremist cryptolects
by: de Kock, Christine, et al.
Published: (2025)
by: de Kock, Christine, et al.
Published: (2025)
Exploring the Limitations of Detecting Machine-Generated Text
by: Doughman, Jad, et al.
Published: (2024)
by: Doughman, Jad, et al.
Published: (2024)
Personal Attribute Leakage in Federated Speech Models
by: Al-Ali, Hamdan, et al.
Published: (2025)
by: Al-Ali, Hamdan, et al.
Published: (2025)
EVALUESTEER: Measuring Reward Model Steerability Towards Values and Preferences
by: Ghate, Kshitish, et al.
Published: (2025)
by: Ghate, Kshitish, et al.
Published: (2025)
Epistemic Injustice in Generative AI
by: Kay, Jackie, et al.
Published: (2024)
by: Kay, Jackie, et al.
Published: (2024)
AI, Digital Platforms, and the New Systemic Risk
by: Hacker, Philipp, et al.
Published: (2025)
by: Hacker, Philipp, et al.
Published: (2025)
Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection
by: Ramírez, Guillem, et al.
Published: (2024)
by: Ramírez, Guillem, et al.
Published: (2024)
The Ups and Downs of Large Language Model Inference with Vocabulary Trimming by Language Heuristics
by: Bogoychev, Nikolay, et al.
Published: (2023)
by: Bogoychev, Nikolay, et al.
Published: (2023)
Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention
by: Wang, Weixuan, et al.
Published: (2024)
by: Wang, Weixuan, et al.
Published: (2024)
Aligning Stuttered-Speech Research with End-User Needs: Scoping Review, Survey, and Guidelines
by: Toyin, Hawau Olamide, et al.
Published: (2026)
by: Toyin, Hawau Olamide, et al.
Published: (2026)
Similar Items
-
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models
by: Ungless, Eddie L., et al.
Published: (2024) -
Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices
by: Sigurgeirsson, Atli, et al.
Published: (2024) -
A Capabilities Approach to Studying Bias and Harm in Language Technologies
by: Nigatu, Hellina Hailu, et al.
Published: (2024) -
Experiences of Censorship on TikTok Across Marginalised Identities
by: Ungless, Eddie L., et al.
Published: (2024) -
"Till I can get my satisfaction": Open Questions in the Public Desire to Punish AI
by: Ungless, Eddie L., et al.
Published: (2025)