:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ungless, Eddie L., Vitsakis, Nikolas, Talat, Zeerak, Garforth, James, Ross, Björn, Onken, Arno, Kasirzadeh, Atoosa, Birch, Alexandra
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2412.16022
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models
by: Ungless, Eddie L., et al.
Published: (2024)

Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices
by: Sigurgeirsson, Atli, et al.
Published: (2024)

A Capabilities Approach to Studying Bias and Harm in Language Technologies
by: Nigatu, Hellina Hailu, et al.
Published: (2024)

Experiences of Censorship on TikTok Across Marginalised Identities
by: Ungless, Eddie L., et al.
Published: (2024)

"Till I can get my satisfaction": Open Questions in the Public Desire to Punish AI
by: Ungless, Eddie L., et al.
Published: (2025)

CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models
by: Pistilli, Giada, et al.
Published: (2024)

Impoverished Language Technology: The Lack of (Social) Class in NLP
by: Curry, Amanda Cercas, et al.
Published: (2024)

Online Learning Defense against Iterative Jailbreak Attacks via Prompt Optimization
by: Kaneko, Masahiro, et al.
Published: (2025)

Subjective $\textit{Isms}$? On the Danger of Conflating Hate and Offence in Abusive Language Detection
by: Curry, Amanda Cercas, et al.
Published: (2024)

Understanding "Democratization" in NLP and ML Research
by: Subramonian, Arjun, et al.
Published: (2024)

Unrequited Emotions: Investigating the Gaps in Motivation and Practice in Speech Emotion Recognition Research
by: Wong, Taryn, et al.
Published: (2026)

Voices in a Crowd: Searching for Clusters of Unique Perspectives
by: Vitsakis, Nikolas, et al.
Published: (2024)

Measurement challenges in AI catastrophic risk governance and safety frameworks
by: Kasirzadeh, Atoosa
Published: (2024)

Two Types of AI Existential Risk: Decisive and Accumulative
by: Kasirzadeh, Atoosa
Published: (2024)

Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon
by: Koto, Fajri, et al.
Published: (2024)

Evaluating Long-Term Memory for Long-Context Question Answering
by: Terranova, Alessandra, et al.
Published: (2025)

Exploitation All the Way Down: Calling out the Root Cause of Bad Online Experiences for Users of the "Majority World"
by: Nigatu, Hellina Hailu, et al.
Published: (2024)

Classist Tools: Social Class Correlates with Performance in NLP
by: Curry, Amanda Cercas, et al.
Published: (2024)

Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
by: Parekh, Amit, et al.
Published: (2024)

CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts
by: Nikandrou, Malvina, et al.
Published: (2024)

FedMental: Evaluating Federated Learning for Mental Health Detection from Social Media Data
by: Abdelkadir, Nuredin Ali, et al.
Published: (2026)

The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels
by: Fleisig, Eve, et al.
Published: (2024)

Re-examining Sexism and Misogyny Classification with Annotator Attitudes
by: Jiang, Aiqi, et al.
Published: (2024)

AI Safety for Everyone
by: Gyevnar, Balint, et al.
Published: (2025)

Beyond Model Interpretability: Socio-Structural Explanations in Machine Learning
by: Smart, Andrew, et al.
Published: (2024)

Explanation Hacking: The perils of algorithmic recourse
by: Sullivan, Emily, et al.
Published: (2024)

Characterizing AI Agents for Alignment and Governance
by: Kasirzadeh, Atoosa, et al.
Published: (2025)

Bridging the Gap in the Responsible AI Divides
by: Gyevnár, Bálint, et al.
Published: (2026)

Code-Switching in End-to-End Automatic Speech Recognition: A Systematic Literature Review
by: Agro, Maha Tufail, et al.
Published: (2025)

Generative Value Conflicts Reveal LLM Priorities
by: Liu, Andy, et al.
Published: (2025)

IYKYK: Using language models to decode extremist cryptolects
by: de Kock, Christine, et al.
Published: (2025)

Exploring the Limitations of Detecting Machine-Generated Text
by: Doughman, Jad, et al.
Published: (2024)

Personal Attribute Leakage in Federated Speech Models
by: Al-Ali, Hamdan, et al.
Published: (2025)

EVALUESTEER: Measuring Reward Model Steerability Towards Values and Preferences
by: Ghate, Kshitish, et al.
Published: (2025)

Epistemic Injustice in Generative AI
by: Kay, Jackie, et al.
Published: (2024)

AI, Digital Platforms, and the New Systemic Risk
by: Hacker, Philipp, et al.
Published: (2025)

Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection
by: Ramírez, Guillem, et al.
Published: (2024)

The Ups and Downs of Large Language Model Inference with Vocabulary Trimming by Language Heuristics
by: Bogoychev, Nikolay, et al.
Published: (2023)

Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention
by: Wang, Weixuan, et al.
Published: (2024)

Aligning Stuttered-Speech Research with End-User Needs: Scoping Review, Survey, and Guidelines
by: Toyin, Hawau Olamide, et al.
Published: (2026)