:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Lupo, Lorenzo, Bose, Paul, Habibi, Mahyar, Hovy, Dirk, Schwarz, Carlo
Formato:	Preprint
Publicado:	2024
Materias:	Computation and Language
Acceso en línea:	https://arxiv.org/abs/2403.05700
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

The Content Moderator's Dilemma: Removal of Toxic Content and Distortions to Online Discourse
por: Habibi, Mahyar, et al.
Publicado: (2024)

Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models
por: Pernisi, Fabio, et al.
Publicado: (2024)

Towards Human-Level Text Coding with LLMs: The Case of Fatherhood Roles in Public Policy Documents
por: Lupo, Lorenzo, et al.
Publicado: (2023)

SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors
por: Hu, Tiancheng, et al.
Publicado: (2025)

Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals' Subjective Text Perceptions
por: Orlikowski, Matthias, et al.
Publicado: (2025)

Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts
por: Rooein, Donya, et al.
Publicado: (2024)

SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety
por: Röttger, Paul, et al.
Publicado: (2024)

Conversations as a Source for Teaching Scientific Concepts at Different Education Levels
por: Rooein, Donya, et al.
Publicado: (2024)

Narratives at Conflict: Computational Analysis of News Framing in Multilingual Disinformation Campaigns
por: Sinelnik, Antonina, et al.
Publicado: (2024)

ITALIC: An Italian Intent Classification Dataset
por: Koudounas, Alkis, et al.
Publicado: (2023)

The Ecological Fallacy in Annotation: Modelling Human Label Variation goes beyond Sociodemographics
por: Orlikowski, Matthias, et al.
Publicado: (2023)

Do Prompts Reshape Representations? An Empirical Study of Prompting Effects on Embeddings
por: Gonzalez-Gutierrez, Cesar, et al.
Publicado: (2025)

The Pluralistic Moral Gap: Understanding Judgment and Value Differences between Humans and Large Language Models
por: Russo, Giuseppe, et al.
Publicado: (2025)

The AI Gap: How Socioeconomic Status Affects Language Technology Interactions
por: Bassignana, Elisa, et al.
Publicado: (2025)

Impoverished Language Technology: The Lack of (Social) Class in NLP
por: Curry, Amanda Cercas, et al.
Publicado: (2024)

Principled Personas: Defining and Measuring the Intended Effects of Persona Prompting on Task Performance
por: de Araujo, Pedro Henrique Luz, et al.
Publicado: (2025)

The Call for Socially Aware Language Technologies
por: Yang, Diyi, et al.
Publicado: (2024)

Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps
por: Attanasio, Giuseppe, et al.
Publicado: (2024)

Biased Tales: Cultural and Topic Bias in Generating Children's Stories
por: Rooein, Donya, et al.
Publicado: (2025)

Classist Tools: Social Class Correlates with Performance in NLP
por: Curry, Amanda Cercas, et al.
Publicado: (2024)

Wisdom of Instruction-Tuned Language Model Crowds. Exploring Model Label Variation
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2023)

Do Large Language Models Adapt to Language Variation across Socioeconomic Status?
por: Bassignana, Elisa, et al.
Publicado: (2026)

LLMs for Argument Mining: Detection, Extraction, and Relationship Classification of pre-defined Arguments in Online Comments
por: Guida, Matteo, et al.
Publicado: (2025)

XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models
por: Röttger, Paul, et al.
Publicado: (2023)

Exploring Subjective Tasks in Farsi: A Survey Analysis and Evaluation of Language Models
por: Rooein, Donya, et al.
Publicado: (2025)

Diffusion Language Models Are Natively Length-Aware
por: Rossi, Vittorio, et al.
Publicado: (2026)

IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance
por: Röttger, Paul, et al.
Publicado: (2025)

Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement
por: Abercrombie, Gavin, et al.
Publicado: (2023)

EcoVerse: An Annotated Twitter Dataset for Eco-Relevance Classification, Environmental Impact Analysis, and Stance Detection
por: Grasso, Francesca, et al.
Publicado: (2024)

No for Some, Yes for Others: Persona Prompts and Other Sources of False Refusal in Language Models
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2025)

Comparing Pre-trained Human Language Models: Is it Better with Human Context as Groups, Individual Traits, or Both?
por: Soni, Nikita, et al.
Publicado: (2024)

Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2024)

HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter
por: Tonneau, Manuel, et al.
Publicado: (2024)

RTI-Bench: A Structured Dataset for Indian Right-to-Information Decision Analysis
por: Bose, Joy
Publicado: (2026)

Triggered: A Statistical Analysis of Environmental Influences on Extremist Groups
por: de Kock, Christine, et al.
Publicado: (2026)

Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models
por: Röttger, Paul, et al.
Publicado: (2024)

"My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
por: Wang, Xinpeng, et al.
Publicado: (2024)

ProvocationProbe: Instigating Hate Speech Dataset from Twitter
por: Kumar, Abhay, et al.
Publicado: (2024)

Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2024)

Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2024)