Guardado en:
| Autores principales: | Lupo, Lorenzo, Bose, Paul, Habibi, Mahyar, Hovy, Dirk, Schwarz, Carlo |
|---|---|
| Formato: | Preprint |
| Publicado: |
2024
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2403.05700 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
The Content Moderator's Dilemma: Removal of Toxic Content and Distortions to Online Discourse
por: Habibi, Mahyar, et al.
Publicado: (2024)
por: Habibi, Mahyar, et al.
Publicado: (2024)
Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models
por: Pernisi, Fabio, et al.
Publicado: (2024)
por: Pernisi, Fabio, et al.
Publicado: (2024)
Towards Human-Level Text Coding with LLMs: The Case of Fatherhood Roles in Public Policy Documents
por: Lupo, Lorenzo, et al.
Publicado: (2023)
por: Lupo, Lorenzo, et al.
Publicado: (2023)
SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors
por: Hu, Tiancheng, et al.
Publicado: (2025)
por: Hu, Tiancheng, et al.
Publicado: (2025)
Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals' Subjective Text Perceptions
por: Orlikowski, Matthias, et al.
Publicado: (2025)
por: Orlikowski, Matthias, et al.
Publicado: (2025)
Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts
por: Rooein, Donya, et al.
Publicado: (2024)
por: Rooein, Donya, et al.
Publicado: (2024)
SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety
por: Röttger, Paul, et al.
Publicado: (2024)
por: Röttger, Paul, et al.
Publicado: (2024)
Conversations as a Source for Teaching Scientific Concepts at Different Education Levels
por: Rooein, Donya, et al.
Publicado: (2024)
por: Rooein, Donya, et al.
Publicado: (2024)
Narratives at Conflict: Computational Analysis of News Framing in Multilingual Disinformation Campaigns
por: Sinelnik, Antonina, et al.
Publicado: (2024)
por: Sinelnik, Antonina, et al.
Publicado: (2024)
ITALIC: An Italian Intent Classification Dataset
por: Koudounas, Alkis, et al.
Publicado: (2023)
por: Koudounas, Alkis, et al.
Publicado: (2023)
The Ecological Fallacy in Annotation: Modelling Human Label Variation goes beyond Sociodemographics
por: Orlikowski, Matthias, et al.
Publicado: (2023)
por: Orlikowski, Matthias, et al.
Publicado: (2023)
Do Prompts Reshape Representations? An Empirical Study of Prompting Effects on Embeddings
por: Gonzalez-Gutierrez, Cesar, et al.
Publicado: (2025)
por: Gonzalez-Gutierrez, Cesar, et al.
Publicado: (2025)
The Pluralistic Moral Gap: Understanding Judgment and Value Differences between Humans and Large Language Models
por: Russo, Giuseppe, et al.
Publicado: (2025)
por: Russo, Giuseppe, et al.
Publicado: (2025)
The AI Gap: How Socioeconomic Status Affects Language Technology Interactions
por: Bassignana, Elisa, et al.
Publicado: (2025)
por: Bassignana, Elisa, et al.
Publicado: (2025)
Impoverished Language Technology: The Lack of (Social) Class in NLP
por: Curry, Amanda Cercas, et al.
Publicado: (2024)
por: Curry, Amanda Cercas, et al.
Publicado: (2024)
Principled Personas: Defining and Measuring the Intended Effects of Persona Prompting on Task Performance
por: de Araujo, Pedro Henrique Luz, et al.
Publicado: (2025)
por: de Araujo, Pedro Henrique Luz, et al.
Publicado: (2025)
The Call for Socially Aware Language Technologies
por: Yang, Diyi, et al.
Publicado: (2024)
por: Yang, Diyi, et al.
Publicado: (2024)
Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps
por: Attanasio, Giuseppe, et al.
Publicado: (2024)
por: Attanasio, Giuseppe, et al.
Publicado: (2024)
Biased Tales: Cultural and Topic Bias in Generating Children's Stories
por: Rooein, Donya, et al.
Publicado: (2025)
por: Rooein, Donya, et al.
Publicado: (2025)
Classist Tools: Social Class Correlates with Performance in NLP
por: Curry, Amanda Cercas, et al.
Publicado: (2024)
por: Curry, Amanda Cercas, et al.
Publicado: (2024)
Wisdom of Instruction-Tuned Language Model Crowds. Exploring Model Label Variation
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2023)
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2023)
Do Large Language Models Adapt to Language Variation across Socioeconomic Status?
por: Bassignana, Elisa, et al.
Publicado: (2026)
por: Bassignana, Elisa, et al.
Publicado: (2026)
LLMs for Argument Mining: Detection, Extraction, and Relationship Classification of pre-defined Arguments in Online Comments
por: Guida, Matteo, et al.
Publicado: (2025)
por: Guida, Matteo, et al.
Publicado: (2025)
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models
por: Röttger, Paul, et al.
Publicado: (2023)
por: Röttger, Paul, et al.
Publicado: (2023)
Exploring Subjective Tasks in Farsi: A Survey Analysis and Evaluation of Language Models
por: Rooein, Donya, et al.
Publicado: (2025)
por: Rooein, Donya, et al.
Publicado: (2025)
Diffusion Language Models Are Natively Length-Aware
por: Rossi, Vittorio, et al.
Publicado: (2026)
por: Rossi, Vittorio, et al.
Publicado: (2026)
IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance
por: Röttger, Paul, et al.
Publicado: (2025)
por: Röttger, Paul, et al.
Publicado: (2025)
Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement
por: Abercrombie, Gavin, et al.
Publicado: (2023)
por: Abercrombie, Gavin, et al.
Publicado: (2023)
EcoVerse: An Annotated Twitter Dataset for Eco-Relevance Classification, Environmental Impact Analysis, and Stance Detection
por: Grasso, Francesca, et al.
Publicado: (2024)
por: Grasso, Francesca, et al.
Publicado: (2024)
No for Some, Yes for Others: Persona Prompts and Other Sources of False Refusal in Language Models
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2025)
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2025)
Comparing Pre-trained Human Language Models: Is it Better with Human Context as Groups, Individual Traits, or Both?
por: Soni, Nikita, et al.
Publicado: (2024)
por: Soni, Nikita, et al.
Publicado: (2024)
Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2024)
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2024)
HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter
por: Tonneau, Manuel, et al.
Publicado: (2024)
por: Tonneau, Manuel, et al.
Publicado: (2024)
RTI-Bench: A Structured Dataset for Indian Right-to-Information Decision Analysis
por: Bose, Joy
Publicado: (2026)
por: Bose, Joy
Publicado: (2026)
Triggered: A Statistical Analysis of Environmental Influences on Extremist Groups
por: de Kock, Christine, et al.
Publicado: (2026)
por: de Kock, Christine, et al.
Publicado: (2026)
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models
por: Röttger, Paul, et al.
Publicado: (2024)
por: Röttger, Paul, et al.
Publicado: (2024)
"My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
por: Wang, Xinpeng, et al.
Publicado: (2024)
por: Wang, Xinpeng, et al.
Publicado: (2024)
ProvocationProbe: Instigating Hate Speech Dataset from Twitter
por: Kumar, Abhay, et al.
Publicado: (2024)
por: Kumar, Abhay, et al.
Publicado: (2024)
Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2024)
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2024)
Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2024)
por: Plaza-del-Arco, Flor Miriam, et al.
Publicado: (2024)
Ejemplares similares
-
The Content Moderator's Dilemma: Removal of Toxic Content and Distortions to Online Discourse
por: Habibi, Mahyar, et al.
Publicado: (2024) -
Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models
por: Pernisi, Fabio, et al.
Publicado: (2024) -
Towards Human-Level Text Coding with LLMs: The Case of Fatherhood Roles in Public Policy Documents
por: Lupo, Lorenzo, et al.
Publicado: (2023) -
SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors
por: Hu, Tiancheng, et al.
Publicado: (2025) -
Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals' Subjective Text Perceptions
por: Orlikowski, Matthias, et al.
Publicado: (2025)