:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Dawkins, Hillary, Fraser, Kathleen C., Kiritchenko, Svetlana
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2506.09975
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
by: Fraser, Kathleen C., et al.
Published: (2024)

Fine-Tuning Lowers Safety and Disrupts Evaluation Consistency
by: Fraser, Kathleen C., et al.
Published: (2025)

Adaptable Moral Stances of Large Language Models on Sexist Content: Implications for Society and Gender Discourse
by: Guo, Rongchen, et al.
Published: (2024)

Examining Gender and Racial Bias in Large Vision-Language Models Using a Novel Dataset of Parallel Images
by: Fraser, Kathleen C., et al.
Published: (2024)

Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes
by: Nejadgholi, Isar, et al.
Published: (2024)

The crime of being poor
by: Curto, Georgina, et al.
Published: (2023)

Tackling Social Bias against the Poor: A Dataset and Taxonomy on Aporophobia
by: Curto, Georgina, et al.
Published: (2025)

Uncovering Bias in Large Vision-Language Models at Scale with Counterfactuals
by: Howard, Phillip, et al.
Published: (2024)

Uncovering Bias in Large Vision-Language Models with Counterfactuals
by: Howard, Phillip, et al.
Published: (2024)

From Perceived Effectiveness to Measured Impact: Identity-Aware Evaluation of Automated Counter-Stereotypes
by: Kiritchenko, Svetlana, et al.
Published: (2025)

Gender-Neutral Machine Translation Strategies in Practice
by: Dawkins, Hillary, et al.
Published: (2025)

WMT24 Test Suite: Gender Resolution in Speaker-Listener Dialogue Roles
by: Dawkins, Hillary, et al.
Published: (2024)

Projective Methods for Mitigating Gender Bias in Pre-trained Language Models
by: Dawkins, Hillary, et al.
Published: (2024)

When Fine-Tuning Fails: Lessons from MS MARCO Passage Ranking
by: Pande, Manu, et al.
Published: (2025)

Stance Detection on Social Media with Fine-Tuned Large Language Models
by: Gül, İlker, et al.
Published: (2024)

Fine-Tuning or Fine-Failing? Debunking Performance Myths in Large Language Models
by: Barnett, Scott, et al.
Published: (2024)

Unveiling the Generalization Power of Fine-Tuned Large Language Models
by: Yang, Haoran, et al.
Published: (2024)

MultiSocial: Multilingual Benchmark of Machine-Generated Text Detection of Social-Media Texts
by: Macko, Dominik, et al.
Published: (2024)

Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI
by: Wang, Yuxia, et al.
Published: (2025)

Hidden Human-Like Nature of Machine-Generated Texts: Theory and Detection Enhancement
by: Wu, Chenwang, et al.
Published: (2026)

Fine-Tuning LLMs with Fine-Grained Human Feedback on Text Spans
by: CH-Wang, Sky, et al.
Published: (2025)

CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics
by: Yin, Kai, et al.
Published: (2024)

Incivility and Rigidity: Evaluating the Risks of Fine-Tuning LLMs for Political Argumentation
by: Churina, Svetlana, et al.
Published: (2024)

On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text
by: Gromadzki, Michał, et al.
Published: (2026)

Intention-Adaptive LLM Fine-Tuning for Text Revision Generation
by: Liu, Zhexiong, et al.
Published: (2026)

Why Supervised Fine-Tuning Fails to Learn: A Systematic Study of Incomplete Learning in Large Language Models
by: Xue, Chao, et al.
Published: (2026)

Online Social Support Detection in Spanish Social Media Texts
by: Tash, Moein Shahiki, et al.
Published: (2025)

Joint Localization and Activation Editing for Low-Resource Fine-Tuning
by: Lai, Wen, et al.
Published: (2025)

Modeling Pathology-Like Behavioral Patterns in Language Models Through Behavioral Fine-Tuning
by: Milano, Nicola, et al.
Published: (2026)

Cross-Cultural Value Awareness in Large Vision-Language Models
by: Howard, Phillip, et al.
Published: (2026)

Social Support Detection from Social Media Texts
by: Ahani, Zahra, et al.
Published: (2024)

The Effect of Language Diversity When Fine-Tuning Large Language Models for Translation
by: Stap, David, et al.
Published: (2025)

Fine-Tuning Pre-Trained Code Models for AI-Generated Code Detection
by: Ispas, Jany-Gabriel, et al.
Published: (2026)

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
by: Yuan, Huizhuo, et al.
Published: (2024)

Passing the Turing Test in Political Discourse: Fine-Tuning LLMs to Mimic Polarized Social Media Comments
by: Pazzaglia, ., et al.
Published: (2025)

EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned Data for Evaluating Text-to-Image Models
by: Tan, Zhiyu, et al.
Published: (2024)

Psychologically Potent, Computationally Invisible: LLMs Generate Social-Comparison-Eliciting Posts They Fail to Detect
by: Zhao, Hua, et al.
Published: (2026)

An Attention-Based Denoising Framework for Personality Detection in Social Media Texts
by: Lin, Lei, et al.
Published: (2023)

When Safety Fails Before the Answer: Benchmarking Harmful Behavior Detection in Reasoning Chains
by: Kakkar, Ishita, et al.
Published: (2026)

SiamGPT: Quality-First Fine-Tuning for Stable Thai Text Generation
by: Pairatsuppawat, Thittipat, et al.
Published: (2025)