Saved in:
| Main Authors: | Dawkins, Hillary, Fraser, Kathleen C., Kiritchenko, Svetlana |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.09975 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
by: Fraser, Kathleen C., et al.
Published: (2024)
by: Fraser, Kathleen C., et al.
Published: (2024)
Fine-Tuning Lowers Safety and Disrupts Evaluation Consistency
by: Fraser, Kathleen C., et al.
Published: (2025)
by: Fraser, Kathleen C., et al.
Published: (2025)
Adaptable Moral Stances of Large Language Models on Sexist Content: Implications for Society and Gender Discourse
by: Guo, Rongchen, et al.
Published: (2024)
by: Guo, Rongchen, et al.
Published: (2024)
Examining Gender and Racial Bias in Large Vision-Language Models Using a Novel Dataset of Parallel Images
by: Fraser, Kathleen C., et al.
Published: (2024)
by: Fraser, Kathleen C., et al.
Published: (2024)
Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes
by: Nejadgholi, Isar, et al.
Published: (2024)
by: Nejadgholi, Isar, et al.
Published: (2024)
The crime of being poor
by: Curto, Georgina, et al.
Published: (2023)
by: Curto, Georgina, et al.
Published: (2023)
Tackling Social Bias against the Poor: A Dataset and Taxonomy on Aporophobia
by: Curto, Georgina, et al.
Published: (2025)
by: Curto, Georgina, et al.
Published: (2025)
Uncovering Bias in Large Vision-Language Models at Scale with Counterfactuals
by: Howard, Phillip, et al.
Published: (2024)
by: Howard, Phillip, et al.
Published: (2024)
Uncovering Bias in Large Vision-Language Models with Counterfactuals
by: Howard, Phillip, et al.
Published: (2024)
by: Howard, Phillip, et al.
Published: (2024)
From Perceived Effectiveness to Measured Impact: Identity-Aware Evaluation of Automated Counter-Stereotypes
by: Kiritchenko, Svetlana, et al.
Published: (2025)
by: Kiritchenko, Svetlana, et al.
Published: (2025)
Gender-Neutral Machine Translation Strategies in Practice
by: Dawkins, Hillary, et al.
Published: (2025)
by: Dawkins, Hillary, et al.
Published: (2025)
WMT24 Test Suite: Gender Resolution in Speaker-Listener Dialogue Roles
by: Dawkins, Hillary, et al.
Published: (2024)
by: Dawkins, Hillary, et al.
Published: (2024)
Projective Methods for Mitigating Gender Bias in Pre-trained Language Models
by: Dawkins, Hillary, et al.
Published: (2024)
by: Dawkins, Hillary, et al.
Published: (2024)
When Fine-Tuning Fails: Lessons from MS MARCO Passage Ranking
by: Pande, Manu, et al.
Published: (2025)
by: Pande, Manu, et al.
Published: (2025)
Stance Detection on Social Media with Fine-Tuned Large Language Models
by: Gül, İlker, et al.
Published: (2024)
by: Gül, İlker, et al.
Published: (2024)
Fine-Tuning or Fine-Failing? Debunking Performance Myths in Large Language Models
by: Barnett, Scott, et al.
Published: (2024)
by: Barnett, Scott, et al.
Published: (2024)
Unveiling the Generalization Power of Fine-Tuned Large Language Models
by: Yang, Haoran, et al.
Published: (2024)
by: Yang, Haoran, et al.
Published: (2024)
MultiSocial: Multilingual Benchmark of Machine-Generated Text Detection of Social-Media Texts
by: Macko, Dominik, et al.
Published: (2024)
by: Macko, Dominik, et al.
Published: (2024)
Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI
by: Wang, Yuxia, et al.
Published: (2025)
by: Wang, Yuxia, et al.
Published: (2025)
Hidden Human-Like Nature of Machine-Generated Texts: Theory and Detection Enhancement
by: Wu, Chenwang, et al.
Published: (2026)
by: Wu, Chenwang, et al.
Published: (2026)
Fine-Tuning LLMs with Fine-Grained Human Feedback on Text Spans
by: CH-Wang, Sky, et al.
Published: (2025)
by: CH-Wang, Sky, et al.
Published: (2025)
CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics
by: Yin, Kai, et al.
Published: (2024)
by: Yin, Kai, et al.
Published: (2024)
Incivility and Rigidity: Evaluating the Risks of Fine-Tuning LLMs for Political Argumentation
by: Churina, Svetlana, et al.
Published: (2024)
by: Churina, Svetlana, et al.
Published: (2024)
On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text
by: Gromadzki, Michał, et al.
Published: (2026)
by: Gromadzki, Michał, et al.
Published: (2026)
Intention-Adaptive LLM Fine-Tuning for Text Revision Generation
by: Liu, Zhexiong, et al.
Published: (2026)
by: Liu, Zhexiong, et al.
Published: (2026)
Why Supervised Fine-Tuning Fails to Learn: A Systematic Study of Incomplete Learning in Large Language Models
by: Xue, Chao, et al.
Published: (2026)
by: Xue, Chao, et al.
Published: (2026)
Online Social Support Detection in Spanish Social Media Texts
by: Tash, Moein Shahiki, et al.
Published: (2025)
by: Tash, Moein Shahiki, et al.
Published: (2025)
Joint Localization and Activation Editing for Low-Resource Fine-Tuning
by: Lai, Wen, et al.
Published: (2025)
by: Lai, Wen, et al.
Published: (2025)
Modeling Pathology-Like Behavioral Patterns in Language Models Through Behavioral Fine-Tuning
by: Milano, Nicola, et al.
Published: (2026)
by: Milano, Nicola, et al.
Published: (2026)
Cross-Cultural Value Awareness in Large Vision-Language Models
by: Howard, Phillip, et al.
Published: (2026)
by: Howard, Phillip, et al.
Published: (2026)
Social Support Detection from Social Media Texts
by: Ahani, Zahra, et al.
Published: (2024)
by: Ahani, Zahra, et al.
Published: (2024)
The Effect of Language Diversity When Fine-Tuning Large Language Models for Translation
by: Stap, David, et al.
Published: (2025)
by: Stap, David, et al.
Published: (2025)
Fine-Tuning Pre-Trained Code Models for AI-Generated Code Detection
by: Ispas, Jany-Gabriel, et al.
Published: (2026)
by: Ispas, Jany-Gabriel, et al.
Published: (2026)
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
by: Yuan, Huizhuo, et al.
Published: (2024)
by: Yuan, Huizhuo, et al.
Published: (2024)
Passing the Turing Test in Political Discourse: Fine-Tuning LLMs to Mimic Polarized Social Media Comments
by: Pazzaglia, ., et al.
Published: (2025)
by: Pazzaglia, ., et al.
Published: (2025)
EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned Data for Evaluating Text-to-Image Models
by: Tan, Zhiyu, et al.
Published: (2024)
by: Tan, Zhiyu, et al.
Published: (2024)
Psychologically Potent, Computationally Invisible: LLMs Generate Social-Comparison-Eliciting Posts They Fail to Detect
by: Zhao, Hua, et al.
Published: (2026)
by: Zhao, Hua, et al.
Published: (2026)
An Attention-Based Denoising Framework for Personality Detection in Social Media Texts
by: Lin, Lei, et al.
Published: (2023)
by: Lin, Lei, et al.
Published: (2023)
When Safety Fails Before the Answer: Benchmarking Harmful Behavior Detection in Reasoning Chains
by: Kakkar, Ishita, et al.
Published: (2026)
by: Kakkar, Ishita, et al.
Published: (2026)
SiamGPT: Quality-First Fine-Tuning for Stable Thai Text Generation
by: Pairatsuppawat, Thittipat, et al.
Published: (2025)
by: Pairatsuppawat, Thittipat, et al.
Published: (2025)
Similar Items
-
Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
by: Fraser, Kathleen C., et al.
Published: (2024) -
Fine-Tuning Lowers Safety and Disrupts Evaluation Consistency
by: Fraser, Kathleen C., et al.
Published: (2025) -
Adaptable Moral Stances of Large Language Models on Sexist Content: Implications for Society and Gender Discourse
by: Guo, Rongchen, et al.
Published: (2024) -
Examining Gender and Racial Bias in Large Vision-Language Models Using a Novel Dataset of Parallel Images
by: Fraser, Kathleen C., et al.
Published: (2024) -
Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes
by: Nejadgholi, Isar, et al.
Published: (2024)