:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Cheng, Xiaoqing, Chen, Ruizhe, Zan, Hongying, Jia, Yuxiang, Peng, Min
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Computation and Language
Accesso online:	https://arxiv.org/abs/2505.23829
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Detection, Classification, and Mitigation of Gender Bias in Large Language Models
di: Cheng, Xiaoqing, et al.
Pubblicazione: (2025)

MRC-based Nested Medical NER with Co-prediction and Adaptive Pre-training
di: Du, Xiaojing, et al.
Pubblicazione: (2024)

BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models
di: Fan, Zhiting, et al.
Pubblicazione: (2025)

FairSteer: Inference Time Debiasing for LLMs with Dynamic Activation Steering
di: Li, Yichen, et al.
Pubblicazione: (2025)

A Corpus for Named Entity Recognition in Chinese Novels with Multi-genres
di: Zhao, Hanjie, et al.
Pubblicazione: (2023)

ZZU-NLP at SIGHAN-2024 dimABSA Task: Aspect-Based Sentiment Analysis with Coarse-to-Fine In-context Learning
di: Zhu, Senbin, et al.
Pubblicazione: (2024)

Race, Ethnicity and Their Implication on Bias in Large Language Models
di: Hu, Shiyue, et al.
Pubblicazione: (2026)

Large Language Model Bias Mitigation from the Perspective of Knowledge Editing
di: Chen, Ruizhe, et al.
Pubblicazione: (2024)

SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis
di: Zhu, Senbin, et al.
Pubblicazione: (2024)

BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses
di: Xu, Xin, et al.
Pubblicazione: (2025)

Bi-directional Bias Attribution: Debiasing Large Language Models without Modifying Prompts
di: Lin, Yujie, et al.
Pubblicazione: (2026)

DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models
di: Chen, Ruizhe, et al.
Pubblicazione: (2025)

On Bias and Fairness in NLP: Investigating the Impact of Bias and Debiasing in Language Models on the Fairness of Toxicity Detection
di: Elsafoury, Fatma, et al.
Pubblicazione: (2023)

JOLT-SQL: Joint Loss Tuning of Text-to-SQL with Confusion-aware Noisy Schema Sampling
di: Song, Jinwang, et al.
Pubblicazione: (2025)

Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
di: Zhu, Qin, et al.
Pubblicazione: (2024)

Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models
di: Liusie, Adian, et al.
Pubblicazione: (2024)

Inference-Time Reasoning Selectively Reduces Implicit Social Bias in Large Language Models
di: Apsel, Molly, et al.
Pubblicazione: (2026)

LIDAO: Towards Limited Interventions for Debiasing (Large) Language Models
di: Liu, Tianci, et al.
Pubblicazione: (2024)

Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector
di: Yang, Haoyan, et al.
Pubblicazione: (2025)

BiasEdit: Debiasing Stereotyped Language Models via Model Editing
di: Xu, Xin, et al.
Pubblicazione: (2025)

Gender Bias in Large Language Models across Multiple Languages
di: Zhao, Jinman, et al.
Pubblicazione: (2024)

Identifying and Mitigating Social Bias Knowledge in Language Models
di: Chen, Ruizhe, et al.
Pubblicazione: (2024)

BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
di: Fan, Zhiting, et al.
Pubblicazione: (2024)

Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph Completion
di: Liu, Ben, et al.
Pubblicazione: (2024)

Rethinking Prompt-based Debiasing in Large Language Models
di: Yang, Xinyi, et al.
Pubblicazione: (2025)

Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies
di: Abrar, Ajwad, et al.
Pubblicazione: (2025)

GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Models
di: Zhang, Tao, et al.
Pubblicazione: (2024)

Towards Multimodal Sentiment Analysis Debiasing via Bias Purification
di: Yang, Dingkang, et al.
Pubblicazione: (2024)

Debiasing Reward Models via Causally Motivated Inference-Time Intervention
di: Shinoda, Kazutoshi, et al.
Pubblicazione: (2026)

Inference-Time Selective Debiasing to Enhance Fairness in Text Classification Models
di: Kuzmin, Gleb, et al.
Pubblicazione: (2024)

Self-Adaptive Cognitive Debiasing for Large Language Models in Decision-Making
di: Lyu, Yougang, et al.
Pubblicazione: (2025)

Training Matryoshka Mixture-of-Experts for Elastic Inference-Time Expert Utilization
di: Wang, Yaoxiang, et al.
Pubblicazione: (2025)

Text Prompt Injection of Vision Language Models
di: Zhu, Ruizhe
Pubblicazione: (2025)

From Measurement to Mitigation: Exploring the Transferability of Debiasing Approaches to Gender Bias in Maltese Language Models
di: Galea, Melanie, et al.
Pubblicazione: (2025)

Invisible Filters: Cultural Bias in Hiring Evaluations Using Large Language Models
di: Rao, Pooja S. B., et al.
Pubblicazione: (2025)

Self-Debias: Self-correcting for Debiasing Large Language Models
di: Feng, Xuan, et al.
Pubblicazione: (2026)

Can the capability of Large Language Models be described by human ability? A Meta Study
di: Zan, Mingrui, et al.
Pubblicazione: (2025)

Improving Natural Language Capability of Code Large Language Model
di: Li, Wei, et al.
Pubblicazione: (2024)

Optimizing Large Language Model Training Using FP4 Quantization
di: Wang, Ruizhe, et al.
Pubblicazione: (2025)

Self-Supervised Position Debiasing for Large Language Models
di: Liu, Zhongkun, et al.
Pubblicazione: (2024)