Saved in:
| Main Authors: | Yang, Yi, Duan, Hanyu, Abbasi, Ahmed, Lalor, John P., Tam, Kar Yan |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2311.10395 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Do LLMs Know about Hallucination? An Empirical Investigation of LLM's Hidden States
by: Duan, Hanyu, et al.
Published: (2024)
by: Duan, Hanyu, et al.
Published: (2024)
LLM-Measure: Generating Valid, Consistent, and Reproducible Text-Based Measures for Social Science Research
by: Yang, Yi, et al.
Published: (2024)
by: Yang, Yi, et al.
Published: (2024)
PersonaTwin: A Multi-Tier Prompt Conditioning Framework for Generating and Evaluating Personalized Digital Twins
by: Chen, Sihan, et al.
Published: (2025)
by: Chen, Sihan, et al.
Published: (2025)
Bridging the LLM Accessibility Divide? Performance, Fairness, and Cost of Closed versus Open LLMs for Automated Essay Scoring
by: Oketch, Kezia, et al.
Published: (2025)
by: Oketch, Kezia, et al.
Published: (2025)
Benchmarking Sociolinguistic Diversity in Swahili NLP: A Taxonomy-Guided Approach
by: Oketch, Kezia, et al.
Published: (2025)
by: Oketch, Kezia, et al.
Published: (2025)
Robust Predictive Modeling Under Unseen Data Distribution Shifts: A Methodological Commentary
by: Duan, Hanyu, et al.
Published: (2025)
by: Duan, Hanyu, et al.
Published: (2025)
Ready2Unlearn: A Learning-Time Approach for Preparing Models with Future Unlearning Readiness
by: Duan, Hanyu, et al.
Published: (2025)
by: Duan, Hanyu, et al.
Published: (2025)
Layer-wise Representation Dynamics: An Empirical Investigation Across Embedders and Base LLMs
by: Jiang, Jingzhou, et al.
Published: (2026)
by: Jiang, Jingzhou, et al.
Published: (2026)
BiasDPO: Mitigating Bias in Language Models through Direct Preference Optimization
by: Allam, Ahmed
Published: (2024)
by: Allam, Ahmed
Published: (2024)
Debiasing CLIP: Interpreting and Correcting Bias in Attention Heads
by: Yeo, Wei Jie, et al.
Published: (2025)
by: Yeo, Wei Jie, et al.
Published: (2025)
Beyond Surface Similarity: Detecting Subtle Semantic Shifts in Financial Narratives
by: Liu, Jiaxin, et al.
Published: (2024)
by: Liu, Jiaxin, et al.
Published: (2024)
Bias in, Bias out: Annotation Bias in Multilingual Large Language Models
by: Cui, Xia, et al.
Published: (2025)
by: Cui, Xia, et al.
Published: (2025)
Surface Fairness, Deep Bias: A Comparative Study of Bias in Language Models
by: Sorokovikova, Aleksandra, et al.
Published: (2025)
by: Sorokovikova, Aleksandra, et al.
Published: (2025)
Investigating Spatial Attention Bias in Vision-Language Models
by: Chaudhary, Aryan, et al.
Published: (2025)
by: Chaudhary, Aryan, et al.
Published: (2025)
JBBQ: Japanese Bias Benchmark for Analyzing Social Biases in Large Language Models
by: Yanaka, Hitomi, et al.
Published: (2024)
by: Yanaka, Hitomi, et al.
Published: (2024)
Modality Bias in LVLMs: Analyzing and Mitigating Object Hallucination via Attention Lens
by: Zheng, Haohan, et al.
Published: (2025)
by: Zheng, Haohan, et al.
Published: (2025)
Measuring Spiritual Values and Bias of Large Language Models
by: Liu, Songyuan, et al.
Published: (2024)
by: Liu, Songyuan, et al.
Published: (2024)
Analyzing Language Bias Between French and English in Conventional Multilingual Sentiment Analysis Models
by: Wong, Ethan Parker, et al.
Published: (2024)
by: Wong, Ethan Parker, et al.
Published: (2024)
Analyzing Bias in False Refusal Behavior of Large Language Models for Hate Speech Detoxification
by: Im, Kyuri, et al.
Published: (2026)
by: Im, Kyuri, et al.
Published: (2026)
Analyzing Bias in Swiss Federal Supreme Court Judgments Using Facebook's Holistic Bias Dataset: Implications for Language Model Training
by: Wehnert, Sabine, et al.
Published: (2025)
by: Wehnert, Sabine, et al.
Published: (2025)
FLARE: Task-agnostic embedding model evaluation through a normalization process
by: Jiang, Jingzhou, et al.
Published: (2026)
by: Jiang, Jingzhou, et al.
Published: (2026)
Unlocking Bias Detection: Leveraging Transformer-Based Models for Content Analysis
by: Raza, Shaina, et al.
Published: (2023)
by: Raza, Shaina, et al.
Published: (2023)
Mitigating Biases in Language Models via Bias Unlearning
by: Liu, Dianqing, et al.
Published: (2025)
by: Liu, Dianqing, et al.
Published: (2025)
Evaluating and Aligning Human Economic Risk Preferences in LLMs
by: Liu, Jiaxin, et al.
Published: (2025)
by: Liu, Jiaxin, et al.
Published: (2025)
Regional Bias in Large Language Models
by: Gopinadh, M P V S, et al.
Published: (2026)
by: Gopinadh, M P V S, et al.
Published: (2026)
Analyzing Multi-Head Attention on Trojan BERT Models
by: Wang, Jingwei
Published: (2024)
by: Wang, Jingwei
Published: (2024)
KLAAD: Refining Attention Mechanisms to Reduce Societal Bias in Generative Language Models
by: Kim, Seorin, et al.
Published: (2025)
by: Kim, Seorin, et al.
Published: (2025)
Energy-Gated Attention: Spectral Salience as an Inductive Bias for Transformer Attention
by: Zeris, Athanasios
Published: (2026)
by: Zeris, Athanasios
Published: (2026)
BiasJailbreak:Analyzing Ethical Biases and Jailbreak Vulnerabilities in Large Language Models
by: Lee, Isack, et al.
Published: (2024)
by: Lee, Isack, et al.
Published: (2024)
Attention Speaks Volumes: Localizing and Mitigating Bias in Language Models
by: Adiga, Rishabh, et al.
Published: (2024)
by: Adiga, Rishabh, et al.
Published: (2024)
To Bias or Not to Bias: Detecting bias in News with bias-detector
by: Ghosh, Himel, et al.
Published: (2025)
by: Ghosh, Himel, et al.
Published: (2025)
RuBia: A Russian Language Bias Detection Dataset
by: Grigoreva, Veronika, et al.
Published: (2024)
by: Grigoreva, Veronika, et al.
Published: (2024)
ParlAI Vote: A Web Platform for Analyzing Gender and Political Bias in Large Language Models
by: Lin, Wenjie, et al.
Published: (2025)
by: Lin, Wenjie, et al.
Published: (2025)
BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models
by: Fan, Zhiting, et al.
Published: (2025)
by: Fan, Zhiting, et al.
Published: (2025)
Mitigating the Bias of Large Language Model Evaluation
by: Zhou, Hongli, et al.
Published: (2024)
by: Zhou, Hongli, et al.
Published: (2024)
Open-DeBias: Toward Mitigating Open-Set Bias in Language Models
by: Rani, Arti, et al.
Published: (2025)
by: Rani, Arti, et al.
Published: (2025)
Large Language Model-based Role-Playing for Personalized Medical Jargon Extraction
by: Lim, Jung Hoon, et al.
Published: (2024)
by: Lim, Jung Hoon, et al.
Published: (2024)
AesBiasBench: Evaluating Bias and Alignment in Multimodal Language Models for Personalized Image Aesthetic Assessment
by: Li, Kun, et al.
Published: (2025)
by: Li, Kun, et al.
Published: (2025)
Analyzing and Mitigating Object Hallucination: A Training Bias Perspective
by: Li, Yifan, et al.
Published: (2025)
by: Li, Yifan, et al.
Published: (2025)
PersonaFuse: A Personality Activation-Driven Framework for Enhancing Human-LLM Interactions
by: Tang, Yixuan, et al.
Published: (2025)
by: Tang, Yixuan, et al.
Published: (2025)
Similar Items
-
Do LLMs Know about Hallucination? An Empirical Investigation of LLM's Hidden States
by: Duan, Hanyu, et al.
Published: (2024) -
LLM-Measure: Generating Valid, Consistent, and Reproducible Text-Based Measures for Social Science Research
by: Yang, Yi, et al.
Published: (2024) -
PersonaTwin: A Multi-Tier Prompt Conditioning Framework for Generating and Evaluating Personalized Digital Twins
by: Chen, Sihan, et al.
Published: (2025) -
Bridging the LLM Accessibility Divide? Performance, Fairness, and Cost of Closed versus Open LLMs for Automated Essay Scoring
by: Oketch, Kezia, et al.
Published: (2025) -
Benchmarking Sociolinguistic Diversity in Swahili NLP: A Taxonomy-Guided Approach
by: Oketch, Kezia, et al.
Published: (2025)