Saved in:
| Main Authors: | Kaneko, Masahiro, Bollegala, Danushka, Baldwin, Timothy |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.08511 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
by: Kaneko, Masahiro, et al.
Published: (2024)
by: Kaneko, Masahiro, et al.
Published: (2024)
Eagle: Ethical Dataset Given from Real Interactions
by: Kaneko, Masahiro, et al.
Published: (2024)
by: Kaneko, Masahiro, et al.
Published: (2024)
In-Contextual Gender Bias Suppression for Large Language Models
by: Oba, Daisuke, et al.
Published: (2023)
by: Oba, Daisuke, et al.
Published: (2023)
Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding
by: Oba, Daisuke, et al.
Published: (2026)
by: Oba, Daisuke, et al.
Published: (2026)
Bias Mitigation or Cultural Commonsense? Evaluating LLMs with a Japanese Dataset
by: Yamamoto, Taisei, et al.
Published: (2025)
by: Yamamoto, Taisei, et al.
Published: (2025)
Evaluating the Evaluation of Diversity in Commonsense Generation
by: Zhang, Tianhui, et al.
Published: (2025)
by: Zhang, Tianhui, et al.
Published: (2025)
A Semantic Distance Metric Learning approach for Lexical Semantic Change Detection
by: Aida, Taichi, et al.
Published: (2024)
by: Aida, Taichi, et al.
Published: (2024)
Investigating the Contextualised Word Embedding Dimensions Specified for Contextual and Temporal Semantic Changes
by: Aida, Taichi, et al.
Published: (2024)
by: Aida, Taichi, et al.
Published: (2024)
SCDTour: Embedding Axis Ordering and Merging for Interpretable Semantic Change Detection
by: Aida, Taichi, et al.
Published: (2025)
by: Aida, Taichi, et al.
Published: (2025)
Map of Encoders -- Mapping Sentence Encoders using Quantum Relative Entropy
by: Zhang, Gaifan, et al.
Published: (2026)
by: Zhang, Gaifan, et al.
Published: (2026)
Evaluating the Effect of Retrieval Augmentation on Social Biases
by: Zhang, Tianhui, et al.
Published: (2025)
by: Zhang, Tianhui, et al.
Published: (2025)
Evaluating Unsupervised Dimensionality Reduction Methods for Pretrained Sentence Embeddings
by: Zhang, Gaifan, et al.
Published: (2024)
by: Zhang, Gaifan, et al.
Published: (2024)
Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models
by: Zhou, Yi, et al.
Published: (2024)
by: Zhou, Yi, et al.
Published: (2024)
A Little Leak Will Sink a Great Ship: Survey of Transparency for Large Language Models from Start to Finish
by: Kaneko, Masahiro, et al.
Published: (2024)
by: Kaneko, Masahiro, et al.
Published: (2024)
Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning
by: Zhang, Tianhui, et al.
Published: (2024)
by: Zhang, Tianhui, et al.
Published: (2024)
Synthetic Data Generation for Training Diversified Commonsense Reasoning Models
by: Zhang, Tianhui, et al.
Published: (2026)
by: Zhang, Tianhui, et al.
Published: (2026)
Annotating Training Data for Conditional Semantic Textual Similarity Measurement using Large Language Models
by: Zhang, Gaifan, et al.
Published: (2025)
by: Zhang, Gaifan, et al.
Published: (2025)
CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement
by: Zhang, Gaifan, et al.
Published: (2025)
by: Zhang, Gaifan, et al.
Published: (2025)
Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels
by: Anantaprayoon, Panatchakorn, et al.
Published: (2023)
by: Anantaprayoon, Panatchakorn, et al.
Published: (2023)
Bits Leaked per Query: Information-Theoretic Bounds on Adversarial Attacks against LLMs
by: Kaneko, Masahiro, et al.
Published: (2025)
by: Kaneko, Masahiro, et al.
Published: (2025)
Improving Pre-trained Language Model Sensitivity via Mask Specific losses: A case study on Biomedical NER
by: Abaho, Micheal, et al.
Published: (2024)
by: Abaho, Micheal, et al.
Published: (2024)
Online Learning Defense against Iterative Jailbreak Attacks via Prompt Optimization
by: Kaneko, Masahiro, et al.
Published: (2025)
by: Kaneko, Masahiro, et al.
Published: (2025)
Beyond the Resumé: A Rubric-Aware Automatic Interview System for Information Elicitation
by: Stuart, Harry, et al.
Published: (2026)
by: Stuart, Harry, et al.
Published: (2026)
Improving Unsupervised Constituency Parsing via Maximizing Semantic Information
by: Chen, Junjie, et al.
Published: (2024)
by: Chen, Junjie, et al.
Published: (2024)
Unsupervised Parsing by Searching for Frequent Word Sequences among Sentences with Equivalent Predicate-Argument Structures
by: Chen, Junjie, et al.
Published: (2024)
by: Chen, Junjie, et al.
Published: (2024)
Neuron-Level Analysis of Cultural Understanding in Large Language Models
by: Yamamoto, Taisei, et al.
Published: (2025)
by: Yamamoto, Taisei, et al.
Published: (2025)
JailNewsBench: Multi-Lingual and Regional Benchmark for Fake News Generation under Jailbreak Attacks
by: Kaneko, Masahiro, et al.
Published: (2026)
by: Kaneko, Masahiro, et al.
Published: (2026)
Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
by: Kaneko, Masahiro, et al.
Published: (2025)
by: Kaneko, Masahiro, et al.
Published: (2025)
Social Bias Evaluation for Large Language Models Requires Prompt Variations
by: Hida, Rem, et al.
Published: (2024)
by: Hida, Rem, et al.
Published: (2024)
A Japanese Benchmark for Evaluating Social Bias in Reasoning Based on Attribution Theory
by: Shiotani, Taihei, et al.
Published: (2026)
by: Shiotani, Taihei, et al.
Published: (2026)
Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting
by: Zhou, Ej, et al.
Published: (2025)
by: Zhou, Ej, et al.
Published: (2025)
Likelihood-based Mitigation of Evaluation Bias in Large Language Models
by: Oi, Masanari, et al.
Published: (2024)
by: Oi, Masanari, et al.
Published: (2024)
Inference-Time Selective Debiasing to Enhance Fairness in Text Classification Models
by: Kuzmin, Gleb, et al.
Published: (2024)
by: Kuzmin, Gleb, et al.
Published: (2024)
OffsetBias: Leveraging Debiased Data for Tuning Evaluators
by: Park, Junsoo, et al.
Published: (2024)
by: Park, Junsoo, et al.
Published: (2024)
Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation
by: Iluz, Bar, et al.
Published: (2024)
by: Iluz, Bar, et al.
Published: (2024)
Evaluating Gender Bias Transfer between Pre-trained and Prompt-Adapted Language Models
by: Mackraz, Natalie, et al.
Published: (2024)
by: Mackraz, Natalie, et al.
Published: (2024)
Connecting the Dots in News Analysis: Bridging the Cross-Disciplinary Disparities in Media Bias and Framing
by: Vallejo, Gisela, et al.
Published: (2023)
by: Vallejo, Gisela, et al.
Published: (2023)
Paraphrasing Adversarial Attack on LLM-as-a-Reviewer
by: Kaneko, Masahiro
Published: (2026)
by: Kaneko, Masahiro
Published: (2026)
Benchmarking Gender and Political Bias in Large Language Models
by: Yang, Jinrui, et al.
Published: (2025)
by: Yang, Jinrui, et al.
Published: (2025)
Language Bias in Information Retrieval: The Nature of the Beast and Mitigation Methods
by: Yang, Jinrui, et al.
Published: (2025)
by: Yang, Jinrui, et al.
Published: (2025)
Similar Items
-
Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
by: Kaneko, Masahiro, et al.
Published: (2024) -
Eagle: Ethical Dataset Given from Real Interactions
by: Kaneko, Masahiro, et al.
Published: (2024) -
In-Contextual Gender Bias Suppression for Large Language Models
by: Oba, Daisuke, et al.
Published: (2023) -
Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding
by: Oba, Daisuke, et al.
Published: (2026) -
Bias Mitigation or Cultural Commonsense? Evaluating LLMs with a Japanese Dataset
by: Yamamoto, Taisei, et al.
Published: (2025)