:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kaneko, Masahiro, Bollegala, Danushka, Baldwin, Timothy
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2401.08511
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
by: Kaneko, Masahiro, et al.
Published: (2024)

Eagle: Ethical Dataset Given from Real Interactions
by: Kaneko, Masahiro, et al.
Published: (2024)

In-Contextual Gender Bias Suppression for Large Language Models
by: Oba, Daisuke, et al.
Published: (2023)

Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding
by: Oba, Daisuke, et al.
Published: (2026)

Bias Mitigation or Cultural Commonsense? Evaluating LLMs with a Japanese Dataset
by: Yamamoto, Taisei, et al.
Published: (2025)

Evaluating the Evaluation of Diversity in Commonsense Generation
by: Zhang, Tianhui, et al.
Published: (2025)

A Semantic Distance Metric Learning approach for Lexical Semantic Change Detection
by: Aida, Taichi, et al.
Published: (2024)

Investigating the Contextualised Word Embedding Dimensions Specified for Contextual and Temporal Semantic Changes
by: Aida, Taichi, et al.
Published: (2024)

SCDTour: Embedding Axis Ordering and Merging for Interpretable Semantic Change Detection
by: Aida, Taichi, et al.
Published: (2025)

Map of Encoders -- Mapping Sentence Encoders using Quantum Relative Entropy
by: Zhang, Gaifan, et al.
Published: (2026)

Evaluating the Effect of Retrieval Augmentation on Social Biases
by: Zhang, Tianhui, et al.
Published: (2025)

Evaluating Unsupervised Dimensionality Reduction Methods for Pretrained Sentence Embeddings
by: Zhang, Gaifan, et al.
Published: (2024)

Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models
by: Zhou, Yi, et al.
Published: (2024)

A Little Leak Will Sink a Great Ship: Survey of Transparency for Large Language Models from Start to Finish
by: Kaneko, Masahiro, et al.
Published: (2024)

Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning
by: Zhang, Tianhui, et al.
Published: (2024)

Synthetic Data Generation for Training Diversified Commonsense Reasoning Models
by: Zhang, Tianhui, et al.
Published: (2026)

Annotating Training Data for Conditional Semantic Textual Similarity Measurement using Large Language Models
by: Zhang, Gaifan, et al.
Published: (2025)

CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement
by: Zhang, Gaifan, et al.
Published: (2025)

Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels
by: Anantaprayoon, Panatchakorn, et al.
Published: (2023)

Bits Leaked per Query: Information-Theoretic Bounds on Adversarial Attacks against LLMs
by: Kaneko, Masahiro, et al.
Published: (2025)

Improving Pre-trained Language Model Sensitivity via Mask Specific losses: A case study on Biomedical NER
by: Abaho, Micheal, et al.
Published: (2024)

Online Learning Defense against Iterative Jailbreak Attacks via Prompt Optimization
by: Kaneko, Masahiro, et al.
Published: (2025)

Beyond the Resumé: A Rubric-Aware Automatic Interview System for Information Elicitation
by: Stuart, Harry, et al.
Published: (2026)

Improving Unsupervised Constituency Parsing via Maximizing Semantic Information
by: Chen, Junjie, et al.
Published: (2024)

Unsupervised Parsing by Searching for Frequent Word Sequences among Sentences with Equivalent Predicate-Argument Structures
by: Chen, Junjie, et al.
Published: (2024)

Neuron-Level Analysis of Cultural Understanding in Large Language Models
by: Yamamoto, Taisei, et al.
Published: (2025)

JailNewsBench: Multi-Lingual and Regional Benchmark for Fake News Generation under Jailbreak Attacks
by: Kaneko, Masahiro, et al.
Published: (2026)

Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
by: Kaneko, Masahiro, et al.
Published: (2025)

Social Bias Evaluation for Large Language Models Requires Prompt Variations
by: Hida, Rem, et al.
Published: (2024)

A Japanese Benchmark for Evaluating Social Bias in Reasoning Based on Attribution Theory
by: Shiotani, Taihei, et al.
Published: (2026)

Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting
by: Zhou, Ej, et al.
Published: (2025)

Likelihood-based Mitigation of Evaluation Bias in Large Language Models
by: Oi, Masanari, et al.
Published: (2024)

Inference-Time Selective Debiasing to Enhance Fairness in Text Classification Models
by: Kuzmin, Gleb, et al.
Published: (2024)

OffsetBias: Leveraging Debiased Data for Tuning Evaluators
by: Park, Junsoo, et al.
Published: (2024)

Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation
by: Iluz, Bar, et al.
Published: (2024)

Evaluating Gender Bias Transfer between Pre-trained and Prompt-Adapted Language Models
by: Mackraz, Natalie, et al.
Published: (2024)

Connecting the Dots in News Analysis: Bridging the Cross-Disciplinary Disparities in Media Bias and Framing
by: Vallejo, Gisela, et al.
Published: (2023)

Paraphrasing Adversarial Attack on LLM-as-a-Reviewer
by: Kaneko, Masahiro
Published: (2026)

Benchmarking Gender and Political Bias in Large Language Models
by: Yang, Jinrui, et al.
Published: (2025)

Language Bias in Information Retrieval: The Nature of the Beast and Mitigation Methods
by: Yang, Jinrui, et al.
Published: (2025)