:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Schelb, Julian, Borin, Orr, Garcia, David, Spitz, Andreas
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2503.10229
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Assessing In-context Learning and Fine-tuning for Topic Classification of German Web Data
by: Schelb, Julian, et al.
Published: (2024)

Loci Similes: A Benchmark for Extracting Intertextualities in Latin Literature
by: Schelb, Julian, et al.
Published: (2026)

Only a Little to the Left: A Theory-grounded Measure of Political Bias in Large Language Models
by: Faulborn, Mats, et al.
Published: (2025)

PsychoLex: Unveiling the Psychological Mind of Large Language Models
by: Abbasi, Mohammad Amin, et al.
Published: (2024)

Quantifying the Risks of Tool-assisted Rephrasing to Linguistic Diversity
by: Wang, Mengying, et al.
Published: (2024)

Do Psychometric Tests Work for Large Language Models? Evaluation of Tests on Sexism, Racism, and Morality
by: Jung, Jana, et al.
Published: (2025)

Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models
by: Ye, Haoran, et al.
Published: (2025)

Tag-Pag: A Dedicated Tool for Systematic Web Page Annotations
by: Pogrebnjak, Anton, et al.
Published: (2025)

Evaluating Large Language Models with Psychometrics
by: Li, Yuan, et al.
Published: (2024)

Preference Learning Unlocks LLMs' Psycho-Counseling Skills
by: Zhang, Mian, et al.
Published: (2025)

Psychometric Predictive Power of Large Language Models
by: Kuribayashi, Tatsuki, et al.
Published: (2023)

Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench
by: Huang, Jen-tse, et al.
Published: (2023)

Statistical Hypothesis Testing for Auditing Robustness in Language Models
by: Rauba, Paulius, et al.
Published: (2025)

Psycho-linguistic Experiment on Universal Semantic Components of Verbal Humor: System Description and Annotation
by: Mikhalkova, Elena, et al.
Published: (2024)

Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?
by: Säuberli, Andreas, et al.
Published: (2025)

Psychometric Personality Shaping Modulates Capabilities and Safety in Language Models
by: Fitz, Stephen, et al.
Published: (2025)

Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization
by: Sun, Yudao, et al.
Published: (2025)

Test-Time Fairness and Robustness in Large Language Models
by: Cotta, Leonardo, et al.
Published: (2024)

Psychometric Alignment: Capturing Human Knowledge Distributions via Language Models
by: He-Yueya, Joy, et al.
Published: (2024)

Automatic Generation and Evaluation of Reading Comprehension Test Items with Large Language Models
by: Säuberli, Andreas, et al.
Published: (2024)

Adaptive Testing for LLM Evaluation: A Psychometric Alternative to Static Benchmarks
by: Li, Peiyu, et al.
Published: (2025)

United States Politicians' Tone Became More Negative with 2016 Primary Campaigns
by: Külz, Jonathan, et al.
Published: (2022)

Measuring Human and AI Values Based on Generative Psychometrics with Large Language Models
by: Ye, Haoran, et al.
Published: (2024)

Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement
by: Ye, Haoran, et al.
Published: (2025)

The Impact of Image Resolution on Biomedical Multimodal Large Language Models
by: Chen, Liangyu, et al.
Published: (2025)

The Expressions of Depression and Anxiety in Chinese Psycho-counseling: Usage of First-person Singular Pronoun and Negative Emotional Words
by: Ma, Lizhi, et al.
Published: (2025)

Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective
by: Wen, Yuchen, et al.
Published: (2024)

Defining and Evaluating Visual Language Models' Basic Spatial Abilities: A Perspective from Psychometrics
by: Xu, Wenrui, et al.
Published: (2025)

Political Alignment in Large Language Models: A Multidimensional Audit of Psychometric Identity and Behavioral Bias
by: Sakhawat, Adib, et al.
Published: (2026)

VideoAgent: Long-form Video Understanding with Large Language Model as Agent
by: Wang, Xiaohan, et al.
Published: (2024)

Camouflage is all you need: Evaluating and Enhancing Language Model Robustness Against Camouflage Adversarial Attacks
by: Huertas-García, Álvaro, et al.
Published: (2024)

A Novel Psychometrics-Based Approach to Developing Professional Competency Benchmark for Large Language Models
by: Kardanova, Elena, et al.
Published: (2024)

Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models
by: Xu, Shilin, et al.
Published: (2025)

Synchronic and Diachronic Aspects of Kanashi
by: Saxena, Anju, et al.
Published: (2022)

You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments
by: Shu, Bangzhao, et al.
Published: (2023)

Do GPT Language Models Suffer From Split Personality Disorder? The Advent Of Substrate-Free Psychometrics
by: Romero, Peter, et al.
Published: (2024)

Search-R3: Unifying Reasoning and Embedding in Large Language Models
by: Gui, Yuntao, et al.
Published: (2025)

PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents
by: Yang, Qisen, et al.
Published: (2024)

On Non-interactive Evaluation of Animal Communication Translators
by: Paradise, Orr, et al.
Published: (2025)

Langformers: Unified NLP Pipelines for Language Models
by: Lamsal, Rabindra, et al.
Published: (2025)