Saved in:
| Main Authors: | Schelb, Julian, Borin, Orr, Garcia, David, Spitz, Andreas |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.10229 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Assessing In-context Learning and Fine-tuning for Topic Classification of German Web Data
by: Schelb, Julian, et al.
Published: (2024)
by: Schelb, Julian, et al.
Published: (2024)
Loci Similes: A Benchmark for Extracting Intertextualities in Latin Literature
by: Schelb, Julian, et al.
Published: (2026)
by: Schelb, Julian, et al.
Published: (2026)
Only a Little to the Left: A Theory-grounded Measure of Political Bias in Large Language Models
by: Faulborn, Mats, et al.
Published: (2025)
by: Faulborn, Mats, et al.
Published: (2025)
PsychoLex: Unveiling the Psychological Mind of Large Language Models
by: Abbasi, Mohammad Amin, et al.
Published: (2024)
by: Abbasi, Mohammad Amin, et al.
Published: (2024)
Quantifying the Risks of Tool-assisted Rephrasing to Linguistic Diversity
by: Wang, Mengying, et al.
Published: (2024)
by: Wang, Mengying, et al.
Published: (2024)
Do Psychometric Tests Work for Large Language Models? Evaluation of Tests on Sexism, Racism, and Morality
by: Jung, Jana, et al.
Published: (2025)
by: Jung, Jana, et al.
Published: (2025)
Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models
by: Ye, Haoran, et al.
Published: (2025)
by: Ye, Haoran, et al.
Published: (2025)
Tag-Pag: A Dedicated Tool for Systematic Web Page Annotations
by: Pogrebnjak, Anton, et al.
Published: (2025)
by: Pogrebnjak, Anton, et al.
Published: (2025)
Evaluating Large Language Models with Psychometrics
by: Li, Yuan, et al.
Published: (2024)
by: Li, Yuan, et al.
Published: (2024)
Preference Learning Unlocks LLMs' Psycho-Counseling Skills
by: Zhang, Mian, et al.
Published: (2025)
by: Zhang, Mian, et al.
Published: (2025)
Psychometric Predictive Power of Large Language Models
by: Kuribayashi, Tatsuki, et al.
Published: (2023)
by: Kuribayashi, Tatsuki, et al.
Published: (2023)
Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench
by: Huang, Jen-tse, et al.
Published: (2023)
by: Huang, Jen-tse, et al.
Published: (2023)
Statistical Hypothesis Testing for Auditing Robustness in Language Models
by: Rauba, Paulius, et al.
Published: (2025)
by: Rauba, Paulius, et al.
Published: (2025)
Psycho-linguistic Experiment on Universal Semantic Components of Verbal Humor: System Description and Annotation
by: Mikhalkova, Elena, et al.
Published: (2024)
by: Mikhalkova, Elena, et al.
Published: (2024)
Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?
by: Säuberli, Andreas, et al.
Published: (2025)
by: Säuberli, Andreas, et al.
Published: (2025)
Psychometric Personality Shaping Modulates Capabilities and Safety in Language Models
by: Fitz, Stephen, et al.
Published: (2025)
by: Fitz, Stephen, et al.
Published: (2025)
Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization
by: Sun, Yudao, et al.
Published: (2025)
by: Sun, Yudao, et al.
Published: (2025)
Test-Time Fairness and Robustness in Large Language Models
by: Cotta, Leonardo, et al.
Published: (2024)
by: Cotta, Leonardo, et al.
Published: (2024)
Psychometric Alignment: Capturing Human Knowledge Distributions via Language Models
by: He-Yueya, Joy, et al.
Published: (2024)
by: He-Yueya, Joy, et al.
Published: (2024)
Automatic Generation and Evaluation of Reading Comprehension Test Items with Large Language Models
by: Säuberli, Andreas, et al.
Published: (2024)
by: Säuberli, Andreas, et al.
Published: (2024)
Adaptive Testing for LLM Evaluation: A Psychometric Alternative to Static Benchmarks
by: Li, Peiyu, et al.
Published: (2025)
by: Li, Peiyu, et al.
Published: (2025)
United States Politicians' Tone Became More Negative with 2016 Primary Campaigns
by: Külz, Jonathan, et al.
Published: (2022)
by: Külz, Jonathan, et al.
Published: (2022)
Measuring Human and AI Values Based on Generative Psychometrics with Large Language Models
by: Ye, Haoran, et al.
Published: (2024)
by: Ye, Haoran, et al.
Published: (2024)
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement
by: Ye, Haoran, et al.
Published: (2025)
by: Ye, Haoran, et al.
Published: (2025)
The Impact of Image Resolution on Biomedical Multimodal Large Language Models
by: Chen, Liangyu, et al.
Published: (2025)
by: Chen, Liangyu, et al.
Published: (2025)
The Expressions of Depression and Anxiety in Chinese Psycho-counseling: Usage of First-person Singular Pronoun and Negative Emotional Words
by: Ma, Lizhi, et al.
Published: (2025)
by: Ma, Lizhi, et al.
Published: (2025)
Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective
by: Wen, Yuchen, et al.
Published: (2024)
by: Wen, Yuchen, et al.
Published: (2024)
Defining and Evaluating Visual Language Models' Basic Spatial Abilities: A Perspective from Psychometrics
by: Xu, Wenrui, et al.
Published: (2025)
by: Xu, Wenrui, et al.
Published: (2025)
Political Alignment in Large Language Models: A Multidimensional Audit of Psychometric Identity and Behavioral Bias
by: Sakhawat, Adib, et al.
Published: (2026)
by: Sakhawat, Adib, et al.
Published: (2026)
VideoAgent: Long-form Video Understanding with Large Language Model as Agent
by: Wang, Xiaohan, et al.
Published: (2024)
by: Wang, Xiaohan, et al.
Published: (2024)
Camouflage is all you need: Evaluating and Enhancing Language Model Robustness Against Camouflage Adversarial Attacks
by: Huertas-García, Álvaro, et al.
Published: (2024)
by: Huertas-García, Álvaro, et al.
Published: (2024)
A Novel Psychometrics-Based Approach to Developing Professional Competency Benchmark for Large Language Models
by: Kardanova, Elena, et al.
Published: (2024)
by: Kardanova, Elena, et al.
Published: (2024)
Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models
by: Xu, Shilin, et al.
Published: (2025)
by: Xu, Shilin, et al.
Published: (2025)
Synchronic and Diachronic Aspects of Kanashi
by: Saxena, Anju, et al.
Published: (2022)
by: Saxena, Anju, et al.
Published: (2022)
You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments
by: Shu, Bangzhao, et al.
Published: (2023)
by: Shu, Bangzhao, et al.
Published: (2023)
Do GPT Language Models Suffer From Split Personality Disorder? The Advent Of Substrate-Free Psychometrics
by: Romero, Peter, et al.
Published: (2024)
by: Romero, Peter, et al.
Published: (2024)
Search-R3: Unifying Reasoning and Embedding in Large Language Models
by: Gui, Yuntao, et al.
Published: (2025)
by: Gui, Yuntao, et al.
Published: (2025)
PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents
by: Yang, Qisen, et al.
Published: (2024)
by: Yang, Qisen, et al.
Published: (2024)
On Non-interactive Evaluation of Animal Communication Translators
by: Paradise, Orr, et al.
Published: (2025)
by: Paradise, Orr, et al.
Published: (2025)
Langformers: Unified NLP Pipelines for Language Models
by: Lamsal, Rabindra, et al.
Published: (2025)
by: Lamsal, Rabindra, et al.
Published: (2025)
Similar Items
-
Assessing In-context Learning and Fine-tuning for Topic Classification of German Web Data
by: Schelb, Julian, et al.
Published: (2024) -
Loci Similes: A Benchmark for Extracting Intertextualities in Latin Literature
by: Schelb, Julian, et al.
Published: (2026) -
Only a Little to the Left: A Theory-grounded Measure of Political Bias in Large Language Models
by: Faulborn, Mats, et al.
Published: (2025) -
PsychoLex: Unveiling the Psychological Mind of Large Language Models
by: Abbasi, Mohammad Amin, et al.
Published: (2024) -
Quantifying the Risks of Tool-assisted Rephrasing to Linguistic Diversity
by: Wang, Mengying, et al.
Published: (2024)