Saved in:
| Main Authors: | Choi, Minje, Pei, Jiaxin, Kumar, Sagar, Shu, Chang, Jurgens, David |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2305.14938 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MM-Soc: Benchmarking Multimodal Large Language Models in Social Media Platforms
by: Jin, Yiqiao, et al.
Published: (2024)
by: Jin, Yiqiao, et al.
Published: (2024)
Sociodemographic Prompting is Not Yet an Effective Approach for Simulating Subjective Judgments with LLMs
by: Sun, Huaman, et al.
Published: (2023)
by: Sun, Huaman, et al.
Published: (2023)
You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments
by: Shu, Bangzhao, et al.
Published: (2023)
by: Shu, Bangzhao, et al.
Published: (2023)
When "A Helpful Assistant" Is Not Really Helpful: Personas in System Prompts Do Not Improve Performances of Large Language Models
by: Zheng, Mingqian, et al.
Published: (2023)
by: Zheng, Mingqian, et al.
Published: (2023)
Analyzing the Engagement of Social Relationships During Life Event Shocks in Social Media
by: Choi, Minje, et al.
Published: (2023)
by: Choi, Minje, et al.
Published: (2023)
Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals' Subjective Text Perceptions
by: Orlikowski, Matthias, et al.
Published: (2025)
by: Orlikowski, Matthias, et al.
Published: (2025)
Modeling Public Perceptions of Science in Media
by: Pei, Jiaxin, et al.
Published: (2025)
by: Pei, Jiaxin, et al.
Published: (2025)
Are Rules Meant to be Broken? Understanding Multilingual Moral Reasoning as a Computational Pipeline with UniMoral
by: Kumar, Shivani, et al.
Published: (2025)
by: Kumar, Shivani, et al.
Published: (2025)
SocREval: Large Language Models with the Socratic Method for Reference-Free Reasoning Evaluation
by: He, Hangfeng, et al.
Published: (2023)
by: He, Hangfeng, et al.
Published: (2023)
More than Meets the Tie: Examining the Role of Interpersonal Relationships in Social Networks
by: Choi, Minje, et al.
Published: (2021)
by: Choi, Minje, et al.
Published: (2021)
Beyond Consensus: Perspectivist Modeling and Evaluation of Annotator Disagreement in NLP
by: Xu, Yinuo, et al.
Published: (2026)
by: Xu, Yinuo, et al.
Published: (2026)
The Call for Socially Aware Language Technologies
by: Yang, Diyi, et al.
Published: (2024)
by: Yang, Diyi, et al.
Published: (2024)
Benchmarking Local LLMs for Natural-Language-to-SQL Querying in Biopharmaceutical Manufacturing: An Empirical Benchmark on Consumer-Grade Hardware
by: Bhetwal, Sagar, et al.
Published: (2026)
by: Bhetwal, Sagar, et al.
Published: (2026)
Performance Evaluation of Tokenizers in Large Language Models for the Assamese Language
by: Tamang, Sagar, et al.
Published: (2024)
by: Tamang, Sagar, et al.
Published: (2024)
Developing a Pragmatic Benchmark for Assessing Korean Legal Language Understanding in Large Language Models
by: Kim, Yeeun, et al.
Published: (2024)
by: Kim, Yeeun, et al.
Published: (2024)
KET-QA: A Dataset for Knowledge Enhanced Table Question Answering
by: Hu, Mengkang, et al.
Published: (2024)
by: Hu, Mengkang, et al.
Published: (2024)
Are Economists Always More Introverted? Analyzing Consistency in Persona-Assigned LLMs
by: Reusens, Manon, et al.
Published: (2025)
by: Reusens, Manon, et al.
Published: (2025)
Can AI Truly Represent Your Voice in Deliberations? A Comprehensive Study of Large-Scale Opinion Aggregation with LLMs
by: Zhu, Shenzhe, et al.
Published: (2025)
by: Zhu, Shenzhe, et al.
Published: (2025)
SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
by: Xu, Zixiang, et al.
Published: (2025)
by: Xu, Zixiang, et al.
Published: (2025)
Modeling Empathetic Alignment in Conversation
by: Yang, Jiamin, et al.
Published: (2024)
by: Yang, Jiamin, et al.
Published: (2024)
Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language Models
by: Zhang, Jiaxin, et al.
Published: (2024)
by: Zhang, Jiaxin, et al.
Published: (2024)
How Do Large Language Models Acquire Factual Knowledge During Pretraining?
by: Chang, Hoyeon, et al.
Published: (2024)
by: Chang, Hoyeon, et al.
Published: (2024)
How Well Do LLMs Understand Drug Mechanisms? A Knowledge + Reasoning Evaluation Dataset
by: Mohan, Sunil, et al.
Published: (2025)
by: Mohan, Sunil, et al.
Published: (2025)
How Well Do LLMs Handle Cantonese? Benchmarking Cantonese Capabilities of Large Language Models
by: Jiang, Jiyue, et al.
Published: (2024)
by: Jiang, Jiyue, et al.
Published: (2024)
Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space
by: Verma, Gaurav, et al.
Published: (2024)
by: Verma, Gaurav, et al.
Published: (2024)
Tokenization is Sensitive to Language Variation
by: Wegmann, Anna, et al.
Published: (2025)
by: Wegmann, Anna, et al.
Published: (2025)
SANSKRITI: A Comprehensive Benchmark for Evaluating Language Models' Knowledge of Indian Culture
by: Maji, Arijit, et al.
Published: (2025)
by: Maji, Arijit, et al.
Published: (2025)
Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation
by: Yin, Xunjian, et al.
Published: (2024)
by: Yin, Xunjian, et al.
Published: (2024)
When LLMs Meet Cunning Texts: A Fallacy Understanding Benchmark for Large Language Models
by: Li, Yinghui, et al.
Published: (2024)
by: Li, Yinghui, et al.
Published: (2024)
Evaluating Cultural Knowledge Processing in Large Language Models: A Cognitive Benchmarking Framework Integrating Retrieval-Augmented Generation
by: Lee, Hung-Shin, et al.
Published: (2025)
by: Lee, Hung-Shin, et al.
Published: (2025)
SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs
by: Liu, Zhiqiang, et al.
Published: (2025)
by: Liu, Zhiqiang, et al.
Published: (2025)
Do LLMs Know What Is Private Internally? Probing and Steering Contextual Privacy Norms in Large Language Model Representations
by: Wang, Haoran, et al.
Published: (2026)
by: Wang, Haoran, et al.
Published: (2026)
Multi-User Large Language Model Agents
by: Yang, Shu, et al.
Published: (2026)
by: Yang, Shu, et al.
Published: (2026)
The Muddy Waters of Modeling Empathy in Language: The Practical Impacts of Theoretical Constructs
by: Lahnala, Allison, et al.
Published: (2025)
by: Lahnala, Allison, et al.
Published: (2025)
Mitigating Geospatial Knowledge Hallucination in Large Language Models: Benchmarking and Dynamic Factuality Aligning
by: Wang, Shengyuan, et al.
Published: (2025)
by: Wang, Shengyuan, et al.
Published: (2025)
Do They Understand Them? An Updated Evaluation on Nonbinary Pronoun Handling in Large Language Models
by: Tang, Xushuo, et al.
Published: (2025)
by: Tang, Xushuo, et al.
Published: (2025)
Do LLMs Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models
by: Yuan, Yu, et al.
Published: (2024)
by: Yuan, Yu, et al.
Published: (2024)
SPRIG: Improving Large Language Model Performance by System Prompt Optimization
by: Zhang, Lechen, et al.
Published: (2024)
by: Zhang, Lechen, et al.
Published: (2024)
Do Large Language Models Understand Word Senses?
by: Meconi, Domenico, et al.
Published: (2025)
by: Meconi, Domenico, et al.
Published: (2025)
SciGPT: A Large Language Model for Scientific Literature Understanding and Knowledge Discovery
by: She, Fengyu, et al.
Published: (2025)
by: She, Fengyu, et al.
Published: (2025)
Similar Items
-
MM-Soc: Benchmarking Multimodal Large Language Models in Social Media Platforms
by: Jin, Yiqiao, et al.
Published: (2024) -
Sociodemographic Prompting is Not Yet an Effective Approach for Simulating Subjective Judgments with LLMs
by: Sun, Huaman, et al.
Published: (2023) -
You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments
by: Shu, Bangzhao, et al.
Published: (2023) -
When "A Helpful Assistant" Is Not Really Helpful: Personas in System Prompts Do Not Improve Performances of Large Language Models
by: Zheng, Mingqian, et al.
Published: (2023) -
Analyzing the Engagement of Social Relationships During Life Event Shocks in Social Media
by: Choi, Minje, et al.
Published: (2023)