Saved in:
| Main Authors: | Xue, Zhaoqian, Liu, Guanhong, Zhang, Chong, Wei, Kai, Zeng, Qingcheng, Hu, Songhua, Hua, Wenyue, Fan, Lizhou, Zhang, Yongfeng, Li, Lingyao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.10641 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models via Complexity Classes
by: Fan, Lizhou, et al.
Published: (2023)
by: Fan, Lizhou, et al.
Published: (2023)
Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt
by: Zeng, Qingcheng, et al.
Published: (2025)
by: Zeng, Qingcheng, et al.
Published: (2025)
War and Peace (WarAgent): Large Language Model-based Multi-Agent Simulation of World Wars
by: Hua, Wenyue, et al.
Published: (2023)
by: Hua, Wenyue, et al.
Published: (2023)
What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents
by: Xue, Zhaoqian, et al.
Published: (2024)
by: Xue, Zhaoqian, et al.
Published: (2024)
ADO: Automatic Data Optimization for Inputs in LLM Prompts
by: Lin, Sam, et al.
Published: (2025)
by: Lin, Sam, et al.
Published: (2025)
Crowdsourced reviews reveal substantial disparities in public perceptions of parking
by: Li, Lingyao, et al.
Published: (2024)
by: Li, Lingyao, et al.
Published: (2024)
Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities
by: Hua, Wenyue, et al.
Published: (2024)
by: Hua, Wenyue, et al.
Published: (2024)
EmojiPrompt: Generative Prompt Obfuscation for Privacy-Preserving Communication with Cloud-based LLMs
by: Lin, Sam, et al.
Published: (2024)
by: Lin, Sam, et al.
Published: (2024)
A scoping review of using Large Language Models (LLMs) to investigate Electronic Health Records (EHRs)
by: Li, Lingyao, et al.
Published: (2024)
by: Li, Lingyao, et al.
Published: (2024)
Towards Trustworthy AI: Characterizing User-Reported Risks across LLMs "In the Wild"
by: Li, Lingyao, et al.
Published: (2025)
by: Li, Lingyao, et al.
Published: (2025)
Large Language Models in Biomedical and Health Informatics: A Review with Bibliometric Analysis
by: Yu, Huizi, et al.
Published: (2024)
by: Yu, Huizi, et al.
Published: (2024)
BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis
by: Lin, Shuhang, et al.
Published: (2024)
by: Lin, Shuhang, et al.
Published: (2024)
Crowdsourcing-Based Knowledge Graph Construction for Drug Side Effects Using Large Language Models with an Application on Semaglutide
by: Duan, Zhijie, et al.
Published: (2025)
by: Duan, Zhijie, et al.
Published: (2025)
Crowdsourcing public attitudes toward local services through the lens of Google Maps reviews: An urban density-based perspective
by: Li, Lingyao, et al.
Published: (2024)
by: Li, Lingyao, et al.
Published: (2024)
Health-LLM: Personalized Retrieval-Augmented Disease Prediction System
by: Yu, Qinkai, et al.
Published: (2024)
by: Yu, Qinkai, et al.
Published: (2024)
A Multimodal, Multilingual, and Multidimensional Pipeline for Fine-grained Crowdsourcing Earthquake Damage Evaluation
by: Ma, Zihui, et al.
Published: (2025)
by: Ma, Zihui, et al.
Published: (2025)
NPHardEval4V: Dynamic Evaluation of Large Vision-Language Models with Effects of Vision
by: Li, Xiang, et al.
Published: (2024)
by: Li, Xiang, et al.
Published: (2024)
OpenP5: An Open-Source Platform for Developing, Training, and Evaluating LLM-based Recommender Systems
by: Xu, Shuyuan, et al.
Published: (2023)
by: Xu, Shuyuan, et al.
Published: (2023)
Game-theoretic LLM: Agent Workflow for Negotiation Games
by: Hua, Wenyue, et al.
Published: (2024)
by: Hua, Wenyue, et al.
Published: (2024)
Towards Robust Semantic Correspondence: A Benchmark and Insights
by: Chong, Wenyue
Published: (2025)
by: Chong, Wenyue
Published: (2025)
Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models
by: Xiong, Junjie, et al.
Published: (2025)
by: Xiong, Junjie, et al.
Published: (2025)
Deciphering Scientific Collaboration in Biomedical LLM Research: Dynamics, Institutional Participation, and Resource Disparities
by: Li, Lingyao, et al.
Published: (2025)
by: Li, Lingyao, et al.
Published: (2025)
Characterizing Online Toxicity During the 2022 Mpox Outbreak: A Computational Analysis of Topical and Network Dynamics
by: Fan, Lizhou, et al.
Published: (2024)
by: Fan, Lizhou, et al.
Published: (2024)
Patients Speak, AI Listens: LLM-based Analysis of Online Reviews Uncovers Key Drivers for Urgent Care Satisfaction
by: Xu, Xiaoran, et al.
Published: (2025)
by: Xu, Xiaoran, et al.
Published: (2025)
Exploring Concept Depth: How Large Language Models Acquire Knowledge and Concept at Different Layers?
by: Jin, Mingyu, et al.
Published: (2024)
by: Jin, Mingyu, et al.
Published: (2024)
AttackEval: How to Evaluate the Effectiveness of Jailbreak Attacking on Large Language Models
by: Shu, Dong, et al.
Published: (2024)
by: Shu, Dong, et al.
Published: (2024)
When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world Environments
by: Zhang, Chong, et al.
Published: (2024)
by: Zhang, Chong, et al.
Published: (2024)
PAP-REC: Personalized Automatic Prompt for Recommendation Language Model
by: Li, Zelong, et al.
Published: (2024)
by: Li, Zelong, et al.
Published: (2024)
UP5: Unbiased Foundation Model for Fairness-aware Recommendation
by: Hua, Wenyue, et al.
Published: (2023)
by: Hua, Wenyue, et al.
Published: (2023)
Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents
by: Li, Zelong, et al.
Published: (2024)
by: Li, Zelong, et al.
Published: (2024)
TrustAgent: Towards Safe and Trustworthy LLM-based Agents
by: Hua, Wenyue, et al.
Published: (2024)
by: Hua, Wenyue, et al.
Published: (2024)
"HOT" ChatGPT: The promise of ChatGPT in detecting and discriminating hateful, offensive, and toxic comments on social media
by: Li, Lingyao, et al.
Published: (2023)
by: Li, Lingyao, et al.
Published: (2023)
Academic collaboration on large language model studies increases overall but varies across disciplines
by: Li, Lingyao, et al.
Published: (2024)
by: Li, Lingyao, et al.
Published: (2024)
Toward satisfactory public accessibility: A crowdsourcing approach through online reviews to inclusive urban design
by: Li, Lingyao, et al.
Published: (2024)
by: Li, Lingyao, et al.
Published: (2024)
LLM Use for Mental Health: Crowdsourcing Users' Sentiment-based Perspectives and Values from Social Discussions
by: Li, Lingyao, et al.
Published: (2025)
by: Li, Lingyao, et al.
Published: (2025)
Know the Ropes: A Heuristic Strategy for LLM-based Multi-Agent System Design
by: Li, Zhenkun, et al.
Published: (2025)
by: Li, Zhenkun, et al.
Published: (2025)
IDGenRec: LLM-RecSys Alignment with Textual ID Learning
by: Tan, Juntao, et al.
Published: (2024)
by: Tan, Juntao, et al.
Published: (2024)
MoralBench: Moral Evaluation of LLMs
by: Ji, Jianchao, et al.
Published: (2024)
by: Ji, Jianchao, et al.
Published: (2024)
Interactive Speculative Planning: Enhance Agent Efficiency through Co-design of System and User Interface
by: Hua, Wenyue, et al.
Published: (2024)
by: Hua, Wenyue, et al.
Published: (2024)
Leveraging Human Production-Interpretation Asymmetries to Test LLM Cognitive Plausibility
by: Lam, Suet-Ying, et al.
Published: (2025)
by: Lam, Suet-Ying, et al.
Published: (2025)
Similar Items
-
NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models via Complexity Classes
by: Fan, Lizhou, et al.
Published: (2023) -
Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt
by: Zeng, Qingcheng, et al.
Published: (2025) -
War and Peace (WarAgent): Large Language Model-based Multi-Agent Simulation of World Wars
by: Hua, Wenyue, et al.
Published: (2023) -
What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents
by: Xue, Zhaoqian, et al.
Published: (2024) -
ADO: Automatic Data Optimization for Inputs in LLM Prompts
by: Lin, Sam, et al.
Published: (2025)