Saved in:
| Main Authors: | Noori, Mobina, Chakraborti, Mahasweta, Zhang, Amy X, Frey, Seth |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.08956 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Patterns in the Transition From Founder-Leadership to Community Governance of Open Source
by: Noori, Mobina, et al.
Published: (2025)
by: Noori, Mobina, et al.
Published: (2025)
Responsible AI in Open Ecosystems: Reconciling Innovation with Risk Assessment and Disclosure
by: Chakraborti, Mahasweta, et al.
Published: (2024)
by: Chakraborti, Mahasweta, et al.
Published: (2024)
The Hidden AI Race: Tracking Environmental Costs of Innovation
by: Agarwal, Shyam, et al.
Published: (2025)
by: Agarwal, Shyam, et al.
Published: (2025)
Self-reflection in Automated Qualitative Coding: Improving Text Annotation through Secondary LLM Critique
by: Dunivin, Zackary Okun, et al.
Published: (2026)
by: Dunivin, Zackary Okun, et al.
Published: (2026)
Do We Run How We Say We Run? Formalization and Practice of Governance in OSS Communities
by: Chakraborti, Mahasweta, et al.
Published: (2023)
by: Chakraborti, Mahasweta, et al.
Published: (2023)
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline
by: Dong, Guosheng, et al.
Published: (2024)
by: Dong, Guosheng, et al.
Published: (2024)
Correcting misinformation on social media with a large language model
by: Zhou, Xinyi, et al.
Published: (2024)
by: Zhou, Xinyi, et al.
Published: (2024)
SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution
by: Liu, Hongyi, et al.
Published: (2026)
by: Liu, Hongyi, et al.
Published: (2026)
INTIMA: A Benchmark for Human-AI Companionship Behavior
by: Kaffee, Lucie-Aimée, et al.
Published: (2025)
by: Kaffee, Lucie-Aimée, et al.
Published: (2025)
DevEval: Evaluating Code Generation in Practical Software Projects
by: Li, Jia, et al.
Published: (2024)
by: Li, Jia, et al.
Published: (2024)
Human Psychometric Questionnaires Mischaracterize LLM Behavior
by: Song, Woojung, et al.
Published: (2025)
by: Song, Woojung, et al.
Published: (2025)
From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making
by: Jain, Raunak
Published: (2026)
by: Jain, Raunak
Published: (2026)
NLP4Gov: A Comprehensive Library for Computational Policy Analysis
by: Chakraborti, Mahasweta, et al.
Published: (2024)
by: Chakraborti, Mahasweta, et al.
Published: (2024)
Impact of Stickers on Multimodal Sentiment and Intent in Social Media: A New Task, Dataset and Baseline
by: Shi, Yuanchen, et al.
Published: (2024)
by: Shi, Yuanchen, et al.
Published: (2024)
Explainable Ethical Assessment on Human Behaviors by Generating Conflicting Social Norms
by: Sun, Yuxi, et al.
Published: (2025)
by: Sun, Yuxi, et al.
Published: (2025)
From Human-to-Human to Human-to-Bot Conversations in Software Engineering
by: Khojah, Ranim, et al.
Published: (2024)
by: Khojah, Ranim, et al.
Published: (2024)
Aligning Large Language Model Behavior with Human Citation Preferences
by: Ando, Kenichiro, et al.
Published: (2026)
by: Ando, Kenichiro, et al.
Published: (2026)
Teaching Values to Machines: Simulating Human-Like Behavior in LLMs
by: Yehudai, Asaf, et al.
Published: (2026)
by: Yehudai, Asaf, et al.
Published: (2026)
Dataset Creation and Baseline Models for Sexism Detection in Hausa
by: Muhammad, Fatima Adam, et al.
Published: (2025)
by: Muhammad, Fatima Adam, et al.
Published: (2025)
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
by: Borchmann, Łukasz, et al.
Published: (2026)
by: Borchmann, Łukasz, et al.
Published: (2026)
Can LLMs Truly Embody Human Personality? Analyzing AI and Human Behavior Alignment in Dispute Resolution
by: Kwon, Deuksin, et al.
Published: (2026)
by: Kwon, Deuksin, et al.
Published: (2026)
The Third Ambition: Artificial Intelligence and the Science of Human Behavior
by: Neuman, W. Russell, et al.
Published: (2026)
by: Neuman, W. Russell, et al.
Published: (2026)
The Real, the Better: Aligning Large Language Models with Online Human Behaviors
by: Jiang, Guanying, et al.
Published: (2024)
by: Jiang, Guanying, et al.
Published: (2024)
Keep Guessing? When Considering Inference Scaling, Mind the Baselines
by: Yona, Gal, et al.
Published: (2024)
by: Yona, Gal, et al.
Published: (2024)
AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
by: Yang, Ke, et al.
Published: (2024)
by: Yang, Ke, et al.
Published: (2024)
A Joint Neural Baseline for Concept, Assertion, and Relation Extraction from Clinical Text
by: Cheng, Fei, et al.
Published: (2026)
by: Cheng, Fei, et al.
Published: (2026)
LLM Nepotism in Organizational Governance
by: Mao, Shunqi, et al.
Published: (2026)
by: Mao, Shunqi, et al.
Published: (2026)
'Neural howlround' in large language models: a self-reinforcing bias phenomenon, and a dynamic attenuation solution
by: Drake, Seth
Published: (2025)
by: Drake, Seth
Published: (2025)
Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame Simulations
by: Lamparth, Max, et al.
Published: (2024)
by: Lamparth, Max, et al.
Published: (2024)
The Right Model for the Job: An Evaluation of Legal Multi-Label Classification Baselines
by: Forster, Martina, et al.
Published: (2024)
by: Forster, Martina, et al.
Published: (2024)
Leveraging Large Language Models in Code Question Answering: Baselines and Issues
by: Andryushchenko, Georgy, et al.
Published: (2024)
by: Andryushchenko, Georgy, et al.
Published: (2024)
Can Large Language Models put 2 and 2 together? Probing for Entailed Arithmetical Relationships
by: Panas, D., et al.
Published: (2024)
by: Panas, D., et al.
Published: (2024)
EigenBench: A Comparative Behavioral Measure of Value Alignment
by: Chang, Jonathn, et al.
Published: (2025)
by: Chang, Jonathn, et al.
Published: (2025)
Be.FM: Open Foundation Models for Human Behavior
by: Xie, Yutong, et al.
Published: (2025)
by: Xie, Yutong, et al.
Published: (2025)
Inner Speech as Behavior Guides: Steerable Imitation of Diverse Behaviors for Human-AI coordination
by: Trivedi, Rakshit, et al.
Published: (2026)
by: Trivedi, Rakshit, et al.
Published: (2026)
Language Models are Few-Shot Graders
by: Zhao, Chenyan, et al.
Published: (2025)
by: Zhao, Chenyan, et al.
Published: (2025)
Siren: A Learning-Based Multi-Turn Attack Framework for Simulating Real-World Human Jailbreak Behaviors
by: Zhao, Yi, et al.
Published: (2025)
by: Zhao, Yi, et al.
Published: (2025)
Strings from the Library of Babel: Random Sampling as a Strong Baseline for Prompt Optimisation
by: Lu, Yao, et al.
Published: (2023)
by: Lu, Yao, et al.
Published: (2023)
SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection
by: Tang, Kexian, et al.
Published: (2026)
by: Tang, Kexian, et al.
Published: (2026)
Lightweight Baselines for Medical Abstract Classification: DistilBERT with Cross-Entropy as a Strong Default
by: Liu, Jiaqi, et al.
Published: (2025)
by: Liu, Jiaqi, et al.
Published: (2025)
Similar Items
-
Patterns in the Transition From Founder-Leadership to Community Governance of Open Source
by: Noori, Mobina, et al.
Published: (2025) -
Responsible AI in Open Ecosystems: Reconciling Innovation with Risk Assessment and Disclosure
by: Chakraborti, Mahasweta, et al.
Published: (2024) -
The Hidden AI Race: Tracking Environmental Costs of Innovation
by: Agarwal, Shyam, et al.
Published: (2025) -
Self-reflection in Automated Qualitative Coding: Improving Text Annotation through Secondary LLM Critique
by: Dunivin, Zackary Okun, et al.
Published: (2026) -
Do We Run How We Say We Run? Formalization and Practice of Governance in OSS Communities
by: Chakraborti, Mahasweta, et al.
Published: (2023)