:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Noori, Mobina, Chakraborti, Mahasweta, Zhang, Amy X, Frey, Seth
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.08956
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Patterns in the Transition From Founder-Leadership to Community Governance of Open Source
by: Noori, Mobina, et al.
Published: (2025)

Responsible AI in Open Ecosystems: Reconciling Innovation with Risk Assessment and Disclosure
by: Chakraborti, Mahasweta, et al.
Published: (2024)

The Hidden AI Race: Tracking Environmental Costs of Innovation
by: Agarwal, Shyam, et al.
Published: (2025)

Self-reflection in Automated Qualitative Coding: Improving Text Annotation through Secondary LLM Critique
by: Dunivin, Zackary Okun, et al.
Published: (2026)

Do We Run How We Say We Run? Formalization and Practice of Governance in OSS Communities
by: Chakraborti, Mahasweta, et al.
Published: (2023)

BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline
by: Dong, Guosheng, et al.
Published: (2024)

Correcting misinformation on social media with a large language model
by: Zhou, Xinyi, et al.
Published: (2024)

SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution
by: Liu, Hongyi, et al.
Published: (2026)

INTIMA: A Benchmark for Human-AI Companionship Behavior
by: Kaffee, Lucie-Aimée, et al.
Published: (2025)

DevEval: Evaluating Code Generation in Practical Software Projects
by: Li, Jia, et al.
Published: (2024)

Human Psychometric Questionnaires Mischaracterize LLM Behavior
by: Song, Woojung, et al.
Published: (2025)

From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making
by: Jain, Raunak
Published: (2026)

NLP4Gov: A Comprehensive Library for Computational Policy Analysis
by: Chakraborti, Mahasweta, et al.
Published: (2024)

Impact of Stickers on Multimodal Sentiment and Intent in Social Media: A New Task, Dataset and Baseline
by: Shi, Yuanchen, et al.
Published: (2024)

Explainable Ethical Assessment on Human Behaviors by Generating Conflicting Social Norms
by: Sun, Yuxi, et al.
Published: (2025)

From Human-to-Human to Human-to-Bot Conversations in Software Engineering
by: Khojah, Ranim, et al.
Published: (2024)

Aligning Large Language Model Behavior with Human Citation Preferences
by: Ando, Kenichiro, et al.
Published: (2026)

Teaching Values to Machines: Simulating Human-Like Behavior in LLMs
by: Yehudai, Asaf, et al.
Published: (2026)

Dataset Creation and Baseline Models for Sexism Detection in Hausa
by: Muhammad, Fatima Adam, et al.
Published: (2025)

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
by: Borchmann, Łukasz, et al.
Published: (2026)

Can LLMs Truly Embody Human Personality? Analyzing AI and Human Behavior Alignment in Dispute Resolution
by: Kwon, Deuksin, et al.
Published: (2026)

The Third Ambition: Artificial Intelligence and the Science of Human Behavior
by: Neuman, W. Russell, et al.
Published: (2026)

The Real, the Better: Aligning Large Language Models with Online Human Behaviors
by: Jiang, Guanying, et al.
Published: (2024)

Keep Guessing? When Considering Inference Scaling, Mind the Baselines
by: Yona, Gal, et al.
Published: (2024)

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
by: Yang, Ke, et al.
Published: (2024)

A Joint Neural Baseline for Concept, Assertion, and Relation Extraction from Clinical Text
by: Cheng, Fei, et al.
Published: (2026)

LLM Nepotism in Organizational Governance
by: Mao, Shunqi, et al.
Published: (2026)

'Neural howlround' in large language models: a self-reinforcing bias phenomenon, and a dynamic attenuation solution
by: Drake, Seth
Published: (2025)

Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame Simulations
by: Lamparth, Max, et al.
Published: (2024)

The Right Model for the Job: An Evaluation of Legal Multi-Label Classification Baselines
by: Forster, Martina, et al.
Published: (2024)

Leveraging Large Language Models in Code Question Answering: Baselines and Issues
by: Andryushchenko, Georgy, et al.
Published: (2024)

Can Large Language Models put 2 and 2 together? Probing for Entailed Arithmetical Relationships
by: Panas, D., et al.
Published: (2024)

EigenBench: A Comparative Behavioral Measure of Value Alignment
by: Chang, Jonathn, et al.
Published: (2025)

Be.FM: Open Foundation Models for Human Behavior
by: Xie, Yutong, et al.
Published: (2025)

Inner Speech as Behavior Guides: Steerable Imitation of Diverse Behaviors for Human-AI coordination
by: Trivedi, Rakshit, et al.
Published: (2026)

Language Models are Few-Shot Graders
by: Zhao, Chenyan, et al.
Published: (2025)

Siren: A Learning-Based Multi-Turn Attack Framework for Simulating Real-World Human Jailbreak Behaviors
by: Zhao, Yi, et al.
Published: (2025)

Strings from the Library of Babel: Random Sampling as a Strong Baseline for Prompt Optimisation
by: Lu, Yao, et al.
Published: (2023)

SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection
by: Tang, Kexian, et al.
Published: (2026)

Lightweight Baselines for Medical Abstract Classification: DistilBERT with Cross-Entropy as a Strong Default
by: Liu, Jiaqi, et al.
Published: (2025)