Saved in:
| Main Authors: | Davies, Adam, Nguyen, Elisa, Simeone, Michael, Johnston, Erik, Gubri, Martin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.16355 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Is Multilingual LLM Watermarking Truly Multilingual? Scaling Robustness to 100+ Languages via Back-Translation
by: Mohamed, Asim, et al.
Published: (2025)
by: Mohamed, Asim, et al.
Published: (2025)
Time Series Foundation Models for Energy Load Forecasting on Consumer Hardware: A Multi-Dimensional Zero-Shot Benchmark
by: Simeone, Luigi
Published: (2026)
by: Simeone, Luigi
Published: (2026)
DISCO: Diversifying Sample Condensation for Efficient Model Evaluation
by: Rubinstein, Alexander, et al.
Published: (2025)
by: Rubinstein, Alexander, et al.
Published: (2025)
Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models
by: Puerto, Haritz, et al.
Published: (2024)
by: Puerto, Haritz, et al.
Published: (2024)
Physics-Guided Multimodal Transformers are the Necessary Foundation for the Next Generation of Meteorological Science
by: Han, Jing, et al.
Published: (2025)
by: Han, Jing, et al.
Published: (2025)
Operationalizing Perceptions of Agent Gender: Foundations and Guidelines
by: Seaborn, Katie, et al.
Published: (2026)
by: Seaborn, Katie, et al.
Published: (2026)
Calibrating Large Language Models Using Their Generations Only
by: Ulmer, Dennis, et al.
Published: (2024)
by: Ulmer, Dennis, et al.
Published: (2024)
Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers
by: Green, Tommaso, et al.
Published: (2025)
by: Green, Tommaso, et al.
Published: (2025)
RAIL in the Wild: Operationalizing Responsible AI Evaluation Using Anthropic's Value Dataset
by: Verma, Sumit, et al.
Published: (2025)
by: Verma, Sumit, et al.
Published: (2025)
The Emergence of Social Science of Large Language Models
by: Jia, Xiao, et al.
Published: (2025)
by: Jia, Xiao, et al.
Published: (2025)
An Agentic Operationalization of DISARM for FIMI Investigation on Social Media
by: Tseng, Kevin, et al.
Published: (2026)
by: Tseng, Kevin, et al.
Published: (2026)
ReplicatorBench: Benchmarking LLM Agents for Replicability in Social and Behavioral Sciences
by: Nguyen, Bang, et al.
Published: (2026)
by: Nguyen, Bang, et al.
Published: (2026)
Evaluating the Use of Large Language Models as Synthetic Social Agents in Social Science Research
by: Madden, Emma Rose
Published: (2025)
by: Madden, Emma Rose
Published: (2025)
C-SEO Bench: Does Conversational SEO Work?
by: Puerto, Haritz, et al.
Published: (2025)
by: Puerto, Haritz, et al.
Published: (2025)
Dr.LLM: Dynamic Layer Routing in LLMs
by: Heakl, Ahmed, et al.
Published: (2025)
by: Heakl, Ahmed, et al.
Published: (2025)
Robotics-Inspired Guardrails for Foundation Models in Socially Sensitive Domains
by: Ramnauth, Rebecca, et al.
Published: (2026)
by: Ramnauth, Rebecca, et al.
Published: (2026)
Ecosystem Graphs: The Social Footprint of Foundation Models
by: Bommasani, Rishi, et al.
Published: (2023)
by: Bommasani, Rishi, et al.
Published: (2023)
Operationalizing Contextual Integrity in Privacy-Conscious Assistants
by: Ghalebikesabi, Sahra, et al.
Published: (2024)
by: Ghalebikesabi, Sahra, et al.
Published: (2024)
Levels of AGI for Operationalizing Progress on the Path to AGI
by: Morris, Meredith Ringel, et al.
Published: (2023)
by: Morris, Meredith Ringel, et al.
Published: (2023)
MASEval: Extending Multi-Agent Evaluation from Models to Systems
by: Emde, Cornelius, et al.
Published: (2026)
by: Emde, Cornelius, et al.
Published: (2026)
GeoAI in Social Science
by: Li, Wenwen
Published: (2023)
by: Li, Wenwen
Published: (2023)
TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification
by: Gubri, Martin, et al.
Published: (2024)
by: Gubri, Martin, et al.
Published: (2024)
When Chain of Thought is Necessary, Language Models Struggle to Evade Monitors
by: Emmons, Scott, et al.
Published: (2025)
by: Emmons, Scott, et al.
Published: (2025)
Social Human Robot Embodied Conversation (SHREC) Dataset: Benchmarking Foundational Models' Social Reasoning
by: Lee, Dong Won, et al.
Published: (2025)
by: Lee, Dong Won, et al.
Published: (2025)
Word Embedding for Social Sciences: An Interdisciplinary Survey
by: Matsui, Akira, et al.
Published: (2022)
by: Matsui, Akira, et al.
Published: (2022)
OmniSapiens: A Foundation Model for Social Behavior Processing via Heterogeneity-Aware Relative Policy Optimization
by: Ong, Keane, et al.
Published: (2026)
by: Ong, Keane, et al.
Published: (2026)
SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation
by: Chen, Ziyi, et al.
Published: (2025)
by: Chen, Ziyi, et al.
Published: (2025)
ExpProof : Operationalizing Explanations for Confidential Models with ZKPs
by: Yadav, Chhavi, et al.
Published: (2025)
by: Yadav, Chhavi, et al.
Published: (2025)
LLM Agents as Social Scientists: A Human-AI Collaborative Platform for Social Science Automation
by: Wang, Lei, et al.
Published: (2026)
by: Wang, Lei, et al.
Published: (2026)
Conceptual Logical Foundations of Artificial Social Intelligence
by: Werner, Eric
Published: (2025)
by: Werner, Eric
Published: (2025)
Specification, Application, and Operationalization of a Metamodel of Fairness
by: Mendez, Julian Alfredo, et al.
Published: (2025)
by: Mendez, Julian Alfredo, et al.
Published: (2025)
The Science of Evaluating Foundation Models
by: Yuan, Jiayi, et al.
Published: (2025)
by: Yuan, Jiayi, et al.
Published: (2025)
Is Monotonic Sampling Necessary in Diffusion Models?
by: Khan, Muhammad Haris
Published: (2026)
by: Khan, Muhammad Haris
Published: (2026)
Towards Operationalizing Right to Data Protection
by: Java, Abhinav, et al.
Published: (2024)
by: Java, Abhinav, et al.
Published: (2024)
Social Behaviour Understanding using Deep Neural Networks: Development of Social Intelligence Systems
by: Feng, Ethan Lim Ding, et al.
Published: (2021)
by: Feng, Ethan Lim Ding, et al.
Published: (2021)
A Grounded Observer Framework for Establishing Guardrails for Foundation Models in Socially Sensitive Domains
by: Ramnauth, Rebecca, et al.
Published: (2024)
by: Ramnauth, Rebecca, et al.
Published: (2024)
LLM-Assisted Replication for Quantitative Social Science
by: Kubota, So, et al.
Published: (2026)
by: Kubota, So, et al.
Published: (2026)
Assessing and Mitigating Miscalibration in LLM-Based Social Science Measurement
by: Wang, Jinyuan, et al.
Published: (2026)
by: Wang, Jinyuan, et al.
Published: (2026)
AI, Climate, and Transparency: Operationalizing and Improving the AI Act
by: Alder, Nicolas, et al.
Published: (2024)
by: Alder, Nicolas, et al.
Published: (2024)
Operationalizing Data Minimization for Privacy-Preserving LLM Prompting
by: Zhou, Jijie, et al.
Published: (2025)
by: Zhou, Jijie, et al.
Published: (2025)
Similar Items
-
Is Multilingual LLM Watermarking Truly Multilingual? Scaling Robustness to 100+ Languages via Back-Translation
by: Mohamed, Asim, et al.
Published: (2025) -
Time Series Foundation Models for Energy Load Forecasting on Consumer Hardware: A Multi-Dimensional Zero-Shot Benchmark
by: Simeone, Luigi
Published: (2026) -
DISCO: Diversifying Sample Condensation for Efficient Model Evaluation
by: Rubinstein, Alexander, et al.
Published: (2025) -
Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models
by: Puerto, Haritz, et al.
Published: (2024) -
Physics-Guided Multimodal Transformers are the Necessary Foundation for the Next Generation of Meteorological Science
by: Han, Jing, et al.
Published: (2025)