:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Schoenberg, William, Girard, Davidson, Chung, Saras, O'Neill, Ellen, Velasquez, Janet, Metcalf, Sara
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2503.15580
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

The Qualitative Engine: Creating and Evaluating an Iterative AI Modeling Tool
di: William Schoenberg, et al.
Pubblicazione: (2026)

BEAMS: Benchmarking and Evaluating AI for Modeling and Simulation
di: Metcalf, Sara, et al.
Pubblicazione: (2026)

Regulating AI: Applying insights from behavioural economics and psychology to the application of article 5 of the EU AI Act
di: Zhong, Huixin, et al.
Pubblicazione: (2023)

Building and Learning With Models Using AI
di: William Schoenberg
Pubblicazione: (2026)

Efficiency Will Not Lead to Sustainable Reasoning AI
di: Wiesner, Philipp, et al.
Pubblicazione: (2025)

How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models
di: Asawa, Parth, et al.
Pubblicazione: (2025)

Paired Completion: Flexible Quantification of Issue-framing at Scale with LLMs
di: Angus, Simon D, et al.
Pubblicazione: (2024)

How Can Generative AI Enhance the Well-being of Blind?
di: Bendel, Oliver
Pubblicazione: (2024)

Cooperate to Compete: Strategic Coordination in Multi-Agent Conquest
di: O'Neill, Abigail, et al.
Pubblicazione: (2026)

How Well Can Transformers Emulate In-context Newton's Method?
di: Giannou, Angeliki, et al.
Pubblicazione: (2024)

How Well Do Models Follow Their Constitutions?
di: Jakkli, Arya, et al.
Pubblicazione: (2026)

“Can you help me think this through?” How pediatric hospitalists learn from informal peer consultation
di: Laura B. O'Neill, et al.
Pubblicazione: (2024)

AI reasoning effort predicts human decision time in content moderation
di: Davidson, Thomas
Pubblicazione: (2025)

How Clinicians Think and What AI Can Learn From It
di: Sengupta, Dipayan, et al.
Pubblicazione: (2026)

AIBuildAI: An AI Agent for Automatically Building AI Models
di: Zhang, Ruiyi, et al.
Pubblicazione: (2026)

How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis
di: Bianchi, Federico, et al.
Pubblicazione: (2024)

Dukawalla: Voice Interfaces for Small Businesses in Africa
di: Ankrah, Elizabeth, et al.
Pubblicazione: (2025)

Can OpenAI o1 Reason Well in Ophthalmology? A 6,990-Question Head-to-Head Evaluation Study
di: Srinivasan, Sahana, et al.
Pubblicazione: (2025)

Why the Center Can't Hold: A Diagnosis of Puritanized America
di: O’Neill, Tom
Pubblicazione: (2019)

Towards Measuring and Modeling "Culture" in LLMs: A Survey
di: Adilazuarda, Muhammad Farid, et al.
Pubblicazione: (2024)

How Well Do Large Language Models Truly Ground?
di: Lee, Hyunji, et al.
Pubblicazione: (2023)

How Well Do Multimodal Models Reason on ECG Signals?
di: Xu, Maxwell A., et al.
Pubblicazione: (2026)

How Well Can a Long Sequence Model Model Long Sequences? Comparing Architechtural Inductive Biases on Long-Context Abilities
di: Huang, Jerry
Pubblicazione: (2024)

How Well Can Vison-Language Models Understand Humans' Intention? An Open-ended Theory of Mind Question Evaluation Benchmark
di: Wen, Ximing, et al.
Pubblicazione: (2025)

MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis
di: Seyfarth, Marvin, et al.
Pubblicazione: (2025)

"I know myself better, but not really greatly": How Well Can LLMs Detect and Explain LLM-Generated Texts?
di: Ji, Jiazhou, et al.
Pubblicazione: (2025)

VibeServe: Can AI Agents Build Bespoke LLM Serving Systems?
di: Kamahori, Keisuke, et al.
Pubblicazione: (2026)

Let the Barbarians In: How AI Can Accelerate Systems Performance Research
di: Cheng, Audrey, et al.
Pubblicazione: (2025)

How Well Can LLM Agents Simulate End-User Security and Privacy Attitudes and Behaviors?
di: Li, Yuxuan, et al.
Pubblicazione: (2026)

The AI Co-Ethnographer: How Far Can Automation Take Qualitative Research?
di: Retkowski, Fabian, et al.
Pubblicazione: (2025)

A Survey on Test-Time Scaling in Large Language Models: What, How, Where, and How Well?
di: Zhang, Qiyuan, et al.
Pubblicazione: (2025)

SD-MoE: Spectral Decomposition for Effective Expert Specialization
di: Huang, Ruijun, et al.
Pubblicazione: (2026)

SD-VLM: Spatial Measuring and Understanding with Depth-Encoded Vision-Language Models
di: Chen, Pingyi, et al.
Pubblicazione: (2025)

Two Online Map Matching Algorithms Based on Analytic Hierarchy Process and Fuzzy Logic
di: Lin, Jeremy J., et al.
Pubblicazione: (2024)

How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks
di: Ferrag, Mohamed Amine, et al.
Pubblicazione: (2026)

SD$^2$: Self-Distilled Sparse Drafters
di: Lasby, Mike, et al.
Pubblicazione: (2025)

AMR-SD: Asymmetric Meta-Reflective Self-Distillation for Token-Level Credit Assignment
di: Wei, Zhenlin, et al.
Pubblicazione: (2026)

How Group Lives Go Well
di: Beverley, John, et al.
Pubblicazione: (2025)

How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework
di: uulu, Choro Ulan, et al.
Pubblicazione: (2026)

How Well Does Agent Development Reflect Real-World Work?
di: Wang, Zora Zhiruo, et al.
Pubblicazione: (2026)