:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Goethals, Sofie, Rhue, Lauren
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Computation and Language
Online-Zugang:	https://arxiv.org/abs/2412.10281
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Evaluating LLMs for Gender Disparities in Notable Persons
von: Rhue, Lauren, et al.
Veröffentlicht: (2024)

Prompt-Counterfactual Explanations for Generative AI System Behavior
von: Goethals, Sofie, et al.
Veröffentlicht: (2026)

Would a Large Language Model Pay Extra for a View? Inferring Willingness to Pay from Subjective Choices
von: Reusens, Manon, et al.
Veröffentlicht: (2026)

Cash or Comfort? How LLMs Value Your Inconvenience
von: Cedro, Mateusz, et al.
Veröffentlicht: (2025)

The Basic B*** Effect: The Use of LLM-based Agents Reduces the Distinctiveness and Diversity of People's Choices
von: Matz, Sandra C., et al.
Veröffentlicht: (2025)

Reranking individuals: The effect of fair classification within-groups
von: Goethals, Sofie, et al.
Veröffentlicht: (2024)

Resource-constrained Fairness
von: Goethals, Sofie, et al.
Veröffentlicht: (2024)

Manson superstar
Veröffentlicht: (2006)

When One LLM Drools, Multi-LLM Collaboration Rules
von: Feng, Shangbin, et al.
Veröffentlicht: (2025)

DiscoTrack: A Multilingual LLM Benchmark for Discourse Tracking
von: Bu, Lanni, et al.
Veröffentlicht: (2025)

A Customer Journey in the Land of Oz: Leveraging the Wizard of Oz Technique to Model Emotions in Customer Service Interactions
von: Labat, Sofie, et al.
Veröffentlicht: (2025)

LLMBridge: An LLM Pipeline for End-to-end Referential Bridging Resolution in English
von: Levine, Lauren, et al.
Veröffentlicht: (2026)

Brevity is the soul of sustainability: Characterizing LLM response lengths
von: Poddar, Soham, et al.
Veröffentlicht: (2025)

LCO: LLM-based Constraint Optimization for Safer Agentic LLMs in Real-world Tasks
von: Wan, Jiayong, et al.
Veröffentlicht: (2026)

ReZero: Enhancing LLM search ability by trying one-more-time
von: Dao, Alan, et al.
Veröffentlicht: (2025)

One-Eval: An Agentic System for Automated and Traceable LLM Evaluation
von: Shen, Chengyu, et al.
Veröffentlicht: (2026)

One Token to Fool LLM-as-a-Judge
von: Zhao, Yulai, et al.
Veröffentlicht: (2025)

LLM one-shot style transfer for Authorship Attribution and Verification
von: Miralles-González, Pablo, et al.
Veröffentlicht: (2025)

Researchers waste 80% of LLM annotation costs by classifying one text at a time
von: Pipal, Christian, et al.
Veröffentlicht: (2026)

TRUEBench: Can LLM Response Meet Real-world Constraints as Productivity Assistant?
von: Park, Jiho, et al.
Veröffentlicht: (2025)

MultiZebraLogic: A Multilingual Logical Reasoning Benchmark
von: Bruun, Sofie Helene, et al.
Veröffentlicht: (2025)

Beyond One Path: Evaluating and Enhancing Divergent Thinking in Interactive LLM Agents
von: Park, Jihyeong, et al.
Veröffentlicht: (2026)

One Language, Two Scripts: Probing Script-Invariance in LLM Concept Representations
von: Karne, Sripad
Veröffentlicht: (2026)

OneLLM: One Framework to Align All Modalities with Language
von: Han, Jiaming, et al.
Veröffentlicht: (2023)

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
von: He, Wei, et al.
Veröffentlicht: (2025)

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
von: Chen, Yanxu, et al.
Veröffentlicht: (2025)

OneEval: Benchmarking LLM Knowledge-intensive Reasoning over Diverse Knowledge Bases
von: Chen, Yongrui, et al.
Veröffentlicht: (2025)

Skill is Not One-Size-Fits-All: Model-Aware Skill Alignment for LLM Agents
von: Yu, Jianxiang, et al.
Veröffentlicht: (2026)

One LLM to Train Them All: Multi-Task Learning Framework for Fact-Checking
von: Larsson, Malin Astrid, et al.
Veröffentlicht: (2026)

OneShield -- the Next Generation of LLM Guardrails
von: DeLuca, Chad, et al.
Veröffentlicht: (2025)

Bias in the Mirror: Are LLMs opinions robust to their own adversarial attacks ?
von: Rennard, Virgile, et al.
Veröffentlicht: (2024)

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
von: Chen, Guoxuan, et al.
Veröffentlicht: (2024)

Are LLM-Judges Robust to Expressions of Uncertainty? Investigating the effect of Epistemic Markers on LLM-based Evaluation
von: Lee, Dongryeol, et al.
Veröffentlicht: (2024)

One Persona, Many Cues, Different Results: How Sociodemographic Cues Impact LLM Personalization
von: Weeber, Franziska, et al.
Veröffentlicht: (2026)

One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment
von: Cai, Hongru, et al.
Veröffentlicht: (2026)

One Size Fits None: Heuristic Collapse in LLM Investment Advice
von: Ross, Jillian, et al.
Veröffentlicht: (2026)

DepressLLM: Interpretable domain-adapted language model for depression detection from real-world narratives
von: Moon, Sehwan, et al.
Veröffentlicht: (2025)

From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
von: Finkelstein, Mara, et al.
Veröffentlicht: (2024)

One Word at a Time: Incremental Completion Decomposition Breaks LLM Safety
von: Arif, Samee, et al.
Veröffentlicht: (2026)

CodeNav: Beyond tool-use to using real-world codebases with LLM agents
von: Gupta, Tanmay, et al.
Veröffentlicht: (2024)