Saved in:
| Main Authors: | Bignotti, Camilla, Camassa, Carolina |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.19760 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Prompting for Policy: Forecasting Macroeconomic Scenarios with Synthetic LLM Personas
by: Iadisernia, Giulia, et al.
Published: (2025)
by: Iadisernia, Giulia, et al.
Published: (2025)
Chat Bankman-Fried: an Exploration of LLM Alignment in Finance
by: Biancotti, Claudia, et al.
Published: (2024)
by: Biancotti, Claudia, et al.
Published: (2024)
Do as I Say, Not as I Do: Instruction-Induction Conflict in LLMs
by: Camassa, Carolina, et al.
Published: (2026)
by: Camassa, Carolina, et al.
Published: (2026)
LLMs Provide Unstable Answers to Legal Questions
by: Blair-Stanek, Andrew, et al.
Published: (2025)
by: Blair-Stanek, Andrew, et al.
Published: (2025)
Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios
by: Xu, Shaochen, et al.
Published: (2024)
by: Xu, Shaochen, et al.
Published: (2024)
Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms
by: Ashkinaze, Joshua, et al.
Published: (2024)
by: Ashkinaze, Joshua, et al.
Published: (2024)
Assessing LLMs in Art Contexts: Critique Generation and Theory of Mind Evaluation
by: Arita, Takaya, et al.
Published: (2025)
by: Arita, Takaya, et al.
Published: (2025)
Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning
by: Juvekar, Kush, et al.
Published: (2025)
by: Juvekar, Kush, et al.
Published: (2025)
RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?
by: de Wynter, Adrian, et al.
Published: (2024)
by: de Wynter, Adrian, et al.
Published: (2024)
Leveraging Large Language Models (LLMs) for Traffic Management at Urban Intersections: The Case of Mixed Traffic Scenarios
by: Masri, Sari, et al.
Published: (2024)
by: Masri, Sari, et al.
Published: (2024)
Evaluating the Promise and Pitfalls of LLMs in Hiring Decisions
by: Anzenberg, Eitan, et al.
Published: (2025)
by: Anzenberg, Eitan, et al.
Published: (2025)
RAGAT-Mind: A Multi-Granular Modeling Approach for Rumor Detection Based on MindSpore
by: Qin, Zhenkai, et al.
Published: (2025)
by: Qin, Zhenkai, et al.
Published: (2025)
Benchmarking the Legal Reasoning of LLMs in Arabic Islamic Inheritance Cases
by: AlDahoul, Nouar, et al.
Published: (2025)
by: AlDahoul, Nouar, et al.
Published: (2025)
PLawBench: A Rubric-Based Benchmark for Evaluating LLMs in Real-World Legal Practice
by: Shi, Yuzhen, et al.
Published: (2026)
by: Shi, Yuzhen, et al.
Published: (2026)
Does Claude's Constitution Have a Culture?
by: Pourdavood, Parham
Published: (2026)
by: Pourdavood, Parham
Published: (2026)
Epistemic Constitutionalism Or: how to avoid coherence bias
by: Loi, Michele
Published: (2026)
by: Loi, Michele
Published: (2026)
Legal Fact Prediction: The Missing Piece in Legal Judgment Prediction
by: Liu, Junkai, et al.
Published: (2024)
by: Liu, Junkai, et al.
Published: (2024)
Are Models Trained on Indian Legal Data Fair?
by: Girhepuje, Sahil, et al.
Published: (2023)
by: Girhepuje, Sahil, et al.
Published: (2023)
Towards Grammatical Tagging for the Legal Language of Cybersecurity
by: Castiglione, Gianpietro, et al.
Published: (2023)
by: Castiglione, Gianpietro, et al.
Published: (2023)
Mining Legal Arguments to Study Judicial Formalism
by: Koref, Tomáš, et al.
Published: (2025)
by: Koref, Tomáš, et al.
Published: (2025)
Toward Robust Legal Text Formalization into Defeasible Deontic Logic using LLMs
by: Horner, Elias, et al.
Published: (2025)
by: Horner, Elias, et al.
Published: (2025)
RoleConflictBench: A Benchmark of Role Conflict Scenarios for Evaluating LLMs' Contextual Sensitivity
by: Shin, Jisu, et al.
Published: (2025)
by: Shin, Jisu, et al.
Published: (2025)
Gender Bias in LLMs: Preliminary Evidence from Shared Parenting Scenario in Czech Family Law
by: Harasta, Jakub, et al.
Published: (2026)
by: Harasta, Jakub, et al.
Published: (2026)
Artificial Intelligence and Civil Discourse: How LLMs Moderate Climate Change Conversations
by: Fan, Wenlu, et al.
Published: (2025)
by: Fan, Wenlu, et al.
Published: (2025)
Minding the Politeness Gap in Cross-cultural Communication
by: Machino, Yuka, et al.
Published: (2025)
by: Machino, Yuka, et al.
Published: (2025)
Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models
by: Dahl, Matthew, et al.
Published: (2024)
by: Dahl, Matthew, et al.
Published: (2024)
Algorithmic Fairness in NLP: Persona-Infused LLMs for Human-Centric Hate Speech Detection
by: Gajewska, Ewelina, et al.
Published: (2025)
by: Gajewska, Ewelina, et al.
Published: (2025)
Caveat Lector: Large Language Models in Legal Practice
by: Mik, Eliza
Published: (2024)
by: Mik, Eliza
Published: (2024)
Beyond Accuracy: Diagnosing Algebraic Reasoning Failures in LLMs Across Nine Complexity Dimensions
by: Patil, Parth, et al.
Published: (2026)
by: Patil, Parth, et al.
Published: (2026)
How Large Language Models (LLMs) Extrapolate: From Guided Missiles to Guided Prompts
by: Cao, Xuenan
Published: (2024)
by: Cao, Xuenan
Published: (2024)
Bridging Legal Interpretation and Formal Logic: Faithfulness, Assumption, and the Future of AI Legal Reasoning
by: Wang, Olivia Peiyu, et al.
Published: (2026)
by: Wang, Olivia Peiyu, et al.
Published: (2026)
Mind the Gap: Pitfalls of LLM Alignment with Asian Public Opinion
by: Shankar, Hari, et al.
Published: (2026)
by: Shankar, Hari, et al.
Published: (2026)
From Perceptions to Decisions: Wildfire Evacuation Decision Prediction with Behavioral Theory-informed LLMs
by: Chen, Ruxiao, et al.
Published: (2025)
by: Chen, Ruxiao, et al.
Published: (2025)
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools
by: Magesh, Varun, et al.
Published: (2024)
by: Magesh, Varun, et al.
Published: (2024)
How Far Are LLMs from Believable AI? A Benchmark for Evaluating the Believability of Human Behavior Simulation
by: Xiao, Yang, et al.
Published: (2023)
by: Xiao, Yang, et al.
Published: (2023)
Red Lines and Grey Zones in the Fog of War: Benchmarking Legal Risk, Moral Harm, and Regional Bias in Large Language Model Military Decision-Making
by: Drinkall, Toby
Published: (2025)
by: Drinkall, Toby
Published: (2025)
Persuadability and LLMs as Legal Decision Tools
by: Suttle, Oisin, et al.
Published: (2026)
by: Suttle, Oisin, et al.
Published: (2026)
Few-shot Hate Speech Detection Based on the MindSpore Framework
by: Qin, Zhenkai, et al.
Published: (2025)
by: Qin, Zhenkai, et al.
Published: (2025)
ArabLegalEval: A Multitask Benchmark for Assessing Arabic Legal Knowledge in Large Language Models
by: Hijazi, Faris, et al.
Published: (2024)
by: Hijazi, Faris, et al.
Published: (2024)
CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented Analysis Generation
by: Hou, Abe Bohan, et al.
Published: (2024)
by: Hou, Abe Bohan, et al.
Published: (2024)
Similar Items
-
Prompting for Policy: Forecasting Macroeconomic Scenarios with Synthetic LLM Personas
by: Iadisernia, Giulia, et al.
Published: (2025) -
Chat Bankman-Fried: an Exploration of LLM Alignment in Finance
by: Biancotti, Claudia, et al.
Published: (2024) -
Do as I Say, Not as I Do: Instruction-Induction Conflict in LLMs
by: Camassa, Carolina, et al.
Published: (2026) -
LLMs Provide Unstable Answers to Legal Questions
by: Blair-Stanek, Andrew, et al.
Published: (2025) -
Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios
by: Xu, Shaochen, et al.
Published: (2024)