Saved in:
| Main Authors: | Amirizaniani, Maryam, Martin, Elias, Roosta, Tanya, Chadha, Aman, Shah, Chirag |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.09334 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLMAuditor: A Framework for Auditing Large Language Models Using Human-in-the-Loop
by: Amirizaniani, Maryam, et al.
Published: (2024)
by: Amirizaniani, Maryam, et al.
Published: (2024)
Do LLMs Exhibit Human-Like Reasoning? Evaluating Theory of Mind in LLMs for Open-Ended Responses
by: Amirizaniani, Maryam, et al.
Published: (2024)
by: Amirizaniani, Maryam, et al.
Published: (2024)
I Think, Therefore I Am Under-Qualified? A Benchmark for Evaluating Linguistic Shibboleth Detection in LLM Hiring Evaluations
by: Kharchenko, Julia, et al.
Published: (2025)
by: Kharchenko, Julia, et al.
Published: (2025)
How Well Do LLMs Represent Values Across Cultures? Empirical Analysis of LLM Responses Based on Hofstede Cultural Dimensions
by: Kharchenko, Julia, et al.
Published: (2024)
by: Kharchenko, Julia, et al.
Published: (2024)
Rudder: Steering Prefetching in Distributed GNN Training using LLM Agents
by: Sarkar, Aishwarya, et al.
Published: (2026)
by: Sarkar, Aishwarya, et al.
Published: (2026)
AuditWen:An Open-Source Large Language Model for Audit
by: Huang, Jiajia, et al.
Published: (2024)
by: Huang, Jiajia, et al.
Published: (2024)
Are Small Language Models Ready to Compete with Large Language Models for Practical Applications?
by: Sinha, Neelabh, et al.
Published: (2024)
by: Sinha, Neelabh, et al.
Published: (2024)
Learning to Reason for Multi-Step Retrieval of Personal Context in Personalized Question Answering
by: Amirizaniani, Maryam, et al.
Published: (2026)
by: Amirizaniani, Maryam, et al.
Published: (2026)
Privacy Auditing of Large Language Models
by: Panda, Ashwinee, et al.
Published: (2025)
by: Panda, Ashwinee, et al.
Published: (2025)
Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models
by: Kasat, Aryan, et al.
Published: (2026)
by: Kasat, Aryan, et al.
Published: (2026)
Exploring the Impact of Large Language Models on Recommender Systems: An Extensive Review
by: Vats, Arpita, et al.
Published: (2024)
by: Vats, Arpita, et al.
Published: (2024)
Auditing the Ethical Logic of Generative AI Models
by: Neuman, W. Russell, et al.
Published: (2025)
by: Neuman, W. Russell, et al.
Published: (2025)
CALM: Curiosity-Driven Auditing for Large Language Models
by: Zheng, Xiang, et al.
Published: (2025)
by: Zheng, Xiang, et al.
Published: (2025)
TRUST: A Decentralized Framework for Auditing Large Language Model Reasoning
by: Huang, Morris Yu-Chao, et al.
Published: (2025)
by: Huang, Morris Yu-Chao, et al.
Published: (2025)
PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection
by: Camarato, Steffen J., et al.
Published: (2026)
by: Camarato, Steffen J., et al.
Published: (2026)
A Comprehensive Survey of Accelerated Generation Techniques in Large Language Models
by: Khoshnoodi, Mahsa, et al.
Published: (2024)
by: Khoshnoodi, Mahsa, et al.
Published: (2024)
Human-Readable Adversarial Prompts: An Investigation into LLM Vulnerabilities Using Situational Context
by: Das, Nilanjana, et al.
Published: (2024)
by: Das, Nilanjana, et al.
Published: (2024)
Born With a Silver Spoon? Investigating Socioeconomic Bias in Large Language Models
by: Singh, Smriti, et al.
Published: (2024)
by: Singh, Smriti, et al.
Published: (2024)
Auditing Pay-Per-Token in Large Language Models
by: Velasco, Ander Artola, et al.
Published: (2025)
by: Velasco, Ander Artola, et al.
Published: (2025)
Automating Security Audit Using Large Language Model based Agent: An Exploration Experiment
by: Chin, Jia Hui, et al.
Published: (2025)
by: Chin, Jia Hui, et al.
Published: (2025)
Output Scouting: Auditing Large Language Models for Catastrophic Responses
by: Bell, Andrew, et al.
Published: (2024)
by: Bell, Andrew, et al.
Published: (2024)
PRISM: A Methodology for Auditing Biases in Large Language Models
by: Azzopardi, Leif, et al.
Published: (2024)
by: Azzopardi, Leif, et al.
Published: (2024)
Can Large Language Models Infer Causal Relationships from Real-World Text?
by: Saklad, Ryan, et al.
Published: (2025)
by: Saklad, Ryan, et al.
Published: (2025)
Auditing Disability Representation in Vision-Language Models
by: Panda, Srikant, et al.
Published: (2026)
by: Panda, Srikant, et al.
Published: (2026)
Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Meaning in LLM Text Summaries
by: Perez, Natalie, et al.
Published: (2026)
by: Perez, Natalie, et al.
Published: (2026)
AlignMerge - Alignment-Preserving Large Language Model Merging via Fisher-Guided Geometric Constraints
by: Roy, Aniruddha, et al.
Published: (2025)
by: Roy, Aniruddha, et al.
Published: (2025)
RAudit: A Blind Auditing Protocol for Large Language Model Reasoning
by: Chang, Edward Y., et al.
Published: (2026)
by: Chang, Edward Y., et al.
Published: (2026)
The Refusal--Compliance Tradeoff: A Large-Scale Safety Behavior Audit of Large Language Models
by: Hasan, Alif Al, et al.
Published: (2026)
by: Hasan, Alif Al, et al.
Published: (2026)
The Illusion of Fairness: Auditing Fairness Interventions with Audit Studies
by: Sariola, Disa, et al.
Published: (2025)
by: Sariola, Disa, et al.
Published: (2025)
Explicit Cognitive Allocation: A Principle for Governed and Auditable Inference in Large Language Models
by: Manzanilla-Granados, Héctor Manuel, et al.
Published: (2026)
by: Manzanilla-Granados, Héctor Manuel, et al.
Published: (2026)
Counterfactual Trace Auditing of LLM Agent Skills
by: Zhou, Xiaolin, et al.
Published: (2026)
by: Zhou, Xiaolin, et al.
Published: (2026)
Large Language Model based Smart Contract Auditing with LLMBugScanner
by: Yuan, Yining, et al.
Published: (2025)
by: Yuan, Yining, et al.
Published: (2025)
Training LLMs with Reinforcement Learning for Intent-Aware Personalized Question Answering
by: Amirizaniani, Maryam, et al.
Published: (2026)
by: Amirizaniani, Maryam, et al.
Published: (2026)
Affording Process Auditability with QualAnalyzer: An Atomistic LLM Analysis Tool for Qualitative Research
by: Lu, Max Hao, et al.
Published: (2026)
by: Lu, Max Hao, et al.
Published: (2026)
Auditing Games for Sandbagging
by: Taylor, Jordan, et al.
Published: (2025)
by: Taylor, Jordan, et al.
Published: (2025)
Auditable Agents
by: Nian, Yi, et al.
Published: (2026)
by: Nian, Yi, et al.
Published: (2026)
INACIA: Integrating Large Language Models in Brazilian Audit Courts: Opportunities and Challenges
by: Pereira, Jayr, et al.
Published: (2024)
by: Pereira, Jayr, et al.
Published: (2024)
Who Gets Left Behind? Auditing Disability Inclusivity in Large Language Models
by: Dash, Deepika, et al.
Published: (2025)
by: Dash, Deepika, et al.
Published: (2025)
Don't Change My View: Ideological Bias Auditing in Large Language Models
by: Kröger, Paul, et al.
Published: (2025)
by: Kröger, Paul, et al.
Published: (2025)
Soft Token Attacks Cannot Reliably Audit Unlearning in Large Language Models
by: Chen, Haokun, et al.
Published: (2025)
by: Chen, Haokun, et al.
Published: (2025)
Similar Items
-
LLMAuditor: A Framework for Auditing Large Language Models Using Human-in-the-Loop
by: Amirizaniani, Maryam, et al.
Published: (2024) -
Do LLMs Exhibit Human-Like Reasoning? Evaluating Theory of Mind in LLMs for Open-Ended Responses
by: Amirizaniani, Maryam, et al.
Published: (2024) -
I Think, Therefore I Am Under-Qualified? A Benchmark for Evaluating Linguistic Shibboleth Detection in LLM Hiring Evaluations
by: Kharchenko, Julia, et al.
Published: (2025) -
How Well Do LLMs Represent Values Across Cultures? Empirical Analysis of LLM Responses Based on Hofstede Cultural Dimensions
by: Kharchenko, Julia, et al.
Published: (2024) -
Rudder: Steering Prefetching in Distributed GNN Training using LLM Agents
by: Sarkar, Aishwarya, et al.
Published: (2026)