Saved in:
| Main Authors: | Yashwanth, Tadisetty Sai, Royal, Yangalasetty Sruthi, Shreya, Vankayala Rajeshwari, Kashyap, Mayank, N, Divyaprabha K |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.11690 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Multi-Agent Pokemon Tournament for Evaluating Strategic Reasoning of Large Language Models
by: Yashwanth, Tadisetty Sai, et al.
Published: (2025)
by: Yashwanth, Tadisetty Sai, et al.
Published: (2025)
On the Structure of Floating-Point Noise in Batch-Invariant GPU Matrix Multiplication
by: Yashwanth, Tadisetty Sai
Published: (2025)
by: Yashwanth, Tadisetty Sai
Published: (2025)
BioBlue: Systematic runaway-optimiser-like LLM failure modes on biologically and economically aligned AI safety benchmarks for LLMs with simplified observation format
by: Pihlakas, Roland, et al.
Published: (2025)
by: Pihlakas, Roland, et al.
Published: (2025)
To Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI Systems
by: Chappidi, Shreya, et al.
Published: (2026)
by: Chappidi, Shreya, et al.
Published: (2026)
Safe for Whom? Rethinking How We Evaluate the Safety of LLMs for Real Users
by: Kempermann, Manon, et al.
Published: (2025)
by: Kempermann, Manon, et al.
Published: (2025)
An Analysis of Artificial Intelligence Adoption in NIH-Funded Research
by: Nananukul, Navapat, et al.
Published: (2026)
by: Nananukul, Navapat, et al.
Published: (2026)
Leveraging LLM-Respondents for Item Evaluation: a Psychometric Analysis
by: Liu, Yunting, et al.
Published: (2024)
by: Liu, Yunting, et al.
Published: (2024)
The Silicon Psyche: Anthropomorphic Vulnerabilities in Large Language Models
by: Canale, Giuseppe, et al.
Published: (2025)
by: Canale, Giuseppe, et al.
Published: (2025)
Accountability Capture: How Record-Keeping to Support AI Transparency and Accountability (Re)shapes Algorithmic Oversight
by: Chappidi, Shreya, et al.
Published: (2025)
by: Chappidi, Shreya, et al.
Published: (2025)
Scrapyard AI
by: Böhlen, Marc, et al.
Published: (2026)
by: Böhlen, Marc, et al.
Published: (2026)
Building Interpretable Models for Moral Decision-Making
by: Goel, Mayank, et al.
Published: (2026)
by: Goel, Mayank, et al.
Published: (2026)
LLM-Guided Synthetic Augmentation (LGSA) for Mitigating Bias in AI Systems
by: Karri, Sai Suhruth Reddy, et al.
Published: (2025)
by: Karri, Sai Suhruth Reddy, et al.
Published: (2025)
EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI
by: Kasu, Sai Kartheek Reddy
Published: (2025)
by: Kasu, Sai Kartheek Reddy
Published: (2025)
Criminal Liability of Generative Artificial Intelligence Providers for User-Generated Child Sexual Abuse Material
by: Mojica-Hanke, Anamaria, et al.
Published: (2026)
by: Mojica-Hanke, Anamaria, et al.
Published: (2026)
Deploying ADVISER: Impact and Lessons from Using Artificial Intelligence for Child Vaccination Uptake in Nigeria
by: Kehinde, Opadele, et al.
Published: (2023)
by: Kehinde, Opadele, et al.
Published: (2023)
Stop Testing Attacks, Start Diagnosing Defenses: The Four-Checkpoint Framework Reveals Where LLM Safety Breaks
by: Dhabhi, Hayfa, et al.
Published: (2026)
by: Dhabhi, Hayfa, et al.
Published: (2026)
A Checklist for Trustworthy, Safe, and User-Friendly Mental Health Chatbots
by: Haran, Shreya, et al.
Published: (2026)
by: Haran, Shreya, et al.
Published: (2026)
InterveneBench: Benchmarking LLMs for Intervention Reasoning and Causal Study Design in Real Social Systems
by: Shi, Shaojie, et al.
Published: (2026)
by: Shi, Shaojie, et al.
Published: (2026)
AI Generated Child Sexual Abuse Material -- What's the Harm?
by: Ciardha, Caoilte Ó, et al.
Published: (2025)
by: Ciardha, Caoilte Ó, et al.
Published: (2025)
Unveiling AI's Threats to Child Protection: Regulatory efforts to Criminalize AI-Generated CSAM and Emerging Children's Rights Violations
by: Kokolaki, Emmanouela, et al.
Published: (2025)
by: Kokolaki, Emmanouela, et al.
Published: (2025)
Listening with Language Models: Using LLMs to Collect and Interpret Classroom Feedback
by: Maram, Sai Siddartha, et al.
Published: (2025)
by: Maram, Sai Siddartha, et al.
Published: (2025)
The Impact of Artificial Intelligence on Traditional Art Forms: A Disruption or Enhancement
by: Marella, Viswa Chaitanya, et al.
Published: (2025)
by: Marella, Viswa Chaitanya, et al.
Published: (2025)
Comparative Analysis of Multi-Agent Reinforcement Learning Policies for Crop Planning Decision Support
by: Mahajan, Anubha, et al.
Published: (2024)
by: Mahajan, Anubha, et al.
Published: (2024)
Sparks of Rationality: Do Reasoning LLMs Align with Human Judgment and Choice?
by: Tak, Ala N., et al.
Published: (2026)
by: Tak, Ala N., et al.
Published: (2026)
An LLM-Powered Agent for Real-Time Analysis of the Vietnamese IT Job Market
by: Nguyen, Minh-Thuan, et al.
Published: (2025)
by: Nguyen, Minh-Thuan, et al.
Published: (2025)
Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning
by: Juvekar, Kush, et al.
Published: (2025)
by: Juvekar, Kush, et al.
Published: (2025)
Classification for everyone : Building geography agnostic models for fairer recognition
by: Jindal, Akshat, et al.
Published: (2023)
by: Jindal, Akshat, et al.
Published: (2023)
SentinelSphere: Integrating AI-Powered Real-Time Threat Detection with Cybersecurity Awareness Training
by: Tantaroudas, Nikolaos D., et al.
Published: (2026)
by: Tantaroudas, Nikolaos D., et al.
Published: (2026)
SV3.3B: A Sports Video Understanding Model for Action Recognition
by: Kodathala, Sai Varun, et al.
Published: (2025)
by: Kodathala, Sai Varun, et al.
Published: (2025)
From Cloud to Edge: Rethinking Generative AI for Low-Resource Design Challenges
by: Vuruma, Sai Krishna Revanth, et al.
Published: (2024)
by: Vuruma, Sai Krishna Revanth, et al.
Published: (2024)
Justice in Judgment: Unveiling (Hidden) Bias in LLM-assisted Peer Reviews
by: Vasu, Sai Suresh Macharla, et al.
Published: (2025)
by: Vasu, Sai Suresh Macharla, et al.
Published: (2025)
Complexity of Faceted Explanations in Propositional Abduction
by: Schmidt, Johannes, et al.
Published: (2025)
by: Schmidt, Johannes, et al.
Published: (2025)
Manimator: Transforming Research Papers into Visual Explanations
by: P, Samarth, et al.
Published: (2025)
by: P, Samarth, et al.
Published: (2025)
FairJob: A Real-World Dataset for Fairness in Online Systems
by: Vladimirova, Mariia, et al.
Published: (2024)
by: Vladimirova, Mariia, et al.
Published: (2024)
How English Print Media Frames Human-Elephant Conflicts in India
by: Punith, Bonala Sai, et al.
Published: (2026)
by: Punith, Bonala Sai, et al.
Published: (2026)
Sentiment Analysis of Cyberbullying Data in Social Media
by: Susmitha, Arvapalli Sai, et al.
Published: (2024)
by: Susmitha, Arvapalli Sai, et al.
Published: (2024)
Is Your AI Truly Yours? Leveraging Blockchain for Copyrights, Provenance, and Lineage
by: Wang, Qin, et al.
Published: (2024)
by: Wang, Qin, et al.
Published: (2024)
Enhancing Debugging Skills with AI-Powered Assistance: A Real-Time Tool for Debugging Support
by: Artser, Elizaveta, et al.
Published: (2026)
by: Artser, Elizaveta, et al.
Published: (2026)
Wearable Device-Based Real-Time Monitoring of Physiological Signals: Evaluating Cognitive Load Across Different Tasks
by: He, Ling, et al.
Published: (2024)
by: He, Ling, et al.
Published: (2024)
The Systems Engineering Approach in Times of Large Language Models
by: Cabrera, Christian, et al.
Published: (2024)
by: Cabrera, Christian, et al.
Published: (2024)
Similar Items
-
A Multi-Agent Pokemon Tournament for Evaluating Strategic Reasoning of Large Language Models
by: Yashwanth, Tadisetty Sai, et al.
Published: (2025) -
On the Structure of Floating-Point Noise in Batch-Invariant GPU Matrix Multiplication
by: Yashwanth, Tadisetty Sai
Published: (2025) -
BioBlue: Systematic runaway-optimiser-like LLM failure modes on biologically and economically aligned AI safety benchmarks for LLMs with simplified observation format
by: Pihlakas, Roland, et al.
Published: (2025) -
To Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI Systems
by: Chappidi, Shreya, et al.
Published: (2026) -
Safe for Whom? Rethinking How We Evaluate the Safety of LLMs for Real Users
by: Kempermann, Manon, et al.
Published: (2025)