:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Yeung, Joshua Au, Dalmasso, Jacopo, Foschini, Luca, Dobson, Richard JB, Kraljevic, Zeljko
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Machine Learning Artificial Intelligence
Online-Zugang:	https://arxiv.org/abs/2509.10970
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Large Language Models for Medical Forecasting -- Foresight 2
von: Kraljevic, Zeljko, et al.
Veröffentlicht: (2024)

Hallucinating with AI: AI Psychosis as Distributed Delusions
von: Osler, Lucy
Veröffentlicht: (2025)

AI Psychosis: Does Conversational AI Amplify Delusion-Related Language?
von: Shimgekar, Soorya Ram, et al.
Veröffentlicht: (2026)

Delusions of Large Language Models
von: Xu, Hongshen, et al.
Veröffentlicht: (2025)

Unlocking Electronic Health Records: A Hybrid Graph RAG Approach to Safe Clinical AI for Patient QA
von: Thio, Samuel, et al.
Veröffentlicht: (2025)

FastOMOP: A Foundational Architecture for Reliable Agentic Real-World Evidence Generation on OMOP CDM data
von: Moeller-Grell, Niko, et al.
Veröffentlicht: (2026)

Grounding Large Language Models in Clinical Evidence: A Retrieval-Augmented Generation System for Querying UK NICE Clinical Guidelines
von: Lewis, Matthew, et al.
Veröffentlicht: (2025)

Self-HarmLLM: Can Large Language Model Harm Itself?
von: Kim, Heehwan, et al.
Veröffentlicht: (2025)

Purposefully Induced Psychosis (PIP): Embracing Hallucination as Imagination in Large Language Models
von: Pilcher, Kris, et al.
Veröffentlicht: (2025)

A Novel Zero-Touch, Zero-Trust, AI/ML Enablement Framework for IoT Network Security
von: Shakya, Sushil, et al.
Veröffentlicht: (2025)

AI Meets the Classroom: When Do Large Language Models Harm Learning?
von: Lehmann, Matthias, et al.
Veröffentlicht: (2024)

LLM Spirals of Delusion: A Benchmarking Audit Study of AI Chatbot Interfaces
von: Kirgis, Peter, et al.
Veröffentlicht: (2026)

Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation
von: Huang, Tiansheng, et al.
Veröffentlicht: (2024)

Why Do Large Language Models Generate Harmful Content?
von: Ganguli, Rajesh, et al.
Veröffentlicht: (2026)

Developing and Evaluating an AI-Assisted Prediction Model for Unplanned Intensive Care Admissions following Elective Neurosurgery using Natural Language Processing within an Electronic Healthcare Record System
von: Ive, Julia, et al.
Veröffentlicht: (2025)

The Depth Delusion: Why Transformers Should Be Wider, Not Deeper
von: Fahim, Md Muhtasim Munif, et al.
Veröffentlicht: (2026)

LLM Psychosis: A Theoretical and Diagnostic Framework for Reality-Boundary Failures in Large Language Models
von: Raj, Ashutosh
Veröffentlicht: (2026)

Manipulation and the AI Act: Large Language Model Chatbots and the Danger of Mirrors
von: Krook, Joshua
Veröffentlicht: (2025)

The DCR Delusion: Measuring the Privacy Risk of Synthetic Data
von: Yao, Zexi, et al.
Veröffentlicht: (2025)

Knowledge as Enablement
Veröffentlicht: (2023)

Harm in AI-Driven Societies: An Audit of Toxicity Adoption on Chirper.ai
von: Coppolillo, Erica, et al.
Veröffentlicht: (2026)

AI for Distributed Systems Design: Scalable Cloud Optimization Through Repeated LLMs Sampling And Simulators
von: Tagliabue, Jacopo
Veröffentlicht: (2025)

Prefix Probing: Lightweight Harmful Content Detection for Large Language Models
von: Yang, Jirui, et al.
Veröffentlicht: (2025)

RealHarm: A Collection of Real-World Language Model Application Failures
von: Jeune, Pierre Le, et al.
Veröffentlicht: (2025)

Evaluating Language Models for Harmful Manipulation
von: Akbulut, Canfer, et al.
Veröffentlicht: (2026)

Safe2Harm: Semantic Isomorphism Attacks for Jailbreaking Large Language Models
von: Yang, Fan
Veröffentlicht: (2025)

Large Language Models are Vulnerable to Bait-and-Switch Attacks for Generating Harmful Content
von: Bianchi, Federico, et al.
Veröffentlicht: (2024)

Surgery: Mitigating Harmful Fine-Tuning for Large Language Models via Attention Sink
von: Liu, Guozhi, et al.
Veröffentlicht: (2026)

OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst
von: Cao, Jingtao, et al.
Veröffentlicht: (2024)

CSAI: Conditional Self-Attention Imputation for Healthcare Time-series
von: Qian, Linglong, et al.
Veröffentlicht: (2023)

Interactive Navigation in Environments with Traversable Obstacles Using Large Language and Vision-Language Models
von: Zhang, Zhen, et al.
Veröffentlicht: (2023)

Model-Based Reinforcement Learning under Random Observation Delays
von: Karamzade, Armin, et al.
Veröffentlicht: (2025)

Uncertainty-Aware Deep Attention Recurrent Neural Network for Heterogeneous Time Series Imputation
von: Qian, Linglong, et al.
Veröffentlicht: (2024)

Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey
von: Huang, Tiansheng, et al.
Veröffentlicht: (2024)

Helpful or Harmful? Exploring the Efficacy of Large Language Models for Online Grooming Prevention
von: Prosser, Ellie, et al.
Veröffentlicht: (2024)

AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness
von: Chen, Zixin, et al.
Veröffentlicht: (2025)

Moderating Harm: Benchmarking Large Language Models for Cyberbullying Detection in YouTube Comments
von: Muminovic, Amel
Veröffentlicht: (2025)

Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks
von: Al-Kaswan, Ali, et al.
Veröffentlicht: (2025)

VaccineRAG: Boosting Multimodal Large Language Models' Immunity to Harmful RAG Samples
von: Sun, Qixin, et al.
Veröffentlicht: (2025)

Harmful Traits of AI Companions
von: Knox, W. Bradley, et al.
Veröffentlicht: (2025)