Saved in:
| Main Author: | Howard, Austin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.18156 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SECURE: Benchmarking Large Language Models for Cybersecurity
by: Bhusal, Dipkamal, et al.
Published: (2024)
by: Bhusal, Dipkamal, et al.
Published: (2024)
Human-Centered Privacy Research in the Age of Large Language Models
by: Li, Tianshi, et al.
Published: (2024)
by: Li, Tianshi, et al.
Published: (2024)
Adversarial VR: An Open-Source Testbed for Evaluating Adversarial Robustness of VR Cybersickness Detection and Mitigation
by: Ahmed, Istiak, et al.
Published: (2025)
by: Ahmed, Istiak, et al.
Published: (2025)
The Silicon Psyche: Anthropomorphic Vulnerabilities in Large Language Models
by: Canale, Giuseppe, et al.
Published: (2025)
by: Canale, Giuseppe, et al.
Published: (2025)
Privacy Leakage Overshadowed by Views of AI: A Study on Human Oversight of Privacy in Language Model Agent
by: Zhang, Zhiping, et al.
Published: (2024)
by: Zhang, Zhiping, et al.
Published: (2024)
Beyond Words: On Large Language Models Actionability in Mission-Critical Risk Analysis
by: Esposito, Matteo, et al.
Published: (2024)
by: Esposito, Matteo, et al.
Published: (2024)
From Assistants to Adversaries: Exploring the Security Risks of Mobile LLM Agents
by: Wu, Liangxuan, et al.
Published: (2025)
by: Wu, Liangxuan, et al.
Published: (2025)
RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models
by: Wang, Jiongxiao, et al.
Published: (2023)
by: Wang, Jiongxiao, et al.
Published: (2023)
Persuasion and Phishing: Analysing the Interplay of Persuasion Tactics in Cyber Threats
by: Khadka, Kalam
Published: (2024)
by: Khadka, Kalam
Published: (2024)
Manipulation Attacks by Misaligned AI: Risk Analysis and Safety Case Framework
by: Dassanayake, Rishane, et al.
Published: (2025)
by: Dassanayake, Rishane, et al.
Published: (2025)
Personalised Feedback Framework for Online Education Programmes Using Generative AI
by: Kuzminykh, Ievgeniia, et al.
Published: (2024)
by: Kuzminykh, Ievgeniia, et al.
Published: (2024)
"Impressively Scary:" Exploring User Perceptions and Reactions to Unraveling Machine Learning Models in Social Media Applications
by: West, Jack, et al.
Published: (2025)
by: West, Jack, et al.
Published: (2025)
A Security Risk Taxonomy for Prompt-Based Interaction With Large Language Models
by: Derner, Erik, et al.
Published: (2023)
by: Derner, Erik, et al.
Published: (2023)
Adversarial Attacks on Machine Learning-Aided Visualizations
by: Fujiwara, Takanori, et al.
Published: (2024)
by: Fujiwara, Takanori, et al.
Published: (2024)
Risk Psychology & Cyber-Attack Tactics
by: Kim, Rubens, et al.
Published: (2025)
by: Kim, Rubens, et al.
Published: (2025)
Watch Your Language: Investigating Content Moderation with Large Language Models
by: Kumar, Deepak, et al.
Published: (2023)
by: Kumar, Deepak, et al.
Published: (2023)
MeAJOR Corpus: A Multi-Source Dataset for Phishing Email Detection
by: Mendes, Paulo, et al.
Published: (2025)
by: Mendes, Paulo, et al.
Published: (2025)
BounTCHA: A CAPTCHA Utilizing Boundary Identification in Guided Generative AI-extended Videos
by: Lin, Lehao, et al.
Published: (2025)
by: Lin, Lehao, et al.
Published: (2025)
Cyri: A Conversational AI-based Assistant for Supporting the Human User in Detecting and Responding to Phishing Attacks
by: La Torre, Antonio, et al.
Published: (2025)
by: La Torre, Antonio, et al.
Published: (2025)
OpenAI's Approach to External Red Teaming for AI Models and Systems
by: Ahmad, Lama, et al.
Published: (2025)
by: Ahmad, Lama, et al.
Published: (2025)
PrivateXR: Defending Privacy Attacks in Extended Reality Through Explainable AI-Guided Differential Privacy
by: Kundu, Ripan Kumar, et al.
Published: (2025)
by: Kundu, Ripan Kumar, et al.
Published: (2025)
Autonomy Reshapes How Personalization Affects Privacy Concerns and Trust in LLM Agents
by: Zhang, Zhiping, et al.
Published: (2025)
by: Zhang, Zhiping, et al.
Published: (2025)
JEEVHITAA -- An End-to-End HCAI System to Support Collective Care
by: Srinivasan, Shyama Sastha Krishnamoorthy, et al.
Published: (2025)
by: Srinivasan, Shyama Sastha Krishnamoorthy, et al.
Published: (2025)
Human-AI Collaboration in Cloud Security: Cognitive Hierarchy-Driven Deep Reinforcement Learning
by: Aref, Zahra, et al.
Published: (2025)
by: Aref, Zahra, et al.
Published: (2025)
Agentic AI and the Industrialization of Cyber Offense: Forecast, Consequences, and Defensive Priorities for Enterprises and the Mittelstand
by: Koch, Christopher
Published: (2026)
by: Koch, Christopher
Published: (2026)
Decision-Aware Trust Signal Alignment for SOC Alert Triage
by: Chowdhury, Israt Jahan, et al.
Published: (2026)
by: Chowdhury, Israt Jahan, et al.
Published: (2026)
Current state of LLM Risks and AI Guardrails
by: Ayyamperumal, Suriya Ganesh, et al.
Published: (2024)
by: Ayyamperumal, Suriya Ganesh, et al.
Published: (2024)
Rescriber: Smaller-LLM-Powered User-Led Data Minimization for LLM-Based Chatbots
by: Zhou, Jijie, et al.
Published: (2024)
by: Zhou, Jijie, et al.
Published: (2024)
Empowering Users in Digital Privacy Management through Interactive LLM-Based Agents
by: Sun, Bolun, et al.
Published: (2024)
by: Sun, Bolun, et al.
Published: (2024)
"It's a Fair Game", or Is It? Examining How Users Navigate Disclosure Risks and Benefits When Using LLM-Based Conversational Agents
by: Zhang, Zhiping, et al.
Published: (2023)
by: Zhang, Zhiping, et al.
Published: (2023)
Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues
by: Chang, Zhiyuan, et al.
Published: (2024)
by: Chang, Zhiyuan, et al.
Published: (2024)
Towards Secure AI-driven Industrial Metaverse with NFT Digital Twins
by: Prakash, Ravi, et al.
Published: (2024)
by: Prakash, Ravi, et al.
Published: (2024)
AI-Assisted Adaptive Rendering for High-Frequency Security Telemetry in Web Interfaces
by: Rajhans, Mona
Published: (2026)
by: Rajhans, Mona
Published: (2026)
Human-Centered Explainability in AI-Enhanced UI Security Interfaces: Designing Trustworthy Copilots for Cybersecurity Analysts
by: Rajhans, Mona
Published: (2026)
by: Rajhans, Mona
Published: (2026)
JailbreakLens: Visual Analysis of Jailbreak Attacks Against Large Language Models
by: Feng, Yingchaojie, et al.
Published: (2024)
by: Feng, Yingchaojie, et al.
Published: (2024)
XRZoo: A Large-Scale and Versatile Dataset of Extended Reality (XR) Applications
by: Li, Shuqing, et al.
Published: (2024)
by: Li, Shuqing, et al.
Published: (2024)
Identify As A Human Does: A Pathfinder of Next-Generation Anti-Cheat Framework for First-Person Shooter Games
by: Zhang, Jiayi, et al.
Published: (2024)
by: Zhang, Jiayi, et al.
Published: (2024)
DASH: Deception-Augmented Shared Mental Model for a Human-Machine Teaming System
by: Wan, Zelin, et al.
Published: (2025)
by: Wan, Zelin, et al.
Published: (2025)
Emergent misalignment as prompt sensitivity: A research note
by: Wyse, Tim, et al.
Published: (2025)
by: Wyse, Tim, et al.
Published: (2025)
PRISM: A Personalized, Rapid, and Immersive Skill Mastery framework for personalizing experiential learning through Generative AI
by: Lin, Yu-Zheng, et al.
Published: (2024)
by: Lin, Yu-Zheng, et al.
Published: (2024)
Similar Items
-
SECURE: Benchmarking Large Language Models for Cybersecurity
by: Bhusal, Dipkamal, et al.
Published: (2024) -
Human-Centered Privacy Research in the Age of Large Language Models
by: Li, Tianshi, et al.
Published: (2024) -
Adversarial VR: An Open-Source Testbed for Evaluating Adversarial Robustness of VR Cybersickness Detection and Mitigation
by: Ahmed, Istiak, et al.
Published: (2025) -
The Silicon Psyche: Anthropomorphic Vulnerabilities in Large Language Models
by: Canale, Giuseppe, et al.
Published: (2025) -
Privacy Leakage Overshadowed by Views of AI: A Study on Human Oversight of Privacy in Language Model Agent
by: Zhang, Zhiping, et al.
Published: (2024)