:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gelman, Haywood, Hastings, John D.
Format:	Preprint
Published:	2025
Subjects:	Cryptography and Security Artificial Intelligence Computation and Language Computers and Society C.2.0; I.2.7; K.4.1; H.3.3
Online Access:	https://arxiv.org/abs/2502.07045
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

An Ethically Grounded LLM-Based Approach to Insider Threat Synthesis and Detection
by: Gelman, Haywood, et al.
Published: (2025)

Harmful Terms and Where to Find Them: Measuring and Modeling Unfavorable Financial Terms and Conditions in Shopping Websites at Scale
by: Tsai, Elisa, et al.
Published: (2025)

Toward an Insider Threat Education Platform: A Theoretical Literature Review
by: Gelman, Haywood, et al.
Published: (2024)

Who Shares What? An Empirical Analysis of Security Conference Content Across Academia and Industry
by: Walter, Lukas, et al.
Published: (2024)

From Native Memes to Global Moderation: Cross-Cultural Evaluation of Vision-Language Models for Hateful Meme Detection
by: Wang, Mo, et al.
Published: (2026)

The Cultural Gene of Large Language Models: A Study on the Impact of Cross-Corpus Training on Model Values and Biases
by: Fenech-Borg, Emanuel Z., et al.
Published: (2025)

Safeguarding Virtual Healthcare: A Novel Attacker-Centric Model for Data Security and Privacy
by: Herath, Suvineetha, et al.
Published: (2024)

AI vs. Human Moderators: A Comparative Evaluation of Multimodal LLMs in Content Moderation for Brand Safety
by: Levi, Adi, et al.
Published: (2025)

Adaptive Defense Orchestration for RAG: A Sentinel-Strategist Architecture against Multi-Vector Attacks
by: Pallerla, Pranav, et al.
Published: (2026)

CAMP: Cumulative Agentic Masking and Pruning for Privacy Protection in Multi-Turn LLM Conversations
by: Panjwani, Aman
Published: (2026)

CEKER: A Generalizable LLM Framework for Literature Analysis with a Case Study in Unikernel Security
by: Wollman, Alex, et al.
Published: (2024)

Deterministic Fuzzy Triage for Legal Compliance Classification and Evidence Retrieval
by: Atri, Rian
Published: (2026)

Cultural Encoding in Large Language Models: The Existence Gap in AI-Mediated Brand Discovery
by: Junyao, Huang, et al.
Published: (2025)

Toward Secure and Compliant AI: Organizational Standards and Protocols for NLP Model Lifecycle Management
by: Arora, Sunil, et al.
Published: (2025)

A Systematic Review and Taxonomy for Privacy Breach Classification: Trends, Gaps, and Future Directions
by: Fuchs, Clint, et al.
Published: (2025)

VectorSmuggle: Steganographic Exfiltration in Embedding Stores and a Cryptographic Provenance Defense
by: Wanger, Jascha
Published: (2026)

Retrieval Augmented Classification for Confidential Documents
by: Chang, Yeseul E., et al.
Published: (2026)

Towards Reliable Retrieval in RAG Systems for Large Legal Datasets
by: Reuter, Markus, et al.
Published: (2025)

Gender and Race Bias in Consumer Product Recommendations by Large Language Models
by: Xu, Ke, et al.
Published: (2026)

LLM Output Drift: Cross-Provider Validation & Mitigation for Financial Workflows
by: Khatchadourian, Raffi, et al.
Published: (2025)

Autonomous Penetration Testing: Solving Capture-the-Flag Challenges with LLMs
by: Bakker, Isabelle, et al.
Published: (2025)

LLMLogAnalyzer: A Clustering-Based Log Analysis Chatbot using Large Language Models
by: Cai, Peng, et al.
Published: (2025)

AlignDP: Hybrid Differential Privacy with Rarity-Aware Protection for LLMs
by: Gaikwad, Madhava
Published: (2025)

Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair Use
by: Ho, Justin, et al.
Published: (2025)

Big Help or Big Brother? Auditing Tracking, Profiling, and Personalization in Generative AI Assistants
by: Vekaria, Yash, et al.
Published: (2025)

Auditing Preferences for Brands and Cultures in LLMs
by: Rienecker, Jasmine, et al.
Published: (2026)

Tatemae: Detecting Alignment Faking via Tool Selection in LLMs
by: Leonesi, Matteo, et al.
Published: (2026)

Who Leads in the Shadows? ERGM and Centrality Analysis of Congressional Democrats on Bluesky
by: Hew, Gordon, et al.
Published: (2025)

Cross-Subreddit Behavior as Open-Source Indicators of Coordinated Influence: A Case Study of r/Sino & r/China
by: Pilaud, Manon, et al.
Published: (2025)

Whisper Leak: a side-channel attack on Large Language Models
by: McDonald, Geoff, et al.
Published: (2025)

Transforming Computer Security and Public Trust Through the Exploration of Fine-Tuning Large Language Models
by: Crumrine, Garrett, et al.
Published: (2024)

Powerful Training-Free Membership Inference Against Autoregressive Language Models
by: Ilić, David, et al.
Published: (2026)

MODP: Multi Objective Directional Prompting
by: Nema, Aashutosh, et al.
Published: (2025)

Privately Fine-Tuned LLMs Preserve Temporal Dynamics in Tabular Data
by: Rosenblatt, Lucas, et al.
Published: (2026)

VulCPE: Context-Aware Cybersecurity Vulnerability Retrieval and Management
by: Jiang, Yuning, et al.
Published: (2025)

FACTUM: Mechanistic Detection of Citation Hallucination in Long-Form RAG
by: Dassen, Maxime, et al.
Published: (2026)

Recipient Profiling: Predicting Characteristics from Messages
by: Borquez, Martin, et al.
Published: (2024)

Lightweight LLMs for Network Attack Detection in IoT Networks
by: Sudasinghe, Piyumi Bhagya, et al.
Published: (2026)

NameBERT: Scaling Name-Based Nationality Classification with LLM-Augmented Open Academic Data
by: Ming, Cong, et al.
Published: (2026)

Can LLMs Understand What We Cannot Say? Measuring Multilevel Alignment Through Abortion Stigma Across Cognitive, Interpersonal, and Structural Levels
by: Sharma, Anika, et al.
Published: (2025)