Saved in:
| Main Authors: | Gelman, Haywood, Hastings, John D. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.07045 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
An Ethically Grounded LLM-Based Approach to Insider Threat Synthesis and Detection
by: Gelman, Haywood, et al.
Published: (2025)
by: Gelman, Haywood, et al.
Published: (2025)
Harmful Terms and Where to Find Them: Measuring and Modeling Unfavorable Financial Terms and Conditions in Shopping Websites at Scale
by: Tsai, Elisa, et al.
Published: (2025)
by: Tsai, Elisa, et al.
Published: (2025)
Toward an Insider Threat Education Platform: A Theoretical Literature Review
by: Gelman, Haywood, et al.
Published: (2024)
by: Gelman, Haywood, et al.
Published: (2024)
Who Shares What? An Empirical Analysis of Security Conference Content Across Academia and Industry
by: Walter, Lukas, et al.
Published: (2024)
by: Walter, Lukas, et al.
Published: (2024)
From Native Memes to Global Moderation: Cross-Cultural Evaluation of Vision-Language Models for Hateful Meme Detection
by: Wang, Mo, et al.
Published: (2026)
by: Wang, Mo, et al.
Published: (2026)
The Cultural Gene of Large Language Models: A Study on the Impact of Cross-Corpus Training on Model Values and Biases
by: Fenech-Borg, Emanuel Z., et al.
Published: (2025)
by: Fenech-Borg, Emanuel Z., et al.
Published: (2025)
Safeguarding Virtual Healthcare: A Novel Attacker-Centric Model for Data Security and Privacy
by: Herath, Suvineetha, et al.
Published: (2024)
by: Herath, Suvineetha, et al.
Published: (2024)
AI vs. Human Moderators: A Comparative Evaluation of Multimodal LLMs in Content Moderation for Brand Safety
by: Levi, Adi, et al.
Published: (2025)
by: Levi, Adi, et al.
Published: (2025)
Adaptive Defense Orchestration for RAG: A Sentinel-Strategist Architecture against Multi-Vector Attacks
by: Pallerla, Pranav, et al.
Published: (2026)
by: Pallerla, Pranav, et al.
Published: (2026)
CAMP: Cumulative Agentic Masking and Pruning for Privacy Protection in Multi-Turn LLM Conversations
by: Panjwani, Aman
Published: (2026)
by: Panjwani, Aman
Published: (2026)
CEKER: A Generalizable LLM Framework for Literature Analysis with a Case Study in Unikernel Security
by: Wollman, Alex, et al.
Published: (2024)
by: Wollman, Alex, et al.
Published: (2024)
Deterministic Fuzzy Triage for Legal Compliance Classification and Evidence Retrieval
by: Atri, Rian
Published: (2026)
by: Atri, Rian
Published: (2026)
Cultural Encoding in Large Language Models: The Existence Gap in AI-Mediated Brand Discovery
by: Junyao, Huang, et al.
Published: (2025)
by: Junyao, Huang, et al.
Published: (2025)
Toward Secure and Compliant AI: Organizational Standards and Protocols for NLP Model Lifecycle Management
by: Arora, Sunil, et al.
Published: (2025)
by: Arora, Sunil, et al.
Published: (2025)
A Systematic Review and Taxonomy for Privacy Breach Classification: Trends, Gaps, and Future Directions
by: Fuchs, Clint, et al.
Published: (2025)
by: Fuchs, Clint, et al.
Published: (2025)
VectorSmuggle: Steganographic Exfiltration in Embedding Stores and a Cryptographic Provenance Defense
by: Wanger, Jascha
Published: (2026)
by: Wanger, Jascha
Published: (2026)
Retrieval Augmented Classification for Confidential Documents
by: Chang, Yeseul E., et al.
Published: (2026)
by: Chang, Yeseul E., et al.
Published: (2026)
Towards Reliable Retrieval in RAG Systems for Large Legal Datasets
by: Reuter, Markus, et al.
Published: (2025)
by: Reuter, Markus, et al.
Published: (2025)
Gender and Race Bias in Consumer Product Recommendations by Large Language Models
by: Xu, Ke, et al.
Published: (2026)
by: Xu, Ke, et al.
Published: (2026)
LLM Output Drift: Cross-Provider Validation & Mitigation for Financial Workflows
by: Khatchadourian, Raffi, et al.
Published: (2025)
by: Khatchadourian, Raffi, et al.
Published: (2025)
Autonomous Penetration Testing: Solving Capture-the-Flag Challenges with LLMs
by: Bakker, Isabelle, et al.
Published: (2025)
by: Bakker, Isabelle, et al.
Published: (2025)
LLMLogAnalyzer: A Clustering-Based Log Analysis Chatbot using Large Language Models
by: Cai, Peng, et al.
Published: (2025)
by: Cai, Peng, et al.
Published: (2025)
AlignDP: Hybrid Differential Privacy with Rarity-Aware Protection for LLMs
by: Gaikwad, Madhava
Published: (2025)
by: Gaikwad, Madhava
Published: (2025)
Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair Use
by: Ho, Justin, et al.
Published: (2025)
by: Ho, Justin, et al.
Published: (2025)
Big Help or Big Brother? Auditing Tracking, Profiling, and Personalization in Generative AI Assistants
by: Vekaria, Yash, et al.
Published: (2025)
by: Vekaria, Yash, et al.
Published: (2025)
Auditing Preferences for Brands and Cultures in LLMs
by: Rienecker, Jasmine, et al.
Published: (2026)
by: Rienecker, Jasmine, et al.
Published: (2026)
Tatemae: Detecting Alignment Faking via Tool Selection in LLMs
by: Leonesi, Matteo, et al.
Published: (2026)
by: Leonesi, Matteo, et al.
Published: (2026)
Who Leads in the Shadows? ERGM and Centrality Analysis of Congressional Democrats on Bluesky
by: Hew, Gordon, et al.
Published: (2025)
by: Hew, Gordon, et al.
Published: (2025)
Cross-Subreddit Behavior as Open-Source Indicators of Coordinated Influence: A Case Study of r/Sino & r/China
by: Pilaud, Manon, et al.
Published: (2025)
by: Pilaud, Manon, et al.
Published: (2025)
Whisper Leak: a side-channel attack on Large Language Models
by: McDonald, Geoff, et al.
Published: (2025)
by: McDonald, Geoff, et al.
Published: (2025)
Transforming Computer Security and Public Trust Through the Exploration of Fine-Tuning Large Language Models
by: Crumrine, Garrett, et al.
Published: (2024)
by: Crumrine, Garrett, et al.
Published: (2024)
Powerful Training-Free Membership Inference Against Autoregressive Language Models
by: Ilić, David, et al.
Published: (2026)
by: Ilić, David, et al.
Published: (2026)
MODP: Multi Objective Directional Prompting
by: Nema, Aashutosh, et al.
Published: (2025)
by: Nema, Aashutosh, et al.
Published: (2025)
Privately Fine-Tuned LLMs Preserve Temporal Dynamics in Tabular Data
by: Rosenblatt, Lucas, et al.
Published: (2026)
by: Rosenblatt, Lucas, et al.
Published: (2026)
VulCPE: Context-Aware Cybersecurity Vulnerability Retrieval and Management
by: Jiang, Yuning, et al.
Published: (2025)
by: Jiang, Yuning, et al.
Published: (2025)
FACTUM: Mechanistic Detection of Citation Hallucination in Long-Form RAG
by: Dassen, Maxime, et al.
Published: (2026)
by: Dassen, Maxime, et al.
Published: (2026)
Recipient Profiling: Predicting Characteristics from Messages
by: Borquez, Martin, et al.
Published: (2024)
by: Borquez, Martin, et al.
Published: (2024)
Lightweight LLMs for Network Attack Detection in IoT Networks
by: Sudasinghe, Piyumi Bhagya, et al.
Published: (2026)
by: Sudasinghe, Piyumi Bhagya, et al.
Published: (2026)
NameBERT: Scaling Name-Based Nationality Classification with LLM-Augmented Open Academic Data
by: Ming, Cong, et al.
Published: (2026)
by: Ming, Cong, et al.
Published: (2026)
Can LLMs Understand What We Cannot Say? Measuring Multilevel Alignment Through Abortion Stigma Across Cognitive, Interpersonal, and Structural Levels
by: Sharma, Anika, et al.
Published: (2025)
by: Sharma, Anika, et al.
Published: (2025)
Similar Items
-
An Ethically Grounded LLM-Based Approach to Insider Threat Synthesis and Detection
by: Gelman, Haywood, et al.
Published: (2025) -
Harmful Terms and Where to Find Them: Measuring and Modeling Unfavorable Financial Terms and Conditions in Shopping Websites at Scale
by: Tsai, Elisa, et al.
Published: (2025) -
Toward an Insider Threat Education Platform: A Theoretical Literature Review
by: Gelman, Haywood, et al.
Published: (2024) -
Who Shares What? An Empirical Analysis of Security Conference Content Across Academia and Industry
by: Walter, Lukas, et al.
Published: (2024) -
From Native Memes to Global Moderation: Cross-Cultural Evaluation of Vision-Language Models for Hateful Meme Detection
by: Wang, Mo, et al.
Published: (2026)